CVE-2026-33298

HIGH EPSS 37.6%

Published Mar 24, 20263mo ago · Modified Jun 17, 20262w ago

7.8 CVSS 3.1

High

Published Mar 24, 2026 3mo ago

Last Modified Jun 17, 2026 2w ago

Description

llama.cpp is an inference of several LLM models in C/C++. Prior to b7824, an integer overflow vulnerability in the `ggml_nbytes` function allows an attacker to bypass memory validation by crafting a GGUF file with specific tensor dimensions. This causes `ggml_nbytes` to return a significantly smaller size than required (e.g., 4MB instead of Exabytes), leading to a heap-based buffer overflow when the application subsequently processes the tensor. This vulnerability allows potential Remote Code Execution (RCE) via memory corruption. b7824 contains a fix.

CVSS Details

Base Score

7.8

Exploitability

1.8

Impact

5.9

Vector string

CVSS:3.1/AV:L/AC:L/PR:N/UI:R/S:U/C:H/I:H/A:H

Attack Vector Local

Attack Complexity Low

Privileges Required None

User Interaction Required

Scope Unchanged

Confidentiality High

Integrity High

Availability High

Threat Intelligence

EPSS Exploit Probability

37.6% percentile

Exploit & Patch Status

Public Exploit Known

No Patch Available

Weaknesses 2

CWE-122

CWE-190 Integer Overflow or Wraparound Numeric Error

Affected Products 1

Vendor	Product	Version	Range
ggml	llama.cpp	*	<b7824

References 2

github.com https://github.com/ggml-org/llama.cpp/releases/tag/b7824

Release Notes
github.com https://github.com/ggml-org/llama.cpp/security/advisories/GHSA-96jg-mvhq-q7q7

ExploitVendor Advisory

Remediation

No remediation data recorded yet

Check vendor advisories and the NVD entry for patch availability.