CVE-2026-33298
HIGH EPSS 37.6%
Published Mar 24, 20263mo ago · Modified Jun 17, 20262w ago
7.8 CVSS 3.1
Published Mar 24, 2026 3mo ago
Last Modified Jun 17, 2026 2w ago
Description
llama.cpp is an inference of several LLM models in C/C++. Prior to b7824, an integer overflow vulnerability in the `ggml_nbytes` function allows an attacker to bypass memory validation by crafting a GGUF file with specific tensor dimensions. This causes `ggml_nbytes` to return a significantly smaller size than required (e.g., 4MB instead of Exabytes), leading to a heap-based buffer overflow when the application subsequently processes the tensor. This vulnerability allows potential Remote Code Execution (RCE) via memory corruption. b7824 contains a fix.
CVSS Details
Base Score
Exploitability
Impact
Vector string
CVSS:3.1/AV:L/AC:L/PR:N/UI:R/S:U/C:H/I:H/A:H Attack Vector Local
Attack Complexity Low
Privileges Required None
User Interaction Required
Scope Unchanged
Confidentiality High
Integrity High
Availability High
Threat Intelligence
EPSS Exploit Probability
37.6% percentile
Exploit & Patch Status
Public Exploit Known
No Patch Available
Weaknesses 2
CWE-122
CWE-190 Integer Overflow or Wraparound Numeric Error
Affected Products 1
| Vendor | Product | Version | Range |
|---|---|---|---|
| ggml | llama.cpp | * | <b7824 |
References 2
- github.com https://github.com/ggml-org/llama.cpp/releases/tag/b7824
- github.com https://github.com/ggml-org/llama.cpp/security/advisories/GHSA-96jg-mvhq-q7q7
Remediation
No remediation data recorded yet
Check vendor advisories and the NVD entry for patch availability.