Search CVEs

Top 20 matches Showing top matches — use filters or a more specific query to narrow

CVE-2026-34159

CRITICAL CVSS 9.8

llama.cpp is an inference of several LLM models in C/C++. Prior to version b8492, the RPC backend's deserialize_tensor() skips all bounds validation when a tensor's buffer field is 0. An unauthenticat

ggml

CVE-2024-42477

HIGH CVSS 7.5

Find Similar

llama.cpp provides LLM inference in C/C++. The unsafe `type` member in the `rpc_tensor` structure can cause `global-buffer-overflow`. This vulnerability may lead to memory data leakage. The vulnerabil

ggml

CVE-2024-42478

CRITICAL CVSS 9.8

Find Similar

llama.cpp provides LLM inference in C/C++. The unsafe `data` pointer member in the `rpc_tensor` structure can cause arbitrary address reading. This vulnerability is fixed in b3561.

ggml

CVE-2024-42479

CRITICAL CVSS 9.8

Find Similar

llama.cpp provides LLM inference in C/C++. The unsafe `data` pointer member in the `rpc_tensor` structure can cause arbitrary address writing. This vulnerability is fixed in b3561.

ggml

CVE-2026-33298

HIGH CVSS 7.8

Find Similar

llama.cpp is an inference of several LLM models in C/C++. Prior to b7824, an integer overflow vulnerability in the `ggml_nbytes` function allows an attacker to bypass memory validation by crafting a G

ggml

CVE-2026-27940

HIGH CVSS 7.8

Find Similar

llama.cpp is an inference of several LLM models in C/C++. Prior to b8146, the gguf_init_from_file_impl() in gguf.cpp is vulnerable to an Integer overflow, leading to an undersized heap allocation. Usi

ggml

CVE-2026-21869

CRITICAL CVSS 9.8

Find Similar

llama.cpp is an inference of several LLM models in C/C++. In commits 55d4206c8 and prior, the n_discard parameter is parsed directly from JSON input in the llama.cpp server's completion endpoints with

ggml

CVE-2025-49847

HIGH CVSS 8.8

Find Similar

llama.cpp is an inference of several LLM models in C/C++. Prior to version b5662, an attacker‐supplied GGUF model vocabulary can trigger a buffer overflow in llama.cpp’s vocabulary‐loading code. Speci

ggml

CVE-2025-52566

HIGH CVSS 8.8

Find Similar

llama.cpp is an inference of several LLM models in C/C++. Prior to version b5721, there is a signed vs. unsigned integer overflow in llama.cpp's tokenizer implementation (llama_vocab::tokenize) (src/l

ggml

CVE-2025-53630

HIGH CVSS 8.9

Find Similar

llama.cpp is an inference of several LLM models in C/C++. Integer Overflow in the gguf_init_from_file_impl function in ggml/src/gguf.cpp can lead to Heap Out-of-Bounds Read/Write. This vulnerability i

CVE-2024-41130

MEDIUM CVSS 6.5

Find Similar

llama.cpp provides LLM inference in C/C++. Prior to b3427, llama.cpp contains a null pointer dereference in gguf_init_from_file. This vulnerability is fixed in b3427.

ggml

CVE-2025-62164

HIGH CVSS 8.8

Find Similar

vLLM is an inference and serving engine for large language models (LLMs). From versions 0.10.2 to before 0.11.1, a memory corruption vulnerability could lead to a crash (denial-of-service) and potenti

vllm

CVE-2026-53923

MEDIUM CVSS 5.3

Find Similar

vLLM is an inference and serving engine for large language models (LLMs). From 0.5.5 until 0.23.1rc0, integer truncation of tensor dimensions in vLLM's GGUF dequantize kernels (csrc/quantization/gguf/

CVE-2025-32444

CRITICAL CVSS 9.8

Find Similar

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.6.5 and prior to 0.8.5, having vLLM integration with mooncake, are vulnerable to remote c

vllm

CVE-2026-54235

MEDIUM CVSS 6.9

Find Similar

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, ll temperature validation gates use comparison operators (<, >), which silently evaluate to False for NaN

CVE-2024-12704

NONE

Find Similar

A vulnerability in the LangChainLLM class of the run-llama/llama_index repository, version v0.12.5, allows for a Denial of Service (DoS) attack. The stream_complete method executes the llm using a thr

llamaindex

CVE-2025-23254

HIGH CVSS 8.8

Find Similar

NVIDIA TensorRT-LLM for any platform contains a vulnerability in python executor where an attacker may cause a data validation issue by local access to the TRTLLM server. A successful exploit of this

CVE-2026-34756

MEDIUM CVSS 6.5

Find Similar

vLLM is an inference and serving engine for large language models (LLMs). From 0.1.0 to before 0.19.0, a Denial of Service vulnerability exists in the vLLM OpenAI-compatible API server. Due to the lac

vllm

CVE-2025-61784

HIGH CVSS 8.1

Find Similar

LLaMA-Factory is a tuning library for large language models. Prior to version 0.9.4, a Server-Side Request Forgery (SSRF) vulnerability in the chat API allows any authenticated user to force the serve

hiyouga

CVE-2025-53002

CRITICAL CVSS 9.8

Find Similar

LLaMA-Factory is a tuning library for large language models. A remote code execution vulnerability was discovered in LLaMA-Factory versions up to and including 0.9.3 during the LLaMA-Factory training

hiyouga

Page 1+ Next →