CVE-2026-33298

7.8

HIGH CVSS 3.1

llama.cpp has a Heap Buffer Overflow via Integer Overflow in GGUF Tensor Parsing

Overview

Description

llama.cpp is an inference of several LLM models in C/C++. Prior to b7824, an integer overflow vulnerability in the `ggml_nbytes` function allows an attacker to bypass memory validation by crafting a GGUF file with specific tensor dimensions. This causes `ggml_nbytes` to return a significantly smaller size than required (e.g., 4MB instead of Exabytes), leading to a heap-based buffer overflow when the application subsequently processes the tensor. This vulnerability allows potential Remote Code Execution (RCE) via memory corruption. b7824 contains a fix.

Details

INFO

Published Date :

March 24, 2026, 1:17 a.m.

Last Modified :

June 17, 2026, 10:37 a.m.

Remotely Exploit :

No

Source :

[email protected]

Impact

Affected Products

The following products are affected by CVE-2026-33298 vulnerability. Even if cvefeed.io is aware of the exact versions of the products that are affected, the information is not represented in the table below.

ID	Vendor	Product	Action
1	Ggml	llama.cpp

: Total Affected Vendor : 1 | Products : 1

Scoring

CVSS Scores

The Common Vulnerability Scoring System is a standardized framework for assessing the severity of vulnerabilities in software and systems. We collect and displays CVSS scores from various sources for each CVE.

Score	Version	Severity	Vector	Exploitability Score	Impact Score	Source
	CVSS					134c704f-9b21-4f2e-91b3-4a467353bcc0
	CVSS 3.1	HIGH				[email protected]

Solution

Update llama.cpp to version b7824 or later to fix heap-based buffer overflow.

Update llama.cpp to b7824 or later.
Validate GGUF file tensor dimensions.

Public PoC/Exploit Available at Github

CVE-2026-33298 has a 3 public PoC/Exploit available at Github. Go to the Public Exploits tab to see the list.

References

References to Advisories, Solutions, and Tools

Here, you will find a curated list of external links that provide in-depth information, practical solutions, and valuable tools related to CVE-2026-33298.

URL	Resource
https://github.com/ggml-org/llama.cpp/releases/tag/b7824	Release Notes
https://github.com/ggml-org/llama.cpp/security/advisories/GHSA-96jg-mvhq-q7q7	Exploit Vendor Advisory

CWE - Common Weakness Enumeration

While CVE identifies specific instances of vulnerabilities, CWE categorizes the common flaws or weaknesses that can lead to vulnerabilities. CVE-2026-33298 is associated with the following CWEs:

CWE-122: Heap-based Buffer Overflow

CWE-190: Integer Overflow or Wraparound

Common Attack Pattern Enumeration and Classification (CAPEC)

Common Attack Pattern Enumeration and Classification (CAPEC) stores attack patterns, which are descriptions of the common attributes and approaches employed by adversaries to exploit the CVE-2026-33298 weaknesses.

CAPEC-92: Forced Integer Overflow Forced Integer Overflow CAPEC-92: Forced Integer Overflow Forced Integer Overflow

We scan GitHub repositories to detect new proof-of-concept exploits. Following list is a collection of public exploits and proof-of-concepts, which have been published on GitHub (sorted by the most recently updated).

learjet5/UIUC-CS598-bench

Collect and construct AI-powered open-source projects and their vulns/bugs for CS598 course project.

Python Shell C C++

Updated: 2 months, 4 weeks ago

0 stars 0 fork 0 watcher

Born at : April 6, 2026, 4:45 p.m. This repo has been linked 15 different CVEs too.

Aviral2642/vllm-integer-truncation-audit

Security audit documenting 221 silent int64-to-int32 truncation sites in vLLM's CUDA/C++ extensions that enable GPU buffer overflow via crafted GGUF model files.

Updated: 3 months, 3 weeks ago

4 stars 0 fork 0 watcher

Born at : April 4, 2026, 6:52 p.m. This repo has been linked 9 different CVEs too.

holmanholdings/lionguard

Cathedral-Grade Security for AI Agents. Attack vectors updated daily. Local-first, zero API cost. MIT licensed.

Python

Updated: 1 week, 4 days ago

6 stars 0 fork 0 watcher

Born at : March 13, 2026, 5:10 p.m. This repo has been linked 119 different CVEs too.

Results are limited to the first 15 repositories due to potential performance issues.

The following list is the news that have been mention CVE-2026-33298 vulnerability anywhere in the article.

Results are limited to the first 20 news articles due to potential performance issues.

The following table lists the changes that have been made to the CVE-2026-33298 vulnerability over time.

Vulnerability history details can be useful for understanding the evolution of a vulnerability, and for identifying the most recent changes that may impact the vulnerability's severity, exploitability, or other characteristics.

CVE Modified by [email protected]

Jun. 17, 2026

Action	Type	Old Value	New Value
Added	Affected		[{'vendor': 'ggml-org', 'product': 'llama.cpp', 'versions': [{'status': 'affected', 'version': '< b7824'}]}]

CVE Modified by 134c704f-9b21-4f2e-91b3-4a467353bcc0

Jun. 17, 2026

Action	Type	Old Value	New Value
Added	SSVC		{'id': 'CVE-2026-33298', 'role': 'CISA Coordinator', 'options': [{'exploitation': 'poc'}, {'automatable': 'no'}, {'technicalImpact': 'total'}], 'version': '2.0.3', 'timestamp': '2026-03-24T00:00:00+00:00'}

Initial Analysis by [email protected]

Apr. 30, 2026

Action	Type	New Value
Added	CPE Configuration	OR cpe:2.3:a:ggml:llama.cpp::::::::* versions up to (excluding) b7824
Added	Reference Type	GitHub, Inc.: https://github.com/ggml-org/llama.cpp/releases/tag/b7824 Types: Release Notes
Added	Reference Type	GitHub, Inc.: https://github.com/ggml-org/llama.cpp/security/advisories/GHSA-96jg-mvhq-q7q7 Types: Exploit, Vendor Advisory

New CVE Received by [email protected]

Mar. 24, 2026

Action	Type	New Value
Added	Description	llama.cpp is an inference of several LLM models in C/C++. Prior to b7824, an integer overflow vulnerability in the `ggml_nbytes` function allows an attacker to bypass memory validation by crafting a GGUF file with specific tensor dimensions. This causes `ggml_nbytes` to return a significantly smaller size than required (e.g., 4MB instead of Exabytes), leading to a heap-based buffer overflow when the application subsequently processes the tensor. This vulnerability allows potential Remote Code Execution (RCE) via memory corruption. b7824 contains a fix.
Added	CVSS V3.1	AV:L/AC:L/PR:N/UI:R/S:U/C:H/I:H/A:H
Added	CWE	CWE-190
Added	CWE	CWE-122
Added	Reference	https://github.com/ggml-org/llama.cpp/releases/tag/b7824
Added	Reference	https://github.com/ggml-org/llama.cpp/security/advisories/GHSA-96jg-mvhq-q7q7

EPSS is a daily estimate of the probability of exploitation activity being observed over the next 30 days. Following chart shows the EPSS score history of the vulnerability.