A newly disclosed set of critical vulnerabilities in NVIDIA’s Triton Inference Server has put organizations operating AI workloads at significant risk.

August 4, 2025No CommentsCybersecurity News

TL;DR

Researchers at Wiz have identified multiple flaws in both the Windows and Linux versions of NVIDIA’s Triton open-source platform, enabling potential attackers to seize control of vulnerable servers without the need for authentication.

Overview of the Vulnerabilities

Three key vulnerabilities are at the heart of the issue, all residing in the Python backend of the Triton Inference Server, which is designed to process inference requests for AI models built with frameworks such as PyTorch and TensorFlow:

CVE-2025-23319 (CVSS 8.1): This vulnerability allows for an out-of-bounds write operation when a specially crafted request is processed, potentially leading to remote code execution or data tampering.
CVE-2025-23320 (CVSS 7.5): By sending an excessively large request, an attacker can exceed the shared memory limit, resulting in information leakage from protected memory areas.
CVE-2025-23334 (CVSS 5.9): An out-of-bounds read vulnerability that could be used to leak sensitive information from the server.

Exploitation Scenario and Impact

The Wiz research team demonstrated that chaining these vulnerabilities could escalate the threat from an information disclosure issue to a complete system compromise. Using CVE-2025-23320, an attacker can first uncover the unique internal name of Triton’s inter-process communication (IPC) shared memory region—a detail that should remain confidential. Leveraging the other two vulnerabilities, the attacker is then able to gain full remote control over the inference server.

Successful exploitation poses several risks, including theft of proprietary AI models, exposure of sensitive data, manipulation of AI responses, denial of service, and a potential entry point for wider network attacks. All these can occur without attackers needing valid credentials.

Additional Bugs and Remediation

NVIDIA’s latest security bulletin also details fixes for three more critical vulnerabilities (CVE-2025-23310, CVE-2025-23311, and CVE-2025-23317), each capable of enabling remote code execution, service disruption, information exposure, or unauthorized data alteration.

NVIDIA has released version 25.07 of the Triton Inference Server, addressing all noted vulnerabilities. At present, there are no reports indicating exploitation of these flaws in the wild.

NVIDIA

Last updated on August 4, 2025

Comments

No comments yet. Why don’t you start the discussion?

Leave a ReplyCancel reply

United States	~59% of ransomware attacks globally Thousands per year
Poland	1,000+ per week
Russia	Highest cybercrime threat level
China	Thousands per year
India	115% surge in attacks Q2 2024
Ukraine	Significant surge since 2022
Brazil	Among top countries for blocked attacks
Mexico	65% of businesses hit in 2024
Germany	High targeted rate (EU)
France	High targeted rate (EU)

AS Name	ASN
Bharat Sanchar Nigam Ltd	9829
No.31,Jin-rong Street	4134
CHINA UNICOM China169 Backbone	4837
DigitalOcean, LLC	14061
HUAWEI INTERNATIONAL PTE. LTD.	136907
Amazon.com, Inc.	14618
Alibaba (US) Technology Co., Ltd.	45102
Google LLC	396982
Amazon.com, Inc.	16509
3xK Tech GmbH	200373

IP Address	Notable Exploits/Context
104.238.159.149	SharePoint zero-day, broad exploitation
107.191.58.76	SharePoint zero-day, government targets
96.9.125.147	SharePoint, previously Ivanti exploits
139.162.47.194	Exploits on CitrixBleed 2
38.180.148.215	CitrixBleed 2 campaigns
185.224.128.17	High activity, Netherlands
89.248.163.200	High activity, Netherlands
15.235.218.150	Associated with APT, active C2
45.9.148.114	Associated with C2, malicious netflow
91.107.150.184	C2 infrastructure, recent IoC

Overview of the Vulnerabilities

Exploitation Scenario and Impact

Additional Bugs and Remediation

Share this:

Related

Comments

Leave a ReplyCancel reply