NVIDIA Triton Inference Server: Critical Auth Bypass Puts AI Workloads at Risk

May 20, 2026 07:16 /HIGH /CVSS 7.3/10

Written by SCW Vulnerability Desk Vulnerability Intelligence

SCW vulnerabilitycvehigh-severityauthentication-bypasscwe-288

The National Vulnerability Database has disclosed CVE-2026-24206, an authentication bypass vulnerability in NVIDIA Triton Inference Server. This flaw carries a CVSSv3.1 score of 7.3 (High) and is categorized under CWE-288, indicating an authentication bypass using an alternate path or channel.

Successful exploitation of this vulnerability could lead to significant security compromises, including escalation of privileges, denial of service, and information disclosure. While specific affected product versions were not detailed by the National Vulnerability Database, the broad impact categories suggest a fundamental flaw that could undermine the integrity and confidentiality of AI/ML inference workloads.

For defenders, this means a critical review of NVIDIA Triton Inference Server deployments is necessary. An authentication bypass at this level can grant attackers unfettered access, effectively neutralizing perimeter defenses and allowing direct manipulation or exfiltration of sensitive models and data. The attacker’s calculus here is straightforward: bypass the gate, own the inference engine, and by extension, the data it processes.

What This Means For You

If your organization utilizes NVIDIA Triton Inference Server, you must prioritize identifying all instances within your environment. Verify your current version and be prepared to apply patches immediately once they are released. Proactive monitoring for unusual access patterns or activity on these servers is also crucial to detect potential pre-patch exploitation attempts. This isn't theoretical – an auth bypass is a direct path to compromise.

title: NVIDIA Triton Inference Server Auth Bypass Attempt - Free Tier id: scw-2026-05-20-ai-1 status: experimental level: critical description: | This rule detects attempts to access the Triton Inference Server's model inference endpoint ('/v2/models' with 'infer' in query) that result in an authentication error (401 or 403). This is a primary indicator of an attempt to exploit CVE-2026-24206, which allows for authentication bypass. The vulnerability could lead to unauthorized access and further compromise of AI workloads. author: SCW Feed Engine (AI-generated) date: 2026-05-20 references: - https://shimiscyberworld.com/posts/nvd-CVE-2026-24206/ tags: - attack.initial_access - attack.t1190 logsource: category: webserver detection: selection: cs-uri|contains: - '/v2/models' cs-uri-query|contains: - 'infer' sc-status: - 401 - 403 condition: cs-uri AND cs-uri-query AND sc-status falsepositives: - Legitimate administrative activity

ID	Type	Indicator
CVE-2026-24206	Auth Bypass	NVIDIA Triton Inference Server
CVE-2026-24206	Privilege Escalation	NVIDIA Triton Inference Server
CVE-2026-24206	DoS	NVIDIA Triton Inference Server
CVE-2026-24206	Information Disclosure	NVIDIA Triton Inference Server

Type

Indicator

CVE-2026-24206

Auth Bypass

NVIDIA Triton Inference Server

CVE-2026-24206

Privilege Escalation

NVIDIA Triton Inference Server

CVE-2026-24206

DoS

NVIDIA Triton Inference Server

CVE-2026-24206

Information Disclosure

NVIDIA Triton Inference Server

Source Platform	NVD
Channel	National Vulnerability Database
Published	May 20, 2026 at 07:16 UTC

NVIDIA Triton Inference Server: Critical Auth Bypass Puts AI Workloads at Risk

What This Means For You

Related ATT&CK Techniques

🛡️ Detection Rules

NVIDIA Triton Inference Server Auth Bypass Attempt - Free Tier

Indicators of Compromise

What This Means For You

Related ATT&CK Techniques

🛡️ Detection Rules

NVIDIA Triton Inference Server Auth Bypass Attempt - Free Tier

Indicators of Compromise

Related coverage

CVE-2026-20240 — Denial of Service

Splunk Enterprise, Cloud Vulnerability Exposes Session Cookies, Sensitive Data

CVE-2026-20238 — In Splunk AI Toolkit versions below 5.7.3, a low-privileged