CVE-2025-46560 MEDIUM

CVE-2025-46560: vLLM phi4mm: Quadratic Time Complexity in Input Token Processing leads to denial of service

Vendor Vllm-Project

Product vllm

Weakness CWE-1333

Published April 30, 2025

Last update April 30, 2025

View on NVD All CVEs

CVSS base score

6.5/10

Attack vector Network

Attack complexity Low

Privileges required Low

User interaction None

Confidentiality None

Integrity None

CVSS vector

CVSS:3.1/AV:N/AC:L/PR:L/UI:N/S:U/C:N/I:N/A:H

What the vulnerability does

01Description

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.8.0 and prior to 0.8.5 are affected by a critical performance vulnerability in the input preprocessing logic of the multimodal tokenizer. The code dynamically replaces placeholder tokens (e.g., <|audio_|>, <|image_|>) with repeated tokens based on precomputed lengths. Due to inefficient list concatenation operations, the algorithm exhibits quadratic time complexity (O(n²)), allowing malicious actors to trigger resource exhaustion via specially crafted inputs. This issue has been patched in version 0.8.5.

Key dates

02Disclosure timeline

April 30, 2025 CVE published

April 30, 2025 Record updated

External resources

03References

NVD — National Vulnerability Database https://nvd.nist.gov/vuln/detail/CVE-2025-46560 CWE — Common Weakness Enumeration https://cwe.mitre.org/data/definitions/1333.html

CVE-2025-46560: vLLM phi4mm: Quadratic Time Complexity in Input Token Processing​ leads to denial of service

01Description

02Disclosure timeline

03References

CVE-2025-46560: vLLM phi4mm: Quadratic Time Complexity in Input Token Processing leads to denial of service