CVE-2025-46560
vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.8.0 and prior to 0.8.5 are affected by a critical performance vulnerability in the input preprocessing logic of the multimodal tokenizer. The code dynamically replaces placeholder tokens (e.g., <|audio_|>, <|image_|>) with repeated tokens based on precomputed lengths. Due to inefficient list concatenation operations, the algorithm exhibits quadratic time complexity (O(n²)), allowing malicious actors to trigger resource exhaustion via specially crafted inputs. This issue has been patched in version 0.8.5.
EPSS 0.57% · 69.0th percentile
Risk Scores
Affected Products
| Vendor | Product | Versions |
|---|---|---|
| vllm | vllm | 0.8.0 |
| vllm-project | vllm | >= 0.8.0, < 0.8.5 |
| PyPI | vllm | 0.8.0 |
Timeline
- Jan 21, 1970 Security Advisory
- Apr 29, 2025 CVE Published
- Apr 30, 2025 CVE Updated
- Apr 30, 2025 EPSS Score
- Apr 30, 2025 PoC Published
- May 4, 2025 Coalition ESS Score
- May 12, 2025 EPSS Score
- May 24, 2025 EPSS Score
- May 29, 2025 Coalition ESS Score
- May 30, 2025 Coalition ESS Score
- Jun 5, 2025 EPSS Score
- Jun 16, 2025 EPSS Score
References
- https://github.com/vllm-project/vllm/security/advisories/GHSA-vc6m-hm49-g9qg url
- https://github.com/vllm-project/vllm/blob/8cac35ba435906fb7eb07e44fe1a8c26e8744f4e/vllm/model_executor/models/phi4mm.py#L1182-L1197 url
- https://nvd.nist.gov/vuln/detail/CVE-2025-46560 advisory
- https://github.com/vllm-project/vllm package