LLMOps
1 Min Read

FireAttention

Subhajeet Dey
January 5, 2025

Serving Open Source Models 4x faster than vLLM