Capacity planning for reserved LLM throughput: latency, headroom, and synthetic workload simulation.
pip install slosizer