gptqmodel
Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
Version4.2.5
Downloads
332.16k
LicenseApache-2.0
AuthorModelCloud
UpdatedSep 16, 2025
Downloads
Weekly, last 90d.
Includes CI traffic.
VersionsTotal7.*6.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included3 / 56 series
Selected total132.78k
332.2kAll-time
35.6kLast 30 days
866Last 24 h
0.01/sPer second
Sponsored
Sponsorships keep pepy free to read
Version distribution
Share of downloads by released version. Computed over the last quarter.
- 0116.9%
2.2.0
17.8k downloadsDownloads17.8k16.9% - 0211.6%
6.0.3
12.2k downloadsDownloads12.2k11.6% - 0310.6%
7.0.0
11.2k downloadsDownloads11.2k10.6% - 048.9%
5.7.0
9.4k downloadsDownloads9.4k8.9% - 058.7%
5.8.0
9.1k downloadsDownloads9.1k8.7% - 063.8%
5.6.12
4.0k downloadsDownloads4.0k3.8% - 073.4%
5.4.2
3.6k downloadsDownloads3.6k3.4% - 083.1%
4.2.5
3.2k downloadsDownloads3.2k3.1% - 0933.0%
Other
34.7k downloadsDownloads34.7k33.0%
Guess the next day
Thirteen recent days of gptqmodel downloads. Drag the green handle on the right to guess where day fourteen lands.