gptqmodel
Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
gptqmodel has been downloaded 359,599 times in total on PyPI, including 35,577 in the last 30 days. The latest version is 4.2.5, released Sep 16, 2025. It is distributed under the Apache-2.0 license.
Version4.2.5
Downloads
359.60k
LicenseApache-2.0
AuthorModelCloud
UpdatedSep 16, 2025
Downloads
Weekly, last 90d.
Includes CI traffic.
VersionsTotal7.*6.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included3 / 57 series
Selected total146.20k
359.6kAll-time
35.6kLast 30 days
1.3kLast 24 h
0.02/sPer second
Sponsored
Sponsorships keep pepy free to read
Version distribution
Share of downloads by released version. Computed over the last quarter.
- 0114.5%
2.2.0
15.9k downloadsDownloads15.9k14.5% - 0213.2%
7.0.0
14.4k downloadsDownloads14.4k13.2% - 0312.4%
6.0.3
13.5k downloadsDownloads13.5k12.4% - 047.3%
5.8.0
8.0k downloadsDownloads8.0k7.3% - 054.6%
7.1.0
5.0k downloadsDownloads5.0k4.6% - 063.9%
5.7.0
4.3k downloadsDownloads4.3k3.9% - 073.7%
5.4.2
4.0k downloadsDownloads4.0k3.7% - 083.6%
6.0.0
3.9k downloadsDownloads3.9k3.6% - 0936.9%
Other
40.3k downloadsDownloads40.3k36.9%
Guess the next day
Thirteen recent days of gptqmodel downloads. Drag the green handle on the right to guess where day fourteen lands.