gptqmodel

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

gptqmodel has been downloaded 359,599 times in total on PyPI, including 35,577 in the last 30 days. The latest version is 4.2.5, released Sep 16, 2025. It is distributed under the Apache-2.0 license.

Version4.2.5
Downloads
359.60k
LicenseApache-2.0
AuthorModelCloud
UpdatedSep 16, 2025

Downloads

Weekly, last 90d.
Includes CI traffic.

VersionsTotal7.*6.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included3 / 57 series
Selected total146.20k
359.6kAll-time
35.6kLast 30 days
1.3kLast 24 h
0.02/sPer second

Version distribution

Share of downloads by released version. Computed over the last quarter.

  • 01

    2.2.0

    15.9k downloads
    14.5%
  • 02

    7.0.0

    14.4k downloads
    13.2%
  • 03

    6.0.3

    13.5k downloads
    12.4%
  • 04

    5.8.0

    8.0k downloads
    7.3%
  • 05

    7.1.0

    5.0k downloads
    4.6%
  • 06

    5.7.0

    4.3k downloads
    3.9%
  • 07

    5.4.2

    4.0k downloads
    3.7%
  • 08

    6.0.0

    3.9k downloads
    3.6%
  • 09

    Other

    40.3k downloads
    36.9%

Guess the next day

Thirteen recent days of gptqmodel downloads. Drag the green handle on the right to guess where day fourteen lands.

TRUTH1.3k1.8kWEDTHUFRISATSUNMONTUEWEDTHUFRISATSUNMON