gptqmodel

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Version4.2.5
Downloads
332.16k
LicenseApache-2.0
AuthorModelCloud
UpdatedSep 16, 2025

Downloads

Weekly, last 90d.
Includes CI traffic.

VersionsTotal7.*6.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included3 / 56 series
Selected total132.78k
332.2kAll-time
35.6kLast 30 days
866Last 24 h
0.01/sPer second

Version distribution

Share of downloads by released version. Computed over the last quarter.

  • 01

    2.2.0

    17.8k downloads
    16.9%
  • 02

    6.0.3

    12.2k downloads
    11.6%
  • 03

    7.0.0

    11.2k downloads
    10.6%
  • 04

    5.7.0

    9.4k downloads
    8.9%
  • 05

    5.8.0

    9.1k downloads
    8.7%
  • 06

    5.6.12

    4.0k downloads
    3.8%
  • 07

    5.4.2

    3.6k downloads
    3.4%
  • 08

    4.2.5

    3.2k downloads
    3.1%
  • 09

    Other

    34.7k downloads
    33.0%

Guess the next day

Thirteen recent days of gptqmodel downloads. Drag the green handle on the right to guess where day fourteen lands.

TRUTH866641MONTUEWEDTHUFRISATSUNMONTUEWEDTHUFRISAT