thaw-vllm

The fork primitive for LLM inference. Snapshot a running session — weights + KV cache + scheduler state — and hydrate it into N divergent children that skip prefill. For RL rollouts, parallel coding agents, agent branching. Supports vLLM and SGLang.

Version0.5.1
Downloads
7.28k
LicenseApache-2.0
AuthorNils Matteson
UpdatedApr 23, 2026

Downloads

Weekly, last 90d.
Includes CI traffic.

VersionsTotal0.*
Range
View
Granularity
Group by
CI traffic
Stack: OffCI: Included2 / 16 series
Selected total14.55k
7.3kAll-time
1.4kLast 30 days
20Last 24 h
<0.01/sPer second

Version distribution

Share of downloads by released version. Computed over the last quarter.

  • 01

    0.3.0

    740 downloads
    10.2%
  • 02

    0.1.0

    627 downloads
    8.6%
  • 03

    0.2.1

    606 downloads
    8.3%
  • 04

    0.5.1

    602 downloads
    8.3%
  • 05

    0.1.4

    480 downloads
    6.6%
  • 06

    0.2.0

    468 downloads
    6.4%
  • 07

    0.1.2

    455 downloads
    6.3%
  • 08

    0.1.3

    451 downloads
    6.2%
  • 09

    Other

    2.8k downloads
    39.1%

Guess the next day

Thirteen recent days of thaw-vllm downloads. Drag the green handle on the right to guess where day fourteen lands.

TRUTH204SATSUNMONTUEWEDTHUFRISATSUNMONWEDTHUFRI