transformer-tricks

0.3.4
19.69k

A collection of tricks to speed up LLMs, see our transformer-tricks papers on arXiv