trl

1.0.0
35.14M

Train transformer language models with reinforcement learning.