LLM Reinforcement Learning framework developed from ByteDance.[GitHub](https://github.com/volcengine/verl) [Docs](https://verl.readthedocs.io/en/latest/)