--- name: 🐛 Bug report about: Create a report to help us improve title: "[Bug]" labels: bug assignees: '' --- **Describe the bug** A clear and concise description of what the bug is. Please fill out the following sections and provide a minimal reproduction script so that we can provide a solution as quickly as possible! 1. **Environment Setup:** - OS: [e.g., Ubuntu 20.04] - Python Version: [e.g., 3.8.10] - Frameworks/Libraries: please paste output of `pip freeze` here - `colab` / script - was this run in `colab` or as a script? 2. **Dataset Details:** - Dataset Name: - Data Preprocessing Steps: [e.g., tokenization, formatting funcs, data collators, etc.] 3. **Model Details:** - Model ID: - Model Configuration: [e.g., lora params, quantization, etc.] 4. **Training Configuration:** - Trainer Args: `SFTConfig`, `GRPOConfig` 5. **Reproduction Steps:** - Minimal script to reproduce error - If using a `colab`, please provide the link to the notebook and describe any changes made. 6. **Expected Behavior:** 7. **Actual Behavior:** - [e.g., Description of the error, unexpected results, or performance issues encountered] - [e.g., Error messages or logs] 8. **Additional notes:** - Any additional information that might help us reproduce the bug.