mirror of
https://github.com/NVIDIA-NeMo/DataDesigner
synced 2026-05-24 09:48:29 +00:00
Add support for capturing full conversation traces during LLM generation, enabling debugging and fine-tuning dataset creation. Changes: - Add `with_trace` field to LLMTextColumnConfig for per-column trace control - Add `debug_override_save_all_column_traces` to RunConfig for global trace - Introduce ChatMessage dataclass for structured message representation - Update ModelFacade.generate() to return full message trace - Rename trace column postfix from `__reasoning_trace` to `__trace` - Add comprehensive traces documentation Traces capture system/user/assistant messages in order, enabling visibility into the full generation conversation including correction retries.
21 lines
489 B
Markdown
21 lines
489 B
Markdown
# Run Config
|
|
|
|
The `run_config` module defines runtime settings that control dataset generation behavior,
|
|
including early shutdown thresholds, batch sizing, and non-inference worker concurrency.
|
|
|
|
## Usage
|
|
|
|
```python
|
|
import data_designer.config as dd
|
|
from data_designer.interface import DataDesigner
|
|
|
|
data_designer = DataDesigner()
|
|
data_designer.set_run_config(dd.RunConfig(
|
|
buffer_size=500,
|
|
max_conversation_restarts=3,
|
|
))
|
|
```
|
|
|
|
## API Reference
|
|
|
|
::: data_designer.config.run_config
|