* removing required resources
* fix tests
* add get required resources method to base column generator
* move classification functions to engine; remove required resources
* drop single from subclass names
* update model config logging
* fix unit test
* typo
* update type hint
* move tests
* Add generation type to ModelConfig
* pass tests
* added generate_text_embeddings
* tests
* remove sensitive=True old artifact no longer needed
* Slight refactor
* slight refactor
* Added embedding generator
* chunk_separator -> chunk_pattern
* update tests
* rename for consistency
* Restructure InferenceParameters -> CompletionInferenceParameters, BaseInferenceParameters, EmbeddingInferenceParameters
* Remove purpose from consolidated kwargs
* WithModelConfiguration.inference_parameters should should be typed with BaseInferenceParameters
* Type as WithModelGeneration
* Add image generation modality
* update return type for generate_kwargs
* make generation_type a field of ModelConfig as opposed to a prop resolved based on the type of InferenceParameters
* remove regex based chunking from embedding generator
* Remove image generation for now
* more tests and updates
* column_type_is_llm_generated -> column_type_is_model_generated
* change set to list: fix flaky tests
* CompletionInferenceParameters -> ChatCompletionInferenceParameters for consistency with generation_type
* Update docs
* fix deprecation warning originating from cli model settings
* update display of inference parameters in cli list
* save prog on inference parameter
* updates for the ocnfig builder
* update cli readme
* update cli for inference parmeters
* update inference parameter names
* flip order of vars
* WithCompletion -> WithChatCompletion
* specify InferenceParamsT
* Update columns.md with EmbeddingColumnConfig info
* make generation_type a descriminator field in inference params. add configuration support for max_parallel_requests and timeout
* DRY out some stuff in field.py
* docs for custom model settings
* Update nomenclature. prompt tokens -> input tokens, completion tokens -> output tokens in column statistics for consistency
* Add nvidia-embedding and openai-embedding to default model configs
* Fix typo in docs
* Make generate collab notebooks
* Address PR comments
* Add generation type to ModelConfig
* pass tests
* added generate_text_embeddings
* tests
* remove sensitive=True old artifact no longer needed
* Slight refactor
* slight refactor
* Added embedding generator
* chunk_separator -> chunk_pattern
* update tests
* rename for consistency
* Restructure InferenceParameters -> CompletionInferenceParameters, BaseInferenceParameters, EmbeddingInferenceParameters
* Remove purpose from consolidated kwargs
* WithModelConfiguration.inference_parameters should should be typed with BaseInferenceParameters
* Type as WithModelGeneration
* Add image generation modality
* update return type for generate_kwargs
* make generation_type a field of ModelConfig as opposed to a prop resolved based on the type of InferenceParameters
* remove regex based chunking from embedding generator
* Remove image generation for now
* more tests and updates
* column_type_is_llm_generated -> column_type_is_model_generated
* change set to list: fix flaky tests
* CompletionInferenceParameters -> ChatCompletionInferenceParameters for consistency with generation_type
* Update docs
* fix deprecation warning originating from cli model settings
* update display of inference parameters in cli list
* save prog on inference parameter
* updates for the ocnfig builder
* update cli readme
* update cli for inference parmeters
* update inference parameter names
* flip order of vars
* WithCompletion -> WithChatCompletion
* specify InferenceParamsT
* Update columns.md with EmbeddingColumnConfig info
* make generation_type a descriminator field in inference params. add configuration support for max_parallel_requests and timeout
* DRY out some stuff in field.py
* Update nomenclature. prompt tokens -> input tokens, completion tokens -> output tokens in column statistics for consistency
* Add nvidia-embedding and openai-embedding to default model configs
* Fix typo in docs
* Make generate collab notebooks
* fine-tune -> adjust
* Add example notebook showing how to use image contexts
* change 101 -> tutorial
* update _README.md with info on the new tutorial
* add reference in mkdocs.yml
* simplify vlm tutorial
* update num_records on tutorials. Update .gitignore
* update readme info
* add models module to code reference
* fix links to generated ipynb
* change vlm in example tutorial to llama4-scout
* adding basic jupytext structure
Co-authored-by: Johnny Greco <jogreco@nvidia.com>
* few fixes
* first test for ci
* adding error intentionally to check workflow behavior
* test calling from other workflows
* typo
* trying as job instead
* couple of fixes
* checking path
* trying to fix path
* wrapping up
---------
Co-authored-by: Johnny Greco <jogreco@nvidia.com>
* Update persona docs
* Updated person sampling docs based on feedback
* remove nemotron personas sampling
* Remove nemotron personas sampling
* Update docs/concepts/person_sampling.md
---------
Co-authored-by: Johnny Greco <jogreco@nvidia.com>
* vibe it baby
* clean up
* iterate with claude
* Save prog
* Update info pipeine
* Fix tests
* Fix typo
* remove redundant overload
* Add support for multiple default model providers and config
* pull user-defined model configs and providers if available
* Added tests for default model settings
* save progress
* refactor cli to be modular and use OOP
* new tests for cli components
* config_dir > config_path
* simplify list
* list tests
* stranded commit
* tests for commands
* tests for field.py
* tests for form.py
* more tests
* deleting providers should delete associated model configs
* add readme.md for cli
* clean up
* Fix tests
* feat: (FTUE) pull user-defined (via cli) model configs and providers (#24)
* added docs for quick start and default model settings
* Updates per chat
* update quickstart.md
* update default-model-settings.md
* add check for interface.py as well
* move default model config resolution to src/data_designer/__init__.py
* Revert "move default model config resolution to src/data_designer/__init__.py"
This reverts commit 806a81dc93.
* docs for cli
* update default-model-settings.md
* docs for model provider
* more docs
* add new tests for get provider name
* add lru cache
* remove non doc related changes
* PR feedback
* update reset info
* tip for settings files
* update
* update info about default inference providers
* DATA_DESIGNER_HOME_DIR -> DATA_DESIGNER_HOME
---------
Co-authored-by: Johnny Greco <jogreco@nvidia.com>