DataDesigner

mirror of https://github.com/NVIDIA-NeMo/DataDesigner synced 2026-05-24 09:48:29 +00:00

Author	SHA1	Message	Date
Johnny Greco	6e6efc009f	docs: some updates for nano3 (#149 ) * some fixes * generate colab notebooks	2025-12-17 18:24:39 -05:00
Nabin Mulepati	8d4c6c12b4	chore: Update nvidia text default model alias to nano v3 (#133 )	2025-12-15 15:03:12 -07:00
Nabin Mulepati	8370e4a00b	feat: support native embedding generation (#106 ) * Add generation type to ModelConfig * pass tests * added generate_text_embeddings * tests * remove sensitive=True old artifact no longer needed * Slight refactor * slight refactor * Added embedding generator * chunk_separator -> chunk_pattern * update tests * rename for consistency * Restructure InferenceParameters -> CompletionInferenceParameters, BaseInferenceParameters, EmbeddingInferenceParameters * Remove purpose from consolidated kwargs * WithModelConfiguration.inference_parameters should should be typed with BaseInferenceParameters * Type as WithModelGeneration * Add image generation modality * update return type for generate_kwargs * make generation_type a field of ModelConfig as opposed to a prop resolved based on the type of InferenceParameters * remove regex based chunking from embedding generator * Remove image generation for now * more tests and updates * column_type_is_llm_generated -> column_type_is_model_generated * change set to list: fix flaky tests * CompletionInferenceParameters -> ChatCompletionInferenceParameters for consistency with generation_type * Update docs * fix deprecation warning originating from cli model settings * update display of inference parameters in cli list * save prog on inference parameter * updates for the ocnfig builder * update cli readme * update cli for inference parmeters * update inference parameter names * flip order of vars * WithCompletion -> WithChatCompletion * specify InferenceParamsT * Update columns.md with EmbeddingColumnConfig info * make generation_type a descriminator field in inference params. add configuration support for max_parallel_requests and timeout * DRY out some stuff in field.py * Update nomenclature. prompt tokens -> input tokens, completion tokens -> output tokens in column statistics for consistency * Add nvidia-embedding and openai-embedding to default model configs * Fix typo in docs * Make generate collab notebooks * fine-tune -> adjust	2025-12-15 11:03:33 -07:00
Andre Manoel	68533c78be	docs: fix links on notebooks and add %%capture on install cell (#134 )	2025-12-15 14:41:01 -03:00
Andre Manoel	7fa9a413ac	docs: add option to open notebook directly in Colab (#126 )	2025-12-12 15:15:26 -03:00
Mike Knepper	32515ba724	style: Sort imports traditionally instead of within sections (#103 )	2025-12-08 09:01:58 -06:00
Nabin Mulepati	1de2262b94	docs: add models module to code reference (#101 ) * Add example notebook showing how to use image contexts * change 101 -> tutorial * update _README.md with info on the new tutorial * add reference in mkdocs.yml * simplify vlm tutorial * update num_records on tutorials. Update .gitignore * update readme info * add models module to code reference * fix links to generated ipynb * change vlm in example tutorial to llama4-scout	2025-12-05 10:41:43 -07:00
Nabin Mulepati	8ccb724fb3	docs: Add example notebook showing how to use image contexts (#97 )	2025-12-04 15:39:58 -07:00
Andre Manoel	6d921c48ba	fix: small typo on text file (#95 ) Notebooooks Also changing from "Jupytext Format" to "`.py` Format"	2025-12-03 18:31:35 -03:00
Nabin Mulepati	8e3080241b	docs: move models docs to concepts > models (#93 )	2025-12-03 14:10:01 -07:00
Andre Manoel	60a898181a	fix: add download links to notebooks (#94 )	2025-12-03 18:01:57 -03:00
Andre Manoel	5d4ad10b11	chore: moving notebooks to jupytext and cleaning up workflows (#91 ) * adding basic jupytext structure Co-authored-by: Johnny Greco <jogreco@nvidia.com> * few fixes * first test for ci * adding error intentionally to check workflow behavior * test calling from other workflows * typo * trying as job instead * couple of fixes * checking path * trying to fix path * wrapping up --------- Co-authored-by: Johnny Greco <jogreco@nvidia.com>	2025-12-03 17:29:07 -03:00

12 commits