Commit graph

10 commits

Author SHA1 Message Date
Lorenzo Stella
050d600f64
Bound number of workers by number of datasets (#157)
*Issue #, if available:* Fixes #154

*Description of changes:* Prior to the fix, some workers have no dataset
to consume if `dataloader_num_workers > len(training_data_paths)`.


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.
2024-07-23 10:37:14 +02:00
Abdul Fatir
d2e0c9d6d5
Set drop_prob = 0 for causal models (#125)
*Description of changes:* This PR sets `drop_prob = 0` when training
causal models. Missing values are problematic for causal model training.


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.
2024-06-16 17:28:47 +02:00
Abdul Fatir
2f92a126d3
Add support for causal models (#113)
*Description of changes:* This PR adds support for training
causal/decoder-only models.


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

Co-authored-by: Abdul Fatir Ansari <ansarnd@amazon.de>
2024-06-13 17:37:04 +02:00
Abdul Fatir
f49c4ee4b5
Remove print statements from train.py (#101)
*Description of changes:* Removes print statements that got left inside
from a debugging session.


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.
2024-06-07 23:36:45 +02:00
Abdul Fatir
6bcd4584a3
Enhance training script: auto tf32 detection and reorder default seed setting (#91)
*Description of changes:* Automatically set `tf32` to `False` if used on
an older NVIDIA GPU. Reorder seed so that the seed is saved as part of
the training config.


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

---------

Co-authored-by: Abdul Fatir Ansari <ansarnd@amazon.de>
2024-05-31 15:13:49 +02:00
Abdul Fatir
223e576e2e
Split input_transform into context_input_transform and label_input_transform (#82)
*Description of changes:* This splits `input_transform` into
`context_input_transform` and `label_input_transform`. Previously,
`input_transform` was being used for both context and label during
training which would lead to incorrect results where `prediction_length`
> `context_length`.

TODO:

- [x] Update docstrings
- [x] Test the training script

By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

---------

Co-authored-by: Abdul Fatir Ansari <ansarnd@amazon.com>
2024-05-28 09:58:22 +02:00
Abdul Fatir
16f927ccfe
Save training job info (#80)
*Description of changes:* This PR updates the training script to also
save the training details in the final checkpoint.


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

---------

Co-authored-by: Abdul Fatir Ansari <ansarnd@amazon.de>
2024-05-27 09:57:18 +02:00
Abdul Fatir
6732b1c2e3
Add a README file for the scripts (#67)
*Description of changes:* Adds usage examples for `scripts/`. 


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

---------

Co-authored-by: Abdul Fatir Ansari <ansarnd@amazon.com>
2024-05-10 16:04:24 +02:00
Lorenzo Stella
069df04e01
Add missing headers (#65)
By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.
2024-05-10 09:30:58 +02:00
Lorenzo Stella
b4e8085c7f
Add training script (#63)
*Description of changes:* Add training script and config files. Can be
used for pre-training, or adapted for fine-tuning chronos models.


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

---------

Co-authored-by: Abdul Fatir <Abdulfatirs@gmail.com>
2024-05-09 17:52:01 +02:00