Commit graph

4 commits

Author SHA1 Message Date
HugoSenetaire
3fe24ff8cd
Fix output transform, add test to enforce tokenizer consistency (#73)
*Description of changes:* 

The bin indexes were shifted by one between input transform and output
transform. Subtracting 1 to the sampled tokens in output transform lead
to the correct reconstruction of the signal.

Add a test to ensure the consistency of the Chronos Tokenizer.

By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

Co-authored-by: Lorenzo Stella <stellalo@amazon.com> and Abdul Fatir
Ansari <ansarnd@amazon.com>
2024-05-17 15:29:18 +02:00
Lorenzo Stella
4b1d1c818b
Fix types, add mypy to workflow (#42)
*Description of changes:* Fix some type checking issues, add mypy to
github workflow, apply black


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.
2024-04-05 15:36:39 +02:00
Abdul Fatir
0595bd872b
Add pipeline.embed (#24)
*Description of changes:* This PR adds `pipeline.embed` which extracts
encoder embeddings from the model. These embeddings may be useful for
some downstream tasks such as classification, so this is useful to have.


By submitting this pull request, I confirm that you can use, modify,
copy, and redistribute this contribution, under the terms of your
choice.

---------

Co-authored-by: Abdul Fatir Ansari <ansarnd@amazon.de>
2024-03-25 13:18:50 +01:00
Lorenzo Stella
7ba945c995 Upload code 2024-03-13 09:58:39 +01:00