Building dataset automatically from Opus

Hello, I have see that argo train need to build a dataset to training a new model. A good feature could be to build a dataset automaticaly with opus? That could be usefull to add more language supported by libretranslate. I have made this code for demo:



Thanks for building this!

I added a link to this project in the Argos Train Readme:

1 Like