I’ve been thinking about ways to further improve translation quality in argos-translate and LT. Wondering if anyone has tried to perform transfer learning from existing open-source models that perform well but are released under restrictive licenses.
In short:
- Pick list of sentences in source language from existing source (e.g. OPUS)
- Translate such list in target language using existing model - OR - validate/filter existing translations to discard bad ones (this might be tricky to automate).
- Train new model using argos-train
Licensing-wise this should be allowed (the output of a model is not subject to the licensing terms of the model itself).
Thoughts?