The idea with seq2seq sentence boundary detection is to run an input like this through the network:
<detect-sentence-boundary> This is the first sentence. This is more text that i
Which gives an output like this:
This is the first sentence. <sentence-boundary>
This is currently used for the Mac app which can’t support Stanza. If there’s enough improvement in ML hardware/software by the time 2.0 makes sense this could not be necessary and instead translate entire paragraphs or more at a time.