Crow Translate
Kevin Kofler
kevin.kofler at chello.at
Sun Jul 14 01:31:00 BST 2024
Hennadii Chernyshchyk wrote:
> Regardless of the warning. I think that the only translation engine that
> fully respects privacy is LibreTranslate.
> But it doesn't support automatic language detection and can't do TTS. It's
> also disabled on the main Mozhi instance (instances can disable specific
> engines if they want).
> So I can add a warning, but there is not much to choose from :(
> Do you still think it's a good idea?
The issue is inherently that your application is relying on a web service
(Mozhi) to do the translations. That service then either forwards the
requests to yet another (typically proprietary) web service (e.g., Google
Translate) or to something local to the Mozhi server instance (e.g., a
LibreTranslate instance running on the same server as the Mozhi instance).
In the first case, there are two services that see your data, in the second
case, there is still one service that sees your data and that you have to
trust.
The only thing that is guaranteed to be privacy-friendly is to run (either
as a separate process or embedded as a library) a local instance of
something like LibreTranslate (or Argos Translate directly) or Bergamot (or
Marian directly) on the same device as your application. And libmozhi is
probably not the library you would want to use for that (though technically
I guess you could theoretically run and point libmozhi to a local Mozhi
instance and in turn point that to a local LibreTranslate instance).
As for:
> But it doesn't support automatic language detection and can't do TTS.
there are separate libraries for that.
For automatic language detection, see, e.g., cld2 or cld3, or one of several
Python or Perl modules.
For TTS, see eSpeak NG (or its library libespeakng) if you want something
simple and compact that supports many languages out of the box, or one of
the local modern AI-based TTS tools (Mozilla TTS, OpenAI Whisper, etc.) if
you want something that sounds better.
And, while you did not ask for it, for STT, there is the old PocketSphinx,
the newer Mozilla DeepSpeech, etc.
The Free Software implementations that you can run locally all focus on
doing one thing well, not on offering a complete solution like the Google
Translate web service does. Integrating that all into a complete solution is
then your job as the application developer.
Kevin Kofler
More information about the kde-core-devel
mailing list