MT based workflow

Thu May 18 09:39:28 BST 2023

On 5/18/23 00:23, Albert Astals Cid wrote:
> El dimecres, 17 de maig de 2023, a les 16:12:48 (CEST), Frederik Schwarzer va 
> escriure:
>> On Tuesday, May 16, 2023 9:51:28 PM CEST Oliver Kellogg wrote:
>>
>> Hi,
>>
>>> I had been exploring how to integrate https://translate.google.com into my
>>> PO translation workflow and had found this:
>>> https://github.com/zcribe/translate-po
>>
>> Apart from the feasability I would like to see the legal situation
>> discussed.
+1, similarly for code
>>
>> I guess we can safely assume at this point that the language model vendors
>> do not care about copyright when training their models. But do we already
>> know what they think about using the result of their models?
> 
> Things like https://libretranslate.com/ that is based on 
>   https://github.com/argosopentech/argos-translate
> should be copyright neutral given they have been trained on data that "has no 
> copyright" (If i'm not mistaken).
> 
Probably traceability is more important.  Public domain training
material that is not very good will unlikely deliver good translations.
Garbage in, garbage out.  Such traceability information can likely be
put in a separate file as it will likely be large.
> Cheers,
>   Albert
> 
>>
>> Cheers,
>> Frederik