Yes, this would do, thanks. We are going to use it to store legal docs as well, as there is a lot of official material we can get from the EU, to start with. So a proper sequence here is relevant, as we present a single listitem for translation, but also offer it in the context of the full document for a better understanding of the original context.<div>
<br></div><div>We need something more complex than @en, as we store information on the script and the orthographic convention as well, so I guess it will have to be properties for that. This is necessary to treat languages that mix scripts (like Serbian and Japanese) and/or languages that underwent mutations, like Turkish (used to scripted in Arabic, now in Latin) or that may use different orthographic standards, like Nederlands or German. These features are relevant to correctors, that may "fish" in the reps to get "proper data" according to a given standard, and even alphabetic filters that will allow only certain chars in a string (if any app needs this).</div>
<div><br></div><div>We currently have a thing called "region" that is probably better called "context" in nepomuk, so that an end user may decide what contexts are relevant for him, plus he can filter by language/script etc, so that people host only what they can use. So there would be a metacontext, storing languages, scripts etc plus "meanings", where "meaning" is simply an attribute (actually a uuid) by which we can group "things that mean the same thing", and strings are actually of two types: expression vs definition, to obtain a dictionary like structure. </div>
<div><br></div><div>The rest is about being able to define "user templates", which is basically the same as we already can do by adding tags in Dolphin. The only difference is that we need this tags to be a Class property (where class is an element of a taxonomy), so some tags may be free text, other may require you choose a "meaning" or a number etc. This is basically like wiki templates, only you get automagically by tagging a resource or a string in a given category.</div>
<div><br></div><div>We currently have some 2 million "meanings" in an experimental db, just by assembling a couple of wiktionaries and free stuff from FAO, but we aim to reach one billion once the storage is stable. So by thye time this gets to be a desktop app we will eventually need to enable sections of the metalevel, not to force users to dnload one billion of uuids just to manage the 100K strings set they want. Most people are going to use this as a "brainstorming notepad" to connect concepts, rather than to make translations.</div>
<div><br></div><div>The last candy on the pie is that we would LOVE not to need any centralized store, and simply run it as a distributed rep, into which people can write and share what they feel to share. This would solve the problem of those who have seldom connections (like people traveling in far places to study rare languages) as they can basically work offline AND would take a large expense chapter off the foundation, since we don't need to feed a hosting company... not sure all of this networking can be done, but this was the general strategy, the night we had too much to dream :))))))</div>
<div><br></div><div>Bèrto<br><br><div class="gmail_quote">On 14 September 2010 15:51, Roman Evstifeev <span dir="ltr"><<a href="mailto:someuniquename@gmail.com">someuniquename@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
oh, something got messed up - this is correct:<br>
<div class="im"><br>
<doc1> a <xxx:Document> .<br>
<doc1> <xxx:hasContents> <listitem1> .<br>
<br>
<listitem1> a <RDF:List><br>
<listitem2> a <RDF:List><br>
<listitem3> a <RDF:List><br>
<br>
<listitem1> <RDF:first> <sen1> .<br>
<listitem1> <RDF:rest> <listitem2> .<br>
<br>
</div> <listitem2> <RDF:first> <sen2> .<br>
<div class="im"> <listitem2> <RDF:rest> <listitem3> .<br>
<br>
<listitem3> <RDF:first> <sen3> .<br>
<listitem3> <RDF:rest> <rdf:nil> . # end of list<br>
<br>
<sen1> <xxx:hasText> "foo bar hello"@en .<br>
<sen2> <xxx:hasText> "The world is mine"@en .<br>
<sen3> <xxx:hasText> "good bye"@en .<br>
<br>
<br>
</div>2010/9/14 Roman Evstifeev <<a href="mailto:someuniquename@gmail.com">someuniquename@gmail.com</a>>:<br>
<div><div></div><div class="h5">> maybe RDF Collections can be used here to store sequences of resources?<br>
><br>
> <doc1> a <xxx:Document> .<br>
> <doc1> <xxx:hasContents> <listitem1> .<br>
><br>
> <listitem1> a <RDF:List><br>
> <listitem2> a <RDF:List><br>
> <listitem3> a <RDF:List><br>
><br>
> <listitem1> <RDF:first> <sen1> .<br>
> <listitem1> <RDF:rest> <listitem2> .tem2> <RDF:first> <sen2> .<br>
> <listitem2> <RDF:rest> <listitem3> .<br>
><br>
> <listitem3> <RDF:first> <sen3> .<br>
> <listitem3> <RDF:rest> <rdf:nil> . # end of list<br>
><br>
> <sen1> <xxx:hasText> "foo bar hello"@en .<br>
> <sen2> <xxx:hasText> "The world is mine"@en .<br>
> <sen3> <xxx:hasText> "good bye"@en .<br>
><br>
><br>
><br>
> 2010/9/14 Sebastian Trüg <<a href="mailto:trueg@kde.org">trueg@kde.org</a>>:<br>
>> If I understand correctly you want to store sentences as RDF literals.<br>
>> Thus, something along the lines of:<br>
>><br>
>> <res> <xxx:hasSentence> "foo bar hello"@en<br>
>> <res> <xxx:hasSentence> "The world is mine"@en<br>
>> ...<br>
>><br>
>> And you want to order them. IMHO this needs to be done by introducing<br>
>> the necessary ontology entities. One could think of something like:<br>
>><br>
>> <res> <xxx:hasDocument> <doc1> .<br>
>> <doc1> a xxx:Document .<br>
>> <doc1> <xxx:hasSentence> <sen1> .<br>
>> <doc1> <xxx:hasSentence> <sen2> .<br>
>> <sen1> <xxx:hasText> "foo bar hello"@en .<br>
>> <sen1> <xxx:sentenceIndex> 0 .<br>
>> <sen2> <xxx:hasText> "The world is mine"@en .<br>
>> <sen2> <xxx:sentenceIndex> 1 .<br>
>><br>
>> It is of course complex - maybe someone else can come up with a less<br>
>> complicated approach?<br>
>> BTW: I am pretty sure someone already wrote an ontology for documents.<br>
>> So all we have to do is look for that and make is Nepomukish. :)<br>
>><br>
>> Another approach is to store both: the sentences and the full doc.<br>
>><br>
>> Cheers,<br>
>> Sebastian<br>
>><br>
>> On 09/14/2010 10:16 AM, Bèrto ëd Sèra wrote:<br>
>>> Hi!<br>
>>><br>
>>> I just saw the "Excerpts for Query Results" thing and I really love it.<br>
>>> Now there is a last thing we will need for ambaradan: sequences. We<br>
>>> store free text in a translatable format, that is, we break it in<br>
>>> sentences. So we need to keep an ordered sequence to "rebuild" the doc.<br>
>>> Other graphs do not depend on a particular order (apart from hierarchy),<br>
>>> as taxonomy should be sorted according to the end user's language. Is is<br>
>>> possible/easy to do this? It should be something like a "document"<br>
>>> class, that is possibly a graph of chapters, who in turn are an ordered<br>
>>> list of sentences.<br>
>>><br>
>>> Everything we need else appears to be there already.<br>
>>><br>
>>> Bèrto<br>
>>><br>
>>> --<br>
>>> ==============================<br>
>>> Constitution du 24 juin 1793 - Article 35. - Quand le gouvernement viole<br>
>>> les droits du peuple, l'insurrection est, pour le peuple et pour chaque<br>
>>> portion du peuple, le plus sacré des droits et le plus indispensable des<br>
>>> devoirs.<br>
>>><br>
>>><br>
>>><br>
>>> _______________________________________________<br>
>>> Nepomuk mailing list<br>
>>> <a href="mailto:Nepomuk@kde.org">Nepomuk@kde.org</a><br>
>>> <a href="https://mail.kde.org/mailman/listinfo/nepomuk" target="_blank">https://mail.kde.org/mailman/listinfo/nepomuk</a><br>
>> _______________________________________________<br>
>> Nepomuk mailing list<br>
>> <a href="mailto:Nepomuk@kde.org">Nepomuk@kde.org</a><br>
>> <a href="https://mail.kde.org/mailman/listinfo/nepomuk" target="_blank">https://mail.kde.org/mailman/listinfo/nepomuk</a><br>
>><br>
><br>
><br>
><br>
_______________________________________________<br>
Nepomuk mailing list<br>
<a href="mailto:Nepomuk@kde.org">Nepomuk@kde.org</a><br>
<a href="https://mail.kde.org/mailman/listinfo/nepomuk" target="_blank">https://mail.kde.org/mailman/listinfo/nepomuk</a><br>
</div></div></blockquote></div><br><br clear="all"><br>-- <br>==============================<br>Constitution du 24 juin 1793 - Article 35. - Quand le gouvernement viole les droits du peuple, l'insurrection est, pour le peuple et pour chaque portion du peuple, le plus sacré des droits et le plus indispensable des devoirs.<br>
</div>