Wikidata, structured Wiktionary, MediaWiki Content Translation and planning for a WUaS Universal Translator in all 8k languages
*
Creating careers, Academic PressWUaS seeks to be in 8k languages https://t.co/YWks7sQ9mU >wiki https://t.co/STxgaNkoYM w machine translation— WorldUnivandSch (@WorldUnivAndSch) July 30, 2016
*
— John Palfrey (@jpalfrey) July 30, 2016
*
Can WUaS's Universal Translator https://t.co/VKtNf0ftkZ in @wikidata_de become the industry standard too for 204 countries' Langs, & poetry?— WorldUnivandSch (@WorldUnivAndSch) July 29, 2016
*
BitCoin vending machine to India Law Sch https://t.co/6hv5tr87Cc & 203 others in countries' langs & Banking & Money https://t.co/JO9I14GqSb?— WorldUnivandSch (@WorldUnivAndSch) July 29, 2016
*
Bitcoin vending machine -
https://twitter.com/EthanZ/status/758485225588531200
*
[Wikidata] Grammatical display of units
The discussion about how to do this is happening in
https://phabricator.wikimedia.org/T86528 The basic problem is that we
do use items for the units. I think this is the right thing to do but
it does make this particular part a bit tricky.
Cheers
Lydia
*
How best to anticipate and plan here for ever more accurate translation between Wikipedia / Wikidata languages with full STEM precision? What's the road map? How might this "Unit Localization" Phabricator RFC https://phabricator. wikimedia.org/T86528 fit into a series of Phabricator RFCs in a longer term plan for great Wikidata translation? Can we further begin to lay out this "road map" at this stage for all of Wikipedia's 358 languages (and anticipate even all 7,943 language entries in Glottolog)?
Would it be possible to dovetail this with developing Wiktionary with Content Translation as Phabricator RFCs?
(WUaS which donated CC WUaS to CC Wikidata last autumn would like to help develop such translation and for CC MIT OCW in 7 languages and CC Yale OYC, for example, in addition to MediaWiki Content Translation).
Scott
info@worlduniversityandschool.org
*
Lydia and Wikidatansl,
If "the basic problem is that we do use items for the units," after developing a workaround for this, could Wikidata make an item out of the parts and sounds of every word (in Wiktionary) in every (Wikipedia) language, to begin, as part of a further translator plan?
Scott
worlduniversityandschool@gmail.com
*
|
Jul 28 (2 days ago)
| |||
On 28.07.2016 20:41, Stas Malyshev wrote:
As far as I know, the message parsing is available through the MW API, so external consumers could take advantage of the same system if the message strings were part of the data (we would like to have grammatical units in SQID as well).
Hi!You mean the MediaWiki message processing code? This would probably be powerful enough for units as well, but it works based on message strings that look a bit like MW template calls. Someone has to enter such strings for all units (and languages). This would be doable but the added power comes at the price of more difficult editing of such message strings instead of plain labels.Good point. Could we not just have a monolingual text string property
that gives the preferred writing of the unit when used after a number? I
don't think the plural/singular issue is very problematic, since you
would have plural almost everywhere, even for "1.0 metres". So maybe we
We have code to deal with that - note that "1 reference" and "2
references" are displayed properly. It's a matter of applying that code
and having it provided with proper configs.
As far as I know, the message parsing is available through the MW API, so external consumers could take advantage of the same system if the message strings were part of the data (we would like to have grammatical units in SQID as well).
Forgive my ignorance; I was not able to read the example you gave there.just need one alternative label for most languages? Or are there
languages with more complex grammar rules for units?
Oh yes :) Russian is one, but I'm sure there are others.
*
Stas, Thomas, John, Markus, Lydia and Wikidatans,
What happens when one develops structured Wiktionary (https://www.wiktionary.org/), as linked open data for every part of every word and their sounds, perhaps as Qitems in Wikidata, in Wikipedia's 358 languages, and planning for all 7,943 language entries in Glottolog, and combines this with MediaWiki Content Translation (https://www.mediawiki.org/ wiki/Content_translation) - all re Wikidata/Wikibase? Qe this grammatical display of units thread, how to plan for the genders of words in some languages only? And is this/could this be partly a structured Wiktionary question, when one spells out the numerals/units in letters and words?
Scott
*
How can https://t.co/VKtNf0ftkZ planned for 8k Langs w machine translation become industry standard in 204 Lands & nuanced for poetry trans?— WorldUnivandSch (@WorldUnivAndSch) July 30, 2016
*
How can https://t.co/Im731rLazp planned for 8k Langs w machine translation become industry standard in 204 Lands & nuanced for POETRY trans?— scottmacleod (@scottmacleod) July 30, 2016
*
...
No comments:
Post a Comment