Tag Archives: Corsican language

Further reflexions on the status of “I love you” in Corsican language

Let us briefly recall the problem: translating ‘I love you’ might sound trivial, but it’s not. In fact, ‘ti amu‘ is not the best translation. The best translation is ‘ti tengu caru‘ when addressed to a male person, or ‘ti tengu cara‘ when addressed to a female person. Hence the proposed preliminary translation ‘ti tengu […]

A specific kind of superlative

Let us consider a specific kind of superlative. Such form specific to Corsican language is notably mentioned by grammarian and author Santu Casta, in his  Punteghju, who recommends the following translation of “C’était le village le plus riche du canton” (It was the richest village of the canton):  Era u più paese riccu di stu cantone (pages 26 […]

Anagrams in Corsican language

Here are some anagrams in Corsican language: Corscia è Corsica Marta è Matra accanitu è uccitana acciliratu, ricciulata è riciculata accirtà è traccià accirtatu, catarticu è tracciatu adriatica è cadariati anacrunisimu è cunsumariani aprarà è pararà arba è bara attaccu è tuccata aumintà è umanità armunizà è rumanizà basatu è sabatu battariu è urbitata calculatu […]

Rough typology of remaining errors (updated march 2018)

French to Corsican: performing on French wikipedia sample test currently amounts to 94% on average. Below is a rough typology of remaining errors (presumably an average scoring of 95% on the open test should be attainable on the basis of correction of ‘easy’ tagged errors): unknown vocabulary: 40% (easy) basic disambiguation: 25%  (easy or medium difficulty) […]

A Special Case of Anaphora Resolution

Anaphora resolution usually refers to pronouns. But we face here a special case of anaphora resolution that relates to an adjective. The following sentence: ‘un vase de Chine authentique’ (an authentic vase of China) is translated erroneously as un vasu di China autentica, due to erroneous anaphora resolution. In this sample, the adjective ‘authentique’ refers […]

Solving fivefold ambiguity: translation for French ‘poste’

French word ‘poste’ has (at least) fivefold ambiguity. For it can designate: ‘poste’ (masculine singular noun) : postu, masculine singular noun (set, i.e. television set) ‘poste’ (masculine singular noun): posta, feminine singular noun (position): erroneously translated as postu in the present case ; it should read a so posta ‘poste’ (feminine singular noun) : posta, feminine singular noun […]

Chemistry: translating acid names

Translating this series of acid names is not as easy as it could seem at first glance. In effect, each acid name is composed of three consecutive ambiguous names: ‘l’ is ambiguous between the masculine (u, the) or feminine (a, the) definite article ‘acide’ is ambiguous betwwen acidu (acid, masculine singular noun), acitu (acid, masculine […]

Interesting case of first name disambiguation

Here is an interesting case of first name disambiguation for machine translation. Consider the following first name ‘Camille’. It can apply to both genders. In Corsican (taravese or sartinese variants) it translates either into Cameddu (masculine) or Camedda (feminine). In some cases, the corresponding disambiguation relies on mere grammatical grounds. For example, ‘Camille était beau’ […]

Writing differences between Corsican and Gallurese

Here are some writing differences between Corsican and Sardinian gallurese, that result from historical writing habits. These writing differences prevail, even when the words are the same: ghj is replaced by gghj: acciaghju (corsu), acciagghju (gallurese) , steel chj is replaced by cchj: finochju (corsu), finocchju (gallurese), fennel tonic accent is marked systematically in gallurese […]

How rule-based and statistical machine translation can help each other

Here are a few suggestions on how rule-based and statistical machine translation  can help each other: To begin with, rule-based and statistical machine translation are often contrasted and compared: it would be oversimplifying to conclude that one is better than the other. From a more objective standpoint, let us consider that each method has its […]

French ‘fin’ followed by a year number: fixed

Tagger improvement: fixed this issue. French ‘l’Empire allemand’ now translates properly into l’Imperu alimanu (the German Empire). French word ‘fin’ is now identified as a preposition when followed by a year number. The above excerpt is translated into the ‘sartinesu’ variant of Corsican language. This issue relates to the more general problem of the grammatical […]

Translation of preposition ‘à’ followed by noun phrase denoting a location

‘au stade de Wembley’ (at the Wembley Stadium) should translate in u stadiu di Wembley. We face the issue of the translation of preposition ‘à’ since ‘au’ is short for ‘à le’ (to the), in particular when ‘à’ is followed by a noun phrase denoting a location. This occurs in the disambiguation of French ‘à’ […]

Accordance of past participe

Now scoring 1 – 2/129 = 98.44%. The issue of past participe’s accordance again: ‘une session du parlement tenue à Nuremberg’ (a session of the Parliament held in Nuremberg) should translate into una sessione di u parlamentu tenuta in Nuremberg. Past participe tenuta should accord with sessione (feminine, session) and not with parlamentu (masculine, Parliament). […]

Past participe or present simple: the disambiguation of French ‘construit’

In the present case, it should read, custruitu à u seculu XII (built in the 12th century). The error relates to the disambiguation of French ‘construit’. It can translate into: custruitu (built): past participe, masculine, singular custruisce (builds): present simple, third person MT should (i) find the proper reference of ‘construit’, i.e. ‘clocher’ (church tower), […]

Can translation help teaching an endangered language?

Can translation help self-teaching and endangered language? It seems yes, it the translation is accurate. Let us check with the verb parlà (to speak). In this case, the translation is 100% accurate, so it can help (but we need to check other verb categories and other tenses). Other verbs of the same group are verbs […]