Apertium

Write a script to add glosses to a monolingual dictionary from a bilingual dictionary

Write a script that matches bilingual dictionary entries (in dix format) to monolingual dictionary entries in one of the languages (in lexc format) and adds glosses from the other side of the bilingual dictionary if not already there. The script should combine glosses into one when there's more than one in the bilingual dictionary. Some level of user control might be justified, from simply defaulting to a dry run unless otherwise specified, to controls for adding to versus replacing versus leaving alone existing glosses, and the like. A prototype of this script is available in SVN, though it's buggy and doesn't fully work—so this task may just end up being to debug it and make it work as intended. A good test case might be the English-Kazakh bilingual dictionary and the Kazakh monolingual dictionary .

Read more

Task tags

  • python
  • lexc
  • dix
  • xml

Students who completed this task

Ryan A. Chi

Task type

  • code Code
  • done_all Quality Assurance
close

2017