A North Saami to South Saami Machine Translation Prototype

Authors

  • Lene Antonsen Department of Language and Culture, The Arctic University of Norway
  • Trond Trosterud Department of Language and Culture, The Arctic University of Norway
  • Francis M. Tyers Department of Language and Culture, The Arctic University of Norway

DOI:

https://doi.org/10.3384/nejlt.2000-1533.1642

Abstract

The paper describes a rule-based machine translation (MT) system from North to South Saami. The system is designed for a workflow where North Saami functions as pivot language in translation from Norwegian or Swedish. We envisage manual translation from Norwegian or Swedish to North Saami, and thereafter MT to South Saami. The system was aimed at a single domain, that of texts for use in school administration. We evaluated the system in terms of the quality of translations for postediting. Two out of three of the Norwegian to South Saami professional translators found the output of the system to be useful. The evaluation shows that it is possible to make a functioning rule-based system with a small transfer lexicon and a small number of rules and achieve results that are useful for a restricted domain, even if there are substantial differences b etween the languages.

References

Antonsen, Lene, Trond Trosterud, and Linda Wiechetek. 2010. Reusing Grammatical Resources for New Languages. In Proceedings of LREC-2010. Valetta, Malta: ELRA.

Armentano-Oller, Carme, Rafael C. Carrasco, Antonio M. Corbí-Bellot, Mikel L. Forcada, Mireia Ginestí-Rosell, Sergio Ortiz-Rojas, Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez Gema Ramírez-Sánchez, , and Miriam A. Scalco. 2006. PROPOR, chap. Open-Source Portuguese–Spanish Machine Translation, pages 50–59. LNAI 3960. Springer-Verlag Berlin Heidelberg.

Babych, Bogdan, Tony Hartley, and Serge Sharoff. 2007. Translating from underresourced languages: comparing direct transfer against pivot translation. In Proceedings of the MT Summit XI, pages 29–35.

Forcada, Mikel L., Mireia Ginestí-Rosell, Jacob Nordfalk, Jim O’Regan, Sergio Ortiz-Rojas, Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Gema Ramírez-Sánchez, and Francis M. Tyers. 2011. Apertium: a free/open-source platform for rule-based machine translation. Machine Translation 25(2):127–144. DOI: 10.1007/s10590-011-9090-0

Lindén, Krister, Miikka Silfverberg, Erik Axelson, Sam Hardwick, and Tommi Pirinen. 2011. HFST-Framework for Compiling and Applying Morphologies. In C. Mahlow and M. Pietrowski, eds., Systems and Frameworks for Computational Morphology, vol. 100 of Communications in Computer and Information Science, pages 67–85.

Masselot, Francois, Petra Ribiczey, and Gema Ramírez-Sánchez. 2010. Using the Apertium Spanish-Brazilian Portuguese machine translation system for localization. In Proceedings of the 14th Annual Conference of the European Association for Machine Translation, EAMT10.

Rasmussen, Torkel. 2015. Samisk språk i grunnskolen og videregående opplæring. Samiske tall forteller Kommentert samisk statistikk 2015 8:17–41.

Sammallahti, Pekka. 1998. The Saami Languages. An Introduction. Davvi Girji. Todal, Jon. 2007. Samisk språk i Svahken Sijte. Sørsamisk vitalisering gjennom barnehage og skule, vol. 1 of Dieđut. Sámi Instituhtta – Nordisk Samisk Institutt.

Trosterud, Trond and Kevin Brubeck Unhammer. 2012. Evaluating North Sámi to Norwegian assimilation RBMT. In Proceedings of the Third International Workshop on Free/Open-Source Rule-Based Machine Translation (FreeRBMT 2012), pages 13–26.

Tyers, Francis, Linda Wiechetek, and Trond Trosterud. 2009. Developing prototypes for machine translation between two Sámi languages. In Proceedings of the 13th Annual Conference of the European Association for Machine Translation, EAMT09, pages 120–128.

Unhammer, Kevin Brubeck and Trond Trosterud. 2009. Reuse of free resources in machine translation between Nynorsk and Bokmål. In J. A. Pérez-Ortiz, F. Sánchez-Martínez, and F. M. Tyers, eds., Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, pages 35–42. Alicante.

Wiechetek, Linda, Francis Tyers, and Thomas Omma. 2010. Shooting at flies in the dark: Rule-based lexical selection for a minority language pair. Lecture Notes in Artificial Intelligence 6233:418–429. DOI: 10.1007/978-3-642-14770-8_45

Downloads

Published

2016-03-13