Titelaufnahme

Titel
A parallel corpus for amharic–english machine translation / Andargachew Mekonnen Gezmu, Andreas Nürnberger, Tesfaye Bayu Bati (Data and Knowledge Engineering Group)
VerfasserGezmu, Andargachew Mekonnen ; Nürnberger, Andreas ; Bati, Tesfaye Bayu
KörperschaftOtto-von-Guericke-Universität Magdeburg
ErschienenMagdeburg : Fakultät für Informatik, Otto-von-Guericke-Universität Magdeburg, [2018]
Umfang1 Online-Ressource (5 Seiten, 0,5 MB)
SpracheEnglisch
SerieTechnical report ; 2018, 04
URNurn:nbn:de:gbv:3:2-101264 
Zugriffsbeschränkung
 Das Dokument ist frei verfügbar
Dateien
A parallel corpus for amharic–english machine translation [0.5 mb]
Links
Nachweis
Klassifikation
Keywords
This paper describes the acquisition preprocessing segmentation and alignment of an Amharic-English parallel corpus. In doing so we addressed language-specific issues such as normalization and end-ofsentence disambiguation. The corpus consists of 145 820 Amharic-English parallel sentences (segments) from various sources. This corpus is larger in size than previously compiled corpora. It is released for research purposes and can be used to train or support Amharic-English machine translation systems.