Show simple record

dc.contributor.authorMohaghegh, Mahsa
dc.contributor.authorSarrafzadeh, Hossein
dc.date.accessioned2013-06-28T02:07:19Z
dc.date.available2013-06-28T02:07:19Z
dc.date.issued2011
dc.identifier.urihttps://hdl.handle.net/10652/2227
dc.description.abstractThis paper documents recent work carried out for PeEn-SMT, our Statistical Machine Translation system for translation between the English-Persian language pair. We give details of our previous SMT system, and present our current development of significantly larger corpora. We explain how recent tests using much larger corpora helped to evaluate problems in parallel corpus alignment, corpus content, and how matching the domains of PeEn-SMT’s components affect translation outcome. We then focus on combining corpora and approaches to improve test data, showing details of experimental setup, together with a number of experiment results and comparisons between them. We show how one combination of corpora gave us a metric score outperforming Google Translate for the English-to-Persian translation. Finally, we outline areas of our intended future work, and how we plan to improve the performance of our system to achieve higher metric scores, and ultimately to provide accurate, reliable language translation.en_NZ
dc.language.isoenen_NZ
dc.subjectStatistical Machine translation- English-Persianen_NZ
dc.subjectLanguage Modelen_NZ
dc.titleAn Overview of the Challenges and Progress in PeEn-SMT: First Large Scale Persian-English SMT Systemen_NZ
dc.typeConference Contribution - Paper in Published Proceedingsen_NZ
dc.rights.holderAuthoren_NZ
dc.subject.marsden089999 Information and Computing Sciences not elsewhere classifieden_NZ
dc.identifier.bibliographicCitationMohaghegh, M., and Sarrafzadeh, A. (2011). An Overview of the Challenges and Progress in PeEn-SMT: First Large Scale Persian-English SMT System. Seventh International Conference on Innovations in Information Technology , Abu Dhabi, UAE.en_NZ
unitec.institutionUnitec Institute of Technologyen_NZ
unitec.conference.titleSeventh International Conference on Innovations in Information Technologyen_NZ
unitec.peerreviewedyesen_NZ
unitec.identifier.roms52006


Files in this item

Thumbnail

This item appears in

Show simple record