THE TRANSLATION QUALITY PROBLEMS OF MACHINE TRANSLATION SYSTEMS FOR THE KAZAKH LANGUAGE

Authors

DOI:

https://doi.org/10.26577/JMMCS.2021.v111.i3.11

Keywords:

Machine translation, systems of machine translation, RBMT, SMT, NMT, Kazakh language, quality of translation

Abstract

Kazakh language is related to languages with complex morphology and syntax. Today, most machine translation systems consider Kazakh language too, like Google, Yandex, Prompt, etc. This article describes the errors, shortcomings, and problems of machine translation (MT) into the Kazakh language. To analyze errors in machine translation into the Kazakh language, the most popular electronic translation programs were selected. When translating from Russian and English into Kazakh (and vice versa), various errors may occur, since the Kazakh language is dierent from other languages, and has special characteristics. To compare the results, an empirical method was used, namely, monitoring and testing the translation results of machine translation systems. Considering the results of statistical methods, the rule-based and neural networks based methods in machine translations were also analyzed. The practical signicance of the study lies in the development of recommendations for the identication and elimination of errors when editing the results of MT. The scientic signicance of the study lies in the fact that for the rst time errors and inaccuracies arising from machine translation of the Kazakh language have been systematized. The assessment of the quality of MT is also presented. The research carried out in this article will be used for the post-editing problem in machine translation.

References

[1] Zhumanov Zh.M., Tukeyev U.A., "Development of machine translation software logical model (translation from Kazakh into English language)" , Reports of the Third Congress of the World Mathematical Society of Turkic Countries. Edited by Academician Bakhytzhan T. Zhumagulov 1 (2009): 356-363.
[2] Tukeyev U., Zhumanov Zh., Rakhimova D., "Features of development for natural language processing" , In the book "ICT - from theory to practice" edited by M. Milosz. Polish Information Processing Society, Lublin (2010): 149-174.
[3] Tukeyev U., Rakhimova D., "Augmented attribute grammar in meaning of natural languages sentences" , The 6th International Conference on Soft Computing and Intelligent Systems, and the 13th International Symposium on Advanced Intelligent Systems, SCIS-ISIS2012/Kobe, Japan (2012): 1080-1085.
[4] Tukeyev U.À., Rakhimova D.R., Baisylbayeva Ê., Umirbekov N., Orazov B., Abakhan Ì., Kyzyrkanova S.,
"Kopmagynalyk beineleu keste tasili negizinde orys tilinen kazakh tiline mashinalyk audarmasynyng morfologiyalyk analizben sintezin kuru [Synthesis of morphological analysis of machine translation from Russian to Kazakh on the basis of the method of ambiguous mapping]" , In proceedings of the I International Conference on Computer processing of Turkic Languages (2013): 182-191.
[5] Tukeyev U.À., "Razrabotka tehnologii mashinnogo perevoda na osnove metoda mnogoznachnyh otobrazhenii dlya morfologicheski slojnyh yazykov [Development of machine translation technology based on the multivalued mapping method for morphologically complex languages]" , Proceedings of the 4th International Scientic and Practical Conference "Informatization of Society" (2014): 130-132.
[6] Abeustanova A., Tukeyev U., "Automatic Post-editing of Kazakh Sentences Machine Translated from English" , In Advanced Topics in Intelligent Information and Database Systems. ACIIDS 2017. Studies in Computational Intelligence, Springer 710 (2017): 283-295.
[7] Rakhimova D.R., Turarbek À.Ò., Karyukin V., Karibayeva À., Turganbayeva À., "Kazakh tiline arnalgan zamanaui mashinalyk audarma tehnologiyalaryna sholu [Review of modern machine translation technologies for the Kazakh language]" , Bulletin of KazNRTU named after K. Satpayev. Technical science 5 (141) (2020): 103-109.
[8] Shormakova A., Zhumanov Zh., Rakhimova D., "Post-editing of words in Kazakh sentences for information retrieval" , Journal of Theoretical and Applied Information Technology 97 (6) (2019): 1896-1908.
[9] Turganbayeva A., Tukeyev U., "The solution of the problem of unknown words under neural machine translation of the Kazakh language" , Journal of Information and Telecommunication (2020): 214-225.
[10] Tukeyev U., Karibayeva A., Zhumanov Z., "Morphological segmentation method for Turkic language neural machine translation" , Cogent Engineering 7(1) (2020): 1-16.
[11] Pairaphrase, "Two Approaches for How to Improve Machine Translation Quality" , ttps://www.pairaphrase.com/howto-improve-machine-translation-quality (2020).
[12] Philipp Koehn, Rebecca Knowles, "Six Challenges for Neural Machine Translation" , Proceedings of the First Workshop on Neural Machine Translation (2017): 28-39.
[13] Philipp Koehn, Statistical Machine Translation. Draft of Chapter 13. Neural Machine Translation (arXiv:1709.07809v1 [cs.CL], 2017): 117

Downloads

Published

2021-10-09