Development of an unified meta language of the turkic languages morphology

Authors

  • А. Sharipbay Scientific-Research Institute "Artificial intelligence" , L. N. Gumilyov Eurasian National University
  • A. Gatiatullin Institute of Applied Semiotics of the Academy of Sciences Republic of Tatarstan
  • B. Yergesh Scientific-Research Institute "Artificial intelligence" , L. N. Gumilyov Eurasian National University
  • D. Kazhymukhan Scientific-Research Institute "Artificial intelligence", L. N. Gumilyov Eurasian National University

DOI:

https://doi.org/10.26577/JMMCS-2018-4-557

Keywords:

morphology, turkic languages, metalanguage, thesaurus

Abstract

Currently, due to the sharp increase of information amount in natural languages on the Internet and social networks, research and development in the field of computational linguistics is becoming extremely relevant. As is known, computational linguistics is a new scientific field and part of computer science.Computational linguistics includes the Natural Language Proccesing (NLP). Creating a unified metalanguage for Turkic languages (UniTurk) is an important task for processing Turkic languages. An unified metalanguage system will allow to unify tags, facilitate their understanding and use common software, as well as conduct various studies on linguistic-statistical comparative analysis among the Turkic languages.The article presents some of the results obtained on an international project to create a multilingual ontology and unified metalanguage of the Turkic languages morphology. Using ontological models, the morphological rules of the Turkic (Kazakh, Kyrgyz, Tatar, Turkish, and Uzbek) languages are formalized. The result of these works can be used in the NLP applications, for example, for corpus tagging, in knowledge extraction systems, information retrieval systems, machine translation, etc.

Author Biographies

  • А. Sharipbay, Scientific-Research Institute "Artificial intelligence" , L. N. Gumilyov Eurasian National University

    Senior lecturer of the Department of Informatics and Information Security, researcher of the Scientific-Research Institute «Artificial intelligence»

  • A. Gatiatullin, Institute of Applied Semiotics of the Academy of Sciences Republic of Tatarstan

    Doctor of Technical Sciences, professor of the Department of Informatics and Information Security, director of the Scientific-Research Institute «Artificial intelligence»

  • B. Yergesh, Scientific-Research Institute "Artificial intelligence" , L. N. Gumilyov Eurasian National University

    Candidate of Technical Sciences, head of department of intellectual information systems of Institute of application-oriented semiotics of AS of RT

  • D. Kazhymukhan, Scientific-Research Institute "Artificial intelligence", L. N. Gumilyov Eurasian National University

    master student of the specialty  5M060100-Computer science

Downloads

Published

2019-01-24

How to Cite

Development of an unified meta language of the turkic languages morphology. (2019). Journal of Mathematics, Mechanics and Computer Science, 100(4), 78=87. https://doi.org/10.26577/JMMCS-2018-4-557