Modelling of resource-intensive problems in the field of bioinformatics.

Authors

  • A Yu Pyrkova Al-Farabi Kazakh National University

Keywords:

математическая модель, алгоритм, Java MPI, выравнивание, нуклеотидные и аминокислотные последовательности, дендограммы

Abstract

In presented article the problems of multiple alignment of nucleotide sequences and dendrogram construction are considered. During the conducted research by the author the following results were received: • the mathematical model of multiple alignment of nucleotide and amino-acid sequences is developed; • the algorithm of multiple alignment, constructed on the basis of algorithm of Needleman- Wunsch which was modified for processing of big data files with help of parallelization of treatment process by means of MPJ (Java MPI), is developed and analyzed; • the algorithm of dendrogram construction, representing modification of algorithms of UPGMA (Unweighted Pair Group Method with Arithmetic Mean) and NJ (Neighbour Joining) with possibility of parallelization of data processing, is developed; • program realization of algorithm of multiple alignment and dendrogram construction in the Java language with use of means of MPI is executed; • results of work of the program were tested on data on the nucleotide sequences provided by staff of the biotechnology department of Kazakh NU named al-Farabi.

References

[1] Lesk Arthur M. Introduction to Bioinformatics. - Oxford: Oxford University Press, 2002. - 255 p.

[2] Ройтберг М.А. Алгоритмы сравнительного анализа первичных структур биополимеров: автореферат диссертации на соискание ученой степени доктора физико-математических наук: 03.00.28. - М.: Издательство РАН, 2009. - 43 с.

[3] Jones Neil C., Pevzner Pavel A. An Introduction to Bioinformatics Algorithms. -Massachusetts: Massachusetts Institute of Technology Press, 2004. - 435 p.

[4] Пыркова А.Ю. Множественное выравнивание нуклеотидных последовательностей и построение дендограмм с использованием средств Java MPI // Материалы IХ международной научно-практической конференции "Перспективы развития информационных технологий". - Новосибирск: Издательство НГТУ, 2012. - С. 20-25.

[5] Пыркова А.Ю. Кластерный анализ больших массивов молекулярно-генетических данных с использование программного интерфейса MPJ // Материалы международной научно-практической конференции "Актуальные проблемы информатики и процессов управления". - Алматы: Институт проблем информатики и управления, 2012. C. 221-225.

[6] Jonathan M. Keith Methods in Molecular Biology. Bioinformatics: in 2 vols. - New York: Humana Press, 2008. - V. 2. - 502 p.

[7] Bioinformatics and Biological Computing [Electronic resource]. - 2012. - URL: http : ==bip:weizmann:ac:il=toolbox=overview=software_avail:html (дата обращения: 07.09.2012)

Downloads

Issue

Section

Computational Mathematics and mathematical modeling