Information Systems Technology and its Applications, International Conference ISTA'2003, June 19-21, 2003, Kharkiv, Ukraine, Proceedings. P-30, 177-190 (2003).

Mikhail Godlevsky (ed.), Stephen W. Liddle (ed.), Mayr Heinrich C. (ed.)

Tools for generation of natural inflected language processors

Nadiya Mishchenko and Anatoly Doroshenko


Supporting multiple languages and natural language processing are of high importance in information systems. This paper discusses software tools for the generation of languages processors (LPs) for the natural inflected languages. The tools are implemented in the LP generator DUAL, which allows for formal specification and reusability of developed components. The declarative language Dual is used to specify words, idioms, and their processing. The paper describes the automatic generation of dictionaries from their specifications in the Dual language and the reusability of software components, which facilitates fast construction of user-oriented software systems for processing of natural inflected languages. The LPs generated are intended for word-for-word translation of domain-specific texts in inflected languages and the generation of frequency lists of words and phrases used in statistical analysis of texts in inflected and analytical languages using Cyrillic or Latin alphabets.

