Language Identification for Multilingual Machine Translation

Authors

  • A.Naga Raju (Assistant Professor), Master of Computer Applications, DNR college, Bhimavaram, Andhra Pradesh. Author
  • Mudunuri Ajay Varma PG scholar, Department of MCA, DNR College, Bhimavaram, Andhra Pradesh Author

Abstract

Machine translation is the process of
translating a text in one natural language into another
natural language using computer system. Translating a
document containing a single source language contents
is easy but when the information in the source
document is given in multilingual format then there is a
need to identify the languages that are involved in such
multilingual document. Language identification is the
task in natural language processing that automatically
identifies the natural language in which the content in
given document are written in. Language identification
is the fundamental and crucial step in many NLP
applications. In this paper, n-gram based and machine
learning based language identifiers are trained and
used to identify three Indian languages such as Hindi,
Marathi and Tamil present in a document given for
machine translation. The inclusion of language
identification component in machine translation
improved the quality of translation. Even google
translator is used for translation of identified language
to English.

Downloads

Published

2025-04-25

How to Cite

Language Identification for Multilingual Machine Translation. (2025). INTERNATIONAL JOURNAL OF MANAGEMENT RESEARCH AND REVIEW, 15(2s), 143-148. https://ijmrr.com/index.php/ijmrr/article/view/42

Most read articles by the same author(s)

<< < 1 2