Hindi to Punjabi Machine Translation System
Encyclopedia
Hindi
Hindi
Standard Hindi, or more precisely Modern Standard Hindi, also known as Manak Hindi , High Hindi, Nagari Hindi, and Literary Hindi, is a standardized and sanskritized register of the Hindustani language derived from the Khariboli dialect of Delhi...

 to Punjabi
Punjabi language
Punjabi is an Indo-Aryan language spoken by inhabitants of the historical Punjab region . For Sikhs, the Punjabi language stands as the official language in which all ceremonies take place. In Pakistan, Punjabi is the most widely spoken language...

 machine translation
Machine translation
Machine translation, sometimes referred to by the abbreviation MT is a sub-field of computational linguistics that investigates the use of computer software to translate text or speech from one natural language to another.On a basic...

 system, developed at Punjabi University
Punjabi University
Punjabi University, located at Patiala, is one of the premier institutions of higher education in Punjab, India. Panjabi University teaches and researches in science, engineering and technology, humanities, social sciences, performing arts and sports....

, Patiala by Gurpreet Singh Lehal  and Dr. Vishal Goyal, is aimed to translate Hindi text into Punjabi text. It is based on the direct approach. It includes Preprocessing (Text Normalization, Replacing Collocations, Replacing Proper Nouns), Translation Engine (Identifying Surnames, Identifying Titles, Lexicon
Lexicon
In linguistics, the lexicon of a language is its vocabulary, including its words and expressions. A lexicon is also a synonym of the word thesaurus. More formally, it is a language's inventory of lexemes. Coined in English 1603, the word "lexicon" derives from the Greek "λεξικόν" , neut...

 Lookup, Word Sense Disambiguation
Word sense disambiguation
In computational linguistics, word-sense disambiguation is an open problem of natural language processing, which governs the process of identifying which sense of a word is used in a sentence, when the word has multiple meanings...

, Inflection Analysis, Transliteration
Transliteration
Transliteration is a subset of the science of hermeneutics. It is a form of translation, and is the practice of converting a text from one script into another...

) and Post processing module. This system has been available online for use. It has accuracy of about 94% on the basis of intelligibility test. The developers are also working on it to still improve the accuracy of the system.

External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK