Join us   Log in   ijesmj@gmail.com  


INTERNATIONAL JOURNAL OF ENGINEERING, SCIENCE AND - Volume 7, Issue 3, March 18

Pages: 223-229

Date of Publication: 23-Mar-2018


Print Article   Download XML  Download PDF

BUILDINGENGLISH-PUNJABI PARALLEL CORPUS FOR MACHINE TRANSLATION

Author: Shishpal Jindal Vishal Goyal Jaskarn Singh Bhullar

Category: Engineering, Science and Mathematics

Abstract:

Objectives:Parallel corpus is the key resource for English Punjabi machine translation. At wide level there is no availability of English-Punjabi Corpora. There is a primary requirement of parallel corpus for the training of statistical machine translation. Methods/Analysis:In this paper, our work focuses on building English-Punjabi corpus at large scale. It posed difficulties and the intensive labor to develop the corpus. We are intricate on the collection as well as the flow of work for the construction of parallel corpus. Now after getting the raw text, we need to refine the corpus in such a way that every source language sentence should have corresponding target language sentence. Findings: The paper attempts to explore existing tools as well as building new tools. One of the goals is alignment of bilingual corpus.

Keywords: bilingual corpora, Machine-translation, English, Punjabi, NLP.