NICT_LOGO.JPG KYOTO-U_LOGO.JPG

Indic Languages Multilingual Task

[HOME]

INTRODUCTION

Given the growing sizes of monolingual, parallel training data as well as good quality evaluation data for Indic languages we have decided to resume the 2018 Indic Multilingual task.

TASK DESCRIPTION

The task covers 7 Indic Languages (Bengali, Hindi, Malayalam, Tamil, Telugu, Gujarati and Marathi) and English. There are a total of 14 translation directions we will evaluate. Individually, Indic languages are resource poor which hampers translation quality but by leveraging multilingualism and abundant monolingual corpora, the translation quality can be substantially boosted. The purpose of this task is to validate the utility of MT techniques that focus on multilingualism and/or monolingual data.

Corpora

Submission Details

Back to top

CONTACT

For general questions, comments, etc. please email to "wat-organizer -at- googlegroups -dot- com". For questions related to this task contact "prajdabre -at- gmail -dot- com" or "anoop.kunchukuttan -at- gmail -dot- com".

Back to top

NICT (National Institute of Information and Communications Technology)
Last Modified: 2018-07-18