NICT_LOGO.JPG KYOTO-U_LOGO.JPG

WAT 2020

Myanmar-English Parallel Data

[HOME]

The registration of the use of UCSY data is opened (2020/07/17)

INTRODUCTION

The parallel data for Myanmar-English tanslation tasks at WAT2020 consist of two corpora, the ALT corpus and UCSY corpus.

DETAIL

The numbers of sentences are as follows:

Data Type File Name Number of Sentences
TRAIN train.ucsy.[my|en] 204,539
train.alt.[my|en] 18,088
DEV dev.alt.[my|en] 1,000
TEST test.alt.[my|en] 1,018

HOW TO OBTAIN

- The data used for WAT2020 are identical to those used in WAT2019.

Myanmar-English Parallel Data for WAT2020

Back to top

CONTACT

For questions, comments, etc. please email to "wat-organizer -at- googlegroups -dot- com".

Back to top

CHANGE LOG

2020-07-17: site open


NICT (National Institute of Information and Communications Technology)
Kyoto University
Last Modified: 2020-07-17