The goal of this task is to build a small neural machine translation system while keeping a reasonable translation quality. There is a high demand in industries to equip smart devices with translation capabilities. Though neural machine translation reaches the point that such capability is not a dream anymore, it usually needs huge resources which are not available on daily devices. The current solution is to run a translation engine on powerful servers and to arrange the device talk to them over Internet. However reliable low-latency connection is not available in the most part of the world and will not in a short term. If we can build a small system while keeping the translation capability reasonably, it has a huge impact in the application of machine translation.
Unfortunately almost all research work of neural machine translation is biased toward improving quality with little consideration to computing resource at inference time. We hope this shared task provides a common language and asset to the NLP community to open a new research field, which will have a huge impact in cross-language communication of our society.
The participants are given pre-processed Japanese-English parallel data and requested to build a neural machine translation. The participants are required to report the followings in the system description paper:
The participants can additionaly use the other dataset other than that is provided by the organizers although using the provided dataset is mandatory.
Data for the Small-NMT Task can be downloaded from here (164MB, tar.bz2 file). This data is created from ASPEC.
Detail of the Data:
There is no registration needed. Just download the data and try to minimize the NMT models. Please describe your method and report the results as the system description paper.
The deadline of the small NMT task is "September 5, 2017", which is the same to the deadline of the system description paper.
For questions, comments, etc. please email to "wat -at- nlp -dot- ist -dot- i -dot- kyoto -hyphen- u -dot- ac -dot- jp".
JST (Japan Science and Technology Agency)
NICT (National Institute of Information and Communications Technology)
Last Modified: 2017-08-02