NICT_LOGO.JPG KYOTO-U_LOGO.JPG

WAT

The Workshop on Asian Translation
Evaluation Results

[EVALUATION RESULTS TOP] | [BLEU] | [RIBES] | [AMFM] | [HUMAN (WAT2022)] | [HUMAN (WAT2021)] | [HUMAN (WAT2020)] | [HUMAN (WAT2019)] | [HUMAN (WAT2018)] | [HUMAN (WAT2017)] | [HUMAN (WAT2016)] | [HUMAN (WAT2015)] | [HUMAN (WAT2014)] | [EVALUATION RESULTS USAGE POLICY]

BLEU


# Team Task Date/Time DataID BLEU
Method
Other
Resources
System
Description
juman kytea mecab moses-
tokenizer
stanford-
segmenter-
ctb
stanford-
segmenter-
pku
indic-
tokenizer
unuse myseg kmseg
1HwTscSUALT20en-id2022/07/11 12:06:056737---42.40------NMTNoXX to XX transformer model finetune on the baseline trained on IT domain data
2sakuraALT20en-id2021/04/29 12:20:025798---41.57------NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
3NICT-2ALT20en-id2021/05/01 13:31:175918---41.15------NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
4NICT-5ALT20en-id2020/09/18 21:52:044013---32.94------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
5NICT-5ALT20en-id2020/09/18 19:15:393956---32.88------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.
6NICT-2ALT20en-id2021/05/01 13:18:165910---31.77------NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
7ORGANIZERALT20en-id2020/09/01 16:01:543612---31.49------NMTNoBaseline MLNMT En to XX model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.

Notice:

Back to top

RIBES


# Team Task Date/Time DataID RIBES
Method
Other
Resources
System
Description
juman kytea mecab moses-
tokenizer
stanford-
segmenter-
ctb
stanford-
segmenter-
pku
indic-
tokenizer
unuse myseg kmseg
1HwTscSUALT20en-id2022/07/11 12:06:056737---0.905156------NMTNoXX to XX transformer model finetune on the baseline trained on IT domain data
2sakuraALT20en-id2021/04/29 12:20:025798---0.901977------NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
3NICT-2ALT20en-id2021/05/01 13:31:175918---0.901974------NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
4NICT-5ALT20en-id2020/09/18 21:52:044013---0.878897------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
5NICT-5ALT20en-id2020/09/18 19:15:393956---0.874827------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.
6NICT-2ALT20en-id2021/05/01 13:18:165910---0.871564------NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
7ORGANIZERALT20en-id2020/09/01 16:01:543612---0.869369------NMTNoBaseline MLNMT En to XX model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.

Notice:

Back to top

AMFM


# Team Task Date/Time DataID AMFM
Method
Other
Resources
System
Description
unuse unuse unuse unuse unuse unuse unuse unuse unuse unuse
1sakuraALT20en-id2021/04/29 12:20:025798---0.868025------NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
2NICT-2ALT20en-id2021/05/01 13:31:175918---0.867678------NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
3NICT-2ALT20en-id2021/05/01 13:18:165910---0.821204------NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
4ORGANIZERALT20en-id2020/09/01 16:01:543612---0.809985------NMTNoBaseline MLNMT En to XX model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.
5NICT-5ALT20en-id2020/09/18 19:15:393956---0.000000------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.
6NICT-5ALT20en-id2020/09/18 21:52:044013---0.000000------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
7HwTscSUALT20en-id2022/07/11 12:06:056737---0.000000------NMTNoXX to XX transformer model finetune on the baseline trained on IT domain data

Notice:

Back to top

HUMAN (WAT2022)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2021)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description
1sakuraALT20en-id2021/04/29 12:20:025798UnderwayNMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
2NICT-2ALT20en-id2021/05/01 13:31:175918UnderwayNMTYesThe extended mBART model, mixed domain training with domain fine-tuning.

Notice:
Back to top

HUMAN (WAT2020)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description
1NICT-5ALT20en-id2020/09/18 19:15:393956UnderwayNMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.
2NICT-5ALT20en-id2020/09/18 21:52:044013UnderwayNMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.

Notice:
Back to top

HUMAN (WAT2019)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2018)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2017)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2016)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2015)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2014)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

EVALUATION RESULTS USAGE POLICY

When you use the WAT evaluation results for any purpose such as:
- writing technical papers,
- making presentations about your system,
- advertising your MT system to the customers,
you can use the information about translation directions, scores (including both automatic and human evaluations) and ranks of your system among others. You can also use the scores of the other systems, but you MUST anonymize the other system's names. In addition, you can show the links (URLs) to the WAT evaluation result pages.

NICT (National Institute of Information and Communications Technology)
Kyoto University
Last Modified: 2018-08-02