NICT_LOGO.JPG KYOTO-U_LOGO.JPG

WAT

The Workshop on Asian Translation
Evaluation Results

[EVALUATION RESULTS TOP] | [BLEU] | [RIBES] | [AMFM] | [HUMAN (WAT2022)] | [HUMAN (WAT2021)] | [HUMAN (WAT2020)] | [HUMAN (WAT2019)] | [HUMAN (WAT2018)] | [HUMAN (WAT2017)] | [HUMAN (WAT2016)] | [HUMAN (WAT2015)] | [HUMAN (WAT2014)] | [EVALUATION RESULTS USAGE POLICY]

BLEU


# Team Task Date/Time DataID BLEU
Method
Other
Resources
System
Description
juman kytea mecab moses-
tokenizer
stanford-
segmenter-
ctb
stanford-
segmenter-
pku
indic-
tokenizer
unuse myseg kmseg
1sakuraALT20ms-en2021/04/29 13:46:425821---45.70------NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
2NICT-2ALT20ms-en2021/05/01 13:35:255921---44.53------NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
3HwTscSUALT20ms-en2022/07/11 18:27:456775---38.90------NMTNoXX to XX transformer model finetune on the baseline trained on IT domain data
4NICT-5ALT20ms-en2020/09/18 21:53:324016---22.02------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
5NICT-2ALT20ms-en2021/05/01 13:21:035913---21.61------NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
6ORGANIZERALT20ms-en2020/09/01 15:52:353606---18.64------NMTNoBaseline MLNMT XX to En model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.
7NICT-5ALT20ms-en2020/09/18 19:17:103959---18.03------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.

Notice:

Back to top

RIBES


# Team Task Date/Time DataID RIBES
Method
Other
Resources
System
Description
juman kytea mecab moses-
tokenizer
stanford-
segmenter-
ctb
stanford-
segmenter-
pku
indic-
tokenizer
unuse myseg kmseg
1NICT-2ALT20ms-en2021/05/01 13:35:255921---0.904478------NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
2sakuraALT20ms-en2021/04/29 13:46:425821---0.901696------NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
3HwTscSUALT20ms-en2022/07/11 18:27:456775---0.887680------NMTNoXX to XX transformer model finetune on the baseline trained on IT domain data
4NICT-2ALT20ms-en2021/05/01 13:21:035913---0.802684------NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
5NICT-5ALT20ms-en2020/09/18 21:53:324016---0.800446------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
6ORGANIZERALT20ms-en2020/09/01 15:52:353606---0.768274------NMTNoBaseline MLNMT XX to En model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.
7NICT-5ALT20ms-en2020/09/18 19:17:103959---0.753260------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.

Notice:

Back to top

AMFM


# Team Task Date/Time DataID AMFM
Method
Other
Resources
System
Description
unuse unuse unuse unuse unuse unuse unuse unuse unuse unuse
1sakuraALT20ms-en2021/04/29 13:46:425821---0.851471------NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
2NICT-2ALT20ms-en2021/05/01 13:35:255921---0.841632------NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
3NICT-2ALT20ms-en2021/05/01 13:21:035913---0.697618------NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
4ORGANIZERALT20ms-en2020/09/01 15:52:353606---0.655865------NMTNoBaseline MLNMT XX to En model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.
5NICT-5ALT20ms-en2020/09/18 19:17:103959---0.000000------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.
6NICT-5ALT20ms-en2020/09/18 21:53:324016---0.000000------NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
7HwTscSUALT20ms-en2022/07/11 18:27:456775---0.000000------NMTNoXX to XX transformer model finetune on the baseline trained on IT domain data

Notice:

Back to top

HUMAN (WAT2022)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2021)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description
1sakuraALT20ms-en2021/04/29 13:46:425821UnderwayNMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
2NICT-2ALT20ms-en2021/05/01 13:35:255921UnderwayNMTYesThe extended mBART model, mixed domain training with domain fine-tuning.

Notice:
Back to top

HUMAN (WAT2020)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description
1NICT-5ALT20ms-en2020/09/18 19:17:103959UnderwayNMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.

Notice:
Back to top

HUMAN (WAT2019)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2018)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2017)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2016)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2015)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2014)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

EVALUATION RESULTS USAGE POLICY

When you use the WAT evaluation results for any purpose such as:
- writing technical papers,
- making presentations about your system,
- advertising your MT system to the customers,
you can use the information about translation directions, scores (including both automatic and human evaluations) and ranks of your system among others. You can also use the scores of the other systems, but you MUST anonymize the other system's names. In addition, you can show the links (URLs) to the WAT evaluation result pages.

NICT (National Institute of Information and Communications Technology)
Kyoto University
Last Modified: 2018-08-02