NICT_LOGO.JPG KYOTO-U_LOGO.JPG

WAT

The Workshop on Asian Translation
Evaluation Results

[EVALUATION RESULTS TOP] | [BLEU] | [RIBES] | [AMFM] | [HUMAN (WAT2022)] | [HUMAN (WAT2021)] | [HUMAN (WAT2020)] | [HUMAN (WAT2019)] | [HUMAN (WAT2018)] | [HUMAN (WAT2017)] | [HUMAN (WAT2016)] | [HUMAN (WAT2015)] | [HUMAN (WAT2014)] | [EVALUATION RESULTS USAGE POLICY]

BLEU


# Team Task Date/Time DataID BLEU
Method
Other
Resources
System
Description
juman kytea mecab moses-
tokenizer
stanford-
segmenter-
ctb
stanford-
segmenter-
pku
indic-
tokenizer
unuse myseg kmseg
1ORGANIZERSOFTWAREen-hi2020/09/01 15:58:443608------13.73---NMTNoBaseline MLNMT En to XX model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.
2NICT-5SOFTWAREen-hi2020/09/18 19:05:093944------14.03---NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.
3NICT-5SOFTWAREen-hi2020/09/18 19:12:513955------14.02---NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
4Bering LabSOFTWAREen-hi2021/04/17 09:52:425190------37.23---NMTNoTransformer trained on OPUS with GNOME,KDE4,Ubuntu weighted
5jyjySOFTWAREen-hi2021/04/23 20:03:065440------ 9.46---NMTNo
6sakuraSOFTWAREen-hi2021/04/29 11:59:475792------28.50---NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
7NICT-2SOFTWAREen-hi2021/05/01 11:54:055892------19.77---NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
8NICT-2SOFTWAREen-hi2021/05/01 12:07:335900------29.05---NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
9JBJBJBSOFTWAREen-hi2021/05/02 22:57:235975------29.73---NMTYesMBart50 fairseq
10HwTscSUSOFTWAREen-hi2022/07/11 13:16:176747------41.70---NMTNoXX to XX transformer model trained on GNOME,KDE4,Ubuntu as well as other data from OPUS and finetune on dev set

Notice:

Back to top

RIBES


# Team Task Date/Time DataID RIBES
Method
Other
Resources
System
Description
juman kytea mecab moses-
tokenizer
stanford-
segmenter-
ctb
stanford-
segmenter-
pku
indic-
tokenizer
unuse myseg kmseg
1ORGANIZERSOFTWAREen-hi2020/09/01 15:58:443608------0.514530---NMTNoBaseline MLNMT En to XX model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.
2NICT-5SOFTWAREen-hi2020/09/18 19:05:093944------0.508329---NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.
3NICT-5SOFTWAREen-hi2020/09/18 19:12:513955------0.522150---NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
4Bering LabSOFTWAREen-hi2021/04/17 09:52:425190------0.688563---NMTNoTransformer trained on OPUS with GNOME,KDE4,Ubuntu weighted
5jyjySOFTWAREen-hi2021/04/23 20:03:065440------0.525166---NMTNo
6sakuraSOFTWAREen-hi2021/04/29 11:59:475792------0.663932---NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
7NICT-2SOFTWAREen-hi2021/05/01 11:54:055892------0.596807---NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
8NICT-2SOFTWAREen-hi2021/05/01 12:07:335900------0.651775---NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
9JBJBJBSOFTWAREen-hi2021/05/02 22:57:235975------0.689604---NMTYesMBart50 fairseq
10HwTscSUSOFTWAREen-hi2022/07/11 13:16:176747------0.741615---NMTNoXX to XX transformer model trained on GNOME,KDE4,Ubuntu as well as other data from OPUS and finetune on dev set

Notice:

Back to top

AMFM


# Team Task Date/Time DataID AMFM
Method
Other
Resources
System
Description
unuse unuse unuse unuse unuse unuse unuse unuse unuse unuse
1ORGANIZERSOFTWAREen-hi2020/09/01 15:58:443608------0.713394---NMTNoBaseline MLNMT En to XX model using ALT, Ubuntu, GNOME and KDE4 data from opus. Transformer big model. Default settings.
2NICT-5SOFTWAREen-hi2020/09/18 19:05:093944------0.000000---NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.
3NICT-5SOFTWAREen-hi2020/09/18 19:12:513955------0.000000---NMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size unbalanced.
4Bering LabSOFTWAREen-hi2021/04/17 09:52:425190------0.810847---NMTNoTransformer trained on OPUS with GNOME,KDE4,Ubuntu weighted
5jyjySOFTWAREen-hi2021/04/23 20:03:065440------0.728430---NMTNo
6sakuraSOFTWAREen-hi2021/04/29 11:59:475792------0.826771---NMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
7NICT-2SOFTWAREen-hi2021/05/01 11:54:055892------0.777014---NMTNoTransformer base model, multilingual + mixed domain training with domain fine-tuning.
8NICT-2SOFTWAREen-hi2021/05/01 12:07:335900------0.821077---NMTYesThe extended mBART model, mixed domain training with domain fine-tuning.
9JBJBJBSOFTWAREen-hi2021/05/02 22:57:235975------0.827720---NMTYesMBart50 fairseq
10HwTscSUSOFTWAREen-hi2022/07/11 13:16:176747------0.000000---NMTNoXX to XX transformer model trained on GNOME,KDE4,Ubuntu as well as other data from OPUS and finetune on dev set

Notice:

Back to top

HUMAN (WAT2022)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2021)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description
1sakuraSOFTWAREen-hi2021/04/29 11:59:475792UnderwayNMTNoMultilingual finetuning of mBART50 finetuned many-to-many model, ensemble of 3
2NICT-2SOFTWAREen-hi2021/05/01 12:07:335900UnderwayNMTYesThe extended mBART model, mixed domain training with domain fine-tuning.

Notice:
Back to top

HUMAN (WAT2020)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description
1NICT-5SOFTWAREen-hi2020/09/18 19:05:093944UnderwayNMTNoXX to XX transformer model trained on ALT as well as KDE, GNOME and Ubuntu data from OPUS. Corpora were size balanced.

Notice:
Back to top

HUMAN (WAT2019)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2018)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2017)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2016)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2015)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

HUMAN (WAT2014)


# Team Task Date/Time DataID HUMAN
Method
Other
Resources
System
Description

Notice:
Back to top

EVALUATION RESULTS USAGE POLICY

When you use the WAT evaluation results for any purpose such as:
- writing technical papers,
- making presentations about your system,
- advertising your MT system to the customers,
you can use the information about translation directions, scores (including both automatic and human evaluations) and ranks of your system among others. You can also use the scores of the other systems, but you MUST anonymize the other system's names. In addition, you can show the links (URLs) to the WAT evaluation result pages.

NICT (National Institute of Information and Communications Technology)
Kyoto University
Last Modified: 2018-08-02