| # |
Team |
Task |
Date/Time |
DataID |
AMFM |
Method
|
Other Resources
|
System Description |
| unuse |
unuse |
unuse |
unuse |
unuse |
unuse |
unuse |
unuse |
unuse |
unuse |
|
| 1 | CNLP-NITS-PP | MMCHMM24en-bn | 2022/07/11 12:54:09 | 6744 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | No | Transliteration-based phrase pairs augmentation and visual features in training using BRNN encoder and doubly-attentive-rnn decoder. |
| 2 | SILO_NLP | MMCHMM24en-bn | 2022/07/14 21:41:04 | 6940 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | No | Object Tags (Image) + Finetune mBART |
| 3 | ODIAGEN | MMCHMM24en-bn | 2023/07/06 04:05:40 | 7108 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | No | Image features extracted as Object tags appended with text and MBART fine-tuning |
| 4 | BITS-P | MMCHMM24en-bn | 2023/07/08 13:38:18 | 7122 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | Yes | NLLB model finetuned on captions + object tags of original & synthetic images using DETR model |
| 5 | 1128 | MMCHMM24en-bn | 2024/07/23 15:47:13 | 7138 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | No | initial model |
| 6 | 00-7 | MMCHMM24en-bn | 2024/08/05 15:16:58 | 7192 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | Yes | Test |
| 7 | v036 | MMCHMM24en-bn | 2024/08/11 12:35:30 | 7318 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | No | NMT based system using both image descriptors and text description. A multistage LLM pipeline used for extracting image data descriptions and translation. Fine tuning done in few cases
Models Used:
|
| 8 | 239233 | MMCHMM24en-bn | 2024/08/13 05:00:55 | 7375 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | Yes | One-shot prompt for synthetic QA description from captions; translate QA using IndicTrans2; generate caption from QA as context |
| 9 | v036 | MMCHMM24en-bn | 2024/08/15 16:27:53 | 7401 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | No | |
| 10 | v036 | MMCHMM24en-bn | 2024/08/15 19:41:50 | 7414 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | No | |
| 11 | IITP-AI-NLP-ML | MMCHMM24en-bn | 2025/10/22 21:56:31 | 7462 | - | - | - | - | - | - | 0.000000 | - | - | - | NMT | Yes | Used Selective Attention Architecture with IndicTrans as the base model and CLIP ViT-B/16 model to extract image features. We extract a) Full image feats, and b) cropped image feats and pick the one w |