Workshop on Video Large Language Models (VidLLMs)

Challenge Leaderboards

Composed Video Retrieval (CoVR) Challenge

Team Name meanR4 R1 R10 R5 R50 meanR3 meanR4
plum 94.7000 82.5000 98.7000 97.6000 100.0000 92.9300 94.7000
Purratonin 90.1700 71.9000 96.4000 93.2000 99.2000 87.1700 90.1700
cvpr2025_unist 82.8800 59.3000 90.5000 84.0000 97.7000 77.9300 82.8800
Host_52483_Team 73.9500 46.1000 82.3000 73.2000 94.2000 67.2000 73.9500

Complex Video Reasoning & Robustness Evaluation (CVRR) Challenge

Team Name overall accuracy continuity and object instance count fine-grained action understanding interpretation of social context multiple actions in a single video non-existent actions with existent scene depictions non-existent actions with non-existent scene depictions partial actions time order understanding understanding of emotional context unusual and physically anomalous activities visual context
DIVE 0.8144 0.7200 0.7636 0.8800 0.5600 0.9800 0.9800 0.6800 0.6600 0.9600 0.8800 0.9000
NJUST__KMG 0.7802 0.6000 0.6545 0.8200 0.6200 0.9600 0.9800 0.7200 0.5200 0.9200 0.9200 0.8800
love_liang 0.7784 0.5600 0.6364 0.9000 0.5200 0.9200 0.9800 0.6400 0.7000 0.9600 0.8800 0.8800
PCIE 0.7532 0.5200 0.6000 0.9200 0.4200 0.9000 1.0000 0.6000 0.6400 0.9600 0.9000 0.8400
PCIEgogogo 0.7495 0.5200 0.6000 0.9200 0.4000 0.9000 1.0000 0.6000 0.6200 0.9600 0.9000 0.8400
PCIEgo 0.7369 0.4800 0.5636 0.9000 0.4200 0.9200 0.9600 0.6200 0.6400 0.9600 0.8800 0.7800
aaa_vlm 0.6523 0.4600 0.4727 0.8400 0.2000 0.9400 1.0000 0.5200 0.5400 0.5800 0.8000 0.8400

Multilingual Video Reasoning Evaluation Challenge

Team Name AVG Acc MCQ Acc OE Acc
ai_dreamers 80.5386 87.0469 74.0303
wangzhiyu918 47.2354 52.2876 42.1833