𝗪𝗼𝗿𝗹𝗱𝗕𝗲𝗻𝗰𝗵: 𝗧𝗼𝗽 𝗠𝗟𝗟𝗠 𝗦𝗰𝗼𝗿𝗲𝘀 𝟲𝟰%

📅1 week ago⏱1 min read

MIT researchers released a new test called WorldBench. It checks how AI models understand images.

They tested 15 multimodal models. The top model scored 64%. Some models performed near chance level.

Most tests focus on tasks like reading charts or text. WorldBench focuses on visual diversity. It uses thousands of concepts. This includes living things and landscapes.

Key facts:

Released June 4 on arXiv.
Top model scored 64%.
Tests visual breadth over task depth.
Exposes gaps in visual understanding.

This tells you visual diversity is the main problem. Models need better vision encoders. They need more diverse training data.

The researchers did not release the code or data yet. You are unable to replicate the results now.

Source: https://dev.to/gentic_news/worldbench-top-mllm-scores-64-on-visually-diverse-benchmark-3h0g Optional learning community: https://t.me/GyaanSetuAi

𝗪𝗼𝗿𝗹𝗱𝗕𝗲𝗻𝗰𝗵: 𝗧𝗼𝗽 𝗠𝗟𝗟𝗠 𝗦𝗰𝗼𝗿𝗲𝘀 𝟲𝟰%

Continue reading

𝗧𝗵𝗲 𝗠𝘆𝘁𝗵 𝗢𝗳 𝗧𝗵𝗲 𝗦𝘁𝗿𝗼𝗻𝗴𝗲𝘀𝘁 𝗠𝗼𝗱𝗲𝗹

𝗟𝗼𝗰𝗮𝗹 𝗟𝗟𝗠𝘀 𝗶𝗻 𝟮𝟬𝟮𝟲 𝗯𝘂𝘁 𝗗𝗲𝘃 𝗘𝘅𝗽𝗲𝗿𝗶𝗲𝗻𝗰𝗲 𝗶𝗻 𝟮𝟬𝟭𝟬

𝗔𝗜 𝗖𝗢𝗠𝗣𝗔𝗡𝗜𝗘𝗦 𝗣𝗔𝗬 𝗠𝗜𝗟𝗟𝗜𝗢𝗡𝗦 𝗙𝗢𝗥 𝗬𝗢𝗨𝗥 𝗢𝗟𝗗 𝗣𝗢𝗦𝗧𝗦

𝗔𝗜 𝗙𝗮𝗶𝗹𝘀 𝗔𝘁 𝗜𝗧 𝗧𝗮𝘀𝗸𝘀

𝗦𝗩𝗼𝗧 𝗜𝗻𝗰𝗿𝗲𝗮𝘀𝗲𝘀 𝗦𝗽𝗮𝘁𝗶𝗮𝗹 𝗥𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 𝗕𝘆 𝟲𝟱%