๐—ช๐—ผ๐—ฟ๐—น๐—ฑ๐—•๐—ฒ๐—ป๐—ฐ๐—ต: ๐—ง๐—ผ๐—ฝ ๐— ๐—Ÿ๐—Ÿ๐—  ๐—ฆ๐—ฐ๐—ผ๐—ฟ๐—ฒ๐˜€ ๐Ÿฒ๐Ÿฐ%

MIT researchers released a new test called WorldBench. It checks how AI models understand images.

They tested 15 multimodal models. The top model scored 64%. Some models performed near chance level.

Most tests focus on tasks like reading charts or text. WorldBench focuses on visual diversity. It uses thousands of concepts. This includes living things and landscapes.

Key facts:

This tells you visual diversity is the main problem. Models need better vision encoders. They need more diverse training data.

The researchers did not release the code or data yet. You are unable to replicate the results now.

Source: https://dev.to/gentic_news/worldbench-top-mllm-scores-64-on-visually-diverse-benchmark-3h0g Optional learning community: https://t.me/GyaanSetuAi