MME-CC is a comprehensive multi-modal evaluation benchmark designed to assess cognitive capacity across diverse reasoning tasks. The benchmark comprises 11 carefully designed tasks with 1,173 samples, spanning three critical dimensions: Spatial Reasoning (SR), General Reasoning (GR), and Visual Knowledge Reasoning (VKR). Each task is crafted to challenge multi-modal models' ability to understand and reason across visual and textual information, providing insights into their cognitive capabilities.
Spatial Reasoning
Avg Input Tokens: 8,198
Avg Output Tokens: 4,076
Geometric Reasoning
Avg Input Tokens: 1,549
Avg Output Tokens: 6,204
Visual Knowledge Reasoning
Avg Input Tokens: 2,751
Avg Output Tokens: 1,329