54.6%45.4%Conversation83.1% (206 : 248)57%43%Detailed Description 75.3% (231 : 174)57%43%Complex Reasoning 96.5% (244 : 236)54%46%All: 85.1%(723 : 616)
GPT-4LLaVA