Beyond Understanding: Evaluating the Pragmatic Gap in LLMs' Cultural Processing of Figurative Language

Afri-MCQA: Multimodal Cultural Question Answering for African Languages

MoMentS: A Comprehensive Multimodal Benchmark for Theory of Mind

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset

All languages matter: Evaluating lmms on culturally diverse 100 languages

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages

Cvqa: Culturally-diverse multilingual visual question answering benchmark