The Spanish EXIST 2025 corpus is a collection of three types of data: text (tweets), image (memes), and video (TikToks). This multimedia approach will help identify trends and patterns of sexism across different formats and user interactions, contributing to a deeper understanding of social dynamics. Additionally, the approaches submitted for all tasks will be evaluated jointly to assess their ability to detect sexism in a multimodal source.
Language(s)
Spanish
English
Dataset description link
Year
2025
Domain
Social
Format
json
Annotation guide link
Data access
Register form
Data link
NLP Topic