MAHED 2025 is a shared task at ArabicNLP 2025 Co-located with EMNLP 2025, focusing on the detection of hate speech, hope speech, and emotional expression in Arabic content across both textual and multimodal formats. This task aims to advance Arabic Natural Language Processing (NLP) through multi-task model and multimodal analysis. Participants may choose to participate in one or more of the following three subtasks:
Social media in the Arabic-speaking world exhibits a dynamic interplay between hateful and hopeful expressions. However, challenges like linguistic diversity, dialect variation, limited datasets, and the emergence of multimodal content (e.g., memes) make automatic detection complex.
MAHED 2025 addresses this by:
The shared task will use the following annotated datasets:
Attribute | Subtask 1 | Subtask 2 | Subtask 3 |
---|---|---|---|
Size | 9,843 | 8,515 | 4,500 |
Labels |
|
|
|
Source | Sub-Task 1 source | Sub-Task 2 source | Sub-Task 3 source |
Annotation: All content was collected from public social media, anonymized, and annotated by native speakers. Annotation agreement (Cohen's Kappa) is > 0.85.
Data Split: Training, development, and test sets provided with evaluation scripts.
Ethics: The dataset complies with ethical data-sharing standards.
MAHED 2025 consists of three interconnected subtasks:
Goal: Classify Arabic text as hate, hope, or not_applicable
Examples:
Goal: Identify the emotion, whether the text is offensive, and if offensive, whether it contains hate content.
Text | Emotion | Offensive | Hate |
---|---|---|---|
كل المهاجرين لصوص ويجب طردهم | anger | yes | hate |
يا حمار ليش نسيت المفاتيح؟ | anger | yes | not_hate |
أشعر بالحزن لأنني خسرت وظيفتي | sadness | no | — |
Goal: Detect whether a meme (text + image) is hateful or not.
Examples:
Participants may engage in any or all sub-tasks. The evaluation uses F1-score, precision, and recall, with macro-averaged F1 as the primary metric.
Evaluation will be performed using:
A pilot study with 2,000 text instances and 500 memes yielded:
Feedback refined the dataset; details will be shared upon acceptance.
Date | Event |
---|---|
June 1, 2025 | Release of training, dev data and evaluation scripts |
July 20, 2025 | Final registration deadline and test set release |
July 25, 2025 | Test submission deadline via CodaLab |
July 30, 2025 | Final results released to participants |
August 15, 2025 | System description papers due |
November 5-9, 2025 | ArabicNLP 2025 Workshop in Suzhou, China |
Particpants can submit their System Paper in openreview submission
.Reference:
Wajdi Zaghouani and Md. Rafiul Biswas. 2025. EmoHopeSpeech: An Annotated Dataset of Emotions and Hope Speech in English and Arabic. arXiv preprint arXiv:2505.11959 [cs.CL]
Zaghouani, Wajdi, Hamdy Mubarak, and Md. Rafiul Biswas. 2024. So Hateful! Building a Multi-Label Hate Speech Annotated Arabic Dataset. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 15044–15055, Torino, Italy
Alam, Firoj, et al. "Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-agent LLMs." International Conference on Web Information Systems Engineering. Singapore: Springer Nature Singapore, 2024.