Loughborough University
Browse

An adaptive multimodal semantic knowledge enhanced framework for sarcasm detection

journal contribution
posted on 2025-09-22, 14:18 authored by Jing Dong, Yu Sui, Qiang Zhang, Hui FangHui Fang, Gerald SchaeferGerald Schaefer, Rui Liu, Pengfei Yi, Xiaoyong Fang
<p dir="ltr">Multimodal sarcasm detection (MSD) has become an important research topic for understanding sentiments on social media, while various recent MSD approaches extract high-level semantic knowledge from images to improve performance. However, some key semantic information, such as emotions expressed in images, is still neglected, limiting reliable sentiment understanding. To address this issue, we propose an adaptive multimodal semantic knowledge enhanced framework for sarcasm detection. We first design an adaptive processing pipeline to extract emotion-aware visual semantics as an auxiliary modality to enhance multimodal feature representations. Enabled by two attention mechanisms, bidirectional cross-modal attention and graph attention, interactions between modalities are analysed to improve MSD performance. Extensive experiments are conducted on two public multimodal sarcasm detection datasets, MSD and MMSD 2.0, comprising approximately 19,000 tweet samples. Our proposed approach achieves consistent improvements in both sarcasm detection accuracy and F1-score compared to strong baseline models such as DIP and KnowleNet. Built upon a ViT-based architecture, the fine-tuned model offers competitive performance with lower computational overhead, highlighting its potential for practical deployment.</p>

Funding

Dalian Major Projects of Basic Research [2023JJ11CG002]

111 Center [D23006]

National Foreign Expert Project of China [D20240244]

Scientific Research Funds of Education Department of Liaoning Province [JYTMS20230379].

Interdisciplinary Research Project of Dalian University [DLUXK-2024-YB-007]

History

School

  • Science

Department

  • Computer Science

Published in

Expert Systems With Applications

Volume

298

Issue

Part C

Article number

129773

Publisher

Elsevier

Version

  • AM (Accepted Manuscript)

Rights holder

© Elsevier Ltd.

Acceptance date

2025-09-15

Publication date

2025-09-18

Copyright date

2025

ISSN

0957-4174

eISSN

1873-6793

Language

  • en

Depositor

Dr Hui Fang. Deposit date: 16 September 2025