← Volume 12: Challenges and Perspectives of Hate Speech Research

The Right Kind of Explanation

Validity in automated hate speech detection

Laura Laugwitz

Berlin, 2023
DOI 10.48541/dcr.v12.23 (SSOAR)

Abstract: To quickly identify hate speech online, communication research offers a useful tool in the form of automatic content analysis. However, the combined methods of standardized manual content analysis and supervised text classification demand different quality criteria. This chapter shows that a more substantial examination of validity is necessary since models often learn on spurious correlations or biases, and researchers run the risk of drawing wrong inferences. To investigate the overlap of theoretical concepts with technological operationalization, explainability methods are evaluated to explain what a model has learned. These methods proved to be of limited use in testing the validity of a model when the generated explanations aim at sense-making rather than faithfulness to the model. The chapter ends with recommendations for further interdisciplinary development of automatic content analysis.

Laura Laugwitz is a PhD candidate at the Institute for Journalism and Communication Studies at Universität Hamburg, Germany.

Laugwitz, L. (2023). The right kind of explanation: Validity in automated hate speech detection. In C. Strippel, S. Paasch-Colberg, M. Emmer, & J. Trebbe (Eds.), Challenges and perspectives of hate speech research (pp. 383–402). Digital Communication Research. https://doi.org/10.48541/dcr.v12.23

This book is published open access and licensed under Creative Commons Attribution 4.0 (CC-BY 4.0).
The persistent long-term archiving of this book is carried out with the help of the Social Science Open Access Repository and the university library of Freie Universität Berlin (Refubium).