The Telegram chronicles of online harm

Harmful language is frequent in social media, in particular in spaces which are considered anonymous and/or allow free participation. In this paper, we analyze the language in a Telegram channel populated by followers of former US President Donald Trump. We seek to identify the ways in which harmful language is used to create a specific narrative in a group of mostly like-minded discussants. Our research has several aims. First, we create an extended taxonomy of potentially harmful language that includes not only hate speech and direct insults (which have been the focus of existing computational methods), but also other forms of harmful speech discussed in the literature. We manually apply this taxonomy to a large portion of the corpus, including the time period leading up to and the aftermath of the January 2021 US Capitol riot. Our data gives empirical evidence for harmful speech, such as in/out-group divisive language and the use of codes within certain communities, that have not often been investigated before. Second, we compare our manual annotations of harmful speech to several automatic methods for classifying hate speech and offensive language, namely list-based and machine-learning-based approaches. We find that the Telegram data sets still pose particular challenges for these automatic methods. Finally, we argue for the value of studying such naturally-occurring, coherent data sets for research on online harm and how to address it in linguistics and philosophy.

Metadaten
Author:	Tatjana Scheffler ORCiD GND, Veronika Solopova GND, Mihaela Popa-Wyatt ORCiD GND
URN:	urn:nbn:de:hbz:294-89000
DOI:	https://doi.org/10.5334/johd.31
Parent Title (English):	Journal of open humanities data
Publisher:	Ubiquity Press
Place of publication:	London
Document Type:	Article
Language:	English
Date of Publication (online):	2022/05/13
Date of first Publication:	2021/07/05
Publishing Institution:	Ruhr-Universität Bochum, Universitätsbibliothek
Tag:	Open Access Fonds; Social Media Telegram; corpus linguistics; hate speech; offensive language detection; online harm
Volume:	7
Issue:	8
First Page:	1
Last Page:	15
Note:	Article Processing Charge funded by the Open Access Publication Fund of Ruhr-Universität Bochum.
Institutes/Facilities:	Germanistisches Institut
open_access (DINI-Set):	open_access
faculties:	Fakultät für Philologie
Licence (English):	Creative Commons - CC BY 4.0 - Attribution 4.0 International

RUB » Bibliotheksportal