Text snippets, similar to tweets, in German language, partly offensive

Short texts, similar to tweets, with a classification of offensive or not. Two classifications are provided. `c1` classifies as hateful or not, whereas `c2` classifies the type of offensive content: abuse, insult, profanity or other (not offensive).

Usage

data(offensive)

Format

A data frame containing 3 variable and approx. 5009 rows

text: Character. Tweet-like text
c1: Character. Offensive content or not?
c2: Character. Type of offensive content

Source

Wiegand, Michael. 2019b. GermEval-2018 Corpus (DE). heiDATA. https://doi.org/10.11588/DATA/0B5VML., https://heidata.uni-heidelberg.de/dataset.xhtml?persistentId=doi:10.11588/data/0B5VML. Licenced under CC-By-4.0 Int.