Skip to contents

Data sets for practical data analysis

Contents

Data sets featuring data from

  • precip_temp_DWD: temperature and precipitation in Germany, csv
  • ames_de_train: Ames housing train set, shortened version, csv
  • ames_de_test: Ames housind test set, shortened version, csv
  • elec_results: German elections 2017, csv
  • wellbeing: OECD wellbeing 2016, csv
  • extra: survey on extraversion 2015-2016, csv
  • cult_values: Schwartz cultural values 1988-2000, csv
  • socec: sociol economic data of all German 2017 electoral districts, csv
  • wahlkreise_shp: geomap of German electoral districts 2017, shape-file
  • wo_men: toy data set on sex, height, and shoe size, and their differences between women and men, csv
  • dating: online dating platform experiment 2015, csv
  • germanlex: German sentiment lexicon, csv
  • sentiws: German sentiment lexicon, csv
  • stats_test: predictors and score of statistics test, csv
  • countries: names of world countries, csv
  • afd: party platform of the AfD party as of 2016 (“Alternative fuer Deutschland”), csv
  • afd_2022: party platform of the AfD party as of 2022 (“Alternative fuer Deutschland”), csv
  • schimpfwoerter: List of profane words (Schimpfwörter) in German csv
  • offensive: List of potentially offensive short texts in German csv
  • wild_emojis: Several emojis frequently used to express discontent or hate csv
  • corr_data: Simulated correlation data csv
  • germeval_train and germeval_test: short text snippets similar to tweets for hate-speech classification based on the GermEval2018 data train csv, test-csv

Notes that some (most) of these data sets were not published by the author of this package. The original authors and the respective licences can be found in the desription of each data set.

Installation

devtools::install_github("sebastiansauer/pradadata")

The package devtools need to be installed.

Getting started

In R (or RStudio) type (after having installed the package) library("pradadata")

Access the data without package

Use the following template to access the datasets as csv files:

https://raw.githubusercontent.com/sebastiansauer/pradadata/master/data-raw/<NAME-OF-DATASET>.csv.

Requirements

R

Licence

Package-License

DOI

DOI