Data sets for practical data analysis
Contents
Data sets featuring data from
-
precip_temp_DWD
: temperature and precipitation in Germany, csv -
ames_de_train
: Ames housing train set, shortened version, csv -
ames_de_test
: Ames housind test set, shortened version, csv -
elec_results
: German elections 2017, csv -
wellbeing
: OECD wellbeing 2016, csv -
extra
: survey on extraversion 2015-2016, csv -
cult_values
: Schwartz cultural values 1988-2000, csv -
socec
: sociol economic data of all German 2017 electoral districts, csv -
wahlkreise_shp
: geomap of German electoral districts 2017, shape-file -
wo_men
: toy data set on sex, height, and shoe size, and their differences between women and men, csv -
dating
: online dating platform experiment 2015, csv -
germanlex
: German sentiment lexicon, csv -
sentiws
: German sentiment lexicon, csv -
stats_test
: predictors and score of statistics test, csv -
countries
: names of world countries, csv -
afd
: party platform of the AfD party as of 2016 (“Alternative fuer Deutschland”), csv -
afd_2022
: party platform of the AfD party as of 2022 (“Alternative fuer Deutschland”), csv -
schimpfwoerter
: List of profane words (Schimpfwörter) in German csv -
offensive
: List of potentially offensive short texts in German csv -
wild_emojis
: Several emojis frequently used to express discontent or hate csv -
corr_data
: Simulated correlation data csv -
germeval_train
andgermeval_test
: short text snippets similar to tweets for hate-speech classification based on the GermEval2018 data train csv, test-csv
Notes that some (most) of these data sets were not published by the author of this package. The original authors and the respective licences can be found in the desription of each data set.
Installation
devtools::install_github("sebastiansauer/pradadata")
The package devtools
need to be installed.
Getting started
In R (or RStudio) type (after having installed the package) library("pradadata")
Access the data without package
Use the following template to access the datasets as csv files:
https://raw.githubusercontent.com/sebastiansauer/pradadata/master/data-raw/<NAME-OF-DATASET>.csv
.