Gaussmodelle: Schätzen einer metrischen Variablen

# Gaussmodelle: Schätzen einer metrischen Variablen
## Kapitel 4

---

<style>

.center2 {
  margin: 0;
  position: absolute;
  top: 50%;
  left: 50%;
  -ms-transform: translate(-50%, -50%);
  transform: translate(-50%, -50%);
}

</style>

## Gliederung

</br>
</br>
</br>

2. [Teil 2: Gauss-Modelle: Wie groß sind die !Kung San?](#gaussmodelle)

3. [Hinweise](#hinweise)
]

---

## Software

Für dieses Thema benötigen Sie einige R-Pakete, die Sie wie folgt installieren können:

```r
pakete <- c("tidyverse", "rstan", "rstanarm", "bayesplot")

install.packages(pakete)
```

Für `rstan` wird [weitere Software](https://github.com/stan-dev/rstan/wiki/RStan-Getting-Started) benötigt.

---

# Verteilungen

---

## Häufigkeitsverteilung

.pull-left[
  
Die Verteilung eines *diskreten* Merkmals `$X$` mit `$k$` Ausprägungen zeigt, wie häufig die einzelnen Ausprägungen sind.

```r
  data(mtcars)
  mtcars %>% 
    count(cyl)
```

```
##   cyl  n
## 1   4 11
## 2   6  7
## 3   8 14
```
  
]

.pull-right[
  
Ein *stetiges* Merkmal lässt sich durch Klassenbildung diskretisieren:
    
<img src="Kapitel_4_chunk-img/Normalverteilung-2-1.png" width="100%" style="display: block; margin: auto;" />
  
]

---

## Wahrscheinlichkeitsverteilung

Eine *diskrete* Wahrscheinlichkeitsverteilung des Merkmals `$X$` ordnet jeder der `$k$` Ausprägungen `$X=x$` eine Wahrscheinlichkeit `$p$` zu. So hat die Variable *Geschlecht eines Babies* die beiden Ausprägungen *Mädchen* und *Junge* mit den Wahrscheinlichkeiten `$p_M = 51.2\%$` bzw. `$p_J = 48.8\%$` 
<a name=cite-gelman_regression_2021></a>([Gelman, Hill, and Vehtari, 2021](#bib-gelman_regression_2021)).

Bei *stetigen* Merkmalen `$X$` geht man von unendlich vielen Ausprägungen aus; die Wahrscheinlichkeit einer bestimmten Ausprägung ist (praktisch) Null: `$p(X=x_j)=0, \quad j=1,...,k$`. Daher gibt man stattdessen die *Dichte* der Wahrscheinlichkeit an: Das ist die Wahrscheinlichkeit(smasse) pro eine Einheit von `$X$`.

---

## Beispiele für  Wahrscheinlichkeitsdichte

Bei `$X=0$` hat eine Einheit von `$X$` die Wahrscheinlichkeitsmasse von 40%.

In Summe liegen 100% der Wahrscheinlichkeitsmasse unter der Kurve.

]

<img src="Kapitel_4_chunk-img/Normalverteilung-4-1.png" width="100%" style="display: block; margin: auto;" /><img src="Kapitel_4_chunk-img/Normalverteilung-4-2.png" width="100%" style="display: block; margin: auto;" />
Bei `$X=0$` hat eine Einheit von `$X$` die Wahrscheinlichkeitsmasse von 50% bzw. 33%.

]

---

## Quantile und Verteilungsfunktion

- *Quantile* teilen eine Verteilung so ein, dass ein Anteil `$p$` kleiner und der andere Teil `$1-p$` größer oder gleich dem Quantil `$q$` ist.
    - *Beispiel*: "50%-Quantil = 100" meint, dass 50% der Werte der Verteilung einen Wert kleiner als 100 haben.
    
- Die *Verteilungsfunktion F* für ein Quantil `$q$` gibt den Anteil der Verteilung an, der nur  Werte höchstens so groß wie `$q$` beinhaltet. Sie zeigt also die kumulierte Wahrscheinlichkeit `$[-\infty, q)$`.
    - *Beispiel*: "F(100) = 50%" meint, dass der Anteil der Verteilung für Werte nicht größer als 100 50% beträgt.

---

## Gesetz der großen Zahl

Zieht man (zufällig) immer mehr Werte aus einer Verteilung (mit endlichem Mittelwert), nähert sich der Mittelwert der Stichprobe immer mehr mit dem Mittelwert (oft als Erwartungswert bezeichnet) der Verteilung an 
<a name=cite-taleb2019technical></a>([Taleb, 2019](https://nassimtaleb.org/2020/01/final-version-fat-tails/))

---

## Normal auf dem Fußballfeld

Sie und 1000 Ihrer besten Freunde stehen auf der Mittellinie eines Fußballfelds (eng). Auf Kommando werfen alle jeweils eine Münze; bei Kopf geht man einen Schritt nach links, bei Zahl nach rechts. Das wird 16 Mal wiederholt. Wie wird die Verteilung der Positionen wohl aussehen?

.footnote[<a name=cite-mcelreath_statistical_2020></a>([McElreath, 2020](#bib-mcelreath_statistical_2020))]

---

## Normal durch Addieren

Die Summe vieler (gleich starker) Zufallswerte (aus der gleichen Verteilung) erzeugt eine Normalverteilung; egal aus welcher Verteilung die Zufallswerte kommen (Zentraler Grenzwertsatz).

---

## Nicht verwechseln

<a name=cite-freeman_visual_2006></a>([Freeman, 2006](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2465539/))

---

## Normalverteilung vs. randlastige Verteilungen

Bei randlastigen Verteilungen ("fat tails") kommen Extremereignisse viel häufiger vor als bei Normalverteilungen. Deshalb ist es wichtig sein, zu wissen, ob eine Normalverteilung oder eine randlastige Verteilung vorliegt. Viele statistische Methoden sind nicht zuverlässig bei (stark) randlastigen Methoden 
([Taleb, 2019](https://nassimtaleb.org/2020/01/final-version-fat-tails/))

---

## Beispiele für Normal- und randlastige Verteilungen

### Normal verteilt

- Größe 
- Münzwürfe
- Gewicht
- IQ
- Blutdruck
- Ausschuss einer Maschine
]

### Randlastig verteilt

- Vermögen
- Verkaufte Bücher
- Ruhm
- Aktienkurse
- Erdbeben
- Pandemien
- Kriege
- Erfolg auf Tinder
- Meteroritengröße
- Stadtgrößen
]

---

## Formel der Normalverteilung

Vereinfacht ausgedrückt lässt die Normalverteilung `$\mathcal{N}$` durch Exponenzieren einer Quadratfunktion beschreiben:

`$$\mathcal{N} \propto e^{-x^2}$$`

mit `$e=2.71...$`, der Eulerschen Zahl.

```r
d <-
  tibble(
    x = seq(-3, 3, 
            length.out = 100),
    y = exp(-x^2)
  )

d %>% 
  ggplot() +
  aes(x = x, y = y) +
  geom_line()
```

]

]

Die Normalverteilung wird auch *[Gauss](https://de.wikipedia.org/wiki/Carl_Friedrich_Gau%C3%9F)-Verteilung* oder *Glockenkurve* genannt.

---

## IQ-Verteilung: Quantile

`$IQ \sim \mathcal{N}(100,15)$`

- Wie schlau muss man sein, um zu den unteren 75%, 50%, 25%, 5%, 1% zu gehören?
- Anders gesagt: Welcher IQ-Wert wird von 75%, 50%, ... der Leute nicht überschritten?

Ziehen wir Stichproben aus `$\mathcal{N}(100,15)$`:

```r
d <-tibble(
  iq = rnorm(1e4, 
             mean = 100, 
             sd = 15))

probs <- c(0.75,.5,.25,.05,.01)

d_summary <- d %>% 
  summarise(
    p = probs,
    q = quantile(iq, probs))
```

]

<div id="hojlmvrpxn" style="overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
<style>html {
  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Helvetica Neue', 'Fira Sans', 'Droid Sans', Arial, sans-serif;
}

#hojlmvrpxn .gt_table {
  display: table;
  border-collapse: collapse;
  margin-left: auto;
  margin-right: auto;
  color: #333333;
  font-size: 16px;
  font-weight: normal;
  font-style: normal;
  background-color: #FFFFFF;
  width: auto;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #A8A8A8;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #A8A8A8;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
}

#hojlmvrpxn .gt_heading {
  background-color: #FFFFFF;
  text-align: center;
  border-bottom-color: #FFFFFF;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#hojlmvrpxn .gt_title {
  color: #333333;
  font-size: 125%;
  font-weight: initial;
  padding-top: 4px;
  padding-bottom: 4px;
  border-bottom-color: #FFFFFF;
  border-bottom-width: 0;
}

#hojlmvrpxn .gt_subtitle {
  color: #333333;
  font-size: 85%;
  font-weight: initial;
  padding-top: 0;
  padding-bottom: 6px;
  border-top-color: #FFFFFF;
  border-top-width: 0;
}

#hojlmvrpxn .gt_bottom_border {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#hojlmvrpxn .gt_col_headings {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#hojlmvrpxn .gt_col_heading {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 6px;
  padding-left: 5px;
  padding-right: 5px;
  overflow-x: hidden;
}

#hojlmvrpxn .gt_column_spanner_outer {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  padding-top: 0;
  padding-bottom: 0;
  padding-left: 4px;
  padding-right: 4px;
}

#hojlmvrpxn .gt_column_spanner_outer:first-child {
  padding-left: 0;
}

#hojlmvrpxn .gt_column_spanner_outer:last-child {
  padding-right: 0;
}

#hojlmvrpxn .gt_column_spanner {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 5px;
  overflow-x: hidden;
  display: inline-block;
  width: 100%;
}

#hojlmvrpxn .gt_group_heading {
  padding: 8px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
}

#hojlmvrpxn .gt_empty_group_heading {
  padding: 0.5px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: middle;
}

#hojlmvrpxn .gt_from_md > :first-child {
  margin-top: 0;
}

#hojlmvrpxn .gt_from_md > :last-child {
  margin-bottom: 0;
}

#hojlmvrpxn .gt_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  margin: 10px;
  border-top-style: solid;
  border-top-width: 1px;
  border-top-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
  overflow-x: hidden;
}

#hojlmvrpxn .gt_stub {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-right-style: solid;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  padding-left: 12px;
}

#hojlmvrpxn .gt_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#hojlmvrpxn .gt_first_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
}

#hojlmvrpxn .gt_grand_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#hojlmvrpxn .gt_first_grand_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: double;
  border-top-width: 6px;
  border-top-color: #D3D3D3;
}

#hojlmvrpxn .gt_striped {
  background-color: rgba(128, 128, 128, 0.05);
}

#hojlmvrpxn .gt_table_body {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#hojlmvrpxn .gt_footnotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#hojlmvrpxn .gt_footnote {
  margin: 0px;
  font-size: 90%;
  padding: 4px;
}

#hojlmvrpxn .gt_sourcenotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#hojlmvrpxn .gt_sourcenote {
  font-size: 90%;
  padding: 4px;
}

#hojlmvrpxn .gt_left {
  text-align: left;
}

#hojlmvrpxn .gt_center {
  text-align: center;
}

#hojlmvrpxn .gt_right {
  text-align: right;
  font-variant-numeric: tabular-nums;
}

#hojlmvrpxn .gt_font_normal {
  font-weight: normal;
}

#hojlmvrpxn .gt_font_bold {
  font-weight: bold;
}

#hojlmvrpxn .gt_font_italic {
  font-style: italic;
}

#hojlmvrpxn .gt_super {
  font-size: 65%;
}

#hojlmvrpxn .gt_footnote_marks {
  font-style: italic;
  font-weight: normal;
  font-size: 65%;
}
</style>
<table class="gt_table">
  
  <thead class="gt_col_headings">
    <tr>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">p</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">q</th>
    </tr>
  </thead>
  <tbody class="gt_table_body">
    <tr><td class="gt_row gt_right">0.75</td>
<td class="gt_row gt_right">110</td></tr>
    <tr><td class="gt_row gt_right">0.50</td>
<td class="gt_row gt_right">100</td></tr>
    <tr><td class="gt_row gt_right">0.25</td>
<td class="gt_row gt_right">90</td></tr>
    <tr><td class="gt_row gt_right">0.05</td>
<td class="gt_row gt_right">75</td></tr>
    <tr><td class="gt_row gt_right">0.01</td>
<td class="gt_row gt_right">65</td></tr>
  </tbody>
  
  
</table>
</div>

Das *Quantil* `$q$` zur kumulierten Wahrscheinlichkeit `$p=75$` ist 110, etc. 
]

---

## IQ-Verteilung: Anteile

`$IQ \sim \mathcal{N}(100,15)$`

- Welcher Anteil `$p$` gehört zu den IQ-Werten 75, 100, 115, 130?
- Anders gesagt: Welcher Anteil der Wahrscheinlichkeitsmasse der Verteilung liegt unter IQ=75, IQ=100, etc.?

```r
d <-
  tibble(
    iq = rnorm(1e4, 
               mean = 100, 
               sd = 15)) %>% 
  mutate(iq = round(iq))

qs <- c(75,100,115,130)

d %>% 
  count(p_100 = iq < 100) %>% 
  mutate(prop = n / sum(n)) 
```

]

<div id="opxzqcnrwb" style="overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
<style>html {
  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Helvetica Neue', 'Fira Sans', 'Droid Sans', Arial, sans-serif;
}

#opxzqcnrwb .gt_table {
  display: table;
  border-collapse: collapse;
  margin-left: auto;
  margin-right: auto;
  color: #333333;
  font-size: 16px;
  font-weight: normal;
  font-style: normal;
  background-color: #FFFFFF;
  width: auto;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #A8A8A8;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #A8A8A8;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
}

#opxzqcnrwb .gt_heading {
  background-color: #FFFFFF;
  text-align: center;
  border-bottom-color: #FFFFFF;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#opxzqcnrwb .gt_title {
  color: #333333;
  font-size: 125%;
  font-weight: initial;
  padding-top: 4px;
  padding-bottom: 4px;
  border-bottom-color: #FFFFFF;
  border-bottom-width: 0;
}

#opxzqcnrwb .gt_subtitle {
  color: #333333;
  font-size: 85%;
  font-weight: initial;
  padding-top: 0;
  padding-bottom: 6px;
  border-top-color: #FFFFFF;
  border-top-width: 0;
}

#opxzqcnrwb .gt_bottom_border {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#opxzqcnrwb .gt_col_headings {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#opxzqcnrwb .gt_col_heading {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 6px;
  padding-left: 5px;
  padding-right: 5px;
  overflow-x: hidden;
}

#opxzqcnrwb .gt_column_spanner_outer {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  padding-top: 0;
  padding-bottom: 0;
  padding-left: 4px;
  padding-right: 4px;
}

#opxzqcnrwb .gt_column_spanner_outer:first-child {
  padding-left: 0;
}

#opxzqcnrwb .gt_column_spanner_outer:last-child {
  padding-right: 0;
}

#opxzqcnrwb .gt_column_spanner {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 5px;
  overflow-x: hidden;
  display: inline-block;
  width: 100%;
}

#opxzqcnrwb .gt_group_heading {
  padding: 8px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
}

#opxzqcnrwb .gt_empty_group_heading {
  padding: 0.5px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: middle;
}

#opxzqcnrwb .gt_from_md > :first-child {
  margin-top: 0;
}

#opxzqcnrwb .gt_from_md > :last-child {
  margin-bottom: 0;
}

#opxzqcnrwb .gt_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  margin: 10px;
  border-top-style: solid;
  border-top-width: 1px;
  border-top-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
  overflow-x: hidden;
}

#opxzqcnrwb .gt_stub {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-right-style: solid;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  padding-left: 12px;
}

#opxzqcnrwb .gt_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#opxzqcnrwb .gt_first_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
}

#opxzqcnrwb .gt_grand_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#opxzqcnrwb .gt_first_grand_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: double;
  border-top-width: 6px;
  border-top-color: #D3D3D3;
}

#opxzqcnrwb .gt_striped {
  background-color: rgba(128, 128, 128, 0.05);
}

#opxzqcnrwb .gt_table_body {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#opxzqcnrwb .gt_footnotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#opxzqcnrwb .gt_footnote {
  margin: 0px;
  font-size: 90%;
  padding: 4px;
}

#opxzqcnrwb .gt_sourcenotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#opxzqcnrwb .gt_sourcenote {
  font-size: 90%;
  padding: 4px;
}

#opxzqcnrwb .gt_left {
  text-align: left;
}

#opxzqcnrwb .gt_center {
  text-align: center;
}

#opxzqcnrwb .gt_right {
  text-align: right;
  font-variant-numeric: tabular-nums;
}

#opxzqcnrwb .gt_font_normal {
  font-weight: normal;
}

#opxzqcnrwb .gt_font_bold {
  font-weight: bold;
}

#opxzqcnrwb .gt_font_italic {
  font-style: italic;
}

#opxzqcnrwb .gt_super {
  font-size: 65%;
}

#opxzqcnrwb .gt_footnote_marks {
  font-style: italic;
  font-weight: normal;
  font-size: 65%;
}
</style>
<table class="gt_table">
  
  <thead class="gt_col_headings">
    <tr>
      <th class="gt_col_heading gt_columns_bottom_border gt_center" rowspan="1" colspan="1">p_100</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">n</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">prop</th>
    </tr>
  </thead>
  <tbody class="gt_table_body">
    <tr><td class="gt_row gt_center">FALSE</td>
<td class="gt_row gt_right">5090</td>
<td class="gt_row gt_right">0.51</td></tr>
    <tr><td class="gt_row gt_center">TRUE</td>
<td class="gt_row gt_right">4910</td>
<td class="gt_row gt_right">0.49</td></tr>
  </tbody>
  
  
</table>
</div>

Anstelle von `iq < 100` kann man `iq < 115` einsetzen, etc.

Die *Verteilungsfunktion* (der Anteil der Wahrscheinlichkeitsmasse), `p`, für IQ-Werte nicht größer als 100, d.h. zum Quantil `$q=100$`, ist 50%, etc.

]

---

## Quantile der Normalverteilung visualisiert

`$IQ \sim \mathcal{N}(100, 15)$`

```r
qnorm(.50, mean = 100, sd = 15)  # 50%-Quantil
pnorm(100, mean = 100, sd = 15)  # Verteilungsfunktion für IQ=100
```

---

## Normalverteilung als konservative Wahl

### Ontologische Begründung

- Wirken viele, gleichstarke Einflüsse additiv zusammen, entsteht eine Normalverteilung  ([McElreath, 2020](#bib-mcelreath_statistical_2020)), Kap. 4.1.4.

### Epistemologische Begründung

- Wenn wir nur wissen, dass eine Variable über einen endlichen Mittelwert und eine endliche Varianz verfügt und wir keine weiteren Annahmen treffen bzw. über kein weiteres Vorwissen verfügen, dann ist die Normalverteilung die plausibelste Verteilung (maximale Entropie) ([McElreath, 2020](#bib-mcelreath_statistical_2020)), Kap. 7 und 10.

]

---

## Zweidimensionale Normalverteilung, unkorreliert

.tiny[[Quelle](https://tex.stackexchange.com/questions/31708/draw-a-bivariate-normal-distribution-in-tikz)]

---

## 2D-Normalverteilung mit R, unkorreliert

`$r(X,Y) = 0$`

```r
d1 <- 
  tibble( 
    x=rnorm(1e4), 
    y=rnorm(1e4)
  )

ggplot(d1) +
  aes(x, y) +
  geom_hex()

ggplot(d1) +
  aes(x, y) +
  geom_density2d()
```

.tiny[[ggplot-Referenz](https://ggplot2.tidyverse.org/reference/geom_density_2d.html), [Quellcode](https://www.r-graph-gallery.com/2d-density-plot-with-ggplot2.html)]

Mit `scale_fill_continuous(type = "viridis")`kann man die Farbpalette der Füllfarbe ändern.
]

]

---

## 2D-Normalverteilung mit R, korreliert, r=0.7

Die ersten paar Zeilen der Daten:

<div id="gnofaowtxy" style="overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
<style>html {
  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Helvetica Neue', 'Fira Sans', 'Droid Sans', Arial, sans-serif;
}

#gnofaowtxy .gt_table {
  display: table;
  border-collapse: collapse;
  margin-left: auto;
  margin-right: auto;
  color: #333333;
  font-size: 16px;
  font-weight: normal;
  font-style: normal;
  background-color: #FFFFFF;
  width: auto;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #A8A8A8;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #A8A8A8;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
}

#gnofaowtxy .gt_heading {
  background-color: #FFFFFF;
  text-align: center;
  border-bottom-color: #FFFFFF;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#gnofaowtxy .gt_title {
  color: #333333;
  font-size: 125%;
  font-weight: initial;
  padding-top: 4px;
  padding-bottom: 4px;
  border-bottom-color: #FFFFFF;
  border-bottom-width: 0;
}

#gnofaowtxy .gt_subtitle {
  color: #333333;
  font-size: 85%;
  font-weight: initial;
  padding-top: 0;
  padding-bottom: 6px;
  border-top-color: #FFFFFF;
  border-top-width: 0;
}

#gnofaowtxy .gt_bottom_border {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#gnofaowtxy .gt_col_headings {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#gnofaowtxy .gt_col_heading {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 6px;
  padding-left: 5px;
  padding-right: 5px;
  overflow-x: hidden;
}

#gnofaowtxy .gt_column_spanner_outer {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  padding-top: 0;
  padding-bottom: 0;
  padding-left: 4px;
  padding-right: 4px;
}

#gnofaowtxy .gt_column_spanner_outer:first-child {
  padding-left: 0;
}

#gnofaowtxy .gt_column_spanner_outer:last-child {
  padding-right: 0;
}

#gnofaowtxy .gt_column_spanner {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 5px;
  overflow-x: hidden;
  display: inline-block;
  width: 100%;
}

#gnofaowtxy .gt_group_heading {
  padding: 8px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
}

#gnofaowtxy .gt_empty_group_heading {
  padding: 0.5px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: middle;
}

#gnofaowtxy .gt_from_md > :first-child {
  margin-top: 0;
}

#gnofaowtxy .gt_from_md > :last-child {
  margin-bottom: 0;
}

#gnofaowtxy .gt_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  margin: 10px;
  border-top-style: solid;
  border-top-width: 1px;
  border-top-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
  overflow-x: hidden;
}

#gnofaowtxy .gt_stub {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-right-style: solid;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  padding-left: 12px;
}

#gnofaowtxy .gt_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#gnofaowtxy .gt_first_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
}

#gnofaowtxy .gt_grand_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#gnofaowtxy .gt_first_grand_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: double;
  border-top-width: 6px;
  border-top-color: #D3D3D3;
}

#gnofaowtxy .gt_striped {
  background-color: rgba(128, 128, 128, 0.05);
}

#gnofaowtxy .gt_table_body {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#gnofaowtxy .gt_footnotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#gnofaowtxy .gt_footnote {
  margin: 0px;
  font-size: 90%;
  padding: 4px;
}

#gnofaowtxy .gt_sourcenotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#gnofaowtxy .gt_sourcenote {
  font-size: 90%;
  padding: 4px;
}

#gnofaowtxy .gt_left {
  text-align: left;
}

#gnofaowtxy .gt_center {
  text-align: center;
}

#gnofaowtxy .gt_right {
  text-align: right;
  font-variant-numeric: tabular-nums;
}

#gnofaowtxy .gt_font_normal {
  font-weight: normal;
}

#gnofaowtxy .gt_font_bold {
  font-weight: bold;
}

#gnofaowtxy .gt_font_italic {
  font-style: italic;
}

#gnofaowtxy .gt_super {
  font-size: 65%;
}

#gnofaowtxy .gt_footnote_marks {
  font-style: italic;
  font-weight: normal;
  font-size: 65%;
}
</style>
<table class="gt_table">
  
  <thead class="gt_col_headings">
    <tr>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">X1</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">X2</th>
    </tr>
  </thead>
  <tbody class="gt_table_body">
    <tr><td class="gt_row gt_right">1.07</td>
<td class="gt_row gt_right">1.16</td></tr>
    <tr><td class="gt_row gt_right">−0.15</td>
<td class="gt_row gt_right">−0.82</td></tr>
    <tr><td class="gt_row gt_right">1.47</td>
<td class="gt_row gt_right">0.11</td></tr>
  </tbody>
  
  
</table>
</div>

Berechnen wir die Korrelation `r`:

```r
d2 %>% 
  summarise(
    r = cor(X1,X2),
    n = n()
  )
```

<div id="udiubmmmah" style="overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
<style>html {
  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Helvetica Neue', 'Fira Sans', 'Droid Sans', Arial, sans-serif;
}

#udiubmmmah .gt_table {
  display: table;
  border-collapse: collapse;
  margin-left: auto;
  margin-right: auto;
  color: #333333;
  font-size: 16px;
  font-weight: normal;
  font-style: normal;
  background-color: #FFFFFF;
  width: auto;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #A8A8A8;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #A8A8A8;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
}

#udiubmmmah .gt_heading {
  background-color: #FFFFFF;
  text-align: center;
  border-bottom-color: #FFFFFF;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#udiubmmmah .gt_title {
  color: #333333;
  font-size: 125%;
  font-weight: initial;
  padding-top: 4px;
  padding-bottom: 4px;
  border-bottom-color: #FFFFFF;
  border-bottom-width: 0;
}

#udiubmmmah .gt_subtitle {
  color: #333333;
  font-size: 85%;
  font-weight: initial;
  padding-top: 0;
  padding-bottom: 6px;
  border-top-color: #FFFFFF;
  border-top-width: 0;
}

#udiubmmmah .gt_bottom_border {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#udiubmmmah .gt_col_headings {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#udiubmmmah .gt_col_heading {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 6px;
  padding-left: 5px;
  padding-right: 5px;
  overflow-x: hidden;
}

#udiubmmmah .gt_column_spanner_outer {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  padding-top: 0;
  padding-bottom: 0;
  padding-left: 4px;
  padding-right: 4px;
}

#udiubmmmah .gt_column_spanner_outer:first-child {
  padding-left: 0;
}

#udiubmmmah .gt_column_spanner_outer:last-child {
  padding-right: 0;
}

#udiubmmmah .gt_column_spanner {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 5px;
  overflow-x: hidden;
  display: inline-block;
  width: 100%;
}

#udiubmmmah .gt_group_heading {
  padding: 8px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
}

#udiubmmmah .gt_empty_group_heading {
  padding: 0.5px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: middle;
}

#udiubmmmah .gt_from_md > :first-child {
  margin-top: 0;
}

#udiubmmmah .gt_from_md > :last-child {
  margin-bottom: 0;
}

#udiubmmmah .gt_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  margin: 10px;
  border-top-style: solid;
  border-top-width: 1px;
  border-top-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
  overflow-x: hidden;
}

#udiubmmmah .gt_stub {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-right-style: solid;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  padding-left: 12px;
}

#udiubmmmah .gt_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#udiubmmmah .gt_first_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
}

#udiubmmmah .gt_grand_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#udiubmmmah .gt_first_grand_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: double;
  border-top-width: 6px;
  border-top-color: #D3D3D3;
}

#udiubmmmah .gt_striped {
  background-color: rgba(128, 128, 128, 0.05);
}

#udiubmmmah .gt_table_body {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#udiubmmmah .gt_footnotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#udiubmmmah .gt_footnote {
  margin: 0px;
  font-size: 90%;
  padding: 4px;
}

#udiubmmmah .gt_sourcenotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#udiubmmmah .gt_sourcenote {
  font-size: 90%;
  padding: 4px;
}

#udiubmmmah .gt_left {
  text-align: left;
}

#udiubmmmah .gt_center {
  text-align: center;
}

#udiubmmmah .gt_right {
  text-align: right;
  font-variant-numeric: tabular-nums;
}

#udiubmmmah .gt_font_normal {
  font-weight: normal;
}

#udiubmmmah .gt_font_bold {
  font-weight: bold;
}

#udiubmmmah .gt_font_italic {
  font-style: italic;
}

#udiubmmmah .gt_super {
  font-size: 65%;
}

#udiubmmmah .gt_footnote_marks {
  font-style: italic;
  font-weight: normal;
  font-size: 65%;
}
</style>
<table class="gt_table">
  
  <thead class="gt_col_headings">
    <tr>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">r</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">n</th>
    </tr>
  </thead>
  <tbody class="gt_table_body">
    <tr><td class="gt_row gt_right">0.70</td>
<td class="gt_row gt_right">10,000.00</td></tr>
  </tbody>
  
  
</table>
</div>

]

]

---

## Die Mensch-ärgere-dich-nicht-Verteilung

- Wie lange muss man warten, bis man bei Mensch-ärgere-dich-nicht raus darf?
- Wieviel Vitamine sind nach einer Woche noch in meiner Möhre?
- Wie lange hält eine Glühbirne, bevor sie den Geist aufgibt?
- Wie weit rollt ein Apfel vom Stamm?
- Wie weit liegt eine Expertin mit ihrer Schätzung daneben?
- ...

]

.pull-right[
Solche Fragen kann man mit dieser Verteilung darstellen:
<img src="Kapitel_4_chunk-img/unnamed-chunk-17-1.png" width="100%" style="display: block; margin: auto;" />

]

</br>

---

## Darf ich vorstellen: Die Exponential-Verteilung

`$$X \sim \operatorname{Exp}(1)$$`

- Eine *Exp*onentialverteilung ist nur für positive Werte, `$x>0$`, definiert.
- Steigt X um eine Einheit, so ändert sich Y um einen konstanten Faktor.
- Sie hat nur einen Parameter, genannt *Rate* oder `$\lambda$` ("lambda").
- `$\frac{1}{\lambda}$`  gibt gleichzeitig Mittelwert und Streuung ("Gestrecktheit") der Verteilung an.
- Je größer die Rate `$\lambda$`, desto *kleiner* die Streuung und der Mittelwert der Verteilung.
- Je größer  `$1/\lambda$`, desto *größer* die Streuung und der Mittelwert der Verteilung.

---

## Exponentialverteilung berechnen

```r
d <- tibble(
  x = seq(0, 5, by = 0.01),
  y = 2^(-x),
  y2 = 2.71^(-x))  # e=2.71...

d %>% 
  ggplot(aes(x)) +
  geom_line(aes(y=y)) +
  geom_line(aes(y=y2), 
            color ="blue") +
  labs(title = paste0("blau: e hoch x,","schwarz: 2 hoch x"))
```

]

.pull-right[
<img src="Kapitel_4_chunk-img/exp1-plot-1.png" width="100%" style="display: block; margin: auto;" />

]

---

## Exponentialverteilung mit R

Für `$e^x$` -- Exponenziern mit `$e$` (Eulersche Zahl) als Basis -- gibt's in R die Funktion `exp()`. Mit `dexp()` bekommt man die zugehörige Wahrscheinlichkeitsdichte.

```r
d <-
  tibble(
    x = seq(0, 5,.1),
    y1 = dexp(x, rate = 1),
    y2 = dexp(x, rate = 0.5)
  )

d %>% 
  ggplot(aes(x)) +
  geom_line(aes(y = y1)) +
  geom_line(aes(y = y2), 
            color = "blue")
```

]

`$$\beta \sim \operatorname{Exp}(1)$$`

`$$\color{blue}\beta \color{blue}\sim \color{blue}{\operatorname{Exp}(0.5)}$$`

]

Je kleiner die Rate `$\lambda$`, desto *größer* die Streuung der Verteilung.

---

## Quantile der Exponentialverteilung

... Wenn du nicht mehr weiter weißt, ziehe ein Stichprobe.

Wie weit fällt ein Apfel 🍎 vom Stamm 🌳, wenn wir `$\text{Apfel} \sim \mathcal{E}(1)$` annehmen?

```r
d <- tibble(apfel = rexp(n = 1e4, rate = 1))

d %>% ggplot(aes(x = apfel)) + geom_histogram()
```

---

# Gaussmodelle

## Wie groß sind die !Kung San?

---

## !Kung San

.tiny[
[Quelle](https://upload.wikimedia.org/wikipedia/commons/b/b5/Wandering_hunters_%28Masarwa_bushmen%29%2C_North_Kalahari_Desert.jpg) Internet Archive Book Images, No restrictions, via Wikimedia Commons]

]

.tiny[By Andrewwik.0 - Own work, CC BY-SA 4.0, [Quelle](https://commons.wikimedia.org/w/index.php?curid=79801340)]

]

---

background-image: url("https://upload.wikimedia.org/wikipedia/commons/e/e6/Kalahari_PICT0036.JPG")
background-position: center
background-size: contain

.footnote[Winfried Bruenken (Amrum), CC BY-SA 2.5 <https://creativecommons.org/licenses/by-sa/2.5>, via Wikimedia Commons]

---

## !Kung Data

[Datenquelle](https://raw.githubusercontent.com/sebastiansauer/2021-wise/main/Data/Howell1a.csv)

```r
library(tidyverse)
Kung_path <-  # Datenquelle s.o.
  "https://tinyurl.com/jr7ckxxj"

d <- read_csv(Kung_path)

d2 <- 
  d %>% 
  filter(age > 18)
```

]
]

```r
d2 <- d %>% 
  filter(age >= 18)
```

```r
library(rstatix)
get_summary_stats(d2)
```

]

</br>

<div id="lklujgjmri" style="overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
<style>html {
  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Helvetica Neue', 'Fira Sans', 'Droid Sans', Arial, sans-serif;
}

#lklujgjmri .gt_table {
  display: table;
  border-collapse: collapse;
  margin-left: auto;
  margin-right: auto;
  color: #333333;
  font-size: 16px;
  font-weight: normal;
  font-style: normal;
  background-color: #FFFFFF;
  width: auto;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #A8A8A8;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #A8A8A8;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
}

#lklujgjmri .gt_heading {
  background-color: #FFFFFF;
  text-align: center;
  border-bottom-color: #FFFFFF;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#lklujgjmri .gt_title {
  color: #333333;
  font-size: 125%;
  font-weight: initial;
  padding-top: 4px;
  padding-bottom: 4px;
  border-bottom-color: #FFFFFF;
  border-bottom-width: 0;
}

#lklujgjmri .gt_subtitle {
  color: #333333;
  font-size: 85%;
  font-weight: initial;
  padding-top: 0;
  padding-bottom: 6px;
  border-top-color: #FFFFFF;
  border-top-width: 0;
}

#lklujgjmri .gt_bottom_border {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#lklujgjmri .gt_col_headings {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#lklujgjmri .gt_col_heading {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 6px;
  padding-left: 5px;
  padding-right: 5px;
  overflow-x: hidden;
}

#lklujgjmri .gt_column_spanner_outer {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  padding-top: 0;
  padding-bottom: 0;
  padding-left: 4px;
  padding-right: 4px;
}

#lklujgjmri .gt_column_spanner_outer:first-child {
  padding-left: 0;
}

#lklujgjmri .gt_column_spanner_outer:last-child {
  padding-right: 0;
}

#lklujgjmri .gt_column_spanner {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 5px;
  overflow-x: hidden;
  display: inline-block;
  width: 100%;
}

#lklujgjmri .gt_group_heading {
  padding: 8px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
}

#lklujgjmri .gt_empty_group_heading {
  padding: 0.5px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: middle;
}

#lklujgjmri .gt_from_md > :first-child {
  margin-top: 0;
}

#lklujgjmri .gt_from_md > :last-child {
  margin-bottom: 0;
}

#lklujgjmri .gt_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  margin: 10px;
  border-top-style: solid;
  border-top-width: 1px;
  border-top-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
  overflow-x: hidden;
}

#lklujgjmri .gt_stub {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-right-style: solid;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  padding-left: 12px;
}

#lklujgjmri .gt_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#lklujgjmri .gt_first_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
}

#lklujgjmri .gt_grand_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#lklujgjmri .gt_first_grand_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: double;
  border-top-width: 6px;
  border-top-color: #D3D3D3;
}

#lklujgjmri .gt_striped {
  background-color: rgba(128, 128, 128, 0.05);
}

#lklujgjmri .gt_table_body {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#lklujgjmri .gt_footnotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#lklujgjmri .gt_footnote {
  margin: 0px;
  font-size: 90%;
  padding: 4px;
}

#lklujgjmri .gt_sourcenotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#lklujgjmri .gt_sourcenote {
  font-size: 90%;
  padding: 4px;
}

#lklujgjmri .gt_left {
  text-align: left;
}

#lklujgjmri .gt_center {
  text-align: center;
}

#lklujgjmri .gt_right {
  text-align: right;
  font-variant-numeric: tabular-nums;
}

#lklujgjmri .gt_font_normal {
  font-weight: normal;
}

#lklujgjmri .gt_font_bold {
  font-weight: bold;
}

#lklujgjmri .gt_font_italic {
  font-style: italic;
}

#lklujgjmri .gt_super {
  font-size: 65%;
}

#lklujgjmri .gt_footnote_marks {
  font-style: italic;
  font-weight: normal;
  font-size: 65%;
}
</style>
<table class="gt_table">
  
  <thead class="gt_col_headings">
    <tr>
      <th class="gt_col_heading gt_columns_bottom_border gt_left" rowspan="1" colspan="1">variable</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">n</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">min</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">max</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">median</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">q1</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">q3</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">iqr</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">mad</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">mean</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">sd</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">se</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">ci</th>
    </tr>
  </thead>
  <tbody class="gt_table_body">
    <tr><td class="gt_row gt_left">age</td>
<td class="gt_row gt_right">346</td>
<td class="gt_row gt_right">19.00</td>
<td class="gt_row gt_right">88.00</td>
<td class="gt_row gt_right">40.00</td>
<td class="gt_row gt_right">29.00</td>
<td class="gt_row gt_right">51.00</td>
<td class="gt_row gt_right">22.00</td>
<td class="gt_row gt_right">16.31</td>
<td class="gt_row gt_right">41.54</td>
<td class="gt_row gt_right">15.81</td>
<td class="gt_row gt_right">0.85</td>
<td class="gt_row gt_right">1.67</td></tr>
    <tr><td class="gt_row gt_left">height</td>
<td class="gt_row gt_right">346</td>
<td class="gt_row gt_right">136.53</td>
<td class="gt_row gt_right">179.07</td>
<td class="gt_row gt_right">154.31</td>
<td class="gt_row gt_right">148.59</td>
<td class="gt_row gt_right">160.66</td>
<td class="gt_row gt_right">12.06</td>
<td class="gt_row gt_right">8.47</td>
<td class="gt_row gt_right">154.64</td>
<td class="gt_row gt_right">7.77</td>
<td class="gt_row gt_right">0.42</td>
<td class="gt_row gt_right">0.82</td></tr>
    <tr><td class="gt_row gt_left">male</td>
<td class="gt_row gt_right">346</td>
<td class="gt_row gt_right">0.00</td>
<td class="gt_row gt_right">1.00</td>
<td class="gt_row gt_right">0.00</td>
<td class="gt_row gt_right">0.00</td>
<td class="gt_row gt_right">1.00</td>
<td class="gt_row gt_right">1.00</td>
<td class="gt_row gt_right">0.00</td>
<td class="gt_row gt_right">0.47</td>
<td class="gt_row gt_right">0.50</td>
<td class="gt_row gt_right">0.03</td>
<td class="gt_row gt_right">0.05</td></tr>
    <tr><td class="gt_row gt_left">weight</td>
<td class="gt_row gt_right">346</td>
<td class="gt_row gt_right">31.52</td>
<td class="gt_row gt_right">62.99</td>
<td class="gt_row gt_right">45.01</td>
<td class="gt_row gt_right">40.33</td>
<td class="gt_row gt_right">49.38</td>
<td class="gt_row gt_right">9.04</td>
<td class="gt_row gt_right">6.72</td>
<td class="gt_row gt_right">45.05</td>
<td class="gt_row gt_right">6.46</td>
<td class="gt_row gt_right">0.35</td>
<td class="gt_row gt_right">0.68</td></tr>
  </tbody>
  
  
</table>
</div>

---

## Wir gehen apriori von normalverteilter Größe aus

</br>
</br>
<img src="https://upload.wikimedia.org/wikipedia/commons/8/83/SVG_Human_Silhouette.svg" width="50%" style="display: block; margin: auto;" />

.footnote[.tiny[Own alterations andFile:SVG_Human_With_All_Organs.svg by Madhero88, CC BY-SA 3.0 <https://creativecommons.org/licenses/by-sa/3.0>, via Wikimedia Commons]
]

]

`$$\mu \sim \mathcal{N}(178, 20)$$`

]

---

## Unser Gauss-Modell der !Kung

Wir nehmen an, dass `$\mu$` und `$h_i$` normalverteilt sind und `$\sigma$` exponentialverteilt (da notwendig positiv) ist:

Prior für `$\mu$`:
`$\mu \sim \mathcal{N}(178, 20)$`

Prior für `$\sigma$`:
`$\sigma \sim \mathcal{E}(0, 0.1)$`

</br>
</br>

`$95\%KI( \mu):$`

`$178 \pm 40$`
]

]

---

## Der Likelihood wird von den Prioris gespeist

### Likelihood

Die einzelnen Größen `$h_i$` sind normalverteilt mit Mittelwert `$\mu$` und Streuung `$\sigma$`:

`$$h_i \sim \mathcal{N}(\color{blue}{\mu},\color{green}{\sigma})$$`
### Prioris

Mittelwert der Größe ist normalverteilt mit `$\mu=178$` und `$\sigma=20$`:

`$$\color{blue}{\mu \sim \mathcal{N}(178, 20)}$$`

Die Streuung `$\sigma$` der Größen ist exponentialverteil mit `$\lambda = 0.1$`.

`$$\color{green}{\sigma \sim \mathcal{E}(0.1)}$$`

---

## Welche Beobachtungen sind auf Basis unseres Modells zu erwarten?

```r
n <- 1e4

sim <- tibble(sample_mu  = 
      rnorm(n, 
            mean = 178, 
            sd   = 20),
    sample_sigma = 
      rexp(n, 
            rate = 0.1)) %>% 
  mutate(height  = 
      rnorm(n, 
            mean = sample_mu, 
            sd   = sample_sigma))

height_sim_sd <- 
  sd(sim$height) %>% round()
height_sim_mean <- 
  mean(sim$height) %>% round()
```
]

🦾 Ziehen wir mal ein paar Stichproben auf Basis des Modells. Voilà:

]

.footnote[.tiny[[Quellcode](https://bookdown.org/content/4857/geocentric-models.html#a-gaussian-model-of-height)]]

---

## Priori-Werte prüfen mit der Priori-Prädiktiv-Verteilung

- Die Priori-Prädiktiv-Verteilung (`sim`) simuliert Beobachtungen (nur) auf Basis der Priori-Annahmen: 
`$h_i \sim \mathcal{N}(\mu, \sigma),$`
`$\mu \sim \mathcal{N}(178, 20),$`
`$\sigma \sim \mathcal{E}(0.1)$`
- So können wir prüfen, ob die Priori-Werte vernünftig sind.

Die Priori-Prädiktiv-Verteilung zeigt, dass unsere Priori-Werte ziemlich vage sind, also einen zu breiten Bereich an Größenwerten zulassen:

]

Anteil `$h_i > 200$`:

```r
anteil_großer_kung <- 
sim %>% 
  count( height > 200) %>% 
  mutate(prop = n/sum(n))
anteil_großer_kung
```

```
## # A tibble: 2 × 3
##   `height > 200`     n  prop
##   <lgl>          <int> <dbl>
## 1 FALSE           8328 0.833
## 2 TRUE            1672 0.167
```

🤔 Sehr große Buschleute? 17 Prozent sind größer als 2 Meter. Das ist diskutabel, muss aber kein schlechter Prior sein.
]

---

##  Vorhersagen der Priori-Werte

---

## Extrem vage Priori-Verteilung für die Streuung?

`$$\sigma \sim \mathcal{E}(\lambda=0.01)$$`

]

Die Streuung der Größen ist weit:

]

🤔 Das Modell geht apriori von ein paar Prozent Menschen mit *negativer* Größe aus. Ein Haufen Riesen 👹 werden auch erwartet.

🤯   Vage (flache, informationsarme, "neutrale", "objektive") Priori-Werte machen oft keinen Sinn.

---

## Zufällige Motivationsseite

---

## Posteriori-Verteilung des Größen-Modells, `m41`

]

- Wir bekommen eine Wahrscheinlichkeitsverteilung für `$\mu$` und eine für `$\sigma$` (bzw. eine zweidimensionale Verteilung, für die `$\mu,\sigma$`-Paare).

- Trotz des eher vagen Priors ist die Streuung Posteriori-Werte für `$\mu$` und `$\sigma$` klein: Die große Stichprobe hat die Priori-Werte überstimmt.

- Ziehen wir Stichproben aus der Posteriori-Verteilung, so können wir interessante Fragen stellen.

]

---

## Hallo, Posteriori-Verteilung

... wir hätten da mal ein paar Fragen an Sie. 🕵

- Mit welcher Wahrscheinlichkeit ist die mittlere !Kung-Person größer als 1,55m?
- Welche mittlere Körpergröße wird mit 95% Wahrscheinlichkeit nicht überschritten, laut dem Modell?
- In welchem 90%-PI liegt `$\mu$` vermutlich?
- Mit welcher Unsicherheit `$\sigma$` ist die Schätzung der mittleren Körpergröße behaftet?
- Welcher Wert der mittleren Körpergröße hat die höchste Wahrscheinlichkeit?

]

]

---

## Posteriori-Stichproben mit `stan_glm()` berechnen

- Mit `stan_glm()` können wir komfortabel die Posteriori-Verteilung berechnen. 
- Die Gittermethode wird nicht verwendet, aber die Ergebnisse sind - in bestimmten Situationen - ähnlich.
- Es werden aber auch viele Stichproben simuliert (sog. MCMC-Methode).
- Gibt man keine Priori-Werte an, so greift die Funktion auf Standardwerte zurück.

```r
library(rstanarm)
# berechnet Post.-Vert.:

stan_glm(
  # modelldefinition:
  AV ~ UV,
, # Datensatz:
  data = meine_daten
)
```

]

`$h_i \sim \mathcal{N}(\mu, \sigma)$`, Likelihood

`$\mu \sim \mathcal{N}(155, 19)$`, Prior Größenmittelwert

`$\sigma \sim \mathcal{E}(0.13)$`, Prior Streuung der Größen

]

---

## Ausgabe von `stan_glm()`

```r
m41 <- stan_glm(height ~ 1, data = d2)

print(m41)
```

```
## stan_glm
##  family:       gaussian [identity]
##  formula:      height ~ 1
##  observations: 346
##  predictors:   1
## ------
##             Median MAD_SD
## (Intercept) 154.6    0.4 
## 
## Auxiliary parameter(s):
##       Median MAD_SD
## sigma 7.8    0.3   
## 
## ------
## * For help interpreting the printed output see ?print.stanreg
## * For info on the priors used see ?prior_summary.stanreg
```

---

## Wie tickt `stan_glm()`?

.left-column[
</br>
</br>
</br>
<img src="https://mc-stan.org/rstanarm/reference/figures/stanlogo.png" width="100%" style="display: block; margin: auto;" />

]

.right-column[
- *Stan* ist eine Software zur Berechnung von Bayesmodellen; das Paket `rstanarm` stellt Stan für uns bereit.
- `stan_glm()` ist für die Berechnung von Regressionsmodellen ausgelegt.
- Will man nur die Verteilung einer Variablen (wie `heights`) schätzen, so hat man man ... eine Regression ohne Prädiktor.
- Eine Regression ohne Prädiktor schreibt man auf Errisch so: `y ~ 1`. Die `1` steht also für die nicht vorhandene UV; `y` meint die AV (`height`).
- `MAD_SD` ist eine robuste Version der Streuung, mit inhaltlich gleicher Aussage
- `(Intercept)` (Achsenabschnitt) gibt den Mittelwert an.

]

[Dokumentation RstanARM](https://mc-stan.org/rstanarm/)

---

## Stichproben aus der Posteriori-Verteilung ziehen

```r
post_m41 <- as_tibble(m41)
print(post_m41)
```

<div id="jtfyegelcu" style="overflow-x:auto;overflow-y:auto;width:auto;height:auto;">
<style>html {
  font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Helvetica Neue', 'Fira Sans', 'Droid Sans', Arial, sans-serif;
}

#jtfyegelcu .gt_table {
  display: table;
  border-collapse: collapse;
  margin-left: auto;
  margin-right: auto;
  color: #333333;
  font-size: 16px;
  font-weight: normal;
  font-style: normal;
  background-color: #FFFFFF;
  width: auto;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #A8A8A8;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #A8A8A8;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
}

#jtfyegelcu .gt_heading {
  background-color: #FFFFFF;
  text-align: center;
  border-bottom-color: #FFFFFF;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#jtfyegelcu .gt_title {
  color: #333333;
  font-size: 125%;
  font-weight: initial;
  padding-top: 4px;
  padding-bottom: 4px;
  border-bottom-color: #FFFFFF;
  border-bottom-width: 0;
}

#jtfyegelcu .gt_subtitle {
  color: #333333;
  font-size: 85%;
  font-weight: initial;
  padding-top: 0;
  padding-bottom: 6px;
  border-top-color: #FFFFFF;
  border-top-width: 0;
}

#jtfyegelcu .gt_bottom_border {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#jtfyegelcu .gt_col_headings {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
}

#jtfyegelcu .gt_col_heading {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 6px;
  padding-left: 5px;
  padding-right: 5px;
  overflow-x: hidden;
}

#jtfyegelcu .gt_column_spanner_outer {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: normal;
  text-transform: inherit;
  padding-top: 0;
  padding-bottom: 0;
  padding-left: 4px;
  padding-right: 4px;
}

#jtfyegelcu .gt_column_spanner_outer:first-child {
  padding-left: 0;
}

#jtfyegelcu .gt_column_spanner_outer:last-child {
  padding-right: 0;
}

#jtfyegelcu .gt_column_spanner {
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: bottom;
  padding-top: 5px;
  padding-bottom: 5px;
  overflow-x: hidden;
  display: inline-block;
  width: 100%;
}

#jtfyegelcu .gt_group_heading {
  padding: 8px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
}

#jtfyegelcu .gt_empty_group_heading {
  padding: 0.5px;
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  vertical-align: middle;
}

#jtfyegelcu .gt_from_md > :first-child {
  margin-top: 0;
}

#jtfyegelcu .gt_from_md > :last-child {
  margin-bottom: 0;
}

#jtfyegelcu .gt_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  margin: 10px;
  border-top-style: solid;
  border-top-width: 1px;
  border-top-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 1px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 1px;
  border-right-color: #D3D3D3;
  vertical-align: middle;
  overflow-x: hidden;
}

#jtfyegelcu .gt_stub {
  color: #333333;
  background-color: #FFFFFF;
  font-size: 100%;
  font-weight: initial;
  text-transform: inherit;
  border-right-style: solid;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
  padding-left: 12px;
}

#jtfyegelcu .gt_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#jtfyegelcu .gt_first_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
}

#jtfyegelcu .gt_grand_summary_row {
  color: #333333;
  background-color: #FFFFFF;
  text-transform: inherit;
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
}

#jtfyegelcu .gt_first_grand_summary_row {
  padding-top: 8px;
  padding-bottom: 8px;
  padding-left: 5px;
  padding-right: 5px;
  border-top-style: double;
  border-top-width: 6px;
  border-top-color: #D3D3D3;
}

#jtfyegelcu .gt_striped {
  background-color: rgba(128, 128, 128, 0.05);
}

#jtfyegelcu .gt_table_body {
  border-top-style: solid;
  border-top-width: 2px;
  border-top-color: #D3D3D3;
  border-bottom-style: solid;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
}

#jtfyegelcu .gt_footnotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#jtfyegelcu .gt_footnote {
  margin: 0px;
  font-size: 90%;
  padding: 4px;
}

#jtfyegelcu .gt_sourcenotes {
  color: #333333;
  background-color: #FFFFFF;
  border-bottom-style: none;
  border-bottom-width: 2px;
  border-bottom-color: #D3D3D3;
  border-left-style: none;
  border-left-width: 2px;
  border-left-color: #D3D3D3;
  border-right-style: none;
  border-right-width: 2px;
  border-right-color: #D3D3D3;
}

#jtfyegelcu .gt_sourcenote {
  font-size: 90%;
  padding: 4px;
}

#jtfyegelcu .gt_left {
  text-align: left;
}

#jtfyegelcu .gt_center {
  text-align: center;
}

#jtfyegelcu .gt_right {
  text-align: right;
  font-variant-numeric: tabular-nums;
}

#jtfyegelcu .gt_font_normal {
  font-weight: normal;
}

#jtfyegelcu .gt_font_bold {
  font-weight: bold;
}

#jtfyegelcu .gt_font_italic {
  font-style: italic;
}

#jtfyegelcu .gt_super {
  font-size: 65%;
}

#jtfyegelcu .gt_footnote_marks {
  font-style: italic;
  font-weight: normal;
  font-size: 65%;
}
</style>
<table class="gt_table">
  
  <thead class="gt_col_headings">
    <tr>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">(Intercept)</th>
      <th class="gt_col_heading gt_columns_bottom_border gt_right" rowspan="1" colspan="1">sigma</th>
    </tr>
  </thead>
  <tbody class="gt_table_body">
    <tr><td class="gt_row gt_right">154.9014</td>
<td class="gt_row gt_right">7.951013</td></tr>
    <tr><td class="gt_row gt_right">154.9072</td>
<td class="gt_row gt_right">8.072041</td></tr>
    <tr><td class="gt_row gt_right">155.0016</td>
<td class="gt_row gt_right">7.523712</td></tr>
    <tr><td class="gt_row gt_right">154.1477</td>
<td class="gt_row gt_right">7.599040</td></tr>
    <tr><td class="gt_row gt_right">154.3553</td>
<td class="gt_row gt_right">7.690318</td></tr>
    <tr><td class="gt_row gt_right">154.7677</td>
<td class="gt_row gt_right">7.683235</td></tr>
  </tbody>
  
  
</table>
</div>

]

Mit welcher Wahrscheinlichkeit ist `$\mu>155$`?

```r
names(post_m41) <- 
  c("mu", "sigma")

post_m41 %>% 
  count(mu > 155) %>% 
  mutate(prop = n/sum(n))
```

```
## # A tibble: 2 × 3
##   `mu > 155`     n  prop
##   <lgl>      <int> <dbl>
## 1 FALSE       3224 0.806
## 2 TRUE         776 0.194
```

]

---

## Antworten von der Posteriori-Verteilung

.pull-left[
Welche mittlere Körpergröße wird mit 95% Wahrscheinlichkeit nicht überschritten, laut dem Modell `m41`?

```r
post_m41 %>% 
  summarise(
    q95 = 
      quantile(mu, .95))
```

```
## # A tibble: 1 × 1
##     q95
##   <dbl>
## 1  155.
```

]

```r
post_m41 %>% 
  summarise(
    pi_90 =
      quantile(mu, c(0.05,
                     0.95)))
```

```
## # A tibble: 2 × 1
##   pi_90
##   <dbl>
## 1  154.
## 2  155.
```

]

</br>
</br>
</br>

🏋️ Ähnliche Fragen bleiben als Übung für die Leseris 🤓.

---

## Standard-Prioriwerte bei `stan_glm()` 1/3

```r
prior_summary(m41)
```

```
## Priors for model 'm41' 
## ------
## Intercept (after predictors centered)
##   Specified prior:
##     ~ normal(location = 155, scale = 2.5)
##   Adjusted prior:
##     ~ normal(location = 155, scale = 19)
## 
## Auxiliary (sigma)
##   Specified prior:
##     ~ exponential(rate = 1)
##   Adjusted prior:
##     ~ exponential(rate = 0.13)
## ------
## See help('prior_summary.stanreg') for more details
```

---

## Standard-Prioriwerte bei `stan_glm()` 2/3

- `stan_glm()` verwendet (in der Voreinstellung) *schwach informative* Priori-Werte, die nur wenig Vorabwissen in das Modell geben.
- Es werden dafür die Stichproben-Daten als Priori-Daten verwendet.
- Man sollte diese Standardwerte als Minimalvorschlag sehen. Kennt man sich im Sachgebiet aus, kann man meist bessere Prioris finden.
- Die Voreinstellung hat keinen tiefen Hintergrund; andere Werte wären auch denkbar.

- `Intercept`: `$\mu$`, der Mittelwert der Verteilung `$X$`
    - `$\mu \sim \mathcal{N}(\bar{X}, sd(X)\cdot 2.5)$`
    - als Streuung von `$\mu$` wird die 2.5-fache Streuung der Stichprobe angenommen.

- `Auxiliary (sigma)`: `$\sigma$`, die Streuung der Verteilung `$X$`
    - `$\sigma \sim \mathcal{E}(\lambda=1/sd(X))$`
    - als Streuung von `$h_i$` wird 7.8 angenommen.
    
    
    
---

## Visualisierung verschiedener Exponentialverteilungen

---

## Modell `m42`: unsere Priori-Werte

```r
m42 <- 
  stan_glm(height ~ 1, 
           prior_intercept = normal(178, 20),  # mu
           prior_aux = exponential(0.1),  # sigma
           refresh = FALSE,  # bitte nicht so viel Ausgabe drucken
           data = d2)
print(m42)
```

```
## stan_glm
##  family:       gaussian [identity]
##  formula:      height ~ 1
##  observations: 346
##  predictors:   1
## ------
##             Median MAD_SD
## (Intercept) 154.7    0.4 
## 
## Auxiliary parameter(s):
##       Median MAD_SD
## sigma 7.8    0.3   
## 
## ------
## * For help interpreting the printed output see ?print.stanreg
## * For info on the priors used see ?prior_summary.stanreg
```

---

## Posteriori-Verteilung plotten

```r
library(bayesplot)
plot(m42,
     pars = "(Intercept)")

plot(m42,
     pars = "sigma")

#plot(m42)
```

Im Standard werden Mediane und 50%- sowie 90%-Perzentilintervalle gezeigt, [s. Doku](https://mc-stan.org/rstanarm/reference/plot.stanreg.html).
]

.pull-right[
<img src="Kapitel_4_chunk-img/unnamed-chunk-36-1.png" width="100%" style="display: block; margin: auto;" /><img src="Kapitel_4_chunk-img/unnamed-chunk-36-2.png" width="100%" style="display: block; margin: auto;" />

]

---

## Fazit
.pull-left[
- Wir haben die Posteriori-Verteilung für ein Gauss-Modell  berechnet.
- Dabei hatten wir ein einfaches Modell mit metrischer Zielvariablen, ohne Prädiktoren, betrachtet.
- Die Zielvariable, Körpergröße (`height`), haben wir als normalverteilt mit den Parametern `$\mu$` und `$\sigma$` angenommen.
- Für `$\mu$` und `$\sigma$` haben wir jeweils keinen einzelnen (fixen) Wert angenommen, sondern eine Wahrscheinlichkeitsverteilung, der mit der Priori-Verteilung für `$\mu$` bzw. `$\sigma$` festgelegt ist.
]

]

---

# Hinweise

---

## Zu diesem Skript

- Dieses Skript bezieht sich auf folgende [Lehrbücher](#literatur): 
  - Statistical Rethinking, Kapitel 4.1 - 4.3

- Dieses Skript wurde erstellt am 2021-11-08 14:08:29

- Lizenz: [CC-BY](https://creativecommons.org/licenses/by/4.0/)

- Autor ist Sebastian Sauer.

- Um diese HTML-Folien korrekt darzustellen, ist eine Internet-Verbindung nötig.

- Mit der Taste `?` bekommt man eine Hilfe über Shortcuts.

- Wenn Sie die Endung `.html` in der URL mit `.pdf` ersetzen, bekommen Sie die PDF-Version der Datei. Wenn Sie mit `.Rmd` ersetzen, den Quellcode.

- Eine PDF-Version kann erzeugt werden, indem man im Chrome-Browser druckt (Drucken als PDF).

---

## Literatur

<a
name=bib-freeman_visual_2006></a>[Freeman,
M.](#cite-freeman_visual_2006) (2006). "A
visual comparison of normal and paranormal
distributions". In: _Journal of
Epidemiology and Community Health_ 60.1,
p. 6.

<a
name=bib-gelman_regression_2021></a>[Gelman,
A., J. Hill, and A.
Vehtari](#cite-gelman_regression_2021)
(2021). _Regression and other stories_.
Analytical methods for social research.
Cambridge University Press.

<a
name=bib-mcelreath_statistical_2020></a>[McElreath,
R.](#cite-mcelreath_statistical_2020)
(2020). _Statistical rethinking: a
Bayesian course with examples in R and
Stan_. 2nd ed. CRC texts in statistical
science. Taylor and Francis, CRC Press.

<a name=bib-taleb2019technical></a>[Taleb,
N. N.](#cite-taleb2019technical) (2019).
_The statistical consequences of fat
tails, papers and commentaries_.

<a name=bib-stan></a>[Team, S.
D.](#cite-stan) (2021). _Stan Modeling
Language Users Guide and Reference Manual
Version 2.28_.