added comparison without ökologie

This commit is contained in:
2026-02-28 16:12:09 +01:00
parent 9e50db2da6
commit b1039e6a7f
48 changed files with 977 additions and 2 deletions

View File

@ -286,6 +286,101 @@ Scatter plots of each Big Five trait against tutoring score gain, with regressio
---
## G. Effect Analysis
> Generated by `generate_plots_effects.py` → `Data/plots_effects/`
> Statistical exports → `Data/stats/effects_*.csv`, `Data/stats/outlier_influence.csv`
---
### G-F. Effect Without Ökologie (vs. With)
Ökologie has markedly higher pre-tutoring baselines (ceiling effects), which compresses gains for that topic. This section quantifies how much those ceiling effects suppress the observed effect sizes, and presents a full side-by-side comparison of all mediums with and without Ökologie included.
#### GF1 Cohen's d Comparison by Medium
![Cohen's d comparison](Data/plots_effects/F1_cohens_d_comparison.png)
Grouped bar chart of Cohen's d per medium under two conditions: All Topics and Excl. Ökologie. Each bar is annotated with the raw mean gain and significance stars. Reference lines mark the conventional small (0.2), medium (0.5), and large (0.8) effect size thresholds. Removing Ökologie consistently raises effect sizes for all mediums, with VR showing the largest absolute shift.
#### GF2 Mean Score Gain Comparison
![Mean score gain comparison](Data/plots_effects/F2_mean_gain_comparison.png)
95% CI bar chart of the raw mean tutoring score gain per medium, both conditions overlaid. Shows the absolute gain shift when Ökologie is excluded. VR and Chat benefit most from exclusion; Video's gain changes least, indicating Video's Ökologie sessions were not as strongly ceiling-affected.
#### GF3 Paired Slopes: All Topics vs. Excl. Ökologie
![Paired slopes comparison](Data/plots_effects/F3_paired_slopes_comparison.png)
A 2×3 grid (rows: All Topics / Excl. Ökologie; columns: Chat / Video / VR). Each panel shows individual PrePost-Tutoring lines colored by topic, the medium mean trajectory (thick diamond marker), and annotated t-test / Cohen's d / p-value. The bottom row directly reveals the cleaner separation in trajectories once the near-zero Ökologie gains are removed.
#### GF4 Gain Distribution Comparison
![Gain distribution comparison](Data/plots_effects/F4_gain_distribution_comparison.png)
Side-by-side violin + box plots per medium, two per medium (All Topics / Excl. Ökologie). Shows the shift in median, spread, and the location of extreme values. For VR in particular, removing Ökologie tightens the distribution and raises the median, confirming Ökologie's pull toward zero.
#### GF5 Descriptive Statistics Table
![Stats table](Data/plots_effects/F5_stats_table.png)
Rendered table summarizing N, mean gain, SD, Cohen's d, t-statistic, and p-value for all 6 conditions (3 mediums × 2 topic sets) in one view.
---
### G-G. Effect Per Topic
Full effect-size breakdown for each of the three topics independently, across all mediums combined.
#### GG1 Effect Per Topic (Gain + Cohen's d)
![Effect per topic](Data/plots_effects/G1_effect_per_topic.png)
Left panel: mean tutoring score gain with 95% CI error bars per topic, annotated with N and significance. Right panel: Cohen's d per topic with threshold reference lines. DNA-Replikation yields the largest effect (high starting deficit large gain), Mendel is intermediate, and Ökologie is smallest (ceiling effects).
#### GG2 Paired Slopes per Topic
![Paired slopes per topic](Data/plots_effects/G2_slopes_per_topic.png)
Three-panel slope plot (one per topic), with lines colored by medium. Medium mean trajectories are drawn as thick diamond markers and labeled with per-medium gains. The overall t-test / d / p annotation summarizes the within-topic effect. Ökologie clearly shows compressed trajectories compared to DNA-Replikation.
---
### G-H. All Medium × Topic Combinations
#### GH1 3×3 Slope Grid (Medium × Topic)
![3x3 slope grid](Data/plots_effects/H1_medium_topic_grid.png)
A 3×3 grid with rows = mediums (Chat, Video, VR) and columns = topics (Mendel, DNA-Replikation, Ökologie). Each of the 9 cells shows individual participant PrePost-Tutoring slope lines (colored by topic), the medium mean (thick line), and the annotated effect size (d, p, n). This is the most granular view: VR × DNA-Replikation shows the largest gains while all mediums × Ökologie show compressed or near-zero gains.
---
### G-I. Outlier Influence Analysis
Outliers are defined using the 1.5×IQR rule applied per medium on tutoring `Score_Gain`.
#### GI1 Score Gain Scatter with Outlier Flags
![Outlier scatter](Data/plots_effects/I1_outlier_scatter.png)
Jittered scatter of individual score gains per medium. IQR fence lines 1.5×IQR) are marked in red. Outlier points are highlighted in red and labeled with participant ID and topic name. P3/Ökologie (VR, +86.7 pp) is the most extreme single data point.
#### GI2 Outlier Influence on Effect Sizes
![Outlier effect comparison](Data/plots_effects/I2_outlier_effect_comparison.png)
Left: grouped bar chart of Cohen's d with All Data vs. Outliers Removed, annotated with raw gains and significance. Right: Δd bar chart showing the change in effect size after outlier removal per medium. A positive Δd means the outlier(s) were suppressing the true effect; a negative Δd means they were inflating it.
#### GI3 Outlier Heatmap (Participant × Topic per Medium)
![Outlier heatmap](Data/plots_effects/I3_outlier_heatmap.png)
Heatmap of tutoring score gain for each participant × topic cell, one panel per medium. Color encodes gain magnitude (redyellowgreen). Cells with a red border are IQR outliers within that medium's distribution. Allows immediate identification of which participant-topic combinations drive extreme results.
---
## F. Questionnaire Analysis
> Questionnaires were administered at multiple phases: Pre-Reading, Post-Reading, Pre-Tutoring, and Post-Tutoring.