diff --git a/paper/src/chapters/04-results.tex b/paper/src/chapters/04-results.tex index f661bd3..2dce74b 100644 --- a/paper/src/chapters/04-results.tex +++ b/paper/src/chapters/04-results.tex @@ -92,19 +92,7 @@ In our complete training runs we logged $\approx 180$ days of net compute time. \label{fig:final_focus_coi_preservation_grid} \end{figure} -\begin{figure}[ht] - \centering - \input{chapters/figures/results/includes/final_focus_revenue_delta.tex} - \caption{Defended-minus-baseline revenue delta over contamination for the final cohort. The strongest high-contamination deviation begins at $\alpha=0.7$, followed by recovery toward near parity by $\alpha=1.0$.} - \label{fig:final_focus_revenue_delta} -\end{figure} -\begin{figure}[ht] - \centering - \input{chapters/figures/results/includes/final_focus_risk_deltas.tex} - \caption{Defended-minus-baseline leakage and volatility deltas for the final cohort. Leakage remains lower for the defended policy across the full contamination range.} - \label{fig:final_focus_risk_deltas} -\end{figure} \subsection{Interpretation and Insights} The Mann-Whitney result ($p<0.001$) confirms that per-session divergence gaps distinguish the two actor classes with near-zero overlap in rank ordering. This is the condition required for distinguishability to act as a useful control signal in the pricing loop rather than just an auxiliary classifier score.