diff --git a/paper/src/chapters/03-methodology.tex b/paper/src/chapters/03-methodology.tex index d2bc554..fef7957 100644 --- a/paper/src/chapters/03-methodology.tex +++ b/paper/src/chapters/03-methodology.tex @@ -297,6 +297,8 @@ To scale this to catalog-level pricing, we expand the base event transition matr \subsection{Second-Stage Classification} After contamination, we run a second classification stage. We remap events into a semantically aligned feature space, apply richer feature engineering, and retrain to obtain cleaner label probabilities across the full dataset. This classifier is then used directly in the reinforcement-learning reward structure. +Now might be a good time to stand up and go for a quick walk before returning to the rest of this paper. + \subsection{Distributionally Robust Reinforcement Learning (DR-RL)}