reintroducing our note :)

2026-07-15 17:43:36 +00:00 · 2026-02-14 21:49:40 +01:00
parent e8229ac313
commit d7657db287
1 changed files with 2 additions and 0 deletions
--- a/paper/src/chapters/03-methodology.tex
+++ b/paper/src/chapters/03-methodology.tex
@@ -297,6 +297,8 @@ To scale this to catalog-level pricing, we expand the base event transition matr
 \subsection{Second-Stage Classification}
 After contamination, we run a second classification stage. We remap events into a semantically aligned feature space, apply richer feature engineering, and retrain to obtain cleaner label probabilities across the full dataset. This classifier is then used directly in the reinforcement-learning reward structure.
 Now might be a good time to stand up and go for a quick walk before returning to the rest of this paper.
 \subsection{Distributionally Robust Reinforcement Learning (DR-RL)}