From d7657db2871090b7c03502ad4740d531ab9dc7f6 Mon Sep 17 00:00:00 2001 From: Daniel Rosel Date: Sat, 14 Feb 2026 21:49:40 +0100 Subject: [PATCH] reintroducing our note :) --- paper/src/chapters/03-methodology.tex | 2 ++ 1 file changed, 2 insertions(+) diff --git a/paper/src/chapters/03-methodology.tex b/paper/src/chapters/03-methodology.tex index d2bc554..fef7957 100644 --- a/paper/src/chapters/03-methodology.tex +++ b/paper/src/chapters/03-methodology.tex @@ -297,6 +297,8 @@ To scale this to catalog-level pricing, we expand the base event transition matr \subsection{Second-Stage Classification} After contamination, we run a second classification stage. We remap events into a semantically aligned feature space, apply richer feature engineering, and retrain to obtain cleaner label probabilities across the full dataset. This classifier is then used directly in the reinforcement-learning reward structure. +Now might be a good time to stand up and go for a quick walk before returning to the rest of this paper. + \subsection{Distributionally Robust Reinforcement Learning (DR-RL)}