mirror of
https://github.com/velocitatem/PHANTOM.git
synced 2026-05-31 08:33:36 +00:00
reintroducing our note :)
This commit is contained in:
@@ -297,6 +297,8 @@ To scale this to catalog-level pricing, we expand the base event transition matr
|
|||||||
\subsection{Second-Stage Classification}
|
\subsection{Second-Stage Classification}
|
||||||
After contamination, we run a second classification stage. We remap events into a semantically aligned feature space, apply richer feature engineering, and retrain to obtain cleaner label probabilities across the full dataset. This classifier is then used directly in the reinforcement-learning reward structure.
|
After contamination, we run a second classification stage. We remap events into a semantically aligned feature space, apply richer feature engineering, and retrain to obtain cleaner label probabilities across the full dataset. This classifier is then used directly in the reinforcement-learning reward structure.
|
||||||
|
|
||||||
|
Now might be a good time to stand up and go for a quick walk before returning to the rest of this paper.
|
||||||
|
|
||||||
|
|
||||||
\subsection{Distributionally Robust Reinforcement Learning (DR-RL)}
|
\subsection{Distributionally Robust Reinforcement Learning (DR-RL)}
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user