mirror of
https://github.com/velocitatem/PHANTOM.git
synced 2026-05-31 16:43:36 +00:00
yapping
This commit is contained in:
@@ -1,5 +1,8 @@
|
|||||||
\section{Methodology}
|
\section{Methodology}
|
||||||
|
|
||||||
|
% Extra notes and clarifications: we observed some humans and get their transition probabilities between event types
|
||||||
|
% We modify behavioral profiles of transition matrices with price elasticity matrices generated by sample valuations of a distributing.
|
||||||
|
|
||||||
This section details the theoretical and practical framework developed to address dynamic pricing under the influence of non-human actors. We begin by formalizing the problem environment and the nature of the actors. We then derive the \textit{Cost of Information} (COI) theorem, proving the erosion of pricing power in the limit of agent saturation. Following this, we outline our generative contamination strategy using GOFAI-driven separability and transition probability learning. Finally, we formulate the robust control problem as a Stackelberg game solved via Distributionally Robust Reinforcement Learning (DR-RL) with constructed ambiguity sets.
|
This section details the theoretical and practical framework developed to address dynamic pricing under the influence of non-human actors. We begin by formalizing the problem environment and the nature of the actors. We then derive the \textit{Cost of Information} (COI) theorem, proving the erosion of pricing power in the limit of agent saturation. Following this, we outline our generative contamination strategy using GOFAI-driven separability and transition probability learning. Finally, we formulate the robust control problem as a Stackelberg game solved via Distributionally Robust Reinforcement Learning (DR-RL) with constructed ambiguity sets.
|
||||||
|
|
||||||
\subsection{Problem Formalization}
|
\subsection{Problem Formalization}
|
||||||
@@ -36,15 +39,17 @@ where $\alpha \in [0, 1]$ represents the contamination parameter (proportion of
|
|||||||
|
|
||||||
\subsection{Cost of Information (COI) Framework}
|
\subsection{Cost of Information (COI) Framework}
|
||||||
|
|
||||||
The \textit{Cost of Information} (COI) represents the markup a pricing policy $\pi$ attempts to extract from the market by leveraging demand signals. We define COI as the expected premium over the minimum viable price $\underline{p}$ (or marginal cost). This also speaks to the financial urgency as a consequence of information asymmetry between the platform and the actors.
|
The platform's pricing power comes from information asymmetry: users who express strong interest signals pay more than the base price. We quantify this markup as the \textit{Cost of Information} (COI), which represents the average premium extracted above marginal cost. COI measures the revenue at risk when information asymmetry collapses.
|
||||||
|
|
||||||
\begin{definition}[Cost of Information]
|
\begin{definition}[Cost of Information]
|
||||||
Let $\pi(\tau)$ be a pricing policy mapping interaction histories to prices. The COI is defined as:
|
Let $\pi(\tau)$ be a pricing policy mapping interaction histories to prices. The COI is defined as:
|
||||||
\begin{align}
|
\begin{equation}
|
||||||
\text{COI} &= \mathbb{E}[P] - \underline{p} \\
|
\text{COI} = \mathbb{E}[P] - \underline{p}
|
||||||
&= \int_{\underline{p}}^{\bar{p}} (1 - F_\pi(p)) \, dp
|
\end{equation}
|
||||||
\end{align}
|
where $\mathbb{E}[P]$ is the expected price charged by the policy and $\underline{p}$ is the minimum viable price (marginal cost).
|
||||||
where $F_\pi(p)$ is the cumulative distribution function of prices generated by $\pi$ under standard operating conditions.
|
% Alternative survival function representation (used in proof):
|
||||||
|
% COI = \int_{\underline{p}}^{\bar{p}} (1 - F_\pi(p)) \, dp
|
||||||
|
% where F_\pi(p) is the CDF of prices generated by \pi
|
||||||
\end{definition}
|
\end{definition}
|
||||||
|
|
||||||
\begin{figure}[ht]
|
\begin{figure}[ht]
|
||||||
@@ -86,37 +91,24 @@ Let $N$ be the number of independent, utility-maximizing agents querying the pla
|
|||||||
\end{theorem}
|
\end{theorem}
|
||||||
|
|
||||||
\begin{proof}
|
\begin{proof}
|
||||||
Let $p_1, \ldots, p_N$ be independent and identically distributed (i.i.d.) price samples drawn from the policy's distribution $F(p)$ with support $[\underline{p}, \bar{p}]$. The realizable price for an optimal searching agent is the first order statistic $p_{(1)} = \min(p_1, \ldots, p_N)$.
|
Consider $N$ independent agents querying the platform, each receiving a price sample $p_i$ drawn from the pricing policy's distribution $F(p)$ with support $[\underline{p}, \bar{p}]$. A strategic agent conducting reconnaissance will select the minimum observed price: $p_{(1)} = \min(p_1, \ldots, p_N)$.
|
||||||
|
|
||||||
The survival function (or reliability function) of the minimum price is given by:
|
The probability that the minimum price exceeds some threshold $t$ is:
|
||||||
\begin{equation}
|
\begin{equation}
|
||||||
S_{p_{(1)}}(t) = P(p_{(1)} > t) = [1 - F(t)]^N
|
P(p_{(1)} > t) = P(\text{all } p_i > t) = [1 - F(t)]^N
|
||||||
\end{equation}
|
\end{equation}
|
||||||
|
|
||||||
To determine the expected value $\mathbb{E}[p_{(1)}]$, we recall the property that for any continuous random variable $X$ with support $[A, B]$, the expectation can be expressed as the lower bound plus the integral of the survival function:
|
For any price $t > \underline{p}$, the CDF satisfies $F(t) > 0$, so $1 - F(t) < 1$. As $N$ grows, this probability decays exponentially: $[1 - F(t)]^N \to 0$.
|
||||||
|
|
||||||
|
The expected minimum price can be written as:
|
||||||
\begin{equation}
|
\begin{equation}
|
||||||
\mathbb{E}[X] = A + \int_{A}^{B} P(X > t) \, dt
|
\mathbb{E}[p_{(1)}] = \underline{p} + \int_{\underline{p}}^{\bar{p}} [1 - F(t)]^N \, dt
|
||||||
\end{equation}
|
\end{equation}
|
||||||
|
|
||||||
Applying this to our pricing statistic where the lower bound is $\underline{p}$:
|
Since the integrand vanishes as $N \to \infty$ for all $t > \underline{p}$, the integral converges to zero. Therefore:
|
||||||
\begin{align}
|
|
||||||
\mathbb{E}[p_{(1)}] &= \underline{p} + \int_{\underline{p}}^{\bar{p}} P(p_{(1)} > t) \, dt \\
|
|
||||||
&= \underline{p} + \int_{\underline{p}}^{\bar{p}} [1 - F(t)]^N \, dt
|
|
||||||
\end{align}
|
|
||||||
|
|
||||||
Since $F(t)$ is a valid CDF, for any $t > \underline{p}$, we have strict inequality $F(t) > 0$, implying $0 \le 1 - F(t) < 1$. By the properties of limits, as $N \to \infty$, the term $[1 - F(t)]^N$ converges to 0 pointwise for all $t > \underline{p}$.
|
|
||||||
|
|
||||||
Applying the Lebesgue Dominated Convergence Theorem (noting that the integrand is bounded by 1 on the finite interval $[\underline{p}, \bar{p}]$):
|
|
||||||
\begin{equation}
|
\begin{equation}
|
||||||
\lim_{N \to \infty} \int_{\underline{p}}^{\bar{p}} [1 - F(t)]^N \, dt = \int_{\underline{p}}^{\bar{p}} 0 \, dt = 0
|
\lim_{N \to \infty} \text{COI} = \lim_{N \to \infty} (\mathbb{E}[p_{(1)}] - \underline{p}) = 0
|
||||||
\end{equation}
|
\end{equation}
|
||||||
|
|
||||||
Substituting this back into the expression for COI:
|
|
||||||
\begin{align}
|
|
||||||
\lim_{N \to \infty} \text{COI} &= \lim_{N \to \infty} (\mathbb{E}[p_{(1)}] - \underline{p}) \\
|
|
||||||
&= \lim_{N \to \infty} \left( (\underline{p} + 0) - \underline{p} \right) \\
|
|
||||||
&= 0
|
|
||||||
\end{align}
|
|
||||||
\end{proof}
|
\end{proof}
|
||||||
|
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user