mirror of
https://github.com/velocitatem/PHANTOM.git
synced 2026-05-31 16:43:36 +00:00
banner addition
This commit is contained in:
@@ -316,6 +316,7 @@ This yields two centroid-like heuristics that guide contamination estimation at
|
||||
In implementation, we maintain an alternating game-history stack (our \textit{Limbo} stack) and execute it explicitly every epoch with exactly two transitions: first the platform publishes a price vector (leader move), then the market responds with trajectory-derived demand (follower move).
|
||||
|
||||
% Mention discretized action space and the clipping and over shotting in continuous action spaces
|
||||
% Also talk about catastrophic economics, we add termination on bankrupcy or zero demand so market collaps
|
||||
|
||||
\subsubsection{Ambiguity Set Construction}
|
||||
We define an ambiguity set $\mathcal{U}_\epsilon(\hat{P}_N)$ centered around our empirical reference distribution $\hat{P}_N$ (derived from the generator $\mathcal{G}$). We utilize the Wasserstein distance metric to define the set of plausible demand distributions the agent might face:
|
||||
|
||||
BIN
paper/src/graphics/banner.png
Normal file
BIN
paper/src/graphics/banner.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 41 KiB |
14
paper/src/graphics/banner.py
Normal file
14
paper/src/graphics/banner.py
Normal file
@@ -0,0 +1,14 @@
|
||||
from PIL import Image, ImageDraw, ImageFont
|
||||
|
||||
text = open("banner.txt", "r", encoding="utf-8").read()
|
||||
font = ImageFont.truetype("DejaVuSansMono.ttf", 18)
|
||||
|
||||
dummy = Image.new("RGB", (1, 1))
|
||||
d = ImageDraw.Draw(dummy)
|
||||
bbox = d.multiline_textbbox((0,0), text, font=font)
|
||||
w, h = bbox[2]-bbox[0], bbox[3]-bbox[1]
|
||||
|
||||
img = Image.new("RGB", (w+20, h+20), "white")
|
||||
d = ImageDraw.Draw(img)
|
||||
d.multiline_text((10,10), text, font=font, fill="black")
|
||||
img.save("banner.png")
|
||||
23
paper/src/graphics/banner.txt
Normal file
23
paper/src/graphics/banner.txt
Normal file
@@ -0,0 +1,23 @@
|
||||
Actors Trajectories
|
||||
■════■ interact ┌────────────┐ ┌──┐
|
||||
║Agent──────┬──────▻Web Platform├──┐ │τ1│ ┌▻Q (demand estimate)─┐
|
||||
║Human──────┘ └──────△─────┘ └──▻..│──┘ │
|
||||
╚════■ │ │τK│ │
|
||||
△ │ └──┘ │
|
||||
│motivate │ │
|
||||
└────────┐ │Setting ┌──┐ Pricing Engine │
|
||||
▲ ┌──┐│ │Prices │p1│ ┌──────────────┐ │
|
||||
│ ┌─┘ ││ └───────────┤..│◅────│▒▒▒▒▒▒▒▒▒▒▒▒▒▒│◅──┘
|
||||
│ │ └──┐ │pN│ └─────┬──┬─────┘
|
||||
│ ┌─┘ │ └──┘ │ │
|
||||
└─┴─────────┴─▶ │ │
|
||||
Private Valuations │ │
|
||||
│ │
|
||||
╔═══════════════════════════════════════════════════╧══╧════════╗
|
||||
║ Training Loop / SAC PPO DQN A2C ║
|
||||
║ ■═════════════════════════════■ ║
|
||||
║ Q̂_t,i = Σ_s Σ_k ω(a_s,k) · 1[i_s,k = i] │ ║
|
||||
║ f(τ') from KL( T' || T_H ) and KL( T' || T_A ) │ ║
|
||||
║ α* = argmin_{α ∈ Aε(α0)} [ Revenue(p, Q^α) - λ·COI_leak ] │ ║
|
||||
║ r_t = Revenue - λ·f(τ') | a* ▽ ║
|
||||
╚═══════════════════════════════════════════════════════════════╝
|
||||
Reference in New Issue
Block a user