|
|
5912062dc0
|
new trainer image
|
2026-02-19 13:03:25 +01:00 |
|
|
|
843564eeb0
|
TPU startup scripts
|
2026-02-19 13:03:03 +01:00 |
|
|
|
9acc998cc9
|
fixing models for gcp
|
2026-02-17 16:54:55 +01:00 |
|
|
|
802f31b4a1
|
adding naive jax and libraries and make adjustments
|
2026-02-17 14:48:18 +01:00 |
|
|
|
66c4a0cd1d
|
chore: fix chips used
|
2026-02-17 14:46:43 +01:00 |
|
|
|
244af9ac09
|
citing compute
|
2026-02-17 14:46:34 +01:00 |
|
|
|
76c31a2abd
|
citing marc and
|
2026-02-17 09:40:20 +01:00 |
|
|
|
64ee7e6d9b
|
forcing light mode
|
2026-02-16 11:30:18 +01:00 |
|
|
|
1e04a928aa
|
migrated new banner
|
2026-02-15 17:31:31 +01:00 |
|
|
|
9b133cddfd
|
introduce penalized sessions to episodes
|
2026-02-15 17:15:25 +01:00 |
|
|
|
ded7290935
|
hidef banner rendering
|
2026-02-15 17:12:12 +01:00 |
|
|
|
8e4dd59f90
|
banner rendering
|
2026-02-15 17:10:16 +01:00 |
|
|
|
024f6d4132
|
banner addition
|
2026-02-15 17:10:13 +01:00 |
|
|
|
2b47c3499a
|
chore: fixing discretization of actions
|
2026-02-15 15:45:46 +01:00 |
|
|
|
ef1d1f6557
|
fixing assumption definition
|
2026-02-14 21:54:42 +01:00 |
|
|
|
d7657db287
|
reintroducing our note :)
|
2026-02-14 21:49:40 +01:00 |
|
|
|
e8229ac313
|
updating methodology with better refelction
|
2026-02-14 15:20:38 +01:00 |
|
|
|
bc6c481d03
|
minor refactors to codebase to implement DRO
|
2026-02-14 14:53:30 +01:00 |
|
|
|
895eea5674
|
imporving methodology and adding onto it
|
2026-02-14 14:28:18 +01:00 |
|
|
|
fba2a9739e
|
updating paper details
|
2026-02-14 13:13:00 +01:00 |
|
|
|
d1aa13360f
|
cleaning refactors
|
2026-02-13 21:03:02 +01:00 |
|
|
|
f6f9729424
|
improving expression of ideas from dump
|
2026-02-10 18:12:49 +01:00 |
|
|
|
29a13340b9
|
hotfix: updating pricing provider to better read data
|
2026-02-06 12:01:12 +01:00 |
|
|
|
e22286371f
|
feat: proportiona lrevenu
|
2026-02-06 11:54:23 +01:00 |
|
|
|
e44feb7da0
|
updaing coi definition
|
2026-02-05 12:47:13 +01:00 |
|
|
|
ebd2378859
|
yapping
|
2026-02-05 12:28:26 +01:00 |
|
|
|
c4d82b2ecc
|
rescaling the graph
|
2026-02-02 16:55:06 +01:00 |
|
|
|
a9e2e7cbf3
|
improving on the methodlology
|
2026-02-02 16:52:50 +01:00 |
|
|
|
e0b074161b
|
fix: typo
|
2026-02-02 12:08:24 +01:00 |
|
|
|
08c0afb55a
|
chore: add chart of supra competive pricing
|
2026-02-02 12:03:30 +01:00 |
|
|
|
c4fd1352c9
|
naoice COI implementation
|
2026-02-02 11:18:37 +01:00 |
|
|
|
4abef97bf7
|
chore: adding simulation logging with wandb
|
2026-01-31 16:21:10 +01:00 |
|
|
|
33cb0d7e95
|
feature: refactored demand splitting and implementation
|
2026-01-31 12:56:48 +01:00 |
|
|
|
e8ef850089
|
feat: introduced simple COI proxy
|
2026-01-31 12:06:48 +01:00 |
|
|
|
e7cb48e9cd
|
chore: updating paper
|
2026-01-31 10:47:12 +01:00 |
|
Daniel Alves Rösel
|
dba8f3fafa
|
Merge pull request #44 from velocitatem/agent-behavior-loader-developemen
Agent behavior loader developement + rl loop definition and e2e tests.
|
2026-01-31 10:21:54 +01:00 |
|
Daniel Alves Rösel
|
9843c5deab
|
Merge pull request #51 from velocitatem/feat-strong-learning-implementation-with-data-contamination
Feat strong learning implementation with data contamination
|
2026-01-31 10:15:09 +01:00 |
|
|
|
13959e4b28
|
chore: bug fixes
|
2026-01-31 10:13:07 +01:00 |
|
Daniel Alves Rösel
|
2f481bd94b
|
Merge branch 'agent-behavior-loader-developemen' into feat-strong-learning-implementation-with-data-contamination
|
2026-01-31 10:08:59 +01:00 |
|
|
|
72877439ca
|
feat: contaminator and training
|
2026-01-31 09:48:20 +01:00 |
|
|
|
0f5f8affab
|
chore: make lib backwards compatible
|
2026-01-31 09:48:20 +01:00 |
|
|
|
ee70f02a1f
|
chore: export repeated methods into lib
|
2026-01-31 09:48:20 +01:00 |
|
|
|
22a2c255bd
|
chore: remove boilerplate
|
2026-01-31 09:48:20 +01:00 |
|
|
|
ccc19f3493
|
acapting some architectures
|
2026-01-31 09:48:20 +01:00 |
|
|
|
00e3eff2fa
|
migrating weak learning
|
2026-01-31 09:48:20 +01:00 |
|
|
|
440371dba4
|
feat: initial feature engineering of trajectories
|
2026-01-31 09:48:20 +01:00 |
|
|
|
b05b510f70
|
strong dataset gathering
|
2026-01-31 09:48:20 +01:00 |
|
|
|
04907df393
|
feat: weak train scaffold
|
2026-01-31 09:48:20 +01:00 |
|
|
|
b2f0746c01
|
chore: extra commenting
|
2026-01-31 09:48:20 +01:00 |
|
|
|
7b2d80ac4c
|
feat: wip contaminator
|
2026-01-31 09:48:20 +01:00 |
|