v4.0 launched this morning. By evening, the paper bot had completed 11 cycles, scanned 180 markets, and found 9 quoteable opportunities. The arbitrage thesis fired faster than I expected — and not where I expected. Three things changed today. Writing them down before they feel obvious.
v4.0's job, in one sentence: scan Polymarket every hour for binaries where the YES bid + NO bid sum to less than $1.00, and post both sides for a redeemable pair at maturity. "Quoteable" means the spread is wide enough that we'd post. "Filled" is a separate question — that's what the replay layer answers.
First-hour numbers (paper, all hypothetical, no real orders):
| Asset | Quotes | Capital (theoretical) | Proj P&L | Edge % |
|---|---|---|---|---|
| DOGE | 3 | $30 | +$5.94 | 19.8% |
| XRP | 3 | $30 | +$3.54 | 11.8% |
| BTC | 1 | $10 | +$0.58 | 5.8% |
| Total | 7 | $70 | +$10.06 | 14.4% |
Those are projection numbers — "if both sides fill at our hypothetical price." Realistic conversion will be lower, possibly much lower. The dashboard's replay panel will show that gap as data accumulates.
I framed v4.0 as a "BTC binaries" experiment. The pivot writeup (entry #001) assumed Bitcoin Up/Down hourly markets would be the bread-and-butter. They're not.
That changes the product framing. The thing being traded isn't "BTC" — it's Polymarket's hourly directional series, which exists for many assets, each with its own liquidity profile. Bitcoin is the most-shopped and the tightest. Alts are the loosest. Both are tradeable; the alts will produce more edge.
Practical change: I added DOGE and XRP to the explicit watchlist and reordered the keyword filter so
"up or down" leads. The per-asset keywords (BTC/ETH/SOL) stay as insurance against Polymarket
renaming the product or shipping new SKUs. Costs nothing to keep them.
The original STAKE_PER_SIDE was $5 — basically toy size. The math said $0.30–$1/day at full
saturation. That's fine for "does the strategy work at all" but it's too small to learn from.
Noise dominates signal. A 14% projected edge on $5 stake = $0.70. Could be skill, could be a single market
tick.
10× the stake, 10× the dollar magnitude, same percentage. Now we're talking about $3–10/day projected, on $500 max-cycle theoretical capital. Still tiny vs the $9,400 paper bankroll, still inside the "science project" guardrail — but loud enough that the daily numbers actually mean something when we read them.
| Knob | Before | After |
|---|---|---|
| Stake per side | $5 | $50 |
| Per-pair capital | $10 | $100 |
| Max-cycle exposure | ~$50 | ~$500 |
| Phase B gate (replay $/day) | $0.30/day | $3/day |
The Phase B decision gate scales with stake — same percentage threshold, just larger dollar values. If the replay layer shows hypothetical realized P&L crossing $3/day over 48–72 hours, we move forward with live orders. Below that, we keep iterating in paper.
MIN_BID_ROOM.Tomorrow morning (2026-05-17) the first full overnight will be in. Three possible outcomes:
MIN_BID_ROOM threshold, narrow universe.Either way, the next entry has the answer. The whole point of building this in public is to publish what the data says, not what I hoped it would say.