For Funds: What this project does
Market Synthesis is a research prototype that uses structured AI debate to identify potential macro regime shifts before they are fully priced.
In plain terms: it is an βearly warning + reasoningβ layer for portfolio managers and risk teams.
Soros-style market logic
The project follows a Soros-style lens: look for moments where market belief and structural reality diverge.
Typical setup:
is stable"] --> B["2. Policy + structural
constraints shift"] B --> C["3. Market positioning
anchored to old story"] C --> D["4. Credibility breaks"] D --> E["π₯ Fast, nonlinear
repricing"] MS["π Market Synthesis
detects the gap HERE"] -.->|"early warning"| C style A fill:#eef4ff,stroke:#4f7ee8,color:#0f2b57 style B fill:#fef3c7,stroke:#d97706,color:#78350f style C fill:#fee2e2,stroke:#dc2626,color:#7f1d1d style D fill:#fee2e2,stroke:#dc2626,color:#7f1d1d style E fill:#7f1d1d,stroke:#450a0a,color:#ffffff style MS fill:#dcfce7,stroke:#16a34a,color:#14532d
Market Synthesis flags the tension before credibility breaks
This is especially relevant in policy-sensitive markets such as FX, rates, sovereign risk, and macro-sensitive equities.
The problem it targets
Most big losses in macro portfolios come from regime breaks that are obvious only in hindsight:
- policy pivots
- liquidity shocks
- currency dislocations
- cross-asset narrative flips
Traditional models often detect these late because they focus on price history.
This system adds a narrative layer from news and macro context, then forces competing views to argue before producing a final score.
How it helps investment teams
- Pre-mortem signal: highlights events that deserve immediate PM attention.
- Gap characterization: describes what the market is getting wrong, affected assets, key tension, time horizon, and example actions β not specific trade instructions.
- Auditability: shows why the model reached the conclusion, including the full Prosecutor/Defender/Expert debate.
- Workflow fit: designed for weekly committee cadence, not high-frequency trading.
Why this approach is different
Instead of one model giving one opinion, the system uses 8 specialized agents with domain routing:
prompts to all 3"| PROS CLASS --> DEF CLASS --> EXPERT PROS["βοΈ Prosecutor
'This IS a real dislocation'"] DEF["π‘οΈ Defender
'This may be temporary
or mistimed'"] EXPERT["π¬ Domain Expert
Category-specific
specialist analysis"] PROS --> ADJ DEF --> ADJ EXPERT --> ADJ ADJ["βοΈ Adjudicator
Direction + horizon +
actionable score"] ADJ --> GAP["π Gap Characterization
Describes gap + affected
assets + example actions"] ADJ --> CAL["π Calibration Agent
Adjusts confidence
for reliability"] GAP --> OUT["π Final Output
Gap summary Β· Affected assets Β· Confidence"] CAL --> OUT OUT --> PM["π€ Portfolio Manager
uses as decision support"] style EVENT fill:#1d4ed8,stroke:#153ca8,color:#ffffff style CLASS fill:#f0f9ff,stroke:#0284c7,color:#0c4a6e style PROS fill:#dcfce7,stroke:#16a34a,color:#14532d style DEF fill:#fee2e2,stroke:#dc2626,color:#7f1d1d style EXPERT fill:#fef3c7,stroke:#d97706,color:#78350f style ADJ fill:#e0e7ff,stroke:#4f46e5,color:#312e81 style GAP fill:#fef3c7,stroke:#d97706,color:#78350f style CAL fill:#f0f9ff,stroke:#0284c7,color:#0c4a6e style OUT fill:#1d4ed8,stroke:#153ca8,color:#ffffff style PM fill:#f8fafc,stroke:#94a3b8,color:#0f2b57
8-agent orchestration with domain-routed experts β every thesis faces a strong rebuttal before verdict
This structure reduces one-sided model bias and improves interpretability for humans.
Domain specialization
The system routes each event to 1 of 7 specialized experts:
| Domain | Expert Focus |
|---|---|
| Currency Peg | FX reserves, peg mechanics, carry trades |
| Commodity | Supply chains, physical markets, storage |
| Central Bank | Monetary policy, rate cycles, plumbing |
| Geopolitical | Sanctions, trade wars, regime changes |
| Market Structure | Leverage, short squeezes, margin cascades |
| Sovereign Debt | Debt sustainability, contagion, restructuring |
| Real Estate | Bubbles, valuations, reflexivity |
Each expert can be tuned independently β a currency peg specialist knows about reserve adequacy and intervention capacity, while a market structure specialist knows about hidden leverage and margin mechanics.
Why not just use one agent?
A single agent tends to produce a coherent story, but not a rigorous internal challenge.
In markets, that is dangerous: one plausible narrative can hide the opposite trade.
The adversarial setup improves decision quality because:
- every thesis is forced to face a strong rebuttal
- domain expertise is matched to the event type
- confidence is produced after contention, not before
- output is easier to defend in investment committee discussions
- failure modes become visible (timing uncertainty vs structural disagreement)
Why this matters in controversial macro episodes
In many high-profile market controversies, price is driven by a fight between:
- policy intent vs policy capacity
- official narrative vs private positioning
- short-term defense vs long-term sustainability
The orchestration is explicitly designed to model that conflict instead of hiding it behind one βfinal answer.β
Evidence: Feed-Driven Backtest with Adversarial Exit (Real News Data)
The latest validation uses only real news feeds β no hand-crafted events. The system discovers events from cached headlines the same way it would in production, then manages positions with an LLM-based exit system.
| Metric | Value |
|---|---|
| Coverage | 2014-01-01 β 2026-05-01 (12.3 years) |
| Headlines indexed | 626,245 (Guardian + NYT + GDELT + FOMC + EDGAR) |
| Signals in store | 635 |
| Signals backtested | 303 (those with price data) |
| Mean P&L per trade | +7.98% |
| Alpha over hold-to-time | +3.11% |
| Worst drawdown | -19.3% (vs -57.3% mechanical) |
| Estimated annual return | ~22% p.a. (10% allocation) |
| Recall on 29 curated crises | 0.759 |
What these numbers mean for a PM:
- +7.98% average per trade, ~22% annualized. For each signal, the system enters the category-appropriate instrument and uses an adversarial LLM exit check (βis the gap still open?β) to decide when to close. These are real P&L numbers from yfinance historical prices β verifiable by re-running the backtest.
- Tail risk dramatically reduced. The adversarial exit cuts worst-case drawdown from -57.3% to -19.3% β it recognizes when a thesis has failed and exits before catastrophic loss.
- ~3 in 4 major crises caught. The system misses 7 of 29 hand-curated events (Swiss Franc 2015, Rusal 2018, Repo 2019, Archegos 2021, Terra-Luna 2022, Red Sea 2023, Yen Carry 2024). Itβs a decision-support layer, not a replacement for manual coverage.
- Commodity and currency peg are the strongest categories β classic Soros-style trades with +32.8% and +16.8% mean returns respectively.
- Direction, instrument, and exit logic attached to every signal β ready for review, not just a score.
P&L by Category
| Category | N | Mean P&L | Alpha vs Hold | Best Instruments |
|---|---|---|---|---|
| Commodity | 45 | +32.8% | +11.0% | CL=F, BZ=F, NG=F |
| Currency Peg | 24 | +16.8% | +13.9% | USDRUB=X, USDTRY=X |
| Sovereign Debt | 111 | +3.2% | +1.2% | TLT, DX-Y.NYB, EMB |
| Geopolitical | 35 | +3.2% | -0.3% | GC=F, country ETFs |
| Central Bank | 67 | +0.9% | -0.4% | FX pairs |
| Real Estate | 21 | +0.6% | +1.1% | VNQ, GBPUSD=X |
The system validates against category-specific instruments (FX pairs, commodity futures, bond ETFs, country funds) β not a blanket proxy like VIX.
Practical User Guide for Fund Managers
What You Get
Each signal from Market Synthesis provides:
- Event name and category β what the system detected
- Direction β long or short
- Confidence score (0-100) β how strongly the adversarial debate agreed
- Entry recommendation β when to enter (immediate, wait, or confirm)
- Position sizing β based on score and category R:R
- Stop-loss β category-specific initial stop
- Hold period β expected time to peak profit
- Exit triggers β headline signals for partial profit-taking
How to Act on a Signal
Step 1: Receive signal (score >= 70)
Check the category and apply entry rule:
| Category | Action |
|---|---|
| Central Bank / Geopolitical / Currency Peg / Real Estate | Enter at next market open |
| Sovereign Debt | Wait for 2 consecutive days of favorable price action |
| Commodity | Wait 3 days for initial volatility to settle |
Step 2: Size the position
| Score | Allocation |
|---|---|
| 90-100 | Full size (adjusted for category R:R) |
| 80-89 | 75% of full size |
| 70-79 | 50% of full size |
Reduce sizing for weak R:R categories (real_estate, currency_peg) by additional 40%.
Step 3: Set initial stop-loss
Sovereign: -6%, Commodity: -7%, Central Bank: -7%, Geopolitical: -5%, Real Estate: -8%, Currency Peg: -6%.
Step 4: Manage the position
- Day 5: If profitable, move stop to breakeven
- First headline exit signal (policy response, exhaustion, containment): Reduce 30%
- Signal cluster (2+ signals in 3 days): Reduce another 30%
- Category peak day: Close remaining
Step 5: Watch for resolution signals
If subsequent headlines indicate crisis resolution (bailout approved, intervention), this is a separate long opportunity β but requires full system conviction, not just a headline scan.
What Not to Do
- Do not enter against the system direction on a high-score signal
- Do not ignore stop-losses (max drawdown averages -6% but can reach -10%+)
- Do not hold past the category peak day without a strong reason
- Do not treat exit signals as full-exit triggers β they mean βreduce,β not βcloseβ
Expected Use Inside a Fund
This is best used as a decision support module, not autonomous execution:
- morning/weekly macro risk review
- watchlist prioritization
- scenario discussion for portfolio hedges
- challenge function against house view
What a pilot can look like
Shadow Mode
No capital impact
Log signal quality"] --> P2["π Phase 2
Controlled Adoption
Watchlist ranking
Hedge discussion only"] --> P3["π Phase 3
Integrated Workflow
IC/risk dashboard
Governance controls"] style P1 fill:#eef4ff,stroke:#4f7ee8,color:#0f2b57 style P2 fill:#fef3c7,stroke:#d97706,color:#78350f style P3 fill:#dcfce7,stroke:#16a34a,color:#14532d
Low-risk adoption path β prove value in shadow mode before capital impact
Commercialization path
A practical pilot model:
- Integrate the signal feed into your existing research/risk workflow.
- Run in shadow mode against your current process.
- Measure uplift in hit rate, drawdown reduction, and decision speed.
- Move to production only after governance sign-off.
Important note
This is research software and does not provide investment advice.
Any live deployment should include risk limits, human oversight, and model governance.