For Funds: What this project does

Market Synthesis is a research prototype that uses structured AI debate to identify potential macro regime shifts before they are fully priced.

In plain terms: it is an β€œearly warning + reasoning” layer for portfolio managers and risk teams.

Soros-style market logic

The project follows a Soros-style lens: look for moments where market belief and structural reality diverge.

Typical setup:

graph TD A["1. Consensus narrative
is stable"] --> B["2. Policy + structural
constraints shift"] B --> C["3. Market positioning
anchored to old story"] C --> D["4. Credibility breaks"] D --> E["πŸ’₯ Fast, nonlinear
repricing"] MS["πŸ” Market Synthesis
detects the gap HERE"] -.->|"early warning"| C style A fill:#eef4ff,stroke:#4f7ee8,color:#0f2b57 style B fill:#fef3c7,stroke:#d97706,color:#78350f style C fill:#fee2e2,stroke:#dc2626,color:#7f1d1d style D fill:#fee2e2,stroke:#dc2626,color:#7f1d1d style E fill:#7f1d1d,stroke:#450a0a,color:#ffffff style MS fill:#dcfce7,stroke:#16a34a,color:#14532d

Market Synthesis flags the tension before credibility breaks

This is especially relevant in policy-sensitive markets such as FX, rates, sovereign risk, and macro-sensitive equities.

The problem it targets

Most big losses in macro portfolios come from regime breaks that are obvious only in hindsight:

Traditional models often detect these late because they focus on price history.
This system adds a narrative layer from news and macro context, then forces competing views to argue before producing a final score.

How it helps investment teams

  1. Pre-mortem signal: highlights events that deserve immediate PM attention.
  2. Gap characterization: describes what the market is getting wrong, affected assets, key tension, time horizon, and example actions β€” not specific trade instructions.
  3. Auditability: shows why the model reached the conclusion, including the full Prosecutor/Defender/Expert debate.
  4. Workflow fit: designed for weekly committee cadence, not high-frequency trading.

Why this approach is different

Instead of one model giving one opinion, the system uses 8 specialized agents with domain routing:

graph TD EVENT["πŸ“° Market Event"] --> CLASS["πŸ“‹ Classifier"] CLASS -->|"routes category-specific
prompts to all 3"| PROS CLASS --> DEF CLASS --> EXPERT PROS["βš”οΈ Prosecutor
'This IS a real dislocation'"] DEF["πŸ›‘οΈ Defender
'This may be temporary
or mistimed'
"] EXPERT["πŸ”¬ Domain Expert
Category-specific
specialist analysis
"] PROS --> ADJ DEF --> ADJ EXPERT --> ADJ ADJ["βš–οΈ Adjudicator
Direction + horizon +
actionable score
"] ADJ --> GAP["πŸ” Gap Characterization
Describes gap + affected
assets + example actions
"] ADJ --> CAL["πŸ“Š Calibration Agent
Adjusts confidence
for reliability
"] GAP --> OUT["πŸ“Š Final Output
Gap summary Β· Affected assets Β· Confidence"] CAL --> OUT OUT --> PM["πŸ‘€ Portfolio Manager
uses as decision support"] style EVENT fill:#1d4ed8,stroke:#153ca8,color:#ffffff style CLASS fill:#f0f9ff,stroke:#0284c7,color:#0c4a6e style PROS fill:#dcfce7,stroke:#16a34a,color:#14532d style DEF fill:#fee2e2,stroke:#dc2626,color:#7f1d1d style EXPERT fill:#fef3c7,stroke:#d97706,color:#78350f style ADJ fill:#e0e7ff,stroke:#4f46e5,color:#312e81 style GAP fill:#fef3c7,stroke:#d97706,color:#78350f style CAL fill:#f0f9ff,stroke:#0284c7,color:#0c4a6e style OUT fill:#1d4ed8,stroke:#153ca8,color:#ffffff style PM fill:#f8fafc,stroke:#94a3b8,color:#0f2b57

8-agent orchestration with domain-routed experts β€” every thesis faces a strong rebuttal before verdict

This structure reduces one-sided model bias and improves interpretability for humans.

Domain specialization

The system routes each event to 1 of 7 specialized experts:

Domain Expert Focus
Currency Peg FX reserves, peg mechanics, carry trades
Commodity Supply chains, physical markets, storage
Central Bank Monetary policy, rate cycles, plumbing
Geopolitical Sanctions, trade wars, regime changes
Market Structure Leverage, short squeezes, margin cascades
Sovereign Debt Debt sustainability, contagion, restructuring
Real Estate Bubbles, valuations, reflexivity

Each expert can be tuned independently β€” a currency peg specialist knows about reserve adequacy and intervention capacity, while a market structure specialist knows about hidden leverage and margin mechanics.

Why not just use one agent?

A single agent tends to produce a coherent story, but not a rigorous internal challenge.
In markets, that is dangerous: one plausible narrative can hide the opposite trade.

The adversarial setup improves decision quality because:

  1. every thesis is forced to face a strong rebuttal
  2. domain expertise is matched to the event type
  3. confidence is produced after contention, not before
  4. output is easier to defend in investment committee discussions
  5. failure modes become visible (timing uncertainty vs structural disagreement)

Why this matters in controversial macro episodes

In many high-profile market controversies, price is driven by a fight between:

The orchestration is explicitly designed to model that conflict instead of hiding it behind one β€œfinal answer.”

Evidence: Feed-Driven Backtest with Adversarial Exit (Real News Data)

The latest validation uses only real news feeds β€” no hand-crafted events. The system discovers events from cached headlines the same way it would in production, then manages positions with an LLM-based exit system.

Metric Value
Coverage 2014-01-01 β†’ 2026-05-01 (12.3 years)
Headlines indexed 626,245 (Guardian + NYT + GDELT + FOMC + EDGAR)
Signals in store 635
Signals backtested 303 (those with price data)
Mean P&L per trade +7.98%
Alpha over hold-to-time +3.11%
Worst drawdown -19.3% (vs -57.3% mechanical)
Estimated annual return ~22% p.a. (10% allocation)
Recall on 29 curated crises 0.759

What these numbers mean for a PM:

P&L by Category

Category N Mean P&L Alpha vs Hold Best Instruments
Commodity 45 +32.8% +11.0% CL=F, BZ=F, NG=F
Currency Peg 24 +16.8% +13.9% USDRUB=X, USDTRY=X
Sovereign Debt 111 +3.2% +1.2% TLT, DX-Y.NYB, EMB
Geopolitical 35 +3.2% -0.3% GC=F, country ETFs
Central Bank 67 +0.9% -0.4% FX pairs
Real Estate 21 +0.6% +1.1% VNQ, GBPUSD=X

The system validates against category-specific instruments (FX pairs, commodity futures, bond ETFs, country funds) β€” not a blanket proxy like VIX.

Practical User Guide for Fund Managers

What You Get

Each signal from Market Synthesis provides:

  1. Event name and category β€” what the system detected
  2. Direction β€” long or short
  3. Confidence score (0-100) β€” how strongly the adversarial debate agreed
  4. Entry recommendation β€” when to enter (immediate, wait, or confirm)
  5. Position sizing β€” based on score and category R:R
  6. Stop-loss β€” category-specific initial stop
  7. Hold period β€” expected time to peak profit
  8. Exit triggers β€” headline signals for partial profit-taking

How to Act on a Signal

Step 1: Receive signal (score >= 70)

Check the category and apply entry rule:

Category Action
Central Bank / Geopolitical / Currency Peg / Real Estate Enter at next market open
Sovereign Debt Wait for 2 consecutive days of favorable price action
Commodity Wait 3 days for initial volatility to settle

Step 2: Size the position

Score Allocation
90-100 Full size (adjusted for category R:R)
80-89 75% of full size
70-79 50% of full size

Reduce sizing for weak R:R categories (real_estate, currency_peg) by additional 40%.

Step 3: Set initial stop-loss

Sovereign: -6%, Commodity: -7%, Central Bank: -7%, Geopolitical: -5%, Real Estate: -8%, Currency Peg: -6%.

Step 4: Manage the position

Step 5: Watch for resolution signals

If subsequent headlines indicate crisis resolution (bailout approved, intervention), this is a separate long opportunity β€” but requires full system conviction, not just a headline scan.

What Not to Do

Expected Use Inside a Fund

This is best used as a decision support module, not autonomous execution:

What a pilot can look like

graph LR P1["πŸ”‡ Phase 1
Shadow Mode
No capital impact
Log signal quality"] --> P2["πŸ“‹ Phase 2
Controlled Adoption
Watchlist ranking
Hedge discussion only"] --> P3["πŸ”— Phase 3
Integrated Workflow
IC/risk dashboard
Governance controls"] style P1 fill:#eef4ff,stroke:#4f7ee8,color:#0f2b57 style P2 fill:#fef3c7,stroke:#d97706,color:#78350f style P3 fill:#dcfce7,stroke:#16a34a,color:#14532d

Low-risk adoption path β€” prove value in shadow mode before capital impact

Commercialization path

A practical pilot model:

  1. Integrate the signal feed into your existing research/risk workflow.
  2. Run in shadow mode against your current process.
  3. Measure uplift in hit rate, drawdown reduction, and decision speed.
  4. Move to production only after governance sign-off.

Important note

This is research software and does not provide investment advice.
Any live deployment should include risk limits, human oversight, and model governance.