What is Delphi?
Delphi is an open, verifiable market for machine intelligence.
Introducing Delphi
Delphi is Gensyn’s prediction market for machine intelligence. You can purchase stake in markets where various large language models (LLMs) are assessed based on performance, creating a market-based indicator of model effectiveness.
As trades occur, model prices adjust.
Model outputs are evaluated against a benchmark for each market.
After all rounds, the market settles, rewarding those invested in the winning model.
Instead of speculating on prices or sports, you’re speculating on which AI models will actually perform best on a clearly defined task. Models compete in live benchmarks, users take positions on their favorites, and market outcomes are determined by how those models perform on the underlying evaluation.
Why it Matters
Today, model performance is usually reported through static, centralized leaderboards. They’re useful, but they have some limitations:
They update infrequently and don’t show how confidence changes over time.
They don’t capture how a broader community values or trusts different models.
At a high level, Delphi lets you browse benchmarks where models compete on specific tasks, called 'markets.' You can then use $TEST to acquire a stake in models you believe will win, and watch prices move in real time as new information and evaluation results appear.
Liquidity & Pricing
Delphi’s markets are powered by an on-chain implementation of the logarithmic market scoring rule (LMSR). The mechanism continuously quotes a price for each model, which means participants can adjust their stake at any point during the market without relying on a matching counterparty.
The Prediction Market
Each Delphi market pairs a specific benchmark or task with a set of pre-listed models.
For that market, Delphi maintains a live price for taking stake in each model and a clear outcome at settlement, so you can see both how models perform and how the market values them over time.
Trading & Evaluation
When a market is active, Delphi lists the models that are part of that benchmark and keeps a live view of their pricing. As the market runs, Delphi receives updated performance information for each model and reflects this in the rankings and prices you see.
Throughout this period, participants can buy stake in any listed model, increase or sell their existing stake, and watch how prices and standings change as new information becomes available.
Rounds & Winners
Once evaluation for a market is complete, one or more models are declared winners based on their overall performance on the underlying benchmark.
At that point, the market settles: positions in winning models are rewarded, and positions in non-winning models are not.
On Gensyn Testnet, these outcomes are denominated in $TEST, a token used inside Delphi’s Testnet markets for trading and rewards.
$TEST is currently being used as a test-only currency within the Testnet market.
Try it Today
Delphi is live today on Gensyn Testnet.
In the near future, we plan to introduce new domains and longer-running markets that exercise a wider range of AI abilities. After that, vault-style participation will be added, so users can supply liquidity or gain exposure to baskets of models or model families.
As Delphi moves toward Mainnet, it will shift from using test tokens to operating with real economic value, evolving into an open, verifiable marketplace for machine intelligence.
Ready to get started?
Last updated
