Back to home

Prediction market data
via API, SQL & Parquet

Multiple ways to access clean, cross-platform data — query it in ClickHouse SQL, fetch via REST, or export Parquet — including the historical order book no real-time feed keeps.

Get Started
REST API

Simple, powerful API

RESTful endpoints for markets, trades, orderbooks, and historical data. Consistent response format across all platforms. Paginated, filterable, and fast.

  • Unified schema — same format across all platforms
  • Filter by platform, category, date range, and more
  • Bearer-token auth — one key across every endpoint
GET/api/v1/markets
200 OK
{
  "data": [
    {
      "id": "poly_0x1234...",
      "platform": "polymarket",
      "title": "BTC above $100k?",
      "status": "active",
      "outcomes": ["Yes", "No"],
      "close_date": "2026-12-31"
    }
  ],
  "meta": { "total": 12847, "page": 1 }
}
Orderbook Data · New

Tick-level historical order book

Full bid/ask depth persisted at 1ms — replay the exact state of any market at any past moment. Most providers only stream the book live; we keep the history. Query via SQL or export Parquet.

  • Tick-level full-depth book — replay any past millisecond
  • 1ms LOCF interpolation — no gaps in your data
  • Up to 900M+ snapshots/day across Polymarket and Kalshi
Granularity Comparison
ProbalyticsUp to 900M+ / day
Competitors2-3M / day
Bulk Data Access · Coming Soon

Direct S3 access

For large-scale analysis and ML pipelines, access complete historical datasets directly from S3. Parquet files partitioned by date and platform for efficient queries.

  • Parquet format — works with Spark, DuckDB, Pandas
  • Weekly fills and monthly market exports
  • Direct S3 bucket access for automation
s3://probalytics-data/
markets/
trades/
trades/polymarket/2026/
2026-03-01.parquet
847 MB
2026-03-02.parquet
912 MB
2026-03-03.parquet
891 MB
SQL Access

Query with SQL

Connect directly to our ClickHouse cluster from your browser or any SQL client. Run complex analytical queries on billions of rows in milliseconds.

  • In-browser SQL console — no setup required
  • Connect via HTTPS or native ClickHouse protocol
  • Sub-second queries across all historical data
DBeaverDataGripMetabaseGrafana
SQL Console
ClickHouse
SELECT
  market_id,
  platform,
  AVG(price) as avg_price,
  COUNT(*) as trade_count
FROM fills
WHERE timestamp > now() - INTERVAL 1 HOUR
GROUP BY market_id, platform
ORDER BY trade_count DESC
LIMIT 10
10 rows · 23ms Run Query

FAQ

Frequently asked questions

Which prediction markets does Probalytics cover?

Polymarket and Kalshi are live — full markets, trades, and order book history in one unified schema. Metaculus and Manifold are coming soon.

How do I access the data?

Three ways from the same warehouse: ClickHouse SQL (in-browser console or any SQL client), a REST API, and Parquet exports.

Do you have historical order book data?

Yes. We persist full bid/ask depth at 1ms resolution, so you can replay the exact state of any market at any past moment — something real-time-only feeds can't offer.

How far back does the data go?

Market and trade history goes back to 2020. The 1ms historical order book is captured from November 2025 onward.

Can I query Polymarket and Kalshi together?

Yes — one normalized schema lets you join and compare both venues in a single SQL query, with no per-venue ETL.

What export formats are supported?

Query live in SQL, fetch JSON via the REST API, or download Parquet for DuckDB, Polars, Spark, and pandas.

Is there a free trial?

Yes — every paid plan includes a 3-day free trial so you can explore the full dataset before committing.

Ready to get started?

Start building with unified prediction market data today.

Sign up now