Earnings Beat Accuracy Analysis – Prediction Markets Data

Executive Summary

Overview

This paper analyzes the accuracy of Polymarket prediction markets in forecasting whether public companies will beat or miss their quarterly earnings estimates. The dataset covers 909 earnings events spanning June 2025 through May 2026, with daily pricing data tracking the market's evolving probability assessment for each event.

Unlike the Fed rate decision markets analyzed previously — where markets consistently priced outcomes weeks in advance with high certainty — earnings markets face a structurally different challenge: most earnings seasons compress pricing activity into a narrow 1–2 week window, leaving limited time for probability convergence. Despite this constraint, prediction markets demonstrate meaningful predictive value, particularly for high-conviction signals.

Key Findings

79.7% overall directional accuracy at D-1 (the day before earnings) across all 909 events
When markets priced a >80% or <20% probability (high-conviction signals), accuracy rose to 87.8%
63% of events had already reached high-conviction pricing (>80% or <20%) by D-1
The naive baseline (always predicting "Beat") yields only 72.6% — markets add meaningful lift beyond this floor
Miss detection is significantly harder: markets correctly flagged only 45.2% of eventual misses at D-1, while identifying 92.7% of eventual beats
Higher-volume markets showed considerably better accuracy (87.6%) vs. lower-volume markets (71.8%)
Total market volume across 909 events: $11.3 million

Section 1

Background: Earnings Markets and Their Structure

1.1 What the markets measure

Each Polymarket earnings market poses a simple binary question: "Will [Company] beat quarterly earnings?" Contracts trade between $0 and $1. A price of $0.72 implies a 72% market-assigned probability of an earnings beat. Resolution occurs at the closing price: contracts near $1.00 (≥$0.98) resolve as "Beat," while contracts near $0.00 (≤$0.02) resolve as "Miss."

1.2 Structural differences from Fed rate markets

Fed rate decision markets typically have 4–8 weeks of active trading before resolution. Earnings markets are structurally different: most companies report quarterly, and prediction market trading tends to concentrate in the final 7–14 days before the earnings announcement. The median event in this dataset had only 11 days of pricing data, and 25% had 8 or fewer data points.

This means the signal-building process documented in the Fed analysis (confidence rising steadily from 30+ days out) is largely absent for earnings. The market dynamic is more abrupt: traders enter a concentrated window, form views, and the market resolves quickly.

1.3 Dataset overview

This analysis covers 909 completed earnings beat/miss markets traded on Polymarket from June 2025 through May 2026. The dataset captures a broad cross-section of U.S. publicly traded companies — from mega-cap technology names (NVIDIA, Apple, Alphabet, Meta, Microsoft) to mid-cap industrials, consumer companies, and speculative growth stocks. Total traded dollar volume across all markets was $11.3 million.

Dataset

909

Unique earnings beat/miss markets from Jun 2025 – May 2026

Pricing Window

11 days

Median days of pricing data per event; 75th percentile was 14 days

Total Volume

$11.3M

Total dollar volume traded across all 909 earnings markets

Section 2

Accuracy Analysis

2.1 The base rate problem

Any assessment of prediction market accuracy for earnings must account for the "base rate" effect. Over the Jun 2025 – May 2026 period, 72.6% of companies beat estimates — a figure consistent with the historical tendency for companies to manage guidance conservatively. A trivial "always predict Beat" model would therefore achieve 72.6% accuracy with zero effort.

The meaningful question is whether prediction markets deliver lift over this baseline — and whether they correctly identify the minority of companies that will miss. The answer is nuanced: the markets do add lift, but unevenly.

72.6%

Naive Baseline

Always predict "Beat"

79.7%

D-1 PM Accuracy

+7.1 percentage points of lift

2.2 Accuracy by time horizon

The table below shows directional accuracy and average implied confidence at each time horizon before earnings. Unlike Fed rate markets — where confidence built steadily over 30+ days — earnings markets show relatively flat accuracy from D-7 to D-14, with a meaningful jump at D-1 as final pricing converges.

Horizon	Events	Directional Accuracy	Avg. Confidence	High-Conf. Accuracy
D-30	10	60.0%	63.1%	86.7%
D-21	29	86.2%	73.4%	86.7%
D-14	193	76.2%	68.7%	84.3%
D-10	512	74.6%	66.9%	82.3%
D-7	696	75.9%	67.0%	87.3%
D-5	841	77.1%	67.9%	86.5%
D-3	892	76.9%	68.0%	86.3%
D-2	903	78.3%	69.0%	86.7%
D-1	907	79.7%	72.4%	87.8%

High-conf. accuracy = accuracy for events priced >80% or <20% at that horizon. D-30/D-21 sample sizes are small due to most markets not opening that early.

2.3 The high-confidence signal

The most actionable finding in this data is the divergence between all events and high-confidence events. When the market priced a beat probability above 80% or below 20%, accuracy at D-1 jumped to 87.8%. Roughly 63% of all events fell in this category by D-1 — meaning a majority of markets were giving clean, high-confidence signals.

D-1 Accuracy by Confidence Tier

Very High
>80% or <20%

87.8%

High
60–80%

72.9%

Medium
40–60%

57.8%

Low
20–40%

64.0%

Very Low
0–20%

85.9%

Note: "Very Low" tier = events where the market priced <20% beat probability (i.e., high confidence in a Miss). High accuracy in both tails demonstrates the market identifies confident calls well in both directions.

Section 3

Miss Detection: Signal Strength, Sector & Size Patterns

3.1 The beat/miss asymmetry

The most important structural feature of earnings prediction markets is the skew: companies beat estimates far more often than they miss. Over this dataset, 72.6% of companies beat. This base rate creates a systematic bias — if uncertain about an outcome, the rational prior is "probably a Beat." The markets reflect this: the average D-1 beat probability across all events was approximately 70%.

The performance profile is dramatically different for beats vs. misses. Markets were highly effective at identifying eventual Beats (92.7% recall) but caught only 45.2% of Misses at D-1 using the 50% threshold. In other words, when a company ultimately missed, the market had it above 50% — predicting a beat — nearly 55% of the time.

611

True Beat

48

Missed Beat

136

False Beat

112

True Miss

3.2 The sub-60% signal: precision vs. recall tradeoff

Rather than using the 50% threshold mechanically, a more actionable framing asks: when the market prices a company below 60% to beat, how informative is that? The answer depends on how far below 60% — and which sector the company is in.

Threshold	Events Flagged	Misses Caught	% of All Misses	Precision
< 20% beat prob	52	46	18.5%	88.5%
< 40% beat prob	130	97	39.1%	74.6%
< 50% beat prob	160	112	45.2%	70.0%
< 60% beat prob	209	133	53.6%	63.6%

The sub-20% bucket is the cleanest signal: 88.5% precision with only 6 false alarms in 52 flagged events. But it catches fewer than 1-in-5 actual misses. The uncomfortable reality is that 46.4% of all misses were priced above 60% at D-1 — the market simply did not anticipate them. These are genuine surprise misses.

The price trajectory also matters: for events that ultimately missed, the average D-7 beat probability was 0.60, falling to 0.51 by D-1. About 58% of eventual misses saw their price fall between D-7 and D-1 — a declining trend that itself is a secondary signal, independent of the absolute level.

3.3 Miss rates by sector: where the signal is strongest

Miss rates vary dramatically across sectors, and so does the quality of the sub-60% warning signal. The data reveals three distinct tiers:

Sector	Miss Rate	Sub-60% Precision	Sub-60% Recall	Surprise Misses
Real Estate	71.4%	90.0%	90.0%	10.0%
Communication Services	41.2%	71.4%	71.4%	28.6%
Healthcare	29.4%	78.6%	73.3%	26.7%
Consumer Discretionary	38.8%	58.5%	47.5%	52.5%
Financials	24.7%	69.7%	52.3%	47.7%
Energy	33.3%	58.3%	70.0%	30.0%
Consumer Staples	20.6%	42.9%	21.4%	78.6%
Industrials	21.0%	36.4%	30.8%	69.2%
Technology	9.8%	70.0%	43.8%	56.2%

Surprise Misses = % of actual misses that were priced above 60% at D-1 (the market did not anticipate them). Sub-60% Precision = of events priced below 60%, % that actually missed. Sub-60% Recall = % of all misses that were flagged by the sub-60% signal.

Three patterns stand out:

Real Estate: the most reliable miss signal in the dataset. With a 71.4% base miss rate and 90% sub-60% precision and recall, Real Estate markets appear to price in distress quite accurately — likely because deteriorating fundamentals in property-linked companies tend to be visible in advance. Only 10% of misses came as surprises.

Communication Services and Healthcare: strong signal, moderate miss rates. Both sectors showed sub-60% precision above 70% and recall above 70%, meaning when the market flagged these names as at-risk, it was usually right and it caught most of the misses. These are the sectors where the sub-60% screen is most actionable.

Technology: low miss rates but mostly surprise misses. Technology companies beat at a remarkable 90.2% rate — the highest in the dataset. When they did miss, however, 56% of those misses were priced above 60% at D-1, meaning the market was caught off guard. Tech is a sector where the prediction market is primarily useful as a beat confirmer, not a miss detector.

Consumer Staples and Industrials: the market adds least value. Both sectors had low base miss rates (around 20%) and poor sub-60% signal quality — with recall below 35% and high surprise miss rates (69–79%). These sectors may resist prediction because their earnings surprises stem from idiosyncratic factors (input cost shocks, weather, supply chain) rather than the kind of public information that prediction market participants can synthesize.

3.4 Miss rates by market cap: smaller companies are harder to predict

Market capitalization shows a clean, monotonic relationship with both miss rates and the quality of the miss detection signal. Smaller companies miss more often, and when they do miss, the sub-60% signal is sharper.

Market Cap Tier	Events	Miss Rate	Sub-60% Precision	Sub-60% Recall	Surprise Misses
Mega (>$200B)	78	14.1%	66.7%	54.5%	45.5%
Large ($10B–$200B)	373	23.2%	55.7%	45.3%	54.7%
Mid ($2B–$10B)	260	26.2%	62.5%	51.5%	48.5%
Small (<$2B)	163	45.7%	72.7%	64.9%	35.1%

Small-cap companies missed at nearly three times the rate of mega-caps (45.7% vs. 14.1%). More importantly, the sub-60% signal for small-cap names had 72.7% precision and 64.9% recall — the best of any tier — meaning the market was better at anticipating small-cap misses than large-cap ones. This is plausible: small-cap companies tend to have higher earnings volatility and weaker guidance frameworks, making genuine uncertainty more visible in market pricing.

Large-cap stocks showed the weakest miss signal precision (55.7%) with the highest surprise miss rate (54.7%). For mega-cap and large-cap names, the market tends to assume a beat until proven otherwise, and misses often arrive as genuine shocks — consistent with the idea that large companies have more sophisticated investor relations and better ability to manage analyst expectations.

Best Miss Signal

Small Cap

45.7% miss rate · 72.7% sub-60% precision · 64.9% recall

Highest Surprise Rate

Large Cap

54.7% of large-cap misses were priced above 60% at D-1

Most Reliable Sector

Real Estate

90% sub-60% precision and recall · only 10% surprise misses

3.5 Companies that missed repeatedly

Several companies appeared multiple times in the dataset and missed earnings on every event tracked. While sample sizes are small, persistent missers may represent structurally unpredictable businesses or companies in operational distress during this period.

Company	Sector	Cap	Events	Misses
Airbnb	Consumer Discretionary	Large	3	3
Warby Parker	Consumer Discretionary	Small	3	3
Rumble	Communication Services	Small	3	3
Alibaba	Technology	Mega	2	2
BuzzFeed	Communication Services	Small	2	2
Opendoor Technologies	Real Estate	Small	3	3
Capital One	Financials	Large	2	2
Blackstone	Financials	Large	2	2
DoorDash	Consumer Discretionary	Large	2	2
Chipotle Mexican Grill	Consumer Discretionary	Large	2	2

Section 4

Volume, Market Quality, and Notable Events

4.1 Volume as a quality signal

Dollar volume varies enormously across the 909 markets — from under $100 for the thinnest names to over $650,000 for the most-watched events (NVIDIA). This variation is itself informative: high-volume markets attract more sophisticated participants and are likely to be better calibrated.

Splitting the dataset at the median volume, higher-volume markets achieved 87.6% D-1 accuracy vs. only 71.8% for lower-volume markets — a 15.8 percentage point gap. This strongly suggests that volume is a useful proxy for signal quality: thinner markets should be weighted less in any investment framework.

High Volume D-1 Accuracy

87.6%

Events above median dollar volume (N=453)

Low Volume D-1 Accuracy

71.8%

Events below median dollar volume (N=454)

Volume Gap

+15.8pp

Higher-volume markets are meaningfully more accurate

4.2 Highest-volume markets

The table below shows the 15 most actively traded earnings markets in the dataset. These are the events where prediction market data is likely most reliable. NVIDIA's earnings events alone accounted for nearly $900,000 in combined volume.

Company	Outcome	Total Volume	Final Price
NVIDIA	BEAT	$651,846	0.999
Tesla	MISS	$335,312	0.001
Rocket Lab	BEAT	$276,354	0.999
Papa John's International	BEAT	$252,402	0.999
NVIDIA	BEAT	$232,237	0.999
Hilton Grand Vacations	MISS	$225,884	0.001
Domino's Pizza	MISS	$202,765	0.001
Amer Sports	BEAT	$199,888	0.999
Hims & Hers Health	BEAT	$196,605	0.999
Costco Wholesale	MISS	$164,438	0.004
Alphabet	BEAT	$132,856	0.999
Cintas	MISS	$99,607	0.001
Robinhood Markets	BEAT	$96,729	0.999
Alphabet	BEAT	$95,325	0.999
Meta Platforms	BEAT	$80,776	0.999

4.3 Notable prediction errors

The most notable high-volume mispredictions at D-1 reveal a pattern: they cluster around events with genuine uncertainty — where both beats and misses were plausible given the public information at the time. The market was not blindly wrong; in many cases it was expressing real uncertainty that happened to resolve the other way.

Company	Actual Outcome	D-1 Price	Volume	Error Type
Tesla	MISS	0.830	$335,312	Predicted Beat, was Miss
Papa John's	BEAT	0.330	$252,402	Predicted Miss, was Beat
Domino's Pizza	MISS	0.660	$202,765	Predicted Beat, was Miss
Tesla	BEAT	0.290	$67,558	Predicted Miss, was Beat
Beyond Meat	MISS	0.740	$67,108	Predicted Beat, was Miss
Amazon	MISS	0.954	$44,525	High-conf Beat, was Miss

Tesla appears multiple times in this table — including once with 83% confidence in a Beat that turned into a Miss, and once with 71% confidence in a Miss that turned out to be a Beat. Tesla earnings have long been idiosyncratic and difficult to forecast, and this data confirms the prediction market also struggled with them.

Section 5

Quarterly Beat Rates and Market Calibration

5.1 Beat rate variation by earnings season

The underlying beat rate varies across reporting periods. The dataset spans three major earnings seasons (Q3 2025, Q4 2025, and Q1 2026), each with a slightly different beat rate. Markets must implicitly calibrate to these seasonal patterns.

Period	Events	Beat Rate
Q3 2025	9	88.9%
Q4 2025	370	75.1%
Q1 2026	295	68.8%
Q2 2026 (partial)	234	72.6%

The Q1 2026 reporting season showed a notably lower beat rate (68.8%) compared to Q4 2025 (75.1%), reflecting broader macro uncertainty in early 2026. Prediction markets tracking these events in real time would have needed to implicitly adjust for changing base rates — a sophisticated task that underlines why even modest lift over the naive baseline is meaningful.

5.2 Calibration quality

The cleanest test of calibration is this: do events priced at 80%+ actually beat at an ~80%+ rate? In this dataset, events at D-1 split almost entirely into two buckets — <10% (all misses) and >90% (all beats) — with very few intermediate prices at resolution. This binary polarization is expected: markets settle into near-certainty by the time of resolution. The more interesting calibration signal is in the intermediate prices 7–14 days out, where the market is still forming its view.

Section 6

Investment Applications

6.1 Strategies informed by earnings prediction market data

Long/Short Earnings Positioning

High-confidence signals (>80% or <20%) at D-7 were accurate 87%+ of the time. For equity managers running concentrated positions into earnings, this provides an evidence-based framework for sizing and hedging decisions.

Options Volatility Strategies

Prediction market probabilities can be compared against implied volatility in equity options ahead of earnings. When a high-volume market is pricing a 90%+ beat probability, selling downside protection may be mispriced relative to the PM signal.

Volume as a Quality Filter

The 15.8pp accuracy gap between high- and low-volume markets is actionable: filter out thin markets and weight the remaining signals by volume. This converts a 79.7% average signal into a consistently higher-quality one.

Miss Probability Monitoring

While overall miss detection is weak (45.2% recall), when a high-volume market prices a stock below 20% beat probability with high conviction, that is a materially useful warning signal. Track these specific "high-conviction miss" signals.

6.2 Comparison to Fed rate market performance

The contrast with Fed rate decision markets is instructive. Fed markets achieved 95.7% directional accuracy on decision day — but that was in a structured, calendar-driven environment with extensive Fed communication, FOMC transcripts, and blackout periods that give markets a clear cadence to work with.

Earnings markets operate in a noisier environment: management teams control the information flow, guidance is strategic, and macro surprises can invalidate even well-researched forecasts. An 87.8% accuracy rate for high-confidence earnings signals is therefore arguably as impressive as 95.7% for Fed decisions — the underlying task is harder.

6.3 Limitations and caveats

Several important caveats apply to this analysis:

Definition of "Beat": Resolution here uses the final prediction market price, which itself reflects consensus on whether the company beat or missed. This is self-consistent but may not always align with every analyst's definition.
Limited pricing history: Most events have only 7–14 days of data. With more lead time, signal quality might improve — or the market might remain uncertain until very close to the announcement.
Adverse selection: The fact that a prediction market exists for an event may itself be endogenous — markets are more likely to be created for earnings events perceived as interesting or uncertain.
Short time series: This analysis covers approximately one year (Jun 2025 – May 2026). A longer dataset would allow for more robust calibration analysis and seasonal pattern identification.

Section 7

Conclusion

Prediction markets for corporate earnings provide a genuine — if limited — informational edge. Across 909 events, Polymarket earnings markets achieved 79.7% directional accuracy at D-1, representing a 7.1 percentage-point lift over the naive base rate. For high-confidence signals in well-traded markets, accuracy reached 87.8%.

The key insight for practitioners is selectivity: not all signals are equal. High-volume markets significantly outperform thin ones (87.6% vs. 71.8%). High-conviction signals (priced above 80% or below 20%) dramatically outperform uncertain signals. Filtering to these high-quality signals converts a moderately useful dataset into an actionable investment tool.

The structural asymmetry — strong beat detection, weak miss detection — is a fundamental feature of the market, not a bug. Companies miss estimates less than 30% of the time, and when they do, it is often for reasons that were not foreseeable from public information. Sophisticated use of this data means understanding what it can and cannot do: it is a high-precision Beat-detector, a moderate Miss-alerter, and a near-useless tool in the uncertain 20–50% pricing band.

For equity investors managing exposure around earnings announcements, this represents a low-cost, high-frequency, real-money-weighted probability signal that adds meaningful information to the earnings event risk management process.

Prediction Markets & Corporate Earnings:
An Accuracy Analysis

Evidence from Earnings Beat / Miss Prediction Markets

Key Findings

Background: Earnings Markets and Their Structure

1.1 What the markets measure

1.2 Structural differences from Fed rate markets

1.3 Dataset overview

Accuracy Analysis

2.1 The base rate problem

2.2 Accuracy by time horizon

2.3 The high-confidence signal

Miss Detection: Signal Strength, Sector & Size Patterns

3.1 The beat/miss asymmetry

3.2 The sub-60% signal: precision vs. recall tradeoff

3.3 Miss rates by sector: where the signal is strongest

3.4 Miss rates by market cap: smaller companies are harder to predict

3.5 Companies that missed repeatedly

Volume, Market Quality, and Notable Events

4.1 Volume as a quality signal

4.2 Highest-volume markets

4.3 Notable prediction errors

Quarterly Beat Rates and Market Calibration

5.1 Beat rate variation by earnings season

5.2 Calibration quality

Investment Applications

6.1 Strategies informed by earnings prediction market data

6.2 Comparison to Fed rate market performance

6.3 Limitations and caveats

Conclusion

Prediction Markets & Corporate Earnings:An Accuracy Analysis

Evidence from Earnings Beat / Miss Prediction Markets

Key Findings

Background: Earnings Markets and Their Structure

1.1 What the markets measure

1.2 Structural differences from Fed rate markets

1.3 Dataset overview

Accuracy Analysis

2.1 The base rate problem

2.2 Accuracy by time horizon

2.3 The high-confidence signal

Miss Detection: Signal Strength, Sector & Size Patterns

3.1 The beat/miss asymmetry

3.2 The sub-60% signal: precision vs. recall tradeoff

3.3 Miss rates by sector: where the signal is strongest

3.4 Miss rates by market cap: smaller companies are harder to predict

3.5 Companies that missed repeatedly

Volume, Market Quality, and Notable Events

4.1 Volume as a quality signal

4.2 Highest-volume markets

4.3 Notable prediction errors

Quarterly Beat Rates and Market Calibration

5.1 Beat rate variation by earnings season

5.2 Calibration quality

Investment Applications

6.1 Strategies informed by earnings prediction market data

6.2 Comparison to Fed rate market performance

6.3 Limitations and caveats

Conclusion

Prediction Markets & Corporate Earnings:
An Accuracy Analysis