Overview
This paper analyzes the accuracy of Polymarket prediction markets in forecasting whether public companies will beat or miss their quarterly earnings estimates. The dataset covers 909 earnings events spanning June 2025 through May 2026, with daily pricing data tracking the market's evolving probability assessment for each event.
Unlike the Fed rate decision markets analyzed previously — where markets consistently priced outcomes weeks in advance with high certainty — earnings markets face a structurally different challenge: most earnings seasons compress pricing activity into a narrow 1–2 week window, leaving limited time for probability convergence. Despite this constraint, prediction markets demonstrate meaningful predictive value, particularly for high-conviction signals.
Key Findings
- 79.7% overall directional accuracy at D-1 (the day before earnings) across all 909 events
- When markets priced a >80% or <20% probability (high-conviction signals), accuracy rose to 87.8%
- 63% of events had already reached high-conviction pricing (>80% or <20%) by D-1
- The naive baseline (always predicting "Beat") yields only 72.6% — markets add meaningful lift beyond this floor
- Miss detection is significantly harder: markets correctly flagged only 45.2% of eventual misses at D-1, while identifying 92.7% of eventual beats
- Higher-volume markets showed considerably better accuracy (87.6%) vs. lower-volume markets (71.8%)
- Total market volume across 909 events: $11.3 million
Background: Earnings Markets and Their Structure
1.1 What the markets measure
Each Polymarket earnings market poses a simple binary question: "Will [Company] beat quarterly earnings?" Contracts trade between $0 and $1. A price of $0.72 implies a 72% market-assigned probability of an earnings beat. Resolution occurs at the closing price: contracts near $1.00 (≥$0.98) resolve as "Beat," while contracts near $0.00 (≤$0.02) resolve as "Miss."
1.2 Structural differences from Fed rate markets
Fed rate decision markets typically have 4–8 weeks of active trading before resolution. Earnings markets are structurally different: most companies report quarterly, and prediction market trading tends to concentrate in the final 7–14 days before the earnings announcement. The median event in this dataset had only 11 days of pricing data, and 25% had 8 or fewer data points.
This means the signal-building process documented in the Fed analysis (confidence rising steadily from 30+ days out) is largely absent for earnings. The market dynamic is more abrupt: traders enter a concentrated window, form views, and the market resolves quickly.
1.3 Dataset overview
This analysis covers 909 completed earnings beat/miss markets traded on Polymarket from June 2025 through May 2026. The dataset captures a broad cross-section of U.S. publicly traded companies — from mega-cap technology names (NVIDIA, Apple, Alphabet, Meta, Microsoft) to mid-cap industrials, consumer companies, and speculative growth stocks. Total traded dollar volume across all markets was $11.3 million.
Unique earnings beat/miss markets from Jun 2025 – May 2026
Median days of pricing data per event; 75th percentile was 14 days
Total dollar volume traded across all 909 earnings markets
Accuracy Analysis
2.1 The base rate problem
Any assessment of prediction market accuracy for earnings must account for the "base rate" effect. Over the Jun 2025 – May 2026 period, 72.6% of companies beat estimates — a figure consistent with the historical tendency for companies to manage guidance conservatively. A trivial "always predict Beat" model would therefore achieve 72.6% accuracy with zero effort.
The meaningful question is whether prediction markets deliver lift over this baseline — and whether they correctly identify the minority of companies that will miss. The answer is nuanced: the markets do add lift, but unevenly.
2.2 Accuracy by time horizon
The table below shows directional accuracy and average implied confidence at each time horizon before earnings. Unlike Fed rate markets — where confidence built steadily over 30+ days — earnings markets show relatively flat accuracy from D-7 to D-14, with a meaningful jump at D-1 as final pricing converges.
| Horizon | Events | Directional Accuracy | Avg. Confidence | High-Conf. Accuracy |
|---|---|---|---|---|
| D-30 | 10 | 60.0% | 63.1% | 86.7% |
| D-21 | 29 | 86.2% | 73.4% | 86.7% |
| D-14 | 193 | 76.2% | 68.7% | 84.3% |
| D-10 | 512 | 74.6% | 66.9% | 82.3% |
| D-7 | 696 | 75.9% | 67.0% | 87.3% |
| D-5 | 841 | 77.1% | 67.9% | 86.5% |
| D-3 | 892 | 76.9% | 68.0% | 86.3% |
| D-2 | 903 | 78.3% | 69.0% | 86.7% |
| D-1 | 907 | 79.7% | 72.4% | 87.8% |
High-conf. accuracy = accuracy for events priced >80% or <20% at that horizon. D-30/D-21 sample sizes are small due to most markets not opening that early.
2.3 The high-confidence signal
The most actionable finding in this data is the divergence between all events and high-confidence events. When the market priced a beat probability above 80% or below 20%, accuracy at D-1 jumped to 87.8%. Roughly 63% of all events fell in this category by D-1 — meaning a majority of markets were giving clean, high-confidence signals.
Note: "Very Low" tier = events where the market priced <20% beat probability (i.e., high confidence in a Miss). High accuracy in both tails demonstrates the market identifies confident calls well in both directions.
Miss Detection: Signal Strength, Sector & Size Patterns
3.1 The beat/miss asymmetry
The most important structural feature of earnings prediction markets is the skew: companies beat estimates far more often than they miss. Over this dataset, 72.6% of companies beat. This base rate creates a systematic bias — if uncertain about an outcome, the rational prior is "probably a Beat." The markets reflect this: the average D-1 beat probability across all events was approximately 70%.
The performance profile is dramatically different for beats vs. misses. Markets were highly effective at identifying eventual Beats (92.7% recall) but caught only 45.2% of Misses at D-1 using the 50% threshold. In other words, when a company ultimately missed, the market had it above 50% — predicting a beat — nearly 55% of the time.
3.2 The sub-60% signal: precision vs. recall tradeoff
Rather than using the 50% threshold mechanically, a more actionable framing asks: when the market prices a company below 60% to beat, how informative is that? The answer depends on how far below 60% — and which sector the company is in.
| Threshold | Events Flagged | Misses Caught | % of All Misses | Precision |
|---|---|---|---|---|
| < 20% beat prob | 52 | 46 | 18.5% | 88.5% |
| < 40% beat prob | 130 | 97 | 39.1% | 74.6% |
| < 50% beat prob | 160 | 112 | 45.2% | 70.0% |
| < 60% beat prob | 209 | 133 | 53.6% | 63.6% |
The sub-20% bucket is the cleanest signal: 88.5% precision with only 6 false alarms in 52 flagged events. But it catches fewer than 1-in-5 actual misses. The uncomfortable reality is that 46.4% of all misses were priced above 60% at D-1 — the market simply did not anticipate them. These are genuine surprise misses.
The price trajectory also matters: for events that ultimately missed, the average D-7 beat probability was 0.60, falling to 0.51 by D-1. About 58% of eventual misses saw their price fall between D-7 and D-1 — a declining trend that itself is a secondary signal, independent of the absolute level.
3.3 Miss rates by sector: where the signal is strongest
Miss rates vary dramatically across sectors, and so does the quality of the sub-60% warning signal. The data reveals three distinct tiers:
| Sector | Miss Rate | Sub-60% Precision | Sub-60% Recall | Surprise Misses |
|---|---|---|---|---|
| Real Estate | 71.4% | 90.0% | 90.0% | 10.0% |
| Communication Services | 41.2% | 71.4% | 71.4% | 28.6% |
| Healthcare | 29.4% | 78.6% | 73.3% | 26.7% |
| Consumer Discretionary | 38.8% | 58.5% | 47.5% | 52.5% |
| Financials | 24.7% | 69.7% | 52.3% | 47.7% |
| Energy | 33.3% | 58.3% | 70.0% | 30.0% |
| Consumer Staples | 20.6% | 42.9% | 21.4% | 78.6% |
| Industrials | 21.0% | 36.4% | 30.8% | 69.2% |
| Technology | 9.8% | 70.0% | 43.8% | 56.2% |
Surprise Misses = % of actual misses that were priced above 60% at D-1 (the market did not anticipate them). Sub-60% Precision = of events priced below 60%, % that actually missed. Sub-60% Recall = % of all misses that were flagged by the sub-60% signal.
Three patterns stand out:
Real Estate: the most reliable miss signal in the dataset. With a 71.4% base miss rate and 90% sub-60% precision and recall, Real Estate markets appear to price in distress quite accurately — likely because deteriorating fundamentals in property-linked companies tend to be visible in advance. Only 10% of misses came as surprises.
Communication Services and Healthcare: strong signal, moderate miss rates. Both sectors showed sub-60% precision above 70% and recall above 70%, meaning when the market flagged these names as at-risk, it was usually right and it caught most of the misses. These are the sectors where the sub-60% screen is most actionable.
Technology: low miss rates but mostly surprise misses. Technology companies beat at a remarkable 90.2% rate — the highest in the dataset. When they did miss, however, 56% of those misses were priced above 60% at D-1, meaning the market was caught off guard. Tech is a sector where the prediction market is primarily useful as a beat confirmer, not a miss detector.
Consumer Staples and Industrials: the market adds least value. Both sectors had low base miss rates (around 20%) and poor sub-60% signal quality — with recall below 35% and high surprise miss rates (69–79%). These sectors may resist prediction because their earnings surprises stem from idiosyncratic factors (input cost shocks, weather, supply chain) rather than the kind of public information that prediction market participants can synthesize.
3.4 Miss rates by market cap: smaller companies are harder to predict
Market capitalization shows a clean, monotonic relationship with both miss rates and the quality of the miss detection signal. Smaller companies miss more often, and when they do miss, the sub-60% signal is sharper.
| Market Cap Tier | Events | Miss Rate | Sub-60% Precision | Sub-60% Recall | Surprise Misses |
|---|---|---|---|---|---|
| Mega (>$200B) | 78 | 14.1% | 66.7% | 54.5% | 45.5% |
| Large ($10B–$200B) | 373 | 23.2% | 55.7% | 45.3% | 54.7% |
| Mid ($2B–$10B) | 260 | 26.2% | 62.5% | 51.5% | 48.5% |
| Small (<$2B) | 163 | 45.7% | 72.7% | 64.9% | 35.1% |
Small-cap companies missed at nearly three times the rate of mega-caps (45.7% vs. 14.1%). More importantly, the sub-60% signal for small-cap names had 72.7% precision and 64.9% recall — the best of any tier — meaning the market was better at anticipating small-cap misses than large-cap ones. This is plausible: small-cap companies tend to have higher earnings volatility and weaker guidance frameworks, making genuine uncertainty more visible in market pricing.
Large-cap stocks showed the weakest miss signal precision (55.7%) with the highest surprise miss rate (54.7%). For mega-cap and large-cap names, the market tends to assume a beat until proven otherwise, and misses often arrive as genuine shocks — consistent with the idea that large companies have more sophisticated investor relations and better ability to manage analyst expectations.
45.7% miss rate · 72.7% sub-60% precision · 64.9% recall
54.7% of large-cap misses were priced above 60% at D-1
90% sub-60% precision and recall · only 10% surprise misses
3.5 Companies that missed repeatedly
Several companies appeared multiple times in the dataset and missed earnings on every event tracked. While sample sizes are small, persistent missers may represent structurally unpredictable businesses or companies in operational distress during this period.
| Company | Sector | Cap | Events | Misses |
|---|---|---|---|---|
| Airbnb | Consumer Discretionary | Large | 3 | 3 |
| Warby Parker | Consumer Discretionary | Small | 3 | 3 |
| Rumble | Communication Services | Small | 3 | 3 |
| Alibaba | Technology | Mega | 2 | 2 |
| BuzzFeed | Communication Services | Small | 2 | 2 |
| Opendoor Technologies | Real Estate | Small | 3 | 3 |
| Capital One | Financials | Large | 2 | 2 |
| Blackstone | Financials | Large | 2 | 2 |
| DoorDash | Consumer Discretionary | Large | 2 | 2 |
| Chipotle Mexican Grill | Consumer Discretionary | Large | 2 | 2 |
Volume, Market Quality, and Notable Events
4.1 Volume as a quality signal
Dollar volume varies enormously across the 909 markets — from under $100 for the thinnest names to over $650,000 for the most-watched events (NVIDIA). This variation is itself informative: high-volume markets attract more sophisticated participants and are likely to be better calibrated.
Splitting the dataset at the median volume, higher-volume markets achieved 87.6% D-1 accuracy vs. only 71.8% for lower-volume markets — a 15.8 percentage point gap. This strongly suggests that volume is a useful proxy for signal quality: thinner markets should be weighted less in any investment framework.
Events above median dollar volume (N=453)
Events below median dollar volume (N=454)
Higher-volume markets are meaningfully more accurate
4.2 Highest-volume markets
The table below shows the 15 most actively traded earnings markets in the dataset. These are the events where prediction market data is likely most reliable. NVIDIA's earnings events alone accounted for nearly $900,000 in combined volume.
| Company | Outcome | Total Volume | Final Price |
|---|---|---|---|
| NVIDIA | BEAT | $651,846 | 0.999 |
| Tesla | MISS | $335,312 | 0.001 |
| Rocket Lab | BEAT | $276,354 | 0.999 |
| Papa John's International | BEAT | $252,402 | 0.999 |
| NVIDIA | BEAT | $232,237 | 0.999 |
| Hilton Grand Vacations | MISS | $225,884 | 0.001 |
| Domino's Pizza | MISS | $202,765 | 0.001 |
| Amer Sports | BEAT | $199,888 | 0.999 |
| Hims & Hers Health | BEAT | $196,605 | 0.999 |
| Costco Wholesale | MISS | $164,438 | 0.004 |
| Alphabet | BEAT | $132,856 | 0.999 |
| Cintas | MISS | $99,607 | 0.001 |
| Robinhood Markets | BEAT | $96,729 | 0.999 |
| Alphabet | BEAT | $95,325 | 0.999 |
| Meta Platforms | BEAT | $80,776 | 0.999 |
4.3 Notable prediction errors
The most notable high-volume mispredictions at D-1 reveal a pattern: they cluster around events with genuine uncertainty — where both beats and misses were plausible given the public information at the time. The market was not blindly wrong; in many cases it was expressing real uncertainty that happened to resolve the other way.
| Company | Actual Outcome | D-1 Price | Volume | Error Type |
|---|---|---|---|---|
| Tesla | MISS | 0.830 | $335,312 | Predicted Beat, was Miss |
| Papa John's | BEAT | 0.330 | $252,402 | Predicted Miss, was Beat |
| Domino's Pizza | MISS | 0.660 | $202,765 | Predicted Beat, was Miss |
| Tesla | BEAT | 0.290 | $67,558 | Predicted Miss, was Beat |
| Beyond Meat | MISS | 0.740 | $67,108 | Predicted Beat, was Miss |
| Amazon | MISS | 0.954 | $44,525 | High-conf Beat, was Miss |
Tesla appears multiple times in this table — including once with 83% confidence in a Beat that turned into a Miss, and once with 71% confidence in a Miss that turned out to be a Beat. Tesla earnings have long been idiosyncratic and difficult to forecast, and this data confirms the prediction market also struggled with them.
Quarterly Beat Rates and Market Calibration
5.1 Beat rate variation by earnings season
The underlying beat rate varies across reporting periods. The dataset spans three major earnings seasons (Q3 2025, Q4 2025, and Q1 2026), each with a slightly different beat rate. Markets must implicitly calibrate to these seasonal patterns.
| Period | Events | Beat Rate |
|---|---|---|
| Q3 2025 | 9 | 88.9% |
| Q4 2025 | 370 | 75.1% |
| Q1 2026 | 295 | 68.8% |
| Q2 2026 (partial) | 234 | 72.6% |
The Q1 2026 reporting season showed a notably lower beat rate (68.8%) compared to Q4 2025 (75.1%), reflecting broader macro uncertainty in early 2026. Prediction markets tracking these events in real time would have needed to implicitly adjust for changing base rates — a sophisticated task that underlines why even modest lift over the naive baseline is meaningful.
5.2 Calibration quality
The cleanest test of calibration is this: do events priced at 80%+ actually beat at an ~80%+ rate? In this dataset, events at D-1 split almost entirely into two buckets — <10% (all misses) and >90% (all beats) — with very few intermediate prices at resolution. This binary polarization is expected: markets settle into near-certainty by the time of resolution. The more interesting calibration signal is in the intermediate prices 7–14 days out, where the market is still forming its view.
Investment Applications
6.1 Strategies informed by earnings prediction market data
High-confidence signals (>80% or <20%) at D-7 were accurate 87%+ of the time. For equity managers running concentrated positions into earnings, this provides an evidence-based framework for sizing and hedging decisions.
Prediction market probabilities can be compared against implied volatility in equity options ahead of earnings. When a high-volume market is pricing a 90%+ beat probability, selling downside protection may be mispriced relative to the PM signal.
The 15.8pp accuracy gap between high- and low-volume markets is actionable: filter out thin markets and weight the remaining signals by volume. This converts a 79.7% average signal into a consistently higher-quality one.
While overall miss detection is weak (45.2% recall), when a high-volume market prices a stock below 20% beat probability with high conviction, that is a materially useful warning signal. Track these specific "high-conviction miss" signals.
6.2 Comparison to Fed rate market performance
The contrast with Fed rate decision markets is instructive. Fed markets achieved 95.7% directional accuracy on decision day — but that was in a structured, calendar-driven environment with extensive Fed communication, FOMC transcripts, and blackout periods that give markets a clear cadence to work with.
Earnings markets operate in a noisier environment: management teams control the information flow, guidance is strategic, and macro surprises can invalidate even well-researched forecasts. An 87.8% accuracy rate for high-confidence earnings signals is therefore arguably as impressive as 95.7% for Fed decisions — the underlying task is harder.
6.3 Limitations and caveats
Several important caveats apply to this analysis:
- Definition of "Beat": Resolution here uses the final prediction market price, which itself reflects consensus on whether the company beat or missed. This is self-consistent but may not always align with every analyst's definition.
- Limited pricing history: Most events have only 7–14 days of data. With more lead time, signal quality might improve — or the market might remain uncertain until very close to the announcement.
- Adverse selection: The fact that a prediction market exists for an event may itself be endogenous — markets are more likely to be created for earnings events perceived as interesting or uncertain.
- Short time series: This analysis covers approximately one year (Jun 2025 – May 2026). A longer dataset would allow for more robust calibration analysis and seasonal pattern identification.
Conclusion
Prediction markets for corporate earnings provide a genuine — if limited — informational edge. Across 909 events, Polymarket earnings markets achieved 79.7% directional accuracy at D-1, representing a 7.1 percentage-point lift over the naive base rate. For high-confidence signals in well-traded markets, accuracy reached 87.8%.
The key insight for practitioners is selectivity: not all signals are equal. High-volume markets significantly outperform thin ones (87.6% vs. 71.8%). High-conviction signals (priced above 80% or below 20%) dramatically outperform uncertain signals. Filtering to these high-quality signals converts a moderately useful dataset into an actionable investment tool.
The structural asymmetry — strong beat detection, weak miss detection — is a fundamental feature of the market, not a bug. Companies miss estimates less than 30% of the time, and when they do, it is often for reasons that were not foreseeable from public information. Sophisticated use of this data means understanding what it can and cannot do: it is a high-precision Beat-detector, a moderate Miss-alerter, and a near-useless tool in the uncertain 20–50% pricing band.
For equity investors managing exposure around earnings announcements, this represents a low-cost, high-frequency, real-money-weighted probability signal that adds meaningful information to the earnings event risk management process.