← Back to home

How Prediction Engine Works

Machine learning models, expected value, and bankroll strategy — everything you need to make informed betting decisions.

24+
ML Models
69.1%
ML Baseline
90.7%
Live K Accuracy
2
Sports Covered

What is Prediction Engine?

Prediction Engine is a quantitative sports analysis platform that generates daily MLB and NBA predictions using machine learning. Instead of relying on gut feelings, talking heads, or social media hype, every pick on the platform is produced by trained statistical models that analyze hundreds of features from historical data.

The platform covers team-level predictions (moneyline, spread/run line, totals), player props (hits, runs, RBIs, strikeouts, points, assists, rebounds, double-doubles, triple-doubles), and live in-game trackers that update predictions every 30-60 seconds as games unfold.

Every prediction includes a confidence level, the model's edge over Vegas implied probability, and historical backtest accuracy — so you always know exactly how much conviction the model has and how it has performed historically.

How Our Models Work

All models are built on XGBoost (gradient-boosted decision trees) and Random Forest classifiers/regressors — the same algorithms used across finance, healthcare, and competitive machine learning. We don't use deep learning or black boxes. Every model is interpretable and auditable.

MLB Models (3 Team + 6 Player Props + 6 Live)

Our MLB pipeline uses three separate team models — one each for moneyline (win/loss classification), run line (margin regression), and totals (over/under regression). Each model is trained on 42-44 engineered features including pitching matchups, bullpen usage, park factors, umpire tendencies, and rolling performance windows.

Player prop models cover five batter statistics (hits, runs, RBIs, home runs, combined HRR) and pitcher strikeouts. The pitcher K model identifies starters who are likely to hit their strikeout over/under, with 86.6% accuracy at the 80%+ confidence tier.

The Live K Tracker runs four XGBoost checkpoint models (innings 2 through 5) that predict a pitcher's final strikeout total from in-game data — pace, pitch count, K rate, and game context. By inning 5, the model achieves 90.7% over/under accuracy. The Live Hits Tracker uses 59 features including park factors, platoon splits, umpire tendencies, and weather to predict remaining hits for batters mid-game.

NBA Models (5 Team + 5 Player Props + 9 Live)

NBA team models use Random Forest for moneyline and spread prediction, and XGBoost for totals. The moneyline model reaches 78.4% accuracy at the 70%+ confidence filter. Spread predictions use regression to estimate winning margin, and the totals model trains daily on rolling data from three seasons.

Player props cover points, assists, rebounds, double-doubles, and triple-doubles. The points and assists models use XGBoost regression combined with CDF (cumulative distribution function) probability to generate over/under confidence for each prop line.

The Live Player Stats Tracker runs nine XGBoost models — three stats (points, rebounds, assists) across three quarterly checkpoints (Q1, Q2, Q3). By Q3, the assists model hits 94% over/under accuracy. These models compare real-time projections against sportsbook prop lines to generate live over/under signals during games.

The Self-Training Loop

Models improve over time through a self-training feedback loop. Each day, the pipeline grades yesterday's predictions against actual results, collects new game data, and feeds it back into the training set. Monthly retrains incorporate the latest data to keep models calibrated against the current season. This isn't a static system — it learns and adapts.

What is +EV Betting?

Expected Value (+EV) is the mathematical foundation of profitable betting. A bet is +EV when its true probability of winning is higher than what the odds imply. Over a large enough sample, +EV bets make money regardless of any individual outcome.

How It Works

Every set of odds implies a probability. When a sportsbook offers +150 on a team, they're implying that team has about a 40% chance of winning. If your model calculates a 52% chance, that 12-point gap is your edge — and that bet is +EV.

Here's the math: if you bet $100 at +150 odds with a true 52% win probability, your expected value per bet is:

EV = (0.52 × $150) − (0.48 × $100) = $78 − $48 = +$30 per bet

You won't win every bet. But over hundreds of bets, the math plays out. This is the same principle that makes casinos profitable — except here, the edge is on your side.

Edge vs. Confidence

Prediction Engine shows both confidence (model's probability a pick wins) and edge (the gap between model probability and Vegas implied probability). A pick can be high-confidence but low-edge if the sportsbook already agrees. The most valuable picks have both high confidence AND significant edge — meaning the model sees something the market hasn't priced in.

Why Most Bettors Lose

Sportsbooks build a margin (the “vig” or “juice”) into every line. Standard -110/-110 odds imply both sides have a 52.4% chance of winning — totaling 104.8%, not 100%. That extra 4.8% is the house edge. To beat the vig, you need to be right more than 52.4% of the time on standard bets. Casual bettors who pick based on fandom or narratives rarely clear that bar consistently.

Betting Strategies & Bankroll Management

Having an edge means nothing if you blow your bankroll on a bad run. The strategy you use to size your bets is just as important as the picks themselves. Here are the most common approaches, from simplest to most sophisticated.

Flat Unit Betting

The simplest and most popular approach. You bet the same fixed amount — one “unit” — on every play regardless of confidence. A unit is typically 1-3% of your total bankroll.

Example: $1,000 bankroll → 1 unit = $10-$30

Pros: Simple, limits variance, easy to track

Cons: Doesn't capitalize on high-confidence spots

Best for: Beginners, anyone who wants steady growth without overthinking bet sizing

Tiered Unit Betting

A step up from flat betting. You assign 1-3 units based on the model's confidence level. Prediction Engine labels picks by tier — a high-confidence play with large edge gets 3 units, a standard play gets 1.

Example: 70%+ confidence = 3u, 60-69% = 2u, 55-59% = 1u

Pros: Allocates more capital to your best spots

Cons: Slightly more variance than flat

Best for: Intermediate bettors comfortable with the model's confidence tiers

Kelly Criterion

The Kelly Criterion is a mathematical formula that calculates the optimal bet size to maximize long-term bankroll growth. It accounts for both the probability of winning and the odds being offered.

Kelly % = (bp − q) / b
where b = decimal odds − 1, p = win probability, q = 1 − p

Example: Model says 60% win probability at +120 odds (decimal 2.2). Kelly = (1.2 × 0.60 − 0.40) / 1.2 = 0.267, or 26.7% of bankroll. In practice, full Kelly is extremely aggressive.

Pros: Mathematically optimal for growth, proven in finance and gambling theory

Cons: Full Kelly is volatile — a bad streak can draw down 50%+ of bankroll

Best for: Advanced bettors who trust their model's probability estimates

Fractional Kelly (Recommended)

Most professionals use half Kelly or quarter Kelly — you calculate the full Kelly percentage then bet 25-50% of it. This dramatically reduces variance while capturing most of the long-term growth.

Example: Full Kelly says 26.7% → Half Kelly = 13.3% → Quarter Kelly = 6.7%

Pros: Best risk/reward balance, still mathematically grounded

Cons: Slower growth than full Kelly (but you stay in the game)

Best for: Anyone who wants optimal sizing without the rollercoaster

Key Bankroll Rules

1. Never bet more than 5% of your bankroll on a single play. Even the highest-confidence pick can lose. Preservation is the game.

2. Track everything. Use the My Dashboard page to log bets, see your win rate by market, and identify which play types work best for you.

3. The model is not a guarantee. A 69% baseline means roughly 31 out of every 100 picks lose. Variance is real — trust the process over a large sample, not any single day.

4. Set a stop-loss. If you lose 10% of your bankroll in a single day, stop. The model produces fresh picks every day — there's always tomorrow.

Pricing & Plans

Prediction Engine offers a free trial so you can see the models in action before committing. After the trial, choose the plan that fits your style.

Weekly
$5/week
  • All daily predictions (MLB + NBA)
  • Live in-game trackers
  • Player props & team models
  • Performance analytics
  • Cancel anytime
Best Value
Monthly
$15/month

Save 25% vs. weekly

  • Everything in Weekly
  • Priority access to new features
  • Cancel anytime

Frequently Asked Questions

How accurate are the predictions?

Accuracy varies by market. Our MLB moneyline model has a 69.1% backtest baseline at the 60%+ confidence filter. NBA moneyline reaches 78.4% at 70%+ confidence. Live tracker models (K predictions, hits, player stats) achieve 82-94% over/under accuracy by the later checkpoints. All accuracy numbers are from backtesting on held-out test data — not cherry-picked.

What data do the models use?

Team models use game-level statistics from API-Sports (team records, odds, scores, head-to-head history) combined with engineered features like rolling averages, rest days, travel distance, and pitching matchups. Player prop models use individual box score data from the free MLB Stats API and nba_api. Live trackers use real-time play-by-play and box score feeds updated every 30-60 seconds.

How are “Top Picks” selected?

Top Picks are the highest-confidence predictions from each market that meet minimum thresholds. For example, the top 2 moneyline picks above 60% confidence, the top spread pick with a projected margin of 5+ points, or the top player prop with 65%+ probability. These curated picks are what the Performance page tracks and grades daily.

Can I see the model's track record?

Yes. The Performance page shows live win/loss records for every market, updated daily. It displays both the backtest baseline (historical accuracy) and live season accuracy, so you can see exactly how the model is performing in real time. Every graded pick is tracked with full transparency.

What sports do you cover?

Currently MLB and NBA. NFL and NHL are planned for their respective 2026 seasons. Each sport gets its own set of dedicated models — we don't reuse a generic model across sports.

What are the live trackers?

Live trackers are in-game prediction models that update in real time. The MLB Live K Tracker predicts a pitcher's final strikeout total as the game progresses. The MLB Live Hits Tracker predicts remaining hits for batters. The NBA Live Player Stats Tracker predicts final points, rebounds, and assists totals from quarterly checkpoints. Each tracker compares its prediction to sportsbook lines and generates over/under signals.

Do I need a sportsbook account?

Prediction Engine provides analysis and predictions — it is not a sportsbook and does not process bets. You would use the insights on whatever legal sportsbook is available in your jurisdiction. Users must be 21+ and comply with local gambling laws.

How do I cancel?

You can cancel your subscription at any time from your account dashboard or by contacting us. There are no long-term contracts or cancellation fees. If you cancel, you retain access until the end of your current billing period.

Is this guaranteed to make money?

No. No prediction model, no matter how sophisticated, can guarantee profit. Sports have inherent randomness, and even a 69% model loses 31% of the time. What Prediction Engine provides is a statistically validated edge over time. Responsible bankroll management and realistic expectations are essential. Never bet money you can't afford to lose.

Ready to get started?

Create a free account and see today's predictions.

Sign Up Free