Menu

Less chance. More data.

Statistics, news, analysis and guidance for informed sports decisions.

Statistics & Analytics

Power Rating

A numerical value that objectively rates the relative strength of a team, independent of its record. Used by professional bettors and bookmakers to predict game outcomes and set point spreads.

What Exactly Is a Power Rating in Sports?

A power rating is a numerical value that objectively measures the relative strength of a team, independent of its win-loss record. Rather than simply ranking teams by how many games they've won, power ratings attempt to quantify how good a team truly is based on the quality of their performance and competition faced. These ratings form the foundation of modern sports betting, handicapping, and sports analytics.

The simplest way to understand a power rating is to think of it as an answer to the question: "How many points better or worse than an average team is this team?" If an average team in the league is assigned a rating of zero, a strong team might be rated at +7 (seven points better than average), while a weak team might be rated at -5 (five points worse than average).

Team Power Rating Interpretation Implied Strength
Kansas City +8 8 points better than average Elite
Buffalo +5 5 points better than average Strong
Average Team 0 Exactly average Neutral
Houston -3 3 points worse than average Below Average
New England -6 6 points worse than average Weak

How Power Ratings Differ From Win-Loss Records

This distinction is crucial for understanding why power ratings matter. A team with a 10-6 record might actually be stronger than a team with a 12-4 record if the 10-6 team faced tougher competition. Power ratings account for strength of schedule—the quality of opponents a team has faced—which win-loss records do not.

Consider a hypothetical scenario: Team A goes 12-4 by dominating weak opponents, while Team B goes 10-6 by playing a brutal schedule of elite teams. Their records differ by two wins, but their actual strength might be nearly identical. A power rating system would capture this nuance; a simple record would not. This is why professional bettors and bookmakers can't rely on records alone—they need power ratings to find true value.


Where Did Power Ratings Come From and How Have They Evolved?

Historical Origins

Power ratings didn't emerge overnight. The concept evolved gradually from the earliest days of organized sports betting. In the mid-20th century, bookmakers and professional handicappers relied heavily on subjective judgment—what they called "the eye test." They would watch games, read statistics, and form opinions about which teams were truly strong. These subjective assessments formed the basis of the opening lines they offered.

However, subjective rankings had obvious flaws. They were inconsistent, subject to bias, and difficult to defend with data. As sports became more quantified and betting markets grew more sophisticated, there was a clear need for a more objective, systematic approach. This need gave birth to the modern power rating system.

From Manual Rankings to Modern Algorithms

The real turning point came with the emergence of data-driven methodologies in the 1980s and 1990s. Jeff Sagarin, a legendary sports statistician, developed one of the most influential power rating systems using an approach called the iterative linkages model. Sagarin's system became so respected that the NCAA used it to help determine the BCS (Bowl Championship Series) rankings for college football.

The evolution followed a clear trajectory:

  1. Old School (Pre-1990s): Expert handicappers using intuition and basic statistics
  2. Transitional (1990s-2000s): Hybrid systems combining expert judgment with statistical formulas
  3. Modern Era (2000s-Present): Sophisticated algorithms using advanced mathematics, machine learning, and comprehensive statistical databases

Today's power rating systems range from relatively simple formulas to incredibly complex algorithms. Some professionals still lean on "feel" and the eye test combined with data, while others rely entirely on automated systems that process thousands of data points per game. The common thread: all serious power rating systems aim for objectivity and measurability.


How Are Power Ratings Actually Calculated?

The Basic Methodology

The foundation of any power rating system is establishing a baseline. Most systems assign a power rating of 0 to an average team—typically the 16th or 17th best team in a 32-team league like the NFL. From there, every other team receives a rating that indicates how many points better or worse they are than this average.

The simplest formula is straightforward:

Difference in Power Ratings = Predicted Point Spread

If the Kansas City Chiefs have a power rating of +7 and the Houston Texans have a power rating of -3, the difference is 10 points. On a neutral field, the Chiefs would be predicted to win by 10. In a real game with home field advantage (typically 2.5-3 points), the home team's advantage is added to their power rating.

Component Definition Example
Baseline Team Average team in the league 16th-ranked NFL team
Baseline Rating Power rating of average team 0 points
Strong Team Rating Points better than average Chiefs at +7
Weak Team Rating Points worse than average Texans at -3
Spread Calculation Difference between two ratings +7 minus (-3) = 10-point spread
Home Field Advantage Added to home team's rating Typically +2.5 to +3

Advanced Statistical Approaches

While the basic concept is simple, professional power rating systems are far more sophisticated. The most respected approach is the iterative linkages model, which works like this:

  1. Start with initial ratings based on preseason assessments
  2. Play through the season using the ratings to predict each game's outcome
  3. Compare predictions to actual results and adjust ratings accordingly
  4. Repeat the entire season with adjusted ratings, making smaller adjustments each iteration
  5. Continue until ratings stabilize, meaning further iterations produce minimal changes

This method captures how teams actually perform while accounting for strength of schedule naturally. If a team's power rating predicts they should win by 5 but they actually win by 12, their rating increases. If they're predicted to lose by 3 but lose by 10, their rating decreases.

Advanced systems also break down power ratings into offensive and defensive components. A team might have an overall power rating of +6, but that could be composed of +8 for offense and -2 for defense. This granularity allows bettors to identify specific matchup advantages. Some systems even assign credit to defenses for defensive scores, ensuring that defensive touchdowns don't artificially inflate offensive ratings.

The Role of Strength of Schedule

Strength of schedule (SOS) adjustments are what separate sophisticated power rating systems from crude ones. Without SOS adjustments, a team that goes 10-0 against terrible opponents would appear stronger than a 9-1 team that played elite competition—obviously incorrect.

The adjustment works by evaluating every opponent's strength and incorporating that into the rating calculation. A win against a +5 team (elite) is worth more than a win against a -5 team (weak). The algorithm continuously recalibrates, recognizing that a team's strength can only be truly assessed relative to the quality of teams they've faced.


How Do Professional Bettors Actually Use Power Ratings?

Finding Value in Point Spreads

This is the core application. A professional bettor creates or obtains power ratings, then compares them to the bookmaker's published point spread. When there's a difference, there's potential value.

Example: Your power ratings say Team A should be favored by 7 points. The bookmaker has them favored by only 5 points. You found value—you can bet Team A at +5 when you believe they should be -7. This is called getting closing line value (CLV).

Over hundreds of bets, consistently getting CLV is how professional bettors profit. They might win only 52-53% of their bets, but if they're consistently getting +110 odds (standard -110 vig) on bets they believe have 54% win probability, they'll profit long-term.

Setting Opening Lines

Bookmakers use power ratings as their starting point for setting opening lines. When the NFL schedule is released, sportsbooks don't wait for public opinion—they immediately calculate power ratings for all 32 teams and generate point spreads for every game. These opening lines are typically available days or weeks before the games are played.

A bookmaker might use power ratings like this:

  • Chiefs power rating: +7
  • Texans power rating: -3
  • Difference: 10 points
  • Chiefs at home (add 2.5 points): Chiefs -12.5 opening line

The bookmaker then monitors how the public bets and adjusts the line to balance their risk. But that opening line came directly from power ratings.

Season-Long Adjustments

Power ratings aren't static. They're continuously updated based on actual performance, injuries, trades, and other factors. A team that starts the season with a +5 rating might be adjusted to +8 after winning its first four games convincingly, or down to +2 after key injuries.

The frequency and magnitude of adjustments vary by system. Some professionals make adjustments after every game, while others wait for patterns to emerge. The key is balancing responsiveness (reacting to new information) with stability (not overreacting to small sample sizes).


How Do Power Ratings Compare to Other Rating Systems?

Power Ratings vs. Elo Ratings

The Elo rating system, originally developed for chess, is sometimes used in sports as an alternative to power ratings. While both systems attempt to measure relative strength, they differ significantly in methodology and application.

Aspect Power Ratings Elo Ratings
Purpose Predict game margins and point spreads Predict win probability between competitors
Output Points better/worse than average Numerical rating (typically 1000-3000 range)
Calculation Margin of victory, strength of schedule Win/loss only, K-factor determines adjustment size
Home Field Often incorporated as separate adjustment Can be incorporated into rating change
Strength of Schedule Explicitly adjusted for Naturally incorporated through head-to-head results
Typical Use Sports betting, handicapping Chess, esports, some sports analytics
Adjustment Speed Varies by system, often flexible Fixed K-factor determines change magnitude
Best For Predicting margins and finding value Head-to-head matchup probability

When to use each: Power ratings are superior for sports betting because they predict point spreads directly. Elo ratings are better for head-to-head competition where you only care about win/loss probability, not margins.

Other Rating Alternatives

Beyond power ratings and Elo systems, several other approaches exist:

  • Expert Rankings: Subjective rankings by knowledgeable analysts. Strength: human judgment and context. Weakness: inconsistent and biased.
  • Simple Formulas: Basic equations like (Wins × 2) + (Point Differential) = Rating. Strength: transparent and easy to calculate. Weakness: ignores strength of schedule and context.
  • Machine Learning Models: Neural networks trained on historical data. Strength: can capture complex patterns. Weakness: black-box nature makes them hard to understand or adjust.

Professional bettors often use multiple systems and compare their outputs to identify consensus and outliers.


Can You Build Your Own Power Rating System?

Setting Your Baseline

Start by identifying what an "average" team looks like in your sport. In the NFL, this is typically the 16th-ranked team by consensus—roughly the middle of the league. Assign this team a power rating of 0.

Then, assign initial ratings to all other teams. For preseason ratings, you might use:

  • Previous season's final ratings as a starting point
  • Offseason changes (draft, free agency, coaching changes)
  • General consensus about team strength
  • Preseason expert rankings

Initial ratings typically range from about -5 to +5, since no team has played yet and there's maximum uncertainty. As the season progresses and teams play games, the range typically expands to -10 to +10 or wider.

Choosing Your Key Statistics

Different sports require different statistics. For NFL power ratings, professional bettors typically focus on:

  • Offensive yards per play: More predictive than total yards
  • Defensive yards per play allowed: Mirror of offensive metric
  • Turnover differential: Wins and losses, heavily weighted
  • Third-down conversion rate: Efficiency indicator
  • Red zone efficiency: Scoring when in position to score
  • Point differential: Final arbiter of performance

Some systems also factor in:

  • Strength of schedule (opponents' power ratings)
  • Home/away splits
  • Coaching changes or key injuries
  • Recent form (last 4-6 games weighted more heavily)

The key is consistency: decide on your metrics upfront and apply them uniformly to all teams.

Updating Ratings Throughout the Season

After each week of games, update your ratings based on actual results versus predicted results. If Team A was predicted to win by 5 but won by 12, increase their rating slightly. If they were predicted to win by 5 but lost by 3, decrease it.

The magnitude of adjustment matters. Most professionals use smaller adjustments early in the season (when sample size is small) and allow larger adjustments later (when trends are clearer). A common approach: adjust ratings by 0.1 to 0.5 points per game in Week 1, increasing to 0.5 to 1.5 points by Week 10.

Some professionals completely overhaul their ratings at the midseason or season-end point, recognizing that preseason assessments become obsolete.


What Are the Common Misconceptions About Power Ratings?

Misconception 1: Power Ratings Predict Winners Perfectly

This is the most dangerous myth. Power ratings are probabilistic, not deterministic. A power rating that predicts Team A to win by 7 doesn't mean Team A will win by exactly 7 or even that they'll definitely win.

Instead, it means: "There's a 50% chance Team A wins by more than 7 points, and a 50% chance they win by less than 7 or lose entirely." Variance exists in sports—upsets happen, injuries occur, and randomness plays a role.

Professional power rating systems like those used by ESPN and The Power Rank achieve accuracy rates of around 60-62% in predicting game winners, which is actually excellent but far from perfect. They're tools for finding value, not crystal balls.

Misconception 2: All Power Rating Systems Are Equal

They absolutely are not. The difference between a sophisticated system and a crude one is enormous. A system that accounts for strength of schedule, home field advantage, and recent form will vastly outperform one that simply divides total points by games played.

Similarly, expertise matters. A power rating system built by a professional bettor with 20 years of experience will typically outperform one built by someone new to sports analytics. The formulas might look similar, but the judgment calls about weighting, adjustments, and special circumstances differ substantially.

Misconception 3: You Can Ignore Recent Performance

Some bettors treat power ratings as static, unchanging measures. In reality, recent performance is crucial. A team that was rated +3 at the season's start but has won six straight games and just acquired a star player should have a significantly higher rating.

Conversely, a team that started strong but has lost four of its last five games and suffered key injuries should be downgraded. The best power rating systems are dynamic, reflecting the current reality of team strength, not historical averages.


What Are the Limitations of Power Ratings?

Incomplete Information

Power ratings are built on historical game results and statistics, but they can't capture everything that matters. A team's best player getting injured the day before a game won't be reflected in the rating until after the game is played. Coaching changes, trades, and other roster moves take time to influence ratings.

Some systems attempt to incorporate injury information, but it's inherently difficult. How much does losing a star wide receiver impact a team's strength? It depends on the replacement, the team's depth, the coach's system, and many other factors that resist quantification.

Market Efficiency

Here's the uncomfortable truth for power rating enthusiasts: professional bookmakers and other bettors are using similar systems. The NFL point spread market is one of the most efficient markets in all of sports. This means that by the time you've built a power rating system and identified value, the market may have already adjusted.

This is why many professional bettors focus on less-efficient markets (college sports, international leagues, live betting) where power ratings can provide more edge.

Sport-Specific Challenges

Some sports are easier to rate than others. The NFL, with 32 teams playing 17 games each, provides a large sample size and relatively stable environments. College football, with 130+ teams, is much harder to rate because teams play fewer games and strength of schedule varies wildly.

Individual sports (tennis, golf) are even more challenging because power ratings for one player might not account for surface preferences, recent injuries, or psychological factors.


Where Are Power Ratings Heading in the Future?

Advanced Machine Learning and AI

The next frontier in power ratings is machine learning. Rather than manually specifying which statistics matter and how much, algorithms can learn optimal weighting from historical data. Neural networks can identify non-linear relationships that human-designed formulas might miss.

Real-time adjustment is becoming more feasible too. Rather than updating power ratings once per week after games conclude, advanced systems could incorporate live data—in-game performance, injury reports, weather conditions—to continuously refine predictions.

Integration With Player Metrics

Future power rating systems will likely move beyond team-level statistics to individual player metrics. A power rating might account for:

  • Individual player strength ratings
  • Salary cap impact and roster construction efficiency
  • Injury probability for key players
  • Coaching system fit and player role clarity

This granularity would allow for more precise predictions, especially in situations where one or two players are crucial to team success.


Frequently Asked Questions About Power Ratings

What is the difference between a power rating and a power ranking?

Power ratings are numerical values (e.g., +7, -3) representing how many points better or worse than average a team is. Power rankings are ordinal lists ranking teams from best to worst. A power ranking might say "Team A is #1," while a power rating would say "Team A is +8." Power ratings are more useful for betting because they predict margins; rankings only show order.

How often should I update my power ratings?

Most professionals update after every game, but the magnitude of adjustment varies. Early in the season, make smaller adjustments because sample sizes are small. Later in the season, adjust more aggressively as patterns become clearer. Some bettors also make mid-season overhauls, completely recalculating ratings based on what they've learned.

Can power ratings be applied to individual sports like tennis or golf?

Yes, but they're less reliable. Individual sports have smaller sample sizes (one player plays maybe 20-30 matches per year), more variability, and more factors outside the player's control (surface, weather, opponent matchups). Some analysts use Elo ratings for individual sports instead, as they're more stable with smaller sample sizes.

What's the most important statistic for building power ratings?

Point differential (points scored minus points allowed) is typically the most predictive single statistic. However, strength of schedule adjustments and context matter enormously. A team with +5 point differential against weak opponents is weaker than a team with +3 point differential against elite opponents. The best systems use multiple statistics and adjust for schedule strength.

How accurate are professional power rating systems?

The best systems achieve about 60-62% accuracy in predicting game winners, which is significantly better than random chance (50%) but far from perfect. Accuracy in predicting exact margins is lower. These systems are designed to find value in point spreads, not to predict winners perfectly.

Should I use power ratings from an established service or build my own?

Established services (Jeff Sagarin, Ken Pomeroy, ESPN Power Ratings) have the advantage of expertise and resources. Building your own allows customization and control. Many professional bettors do both: use established ratings as a baseline and overlay their own adjustments based on factors they believe the public systems miss.

How do power ratings account for home field advantage?

Most systems add 2.5-3 points to the home team's power rating when calculating the predicted spread. Some systems calculate home field advantage separately for each team, recognizing that some teams play much better at home than others. Advanced systems also account for travel distance, altitude, and other environmental factors.

Can power ratings predict upsets?

Power ratings can identify when an upset is likely (when the underdog is much closer in power rating than the spread suggests), but they can't predict which games will be upsets. A 7-point underdog with a power rating only 4 points worse than the favorite is a good value bet, but they'll still lose most of the time. Upsets are inherent to sports; power ratings help you find value when they occur.


Related Terms