Compare the most common survey rating scales - Likert, NPS, star ratings, semantic differential, CSAT and CES - and learn which scale fits which measurement goal.
Rating scales turn opinions into numbers, and the scale you choose quietly shapes everything that follows - how respondents interpret the question, how you analyze the data, and how you benchmark over time. A five-point Likert scale, an eleven-point Net Promoter question, and a five-star rating are not interchangeable; each carries assumptions and trade-offs. This guide compares the most common rating scales so you can match the right one to your measurement goal.
Rating Scale Basics
A few design choices apply to every scale. Number of points: more points capture finer distinctions but increase cognitive load; five and seven points are popular sweet spots. Odd vs even: an odd number of points offers a neutral midpoint, while an even number forces a lean in one direction by removing the middle. Labeling: fully labeling each point improves clarity and consistency, whereas labeling only the endpoints gives respondents more interpretive freedom. Balance: positive and negative points should be symmetric so the scale does not tilt respondents one way. Getting these fundamentals right matters more than which named scale you pick.
The Likert Scale
The Likert scale measures agreement with a statement, classically on five points from "strongly disagree" to "strongly agree," sometimes extended to seven. It is the workhorse of attitude measurement - flexible, familiar, and easy to analyze. A true Likert scale combines several related Likert items into a composite score that measures one underlying attitude more reliably than any single item.
Strengths: respondents understand it instantly, and the data supports rich analysis (means, distributions, and reliability checks across items). Weaknesses: it is vulnerable to acquiescence bias (the tendency to agree) and to central-tendency responding (clustering on the neutral midpoint). To counter these, write some items in reverse, use item-specific wording where possible, and consider whether a neutral midpoint helps or just lets respondents avoid committing.
Net Promoter Score (NPS)
NPS asks a single question - "How likely are you to recommend us to a friend or colleague?" - on an eleven-point scale from 0 to 10. Respondents are grouped into Promoters (9-10), Passives (7-8), and Detractors (0-6). The score is the percentage of Promoters minus the percentage of Passives... minus Detractors: specifically, %Promoters minus %Detractors, producing a number from -100 to +100. Passives count toward the total but not the score.
NPS is popular because it is simple, universally understood, and widely benchmarked, making it ideal for tracking loyalty over time and comparing against industry norms. Its limitations are real: a single number hides the reasons behind it, the category cutoffs are somewhat arbitrary, and the same NPS can come from very different distributions. Always pair the score with an open follow-up asking why. If you want to deploy it properly, start from a dedicated NPS survey rather than rebuilding the scale by hand. Loyalty tracking is especially common among SaaS startups watching retention closely.
CSAT and Customer Effort Score
CSAT (Customer Satisfaction Score) asks how satisfied a customer is with a specific interaction, usually on a 1-to-5 (or 1-to-7) scale from "very dissatisfied" to "very satisfied." The score is often reported as the percentage of respondents choosing the top one or two boxes. CSAT is best for measuring satisfaction with a discrete touchpoint - a support ticket, a purchase, an onboarding step - immediately after it happens.
CES (Customer Effort Score) measures how much effort a customer had to expend, typically by rating agreement with a statement like "The company made it easy to handle my issue" on a 1-to-7 scale. CES is a strong predictor of future loyalty for service interactions, on the logic that reducing friction retains customers more reliably than delighting them. Use CSAT to gauge satisfaction and CES to diagnose friction; they complement each other.
Star Ratings and Semantic Differential
Star ratings (commonly 1 to 5 stars) are intuitive and familiar from e-commerce and app stores, which makes them excellent for public-facing reviews and quick product feedback. Their weakness is interpretation drift - one person's four stars is another's three - and a well-documented skew toward extreme ratings, so averages can be misleading without volume.
Semantic differential scales place opposite adjectives at each end (for example, "unreliable" to "reliable" or "cheap" to "premium") and ask respondents to mark where their impression falls, typically across five to seven points. They are powerful for brand and perception research because they capture the connotations of a product or company along multiple dimensions at once. They require careful selection of truly bipolar adjective pairs to work well, which makes them better suited to a considered market research survey than to a quick pulse check.
Choosing the Right Scale
Start from the decision, not the scale. To measure loyalty and benchmark it, use NPS. To measure satisfaction with a specific interaction, use CSAT. To diagnose friction in a process, use CES. To measure attitudes and beliefs across several items, use a Likert scale. For public reviews, use stars. For brand perception, use semantic differential. Then keep the scale consistent over time - changing the number of points or the labels mid-stream breaks your ability to compare trends. Finally, pilot any scale with a few real respondents to confirm the labels mean what you intend.
A subtle but important point concerns how you analyze scale data. Technically, most rating scales are ordinal: the points are ordered, but the gaps between them are not guaranteed to be equal, so the distance from "agree" to "strongly agree" may not match the distance from "neutral" to "agree." Strictly, that argues for reporting medians and frequency distributions rather than means. In practice many teams report averages of Likert and similar scales because they are easy to communicate and compare - which is acceptable as long as you also look at the full distribution. A 5-point scale averaging 3.0 could be a tidy cluster on the midpoint or a polarized split between 1s and 5s, and those two situations demand very different responses. Always plot the distribution, not just the average.
Mobile rendering deserves a thought too. An eleven-point NPS row can wrap awkwardly on a narrow screen, and a seven-point fully labeled Likert item can become cramped. Test every scale on a phone before launching, since a large share of survey responses now arrive on mobile devices, and a scale that looks fine on a desktop but breaks on mobile will quietly depress both completion and data quality. The best scale on paper is worthless if respondents misread it on the device they actually use.
Frequently Asked Questions
How is the NPS score calculated? Group respondents on the 0-10 scale into Promoters (9-10), Passives (7-8), and Detractors (0-6), then subtract the percentage of Detractors from the percentage of Promoters. The result ranges from -100 to +100; Passives are counted in the base but do not directly add to the score.
Should a rating scale have a neutral midpoint? It depends. An odd number of points gives respondents a genuine neutral option, which is honest when neutrality is a real stance. An even number forces a lean, which is useful when you specifically want to push respondents off the fence - but it can frustrate truly neutral respondents.
How many points should a Likert scale have? Five and seven points are the most common and well-tested choices. Five is simpler and faster; seven offers finer discrimination. Going beyond seven rarely improves reliability and increases respondent effort.
What is the difference between CSAT and NPS? CSAT measures satisfaction with a specific interaction immediately after it occurs, while NPS measures overall loyalty and willingness to recommend. CSAT is transactional and short-term; NPS is relational and used for long-term benchmarking.
Add any rating scale - Likert, NPS, CSAT, stars - to your survey in a few clicks with SurveyMaker.