Free Depression Test PHQ-9: How It Compares to Beck, CES-D, DASS-21, and Zung

Emily Mitchell, Senior Writer · Updated March 28, 2026

Searching for a free depression screener typically returns the same five options - and they are not interchangeable. The PHQ-9, Beck Depression Inventory-II, CES-D, DASS-21, and Zung Self-Rating Scale all measure depression, but they score on different scales, suit different populations, and carry different clinical weight.

Whether you're a busy professional, a parent, an HR coordinator, or someone tracking treatment progress, the right tool matters. Defaulting to whichever test loads first isn't a strategy. Understanding your options is - so here's a side-by-side breakdown of all five, with enough detail to make that choice confidently.

According to the National Institute of Mental Health (NIMH), self-assessment tools have real limits. They are screening aids, not diagnoses. But choosing a well-validated tool - and reading your score correctly - makes a meaningful difference in how useful the result will be.


Quick Comparison: Five Free Depression Screeners at a Glance

Test Questions Time to Complete Age Suitability Score Range Clinical Validation Best Use Case
PHQ-9 9 2-3 minutes Adults (18+) 0-27 Very High - 1,000+ peer-reviewed studies Primary care screening, telehealth
Beck Depression Inventory-II (BDI-II) 21 5-10 minutes Adults (13+) 0-63 High - widely used in psychiatric settings Outpatient psychiatric intake
CES-D 20 5-7 minutes Adults, older adults 0-60 High - especially validated for older populations Research studies, geriatric screening
DASS-21 21 5-10 minutes Adults (18+) 0-42 (per subscale) High - measures depression, anxiety, and stress Chronic illness, bipolar screening context
Zung Self-Rating Scale 20 5-7 minutes Adults 20-80 Moderate - older instrument, less current validation Historical baseline, general population surveys

Detailed Breakdown: What Makes Each Test Different

PHQ-9 - The Primary Care Standard

The PHQ-9 (Patient Health Questionnaire-9) was developed by Pfizer Inc., which holds the copyright and offers free licensing for both clinical and research use. That open licensing is the main reason it spread so widely across healthcare systems.

The PHQ-9's biggest advantage is direct alignment with DSM-5 diagnostic criteria for major depressive disorder. Each of the nine questions maps to one of the nine core diagnostic symptoms. When you bring a printed score to your doctor, they can read it immediately. No translation required.

This makes the PHQ-9 the default tool in primary care, most telehealth platforms, and employer wellness programs. Scores run from 0 to 27. Cutoff thresholds are well-established:

According to Pfizer, the PHQ-9 can be reproduced and used without permission for clinical, educational, or research purposes. That matters if you're an HR coordinator or clinician building a program around it.

Where the PHQ-9 Falls Short

The PHQ-9 has a specific blind spot. In people with chronic physical illness, somatic symptoms - fatigue and sleep disruption in particular - can inflate scores. These symptoms may stem from the illness itself rather than depression, which creates false positives in that population.

The PHQ-9 also lacks specificity for bipolar depression. Someone in a depressive episode of bipolar disorder may score high, but the PHQ-9 won't flag the hypomanic or manic history that changes the clinical picture. In those cases, a dedicated screening tool - such as the Mood Disorder Questionnaire (MDQ) - is a better first step. The DASS-21 is often a stronger choice when chronic illness or mixed symptom presentations are involved.

Beck Depression Inventory-II (BDI-II)

The BDI-II is more detailed than the PHQ-9. Its 21 items cover cognitive symptoms of depression in depth - hopelessness, self-criticism, indecisiveness. It takes longer to complete, but it captures dimensions the PHQ-9 can miss.

The BDI-II is the preferred tool in outpatient psychiatric settings - if you're seeing a psychiatrist or therapist, they're more likely to use or recognize this score than the PHQ-9. Scores run from 0 to 63. Cutoffs fall at roughly 14 (mild), 20 (moderate), and 29 (severe). These thresholds differ significantly from PHQ-9 cutoffs, and mixing up the two scales is a documented source of self-assessment errors.

CES-D - Center for Epidemiologic Studies Depression Scale

The CES-D was designed primarily for research and population-level studies. It performs particularly well for older adults. Its 20 questions ask how often symptoms occurred in the past week, scored from 0 to 3 per item. Total scores range from 0 to 60, with a commonly used cutoff of 16 or higher indicating significant depressive symptoms.

The CES-D is less likely to appear in a doctor's office but is widely used in academic studies. If you're participating in a research study or tracking symptoms over time in a structured way, the CES-D is a solid choice.

DASS-21 - Depression Anxiety Stress Scales

The DASS-21 measures three things at once: depression, anxiety, and stress. Each subscale has 7 items, and scores are doubled to compare against full DASS-42 norms. This makes it especially useful when someone isn't sure whether their symptoms are primarily depressive or anxiety-driven - a common situation.

For people with chronic illness, the DASS-21 outperforms the PHQ-9 because it separates psychological distress from somatic symptoms more cleanly. It's also useful as a first screen when bipolar disorder is a possibility, since anxiety and stress patterns can inform that conversation.

Zung Self-Rating Depression Scale

The Zung scale is one of the oldest standardized depression instruments. Its 20 items produce a score between 20 and 80. A score above 50 is generally considered a marker of depressive symptoms. The scale is still used in some clinical and survey contexts, but it has received less recent validation research than the other tools on this list.

Be especially careful about score confusion here. The Zung's 20-80 range looks nothing like the PHQ-9's 0-27 or the CES-D's 0-60, and misreading one scale as another can lead to serious misinterpretation. The American Psychological Association publishes psychometric standards governing how self-report scales like these should be validated and interpreted. Which scale you're using is not a detail to skim past.


Scoring Confusion: A Real Risk

Score misinterpretation is one of the most common - and least discussed - problems with self-administered depression tests. Each tool uses a different scale and different cutoff thresholds. A score of 16 means "minimal" on the BDI-II but signals "significant depression" on the CES-D. A score of 50 is meaningless on the PHQ-9 but indicates depressive symptoms on the Zung scale.

The National Institute of Mental Health (NIMH) notes that self-assessment tools should always be interpreted alongside professional guidance. If you take multiple tests and try to compare raw numbers, you risk drawing completely wrong conclusions.

Always note which test you took and use its specific scoring key - not a generic "depression score" framework.


Practical Verdict: Who Should Use Which Test

The Busy Professional - 2-Minute Screen

Use the PHQ-9. Nine questions, under three minutes, and a score your doctor will immediately understand if you decide to follow up. It's the right tool for a fast, clinically meaningful check-in.

A Teenager - Parent-Initiated Conversation

The BDI-II is validated for ages 13 and up and captures cognitive symptoms that matter in adolescence - such as hopelessness and self-worth. The PHQ-9 is primarily validated for adults. For teens, the BDI-II gives a more nuanced picture, and a mental health professional should always be involved.

HR Coordinator - Workplace Wellness Program

Use the PHQ-9. Pfizer's free licensing means it's legally clear to use without fees. Its brevity encourages participation. Its DSM-5 alignment means any employee who seeks professional follow-up arrives at the doctor with a usable score. Consider pairing it with a DASS-21 if the program includes stress and anxiety components.

Someone Tracking Treatment Progress

Use whichever tool your provider uses - consistency matters more than which test is "best." If you're tracking on your own, the PHQ-9 is the easiest to administer repeatedly and compare across time. The PHQ-9 is also embedded in many digital health apps for exactly this purpose.

Chronic Illness or Possible Bipolar History

Use the DASS-21 as a first screen. Its separation of depression, anxiety, and stress symptoms reduces the somatic inflation problem. Then discuss bipolar-specific screening tools with a psychiatrist before drawing conclusions about the depression subscale alone.

Research or Study Participation

Use the CES-D. It was designed for population research, performs well across age groups, and produces scores that are directly comparable to existing published literature. The PHQ-9 is increasingly common in newer research, but the CES-D remains a gold standard for longitudinal work.

Want to take the PHQ-9 right now? Start the free PHQ-9 depression test here. Or learn more about how these tests are used and what the results mean.

Most Homeowners Skip 9 of These 12 Tasks

Gutters in November. HVAC filter every 90 days. Water heater flush in spring. This one-page calendar has every maintenance task by month - just print it and follow along.

No matter which tool you choose, a self-administered depression screener is a starting point - not a diagnosis. Use your results to start a conversation with a qualified professional, not to end one.


Frequently Asked Questions

Is the PHQ-9 actually more accurate than other free depression tests, or just more popular?

Both. Popularity and validation are closely linked here. The PHQ-9 has more than 1,000 peer-reviewed studies behind it - more than any other free screener. But "accuracy" depends on the population. In general adult primary care settings, the PHQ-9's sensitivity and specificity are strong. For older adults, the CES-D may perform better. For mixed anxiety-depression presentations, the DASS-21 captures more nuance. The PHQ-9 is the right default for most people, but it's not universally superior across every group. (Source: American Psychological Association psychometric standards.)

Can I use my PHQ-9 score to compare results across different websites or apps?

Yes - with an important caveat. The PHQ-9 scoring is standardized, so a 12 is a 12 regardless of where you take it. However, some online implementations truncate questions, reword items, or change the response time frame. These modifications can shift your score in ways that make comparisons unreliable. For a score you can confidently compare over time or bring to a provider, use the official Pfizer-licensed version. Pfizer offers free licensing and makes the original questionnaire available for clinical and research use. Anything that looks significantly different from the nine core DSM-5 symptom questions should be treated with skepticism.

Which depression test do therapists and psychiatrists actually prefer you bring in - PHQ-9, Beck, or something else?

It depends on the setting. Primary care physicians and telehealth providers almost universally use the PHQ-9. Its brevity and DSM-5 alignment make it the default first-contact tool. In outpatient psychiatric and psychological settings, the Beck Depression Inventory-II (BDI-II) is more common because it captures cognitive symptoms in greater depth. If you're seeing a psychiatrist for the first time, a PHQ-9 printout is useful but a BDI-II score may carry more weight in that specific conversation. When in doubt, bring the PHQ-9 - it's the most universally recognized score across all provider types. (Source: National Institute of Mental Health.)

Is one test better for men vs. women?

Depression can present differently across sexes and genders. Some research suggests men are more likely to report somatic or irritability-based symptoms rather than sadness - and the PHQ-9 may undercount depression in some male populations as a result. The DASS-21's stress subscale sometimes captures male presentations more completely. That said, the PHQ-9 remains the recommended starting point for both men and women in primary care settings. If you feel your symptoms aren't captured well by the score, discuss that directly with a provider.

Can these tests diagnose depression on their own?

No. All five tools covered here are screeners, not diagnostic instruments. According to the National Institute of Mental Health (NIMH), a high score means your symptoms may warrant further evaluation - not that you definitively have major depressive disorder. A formal diagnosis requires a clinical interview, a review of symptom duration and functional impact, and ruling out other medical or psychiatric causes. Use your score to start a conversation with a healthcare provider, not as a self-diagnosis. A score of 10 or higher on the PHQ-9 is a signal to seek professional input.

How often should I retake a depression screener?

If you are in active treatment, your provider may administer the PHQ-9 at each appointment to track progress - typically every two to four weeks. For self-monitoring, retaking the test once per month is a reasonable interval for most people. Taking it daily can amplify anxiety and produce noisy results that are hard to interpret. If your score increases significantly between sessions, or if you are having thoughts of self-harm, contact a mental health professional promptly rather than waiting for your next scheduled appointment.

About this article

Researched and written by Emily Mitchell at depression tests. Our editorial team reviews depression tests to help readers make informed decisions. About our editorial process.