- Research article
- Open Access
Validation of the Patient Health Questionnaire-9 (PHQ-9) and PHQ-2 in patients with migraine
The Journal of Headache and Painvolume 16, Article number: 65 (2015)
Psychiatric problems have been commonly reported in patients with migraine. This study investigated the reliability and validity of the Patient Health Questionnaire-9 (PHQ-9) and Patient Health Questionnaire-9 (PHQ-2) in patients with migraine.
Patients with migraine (with or without aura) were consecutively recruited from our headache clinic. They completed several instruments, including the Mini International Neuropsychiatric Interview-Plus Version 5.0.0 (MINI), the PHQ-9, the Beck Depression Inventory-II (BDI-II), the Migraine Disability Assessment Scale (MIDAS), the Headache Impact Test-6 (HIT-6), and the Migraine-Specific Quality of Life (MSQoL).
Among 132 participants, 39 patients (29.5 %) had a major depressive disorder (MDD) as determined by the MINI. Cronbach’s α coefficients for the PHQ-9 and PHQ-2 were 0.894 and 0.747, respectively. At a cutoff score of 7, the PHQ-9 had a sensitivity of 79.5 %, a specificity of 81.7 %, a positive predictive value (PPV) of 64.6 %, and a negative predictive value (NPV) of 90.5 %. At a cutoff score of 2, the PHQ-2 had a sensitivity of 66.7 %, a specificity of 90.3 %, a PPV of 74.3 %, and a NPV of 86.6 %. The scores of the PHQ-9 and PHQ-2 well correlated with the BDI-II score, the MIDAS score, the HIT-6 score, and the MSQoL score.
The PHQ-9 and PHQ-2 are both reliable and valid screening instruments for MDD in patients with migraine.
Approximately 10–15 % of the general population is affected by migraines, which are characterized by recurrent attacks of severe pulsating headaches lasting 4–72 h . Migraine is the sixth highest cause of disability worldwide . Patients with migraine are more likely to develop depression than those without migraine. In a review of the literature, the prevalence of depression varied from 8.6 to 47.9 % in patients with migraine . The overall risk of developing depression was 2.2 times higher in patients with migraine .
Comorbidity with psychiatric disorders raises the global burden of migraine. Disability and health related quality of life (HRQoL) impairment in patients with migraine is greater when migraine is associated with either depression or anxiety [4, 5]. Individuals with migraine and comorbid psychiatric disorders use more health resource than those with migraine alone . In addition, the presence of psychiatric problems is a risk factor for transformation to chronic form of migraine  and seem to play a role in the evolution of migraine to medication overuse headache (MOH) . Therefore, the early diagnosis and treatment of depression is important for the proper management of patients with migraine. For these purposes, a simple, rapid screening instrument to detect depression is a prerequisite, especially in a busy clinical setting.
The Patient Health Questionnaire-9 (PHQ-9) is a valuable screening instrument for detecting a major depressive disorder (MDD)  based on diagnostic criteria from the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) , and it is generally useful in headache studies . The Patient Health Questionnaire-2 (PHQ-2), which includes the first two items of the PHQ-9, is also a valuable instrument . Although these instruments were validated in primary care patients, their usefulness in patients with migraine is unknown. Therefore, the aim of this study was to investigate the value of the PHQ-9 and PHQ-2 as screening tools in patients with migraine.
Subjects in this study were patients with consecutive visits to the headache clinic in the Department of Neurology at Kyungpook National University Hospital between April and November of 2014. The patients, ranging from 16 to 70 years of age, all had a current diagnosis of migraine and did not take preventive medicines for migraine or other psychotropic agents. A diagnosis of migraine was based on the International Classification of Headache Disorders, 3rd edition, beta version by a trained neurologist (S.P. Park) . Patients were excluded if they were unable to cooperate in the psychiatric interview or had difficulty understanding the questionnaire because of illiteracy, mental retardation, serious medical, neurological, or psychiatric disorders, and alcohol or drug abuse. Patients with a probable migraine and those declining the interview were also excluded.
A cross-sectional study was approved by the institutional review board of Kyungpook National University Hospital, and all subjects provided written informed consent prior to the study. Patients were interviewed by S.P. Park, who also reviewed the medical charts to collect demographic, social, and clinical information for a database. Sociodemographic data included age, gender, education, employment, household income (earning more or less than three million KRW per month, equivalent to 2,800 USD per month), and marital states (married, unmarried, divorced, and bereaved). Clinical data included the type of migraine, migraine chronicity (episodic migraine [EM] or chronic migraine [CM]), MOH, age at onset, disease duration, attack frequency, attack duration, and family history. A family history of migraine was defined as an existing diagnosis of migraine in a lineal ascendant and siblings.
To measure the reliability of the PHQ-9 and PHQ-2 in eligible subjects, a neuropsychologist examined their MDD using the Mini International Neuropsychiatric Interview-Plus Version 5.0.0 (MINI) . Subsequently, patients provided several self-reported questionnaires, including the Beck Depression Inventory-II (BDI-II) , the Migraine Disability Assessment Scale (MIDAS) , the Headache Impact Test-6 (HIT-6) , and the Migraine-Specific Quality of Life (MSQoL) , to examine the validity of the PHQ-9 and PHQ-2.
Interview and questionnaires
Mini International Neuropsychiatric Interview-Plus Version 5.0.0 (MINI)
The MINI is an internationally validated brief structured interview used extensively as a diagnostic tool for psychiatric disorders from the DSM-IV and the International Classification of Diseases-10. The reliability and validity of this instrument is well established , and the Korean translation is also validated . The Kappa value of MDD was 0.71, indicating a moderate and substantial agreement between the MINI and the expert’s diagnoses.
Patient Health Questionnaire-9 (PHQ-9) and Patient Health Questionnaire-2 (PHQ-2)
The PHQ-9 and PHQ-2 were designed for use in primary care patients [8, 11]. The PHQ-9 includes nine items pertaining to the DSM-IV criteria for MDD : (1) anhedonia; (2) depressed mood; (3) trouble sleeping; (4) feeling tired; (5) change in appetite; (6) guilt, self-blame, or worthlessness; (7) trouble concentrating; (8) feeling slowed down or restless; and (9) thoughts of being better off dead or hurting oneself . Each item is rated on a 4-point scale from 0 to 3 (0 - never; 1 – several days; 2 - more than half the time; and 3 - nearly every day) during the two weeks prior to and including the day of survey completion. The overall scores ranged from 0 to 27. At a cutoff score of 9, the PHQ-9 had a sensitivity of 88 % and a specificity of 88 % for detecting MDD compared with a structured psychiatric interview . The PHQ-2 includes only the first two items in the PHQ-9, which are critical for the diagnosis of MDD . The overall scores ranged from 0 to 6. At a cutoff score of 2, the PHQ-2 had a sensitivity of 83 % and a specificity of 92 % for detecting MDD . The PHQ-9 was translated into Korean language, and was freely downloadable on the PHQ website (http://www.phqscreeners.com/) . The translated version was back translated into English by a Korean English teacher. Finally, the two versions were compared by a native English speaker who concluded that they were identical. Thereafter, we administered it to 20 Korean patients with migraine to evaluate potential problems in comprehension or cultural differences. No further adaptations were required.
Beck Depression Inventory-II (BDI-II)
The BDI-II is a commonly used self-rating scale for depression symptoms . Patients score 21 items on a scale from 0 to 3 according to how they felt during the previous 2 weeks. The total scores ranged from 0 to 63. The Korean version of the BDI-II has been validated . Cronbach’s α coefficient was 0.834 in depressive patients and 0.88 in healthy subjects. At a cutoff score of 22, the BDI-II had a sensitivity of 94 % and specificity of 98 % for detecting MDD compared with a structured psychiatric interview.
Migraine Disability Assessment Scale (MIDAS)
The Korean version of the MIDAS, a five-item questionnaire designed to evaluate disability within during the previous 3 months, was used in this study . Patients were asked to report decreased performance in the domains of work/school, household work, and family/social activities. Scores (0–27) measure the overall level of disability: Grade I (0–5), Grade II (6–10), Grade III (11–20), and Grade IV (above 21). Cronbach’s α value was 0.75.
Headache Impact Test-6 (HIT-6)
The HIT-6 was developed in the United States to measure a wider spectrum of headache-induced burden . Items in the HIT-6 cover several HRQOL domains: pain, social functioning, role functioning, vitality, cognitive functioning, and psychological distress. Each item is answered on a 5-point Likert scale (6 = never, 8 = rarely, 10 = sometimes, 11 = very often, 13 = always). The total scores ranged from 36 to 78; larger scores indicate greater impact. For interpretation, HIT-6 scores are categorized in four groups: scores of ≤49 indicate little or no impact; scores between 50 and 55 indicate some impact; scores between 56 and 59 indicate a substantial impact; and scores ≥60 indicate a severe impact . The Korean version of the HIT-6 was validated and Cronbach’s α coefficient was 0.85 .
Migraine-Specific Quality of Life (MSQoL)
The MSQoL, developed by Wagner et al., is a valid and reliable tool for clinical migraine research . A Korean translation of this 25-item questionnaire has been validated . The items are rated on a 4-point scale (1–4). The total scores ranged from 25 to 100. A lower total score indicates poorer QOL. Cronbach’s α value was 0.93.
The Statistical Package for the Social Sciences (SPSS version 21.0) was used for data analysis. The Med Calc 8.0 was used to perform receiver operating characteristic (ROC) analyses, which measure sensitivity, specificity, positive predictive values (PPVs) and negative predictive values (NPVs). ROC analyses for the PHQ-9 and PHQ-2, over a range of cutoff scores, were performed for comparison to MDD diagnoses by the MINI. Optimal cutoff scores were also computed using criteria that minimize the Euclidean distance from point (sensitivity and specificity) to point in the x-y plane. Descriptive statistics are presented as counts, percentages, means, and standard deviations. Independent t tests, Mann–Whitney U tests, and Chi-square tests were used to compare continuous or categorical variables. Cronbach’s α coefficient was computed to ascertain internal consistency and was recalculated after items were removed. Nonparametric correlations (Spearman’s ρ) were used to determine the validity of the PHQ-9 and PHQ-2. The level of statistical significance was set at p < 0.05.
Of the 185 patients who consecutively visited our headache clinic, 53 were excluded due to probable migraine (n = 21), taking preventive medicines for migraine or psychotropic agents (n = 10), illiteracy (n = 5), age older than 70 (n = 3), and refusal to take part in the study (n = 14). The 132 remaining patients were eligible for this study. Of them, 73 patients (55.3 %) had CM and 36 patients (27.3 %) exhibited MOH. According to the MINI, 39 patients (29.5 %) were diagnosed with MDD. The relationship between MDD demographic, clinical, and psychosocial characteristics are listed in Table 1. Patients with MDD were less likely to be employed and more likely to have a low household income than those without MDD. Patients with MDD had a higher risk of developing CM and phonophobia than those without MDD. Patients with MDD exhibited higher scores on the PHQ-9, the BDI-II, the MIDAS, and the HIT-6, a lower score on the MSQoL than those without MDD.
The subjects completed the PHQ-9 without any difficulties in comprehending and replying to the questions. Cronbach’s α coefficients for the PHQ-9 and PHQ-2 were 0.894 and 0.747, respectively, indicating excellent internal consistency. As shown in Table 2, all items in the PHQ-9 were significantly and positively associated with the total PHQ-9 score, and the α did not decrease if items were deleted. The ROC analyses of the PHQ-9 and PHQ-2 are shown in Table 3 and the ROC curves are illustrated in Fig. 1. ROC analysis of the PHQ-9 determined an area under the curve (AUC) of 0.882 (95 % CI = 0.818-0.947; SE = 0.033; p < 0.001). At a cutoff score of >7, the PHQ-9 sensitivity was 79.5 % and specificity was 81.7 %, with a PPV of 64.6 % and an NPV of 90.5 %. For our patients, the MDD frequency was 36.4 % using a cutoff score of 7. ROC analysis of the PHQ-2 revealed an AUC of 0.876 (95 % CI = 0.814–0.938; SE = 0.032; p < 0.001). At a cutoff score >2, the PHQ-2 sensitivity was 66.7 % with a specificity of 90.3 %, a PPV of 74.3 %, and a NPV of 86.6 %. MDD frequency was 26.5 % at a cutoff score of 2.
The validity of the PHQ-9 and PHQ-2 are shown in Table 4. The PHQ-9 score is well correlated with the BDI-II score (Spearman’s ρ = 0.754, p < 0.001), the MIDAS score (Spearman’s ρ = 0.377, p < 0.001), the HIT-6 score (Spearman’s ρ = 0.519, p < 0.001), and the MSQoL score (Spearman’s ρ = −0.538, p < 0.001). The PHQ-2 score was also well correlated with the BDI-II score (Spearman’s ρ = 0.739, p < 0.001), the MIDAS score (Spearman’s ρ = 0.396, p < 0.001), the HIT-6 score (Spearman’s ρ = 0.556, p < 0.001), and the MSQoL score (Spearman’s ρ = −0.580, p < 0.001).
To our knowledge, this is the first study investigating the usefulness of the PHQ-9 and PHQ-2 as screening instruments in patients with migraine. We found that the PHQ-9 and PHQ-2 were easily comprehended and quickly completed by the patients. Furthermore, they had excellent internal consistency reliability (Cronbach’s α =0.894 for the PHQ-9 and Cronbach’s α =0.747 for the PHQ-2). The validity of the PHQ-9 and PHQ-2 was determined by correlation with scores from the BDI-II, the MIDAS, the HIT-6, and the MSQoL. Together, these data suggest that both the PHQ-9 and PHQ-2 are useful screening instruments for the diagnosis of MDD in patients with migraine.
Although there has not yet been a study to validate the PHQ-9 in patients with migraine, many validation studies have been conducted for patients in primary care and hospital settings. The initial validation study for the PHQ-9, conducted in primary care patients, had a Cronbach’s α of 0.89, a sensitivity of 88 %, and a specificity of 88 % at a cutoff score of 9 . In a Korean study of primary care patients, Cronbach’s α was 0.852, sensitivity was 90.9 % and specificity was 87 % using a cutoff score of 8 . While the reliability in our study is consistent with these reports, the sensitivity, specificity, and cutoff scores were all lower. A 2012 meta-analyses included eighteen validation studies from primary care, specialized secondary care services (brain injury, cardiology, stroke, and nephrology), and the community . Eleven of the studies provided details about the diagnostic properties of the questionnaire and the pooled sensitivity ranged from 62 % with a cutoff score of 14 to 89 % using a cutoff score of 10. Pooled specificity results ranged from 73 % with a cutoff score of 6 to 96 % with a cutoff score of 14 . There were no substantial differences in the pooled sensitivity and specificity for cutoff scores from 7 to 10. The cutoff score, sensitivity, and specificity of the PHQ-9 determined in our study are consistent with the literature.
The PHQ-2 has not been as frequently validated as the PHQ-9. The initial PHQ-2 validation study was conducted on primary care patients and it reported a sensitivity of 83 % and specificity of 92 % at a cutoff score of 2 . A Korean study in a tertiary care hospital determined a sensitivity of 91.9 % and specificity of 100 % at a cutoff score of 2 . In a neurologic field, a validation study for patients with Parkinson’s disease documented a sensitivity of 75 % and a specificity of 89 % at a cutoff score of 2 . In comparison, our study established a lower sensitivity for the PHQ-2 at the same cutoff score. Our study also showed that the PHQ-2 had a lower sensitivity and a higher specificity than the PHQ-9. Therefore, we should be cautious interpreting results of the PHQ-2 when establishing the frequency of MDD in patients with migraine.
We reported that MDD frequency was 36.4 % when we applied a cutoff score of 7. However, the frequency was 25.8 % when we used a cutoff score of 9 in the initial validation study . Using a cutoff score of 9 excludes 10.6 % of the patients from the diagnosis of MDD. This suggests that the PHQ-9 validation should be performed for each study settings (primary care setting or hospital setting) and specific disease groups. For example, a validation study of the PHQ-9 for patients with Parkinson’s disease in the US reported that a cutoff score of 5 was appropriate for detecting MDD . If a cutoff score of 9 were applied to these patients, many would be excluded from the diagnosis of MDD. We also recommend validating the PHQ-9 for use in different languages and countries due to the linguistic characteristics of each country. For example, a rapid screening instrument for detecting MDD in people with epilepsy, the Neurological Disorders Depression Inventory for Epilepsy, had different cutoff scores when it was validated in different languages . Differences in the cutoff score may also be influenced by cross-cultural differences during validation. For example, individuals with an Asian background are more likely to express themselves conservatively, leading to the possibility that Korean patients with migraine are less likely to report depression symptoms. Given these possibilities, we should encourage clinicians to translate and validate the PHQ-9 according to specific diseases, native languages, and cultural differences.
There are several limitations to our study. First, the PHQ-9 and PHQ-2 provide only a probable diagnosis of MDD that should be investigated by further evaluation. Second, a cutoff score of 7 in the PHQ-9 had a PPV of 64.6 %, which may lead to false-positive results. Third, our study validated the Korean version of the PHQ-9 and PHQ-2 in Korean patients with migraine and their diagnostic properties may be different from those in other languages and countries.
Patients with migraine are more likely to develop depression than those without migraine . Comorbid depression in patients with migraine may have important clinical implications.
In a busy clinical setting, psychiatric interviews take a long time to conduct. Therefore, the application of the PHQ-9 and PHQ-2 could lead to a better recognition of depression in patients with migraine. Furthermore, because the PHQ-9 and PHQ-2 are quite simple and brief, they could be useful to detect the presence of depression in many neurologic disorders.
Health related quality of life
medication overuse headache
Patient Health Questionnaire
Major depressive disorder
Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition
Mini International Neuropsychiatric Interview
Beck Depression Inventory
Migraine Disability Assessment Scale
Headache Impact Test
Migraine-Specific Quality of Life
Receiver operating characteristic
Positive predictive values
Negative predictive values
Lipton RB, Bigal ME, Diamond M, Freitag F, Reed ML, Stewart WF, AMPP Advisory Group (2007) Migraine prevalence, disease burden, and the need for preventive therapy. Neurology 68:343–9
Steiner TJ, Birbeck GL, Jensen RH, Katsarava Z, Stovner LJ, Martelletti P (2015) Headache disorders are third cause of disability worldwide. J Headache Pain 16:544
Antonaci F, Nappi G, Galli F, Manzoni GC, Calabresi P, Costa A (2011) Migraine and psychiatric comorbidity: a review of clinical findings. J Headache Pain 12:115–25
Lantéri-Minet M, Radat F, Chautard MH, Lucas C (2005) Anxiety and depression associated with migraine: influence on migraine subjects’ disability and quality of life, and acute migraine management. Pain 118:319–26
Kim SY, Park SP (2014) The role of headache chronicity among predictors contributing to quality of life in patients with migraine: a hospital-based study. J Headache Pain 15:68
Lipton RB (2009) Tracing transformation: chronic migraine classification, progression, and epidemiology. Neurology 72(suppl 5):S3–S7
Innamorati M, Pompili M, Erbuto D, Ricci F, Migliorati M, Lamis DA, Amore M, Girardi P, Martelletti P (2015) Psychometric properties of the stagnation scale in medication overuse headache patients. J Headache Pain 16:1052
Kroenke K, Spitzer RL, Williams JB (2001) The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med 16:606–13
American Psychiatric Association (1987) Diagnostic and Statistical Manual of Mental Disorders.Third ed. American Psychiatric Association, Washington DC [Revised]
Peng KP, Wang SJ (2012) Migraine diagnosis: screening items, instruments, and scales. Acta Anaesthesiol Taiwan 50:69–73
Kroenke K, Spitzer RL, Williams JB (2003) The patient health questionnaire-2: validity of a two-item depression screener. Med Care 41:1284–92
Headache Classification Committee of the International Headache Society (2013) The international classification of headache disorders. 3rd edition (beta version). Cephalalgia 33:629–808
Yoo SW, Kim YS, Noh JS, Oh KS, Kim CH, Namkoong K, Chae JH, Lee GC, Jeon SI, Min KJ, Oh DJ, Joo EJ, Park HJ, Choi YH, Kim SJ (2006) Validity of Korean version of the MINI-International Neuropsychiatric Interview. Anxiety Mood 2:50–5
Sung HM, Kim JB, Park YN, Bai DS, Lee SH, Ahn HN (2008) A study on the reliability and the validity of Korean version of the Beck Depression Inventory-II (BDI-II). J Korean Soc Biol Ther Psychiatry 14:201–12
Lee HS, Chung CS, Song HJ, Park HS (2000) The reliability and validity of the MIDAS (Migraine Disability Assessment) Questionnaire for Korean migraine sufferers. J Korean Neurol Assoc 18:287–91
Chu MK, Im HJ, Ju YS, Yu KH, Ma HI, Kim YJ, Kim J, Lee BC (2009) Validity and Reliability Assessment of Korean Headache Impact Test-6 (HIT-6). J Korean Neurol Assoc 27:1–6
Moon HS, Chung CS, Lee HS, Park HS, Kim SW, Woo HW (2003) The reliability and validity of the migraine-specific quality of life questionnaire for Korean migraine suffers. J Korean Neurol Assoc 21:146–55
Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J, Weiller E, Hergueta T, Baker R, Dunbar GC (1998) The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry 59(suppl 20):S22–S33
Pfizer. Patient Health Questionnaire (PHQ) screeners. http://www.phqscreeners.com/. [accessed Sep 2013].
Beck AT, Steer RA, Brown GK (1996) BDI-II Manual. Psychological Corp, San Antonio
Kosinski M, Bayliss MS, Bjorner JB, Ware JE Jr, Garber WH, Batenhorst A, Cady R, Dahlöf CG, Dowson A, Tepper S (2003) A six-item short-form survey for measuring headache impact: the HIT-6. Qual Life Res 12:963–74
Bayliss M, Batenhorst A (2002) The HIT-6™ A User’s Guide. Quality Metric Incorporated, Lincoln, RI.
Wagner TH, Patrick DL, Galer BS, Berzon RA (1996) A new instrument to assess the long-term quality of life effects from migraine: development and psychometric testing of the MSQOL. Headache 36:484–92
Choi HS, Choi JH, Park KH, Joo KJ, Ga H, Ko HJ, Kim SR (2007) Standardization of the Korean Version of Patient Health Questionnaire-9 as a screening instrument for major depressive disorder. J Korean Acad Farm Med 28:114–9
Manea L, Gilbody S, McMillan D (2012) Optimal cut-off score for diagnosing depression with the Patient Health Questionnaire (PHQ-9): a meta-analysis. CMAJ 184:E191–6
Shin JH, Kim HC, Jung CH, Kim JB, Jung SW, Cho HJ, Jung SH (2013) The standardization of the Korean version of the Patient Health Questionnaire-2. J Korean Neuropsychiatr Assoc 52:115–21
Chagas MH, Crippa JA, Loureiro SR, Hallak JE, Cd M-G, Machado-de-Sousa JP, Rodrigues GR, Filho AS, Sanches RF, Tumas V (2011) Validity of the PHQ-2 for the screening of major depression in Parkinson’s disease: two questions and one important answer. Aging Ment Health 15:838–43
Williams JR, Hirsch ES, Anderson K, Bush AL, Goldstein SR, Grill S, Lehmann S, Little JT, Margolis RL, Palanci J, Pontone G, Weiss H, Rabins P, Marsh L (2012) A comparison of nine scales to detect depression in Parkinson disease: which scale to use? Neurology 78:998–1006
Zis P, Gatzonis S (2014) Estimating the diagnostic value of the Neurological Disorders Depression Inventory for Epilepsy in different languages. Epilepsia 55:941
The authors thank Ju-Hui Lee, a neuropsychologist, for conducting the MINI-Plus 5.0.0 and helping in the completion of self-report questionnaires.
This work was supported by Biomedical Research Institute grant, Kyungpook National University Hospital (2014).
The authors declare that they have no competing interests.
SPP took part in the design of the study, contributed to the data collection. JGS and SPP participated in writing the manuscript. JGS was responsible for data statistics. All authors agreed to accept equal responsibility for the accuracy of the content of the paper. Both authors read and approved the final manuscript.