Skip to main content

Psychometric properties of self-report measures of eating disorder cognitions: a systematic review

Abstract

Background

Although eating disorder (ED) models display some differences in theory and treatment approach, cognitive-behavioural, schema-focused, and disorder-specific models all highlight the fundamental nature of cognitions as key factors in ED development and maintenance processes. As such, it is vital that ED cognitions continue to be assessed and monitored as therapeutic targets and treatment outcomes as well as being examined as constructs in empirical research. This review aimed to systematically identify and evaluate the psychometric properties of existing self-report measures of ED cognitions.

Methods

A systematic review protocol was registered using the international prospective register of systematic reviews (PROSPERO; CRD42023440840). Included studies described the development, validation and/or the psychometric evaluation of a measure (or subscale) that was specifically developed to solely assess ED cognitions (that is thoughts, expectations, assumptions, or beliefs), in English-speaking, adult populations. The search was conducted using three electronic databases: PsycINFO, MedLine, and Embase. Two independent reviewers conducted screening, selection and evaluation of the psychometric properties of relevant measures using a standardised, well-established quality appraisal tool.

Results

Of the initial search of 7581 potential studies, 59 met inclusion criteria and described the psychometric evaluation of 31 measures (or subscales) of ED cognitions. The findings from the current review indicate that of the included measures, none currently meet all nine criteria of adequate psychometric properties. The Eating Beliefs Questionnaire (EBQ; and EBQ-18), and the Eating Disorder Inventory Body Dissatisfaction subscale (EDI [BD]) currently possess the most evidence supporting their validity, reliability, and clinical utility.

Conclusions

The findings of the current systematic review provide guidance for future researchers to focus efforts on improving evidence for the validity, reliability and utility of self-report measures of ED cognitions. Overall, the present study has provided a detailed and systematic evaluation to support researchers and clinicians in future selection of measures of ED cognitions dependent on the specific aims of their research and treatment.

Plain English summary

Theoretical and empirical research suggests that eating disorder (ED) cognitions (that is, thoughts, expectations, assumptions, and beliefs) are important factors contributing to the development and maintenance of EDs. As such, it is important to continue to assess and monitor these in research and in clinical practice. Valid, reliable and useful assessment tools are critical in this process. This systematic review aimed to identify and evaluate the psychometric properties of existing self-report measures of ED cognitions. Included studies described the development, validation and/or the psychometric evaluation of a measure specifically developed to solely assess ED cognitions. Three electronic databases were searched, and studies screened and selected by two independent reviewers. The included studies and self-report measured were evaluated using a standardised tool to evaluate their psychometric properties. Fifty-nine studies were included, that identified a total of 31 measures (or subscales) assessing ED cognitions. The findings from the current review indicate that none of the included measures currently meet all nine criteria of adequate psychometric properties. The EBQ, EBQ-18, and EDI-III (BD) currently have the most evidence in support of their psychometric properties. This study provides direction for future researchers to focus efforts on improving evidence for the validity, reliability and utility of these self-report measures. Overall, we provided information to support researchers and clinicians in future selection of measures of ED cognitions.

Introduction

The prevalence and impact of eating disorders (EDs) continues to increase worldwide [2, 15, 32, 73]. Currently, the Diagnostic and Statistical Manual of Mental Disorders [1] outlines the criteria for several disorders, including but not limited to anorexia nervosa (AN), bulimia nervosa (BN), binge eating disorder (BED), and other specified feeding and eating disorder (OSFED). The functional impact these disorders has been well established, with existing literature highlighting that EDs are associated with higher medical and psychiatric comorbidity, reduced quality of life and increased risk of mortality [2, 90, 91]. Despite the existence of various treatment models, and some evidence of increased treatment seeking [73], treatment outcomes are varied, and efficacy typically only achieves modest gains [92, 93]. As such, examining the key components implicated in both the development and maintenance of EDs remains critical.

Theoretical ED models emphasise the importance of ED relevant cognitions as both development and maintenance factors [21, 27, 94]. Cognitions implicated in the development and maintenance of EDs include thoughts, expectations, assumptions, and conditional and unconditional beliefs. From a transdiagnostic perspective, Fairburn et al.’s [27] cognitive-behavioural model of EDs highlights the role of over-evaluation of eating, weight, shape and their control, core low self-esteem, as well as emphasising the role of unconditional, often pervasive, and negative views of the self (e.g., negative, self-referent beliefs [‘I am unlovable]). Core low self-esteem and negative self-beliefs contribute to an increasingly dysfunctional schema for self-evaluation and thereby contribute to the over-evaluation of eating, weight, shape, and their control. The over-evaluation of these features and their importance is often central in the initial emergence and maintenance of ED behaviours, such as restriction, purging and binge eating. Cognitive-behavioural therapy for EDs (CBT-E), the transdiagnostic treatment derived from this theory, has been shown to be an effective and efficacious treatment option for a range of ED diagnoses, in adults and adolescents [3, 24].

The schema-focussed model of EDs [94] clearly delineates the role of unconditional core beliefs or schema level representations in the development of different eating pathology. It suggests that in AN, schema compensation occurs to prevent experiencing negative affect, resulting in restrictive eating behaviours, whereas avoidance of negative affect generated by the activation of negative core beliefs results in more bulimic-type pathology [94]. Schema content may include conditional and unconditional beliefs about the self, others, or the world across several dimensions and schema domains. For example, Waller et al. describe the relationship between unconditional beliefs regarding the self (i.e., self-referent beliefs, such as ‘I am worthless’), others (e.g., ‘Others are judgemental and harsh’), and possible maladaptive schemas relating to these beliefs (e.g., ‘abandonment’ or ‘unrelenting standards’). Narrative and systematic reviews of schemas and schema therapy outcomes have indeed highlighted associations between pronounced early maladaptive schemas (EMS) and EDs [52, 70], and indicated that schema therapy interventions show some promise for treating complex eating presentations [52].

Additionally, several disorder specific models highlight the importance of core, self-referent beliefs, as well as other types of ED cognitions. The cognitive model of bulimia nervosa suggests that maladaptive, negative self-beliefs (e.g., ‘I am a failure’; [21] act as a predisposing factor, leading to increased negative automatic thoughts (NATs), which in turn serve to further reinforce these beliefs. The model also outlines the role of permissive, positive and negative beliefs about eating in the maintenance of binge eating and purging behaviours. These beliefs can be understood as types of meta-cognitive beliefs, that is, a set of higher order beliefs reflecting understanding, awareness and interpretation of one’s own thought processes [97]. For example, a positive meta-cognitive belief about eating might be ‘eating helps me to control my emotions’ [13]. An integrated cognitive-behavioural model of binge eating similarly identifies both the critical roles of core low self-esteem (negative core beliefs about the self) and eating beliefs (that is, meta-cognitive beliefs) in the development and maintenance of binge eating [11].

Models by Fairburn et al. [27] and Cooper et al. [21] both emphasise the role of NATs that reflect the over-evaluation of eating, weight, shape and its control in the maintenance of EDs. For example, negative thoughts about food and eating may be statements such as ‘I hate that I like to binge’, versus negative thoughts about weight and shape that may include content such as ‘I’ll gain a huge amount of weight’ or ‘I think my stomach is too big’. In accordance with previously described models [11, 21, 27], these negative and often automatic thoughts serve to maintain ED cycles by increasing negative affect or impacting emotional regulation, which in turn reinforce ED behaviours. These negative thoughts also ultimately reinforce more pervasive underlying cognitions (i.e., core, self-referent beliefs).

Although the aforementioned models display some differences in their cognitive emphasis (in both content and types of cognitions), each highlights the fundamental nature of ED cognitions as factors in ED developmental and maintenance processes. As such, it is vital that ED cognitions continue to be assessed and monitored as therapeutic targets and treatment outcomes. Both theoretical and empirical research has supported the idea of measuring ED ‘thoughts’ separately to behaviours, as well as assessing ED cognitions as a continuous construct that is often identified in sub-clinical or prodromal populations [53], where there is need to be proactive in prevention and early intervention. It is vital therefore to utilise assessment tools measuring ED cognitions that possess strong psychometric properties.

Previously, Burton et al. [9] conducted a systematic review of the psychometric properties of self-report measures relating specifically to binge-eating symptoms. Further, two recent systematic reviews reported on the available measures and facets relating to body image [48, 69]. However, each of these reviews did not provide a comprehensive summary relating to all EDs and related features more broadly. Another recent review summarised the instruments utilised in the assessment of EDs in adults [77], providing an overview of commonly used and recently developed measures of ED symptomatology. However, the focus of this review was only on frequently used and recently developed assessment measures. It did not specifically report on cognitive measures, that is, those with focus on ED beliefs, expectations, assumptions and thoughts. As such, several important existing instruments that have been developed to assess ED relevant cognitions were not reviewed. Some examples include the Eating Beliefs Questionnaire-18 (EBQ-18) [10], and the Eating Disorder Core Beliefs Questionnaire (ED-CBQ) [28]. Moreover, a thorough evaluation of the psychometric properties of the identified measures using a standardised, published tool for assessing their quality was not conducted, instead, the focus of the review was informative rather than evaluative [77].

Thus, to date, there has been no comprehensive assessment of the available self-report measures of ED cognitions or a thorough assessment of their psychometric properties. The aim of this review was to systematically identify and evaluate the psychometric properties of existing self-report measures of ED cognitions. The psychometric properties of these cognitive self-report measures will be evaluated using the appraisal of adequacy tool, developed by Terwee et al. [87]. This standardised tool guides quality appraisal by using nine quality criteria, including content validity, internal consistency, criterion validity, construct validity, reproducibility (agreement), reproducibility (reliability), responsiveness, floor or ceiling effects, and interpretability. This tool has previously been utilised in Burton et al. [9] review of self-report measures of binge-eating symptoms, as well as several other systematic reviews in other domains [59, 84, 101]. Using these criteria, this study intends to systematically summarise the available findings for cognitive ED self-report measurement tools, providing a comprehensive understanding of their psychometric properties, and guidance for researchers and clinicians in evaluating, comparing and utilising these measures.

Method

Search strategy

A systematic review protocol was registered using the international prospective register of systematic reviews (PROSPERO; CRD42023440840). The search strategy followed guidelines outlined in the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA; [66]. Utilising guidelines for optimal database combinations for literature searches [8], the search was conducted using three electronic databases: PsycINFO, MedLine, and Embase. There were no limits to search based off publication period. Reference lists of all included studies were scanned to identify any additional, relevant publications. Searches were run again prior to final analysis on the 17/08/2023. To identify eligible studies, several combinations of keywords were used that related to EDs (e.g., “eating disorder”, “anorexia nervosa”, “bulimia nervosa”, “binge eating disorder”, etc.), self-report measures (e.g., “questionnaire”, “scale”, “tool”, “assessment”, “measure”, etc.), and psychometric properties (e.g., “psychometric”, “reliability”, “validity”, etc.). A comprehensive search was conducted, including search of titles, abstracts, and keywords, subject headings were mapped, and in some instances the explode function was utilised for expansion of relevant results. The full search strategy created for all three databases is a publicly available supplementary file included in the systematic review protocol registered on PROSPERO.

Inclusion and exclusion criteria

The inclusion criteria were as follows:

  1. 1.

    The study had to describe the development, validation and/or the psychometric evaluation of a self-report measure of ED cognitions.

  2. 2.

    The measure (or subscale) was specifically developed to solely assess ED cognitions (that is thoughts, expectations, assumptions, or beliefs).

  3. 3.

    The measure was developed and administered in the English language, to native English speakers, and published in the English language in a peer review journal.

  4. 4.

    Utilised an adult population (17+, clinical or general) for development, evaluation or investigation purposes.

The exclusion criteria were as follows:

  1. 1.

    Non-psychometric studies (such as literature reviews, systematic reviews, or meta-analyses).

  2. 2.

    Measure (or subscale) was designed for purposes other than assessing ED cognitions (such as, as a screening or diagnostic tool, or that assesses behaviours or emotions).

  3. 3.

    Measure not in English, administered in English, or published in a non-English speaking country.

  4. 4.

    Utilised a child or adolescent population.

  5. 5.

    Book chapters, non-peer reviewed publications, published doctoral theses.

Selection process

Articles were screened and selected by two independent reviewers (AH and SB). Using the identified databases, duplicates were identified and removed, and articles were screened by title and abstract for inclusion/exclusion by AH. During title and abstract screening process, a series of meetings were held with the four included authors, to provide consensus about the relevance of measures that were accessible during this stage. A similar process was utilised by a previous psychometric systematic review of body image [48]. Full texts of the remaining studies were obtained after the initial screening, then both reviewers analysed each text independently to establish the final texts to be included. Measures (and subscales) were also assessed for their relevance and eligibility at this stage of the screening process, if the measure was not available (at the item level) at the stage of title and abstract screening. Studies (and therefore measures) that required further scrutiny to assess whether they met inclusion criteria were included in full text screening in order to assess their relevance and eligibility at the subscale and item level. A final consensus meeting was conducted where all authors provided agreement as to the relevance of all included measures.Footnote 1 The overall agreement between the two reviewers was 96.7%, which equates to an inter-rater agreement (Kappa) of κ = 0.93.

Appraisal of quality

The psychometric properties of included studies were analysed using Terwee et al. [87] criteria of adequacy for measurement properties. This quality appraisal tool was designed to assess health status questionnaires and has been used in several previous systematic reviews [9, 59, 84, 101]. This tool assesses nine measurement properties, including (1) content validity, (2) internal consistency, (3) criterion validity, (4) construct validity, (5) reproducibility: agreement, (6) reproducibility: reliability, (7) responsiveness, (8) floor and ceiling effects, and (9) interpretability. See Table 1 for the definition and criteria of adequacy for each of the nine properties.

Table 1 Criteria of quality of psychometric properties [87]

Criteria were given the following evaluative ratings; positive (+), intermediate (?), negative (−), no information available (0). Intermediate ratings may be given if there are serious doubts about study methodology, as per the guidelines for utilising these criteria [87]. It is essential to consider the methodological quality of included studies when assigning a rating, as those with low methodological quality will have a greater likelihood of reporting biased results. Further, the criteria for internal consistency allowed studies to consider the results of past factor analyses, or item response theory (IRT) analyses, when conducting a Cronbach alpha (i.e., as each study did not necessarily need to carry out a new factor analysis). Finally, Terwee et al. [87] indicate that all measurement properties are not necessarily equally important, thus, we followed the recommendation not to provide a summary or overall score.

Further, included studies were summarised by describing if they were a development study, any factor analyses performed, the study population, sample size, mean age and standard deviation, and the sex ratio (% females). Included measures (or subscales) were summarised by describing the construct or goal being measured, the number of items, response categories and any subscales or factors. Attempts were made to obtain missing or unclear information by contacting the authors of studies assessed for eligibility. Missing or unclear information that did not affect inclusion was still recorded (as either ‘?’ or Not Applicable [N/A]). In the absence of information from authors contacted, an assumption was maintained that if participants were described to be of ‘college’ or ‘university’ age, that they were a part of an adult sample (17+).

Results

Results of search strategy

The initial search identified 7581 potential studies. After removal of 2484 duplicates, this resulted in 5097 potential studies, of which 57 were considered to have met the inclusion criteria. An additional two studies were identified by cross checking reference lists for articles of interest and searching google scholar. This resulted in a total of 59 included studies (see Table 2 for summary of included studies). The selection process is summarized in Fig. 1.

Table 2 Description of studies
Fig. 1
figure 1

PRISMA diagram of study identification, screening and selection (PRISMA [66])

A total of 31 measures (or subscales) were identified that met inclusion criteria from the included studies (see Table 3 for summary of included measures). Nine of the included measures were subscales, and nine were a short-form or revised version of an original measure. These measures were the Beliefs About Appearance Questionnaire (BASS), Bulimic Automatic Thoughts Test (BATT), Body Checking Cognitions Scale (BCCS), Bulimia Cognitive Distortions Scale (BCDS), Bulimic Thoughts Questionnaire (BTQ), Eating Beliefs Questionnaire (EBQ), Eating Beliefs Questionnaire-18 (EBQ-18), Eating Disorder Beliefs Questionnaire (EDBQ, Eating Disorder Core Beliefs Questionnaire (ED-CBQ), Eating Disorder Core Beliefs Questionnaire Revised (ED-CBQ-R), Eating Disorder Inventory Body Dissatisfaction subscale (EDI [BD]), Eating Disorder Inventory II Body Dissatisfaction subscale (EDI-II [BD]), Eating Disorder Inventory III Body Dissatisfaction subscale (EDI-III [BD]), Eating Expectancy Inventory (EEI), Functions of Binge Eating Scale (FBES), Irrational Food Beliefs Scale (IFBS), Interpersonal Outcome Expectancy for Thinness scale (IOET), Mizes Anorectic Cognitions Questionnaire (MACQ), Mizes Anorectic Cognitions Questionnaire Brief (MACQ-B), Mizes Anorectic Cognitions Questionnaire Revised (MACQ-R), Muscle Dysmorphia Inventory Drive for Size subscale (MDDI [DS]), Perceived Benefits of Thinness Scale (PBTS), Sociocultural Attitudes Towards Appearance Questionnaire 4 Internalisation Thin subscale (SATAQ-4 [IT]), Sociocultural Attitudes Towards Appearance Questionnaire 4 Revised Internalisation Thin subscale (SATAQ-4R [IT]), Stirling Eating Disorders Scale Anorexic Dietary Cognitions subscale (SEDS [ADC]), Stirling Eating Disorders Scale Bulimic Dietary Cognitions subscale (SEDS [BDC]), Stirling Eating Disorders Scale Low Self-esteem subscale (SEDS [LSE]), Testable Assumptions Questionnaire for Eating Disorders (TAQ-ED), Testable Assumptions Questionnaire for Eating Disorders Revised (TAQ-ED-R), Thoughts Questionnaire (TQ), and the Weight Influenced Self-Esteem Questionnaire (WISE-Q).

Table 3 Description of measure

The cognitive focus of each measure was described using four broad categories: (1) negative thoughts about food and eating, (2) negative thoughts about weight, shape, or body image, (3) self-referent beliefs, and (4) meta-cognitive beliefs (see Table 4). These categories were formed by examining the content of included measures and identifying the main areas of cognitive content and the types of cognitions assessed. Altogether, 15 measures assessed negative thoughts about food and eating, 20 assessed negative thoughts about weight, shape or body image, nine assessed self-referent beliefs, and nine assessed meta-cognitive beliefs. The BCDS, TAQ-ED, and TAQ-ED-R considered all four categories of cognitive focus.

Table 4 Cognitive focus of included measures

Assessment of psychometric properties

The psychometric properties of each included study were assessed using the criteria outlined by Terwee et al. [87]. This assessment was independently conducted by the same two reviewers who screened and assessed the studies for eligibility (AH and SB). Agreement between the reviewers for the criteria of adequacy was 94% (κ = 0.91). Consensus was reached to resolve discrepancies between the two assessors, and as such a third reviewer was not necessary. The summary ratings for each measure are displayed in Table 5.

Table 5 Quality analysis/ratings of psychometric properties

Content validity

Content validity refers to the extent that the items of a measure are an accurate reflection of the construct of interest [87]. To have received a positive rating, studies (measure or subscale) describing the development of the measure were required to provide a clear description of the aim of the measure, concepts measured, target population and item selection. Further, they were required to have had experts and a relevant target population involved in item selection. Seventeen measures received a positive rating for this criterion, including the BASS, BATT, BCCS, BCDS, BTQ, EDBQ, ED-CBQ, ED-CBQ-R, EEI, IFBS, IOET, MACQ-B, MACQ-R, SEDS (ADC) subscale, SEDS (BDC) subscale, and the SEDS (LSE) subscale. These findings indicated that these measures (or subscales) demonstrated adequate content validity.

Internal consistency

Internal consistency refers to the extent to which items in a measure are correlated, and thus assess the same construct [87]. To have received a positive rating, studies must have reported a Cronbach’s alpha of adequate magnitude for the measure (or subscale) and performed a factor analysis using an adequate sample size. When assessing internal consistency for subscales, factor analysis was considered if it was performed solely on the subscale or on the full scale if it tested the included subscale as a factor. Fourteen measures received a positive rating for this criterion. These findings indicated that BATT, EBQ, EBQ-18, EDBQ, ED-CBQ-R, EDI-III (BD) subscale, FBES, MACQ, MACQ-R, MDDI (DS) subscale, PBTS, SATAQ-4 (IT) subscale, SATAQ-4R (IT) subscale, and the TQ have demonstrated adequate internal consistency.

Criterion validity

According to the Terwee et al. [87] criteria, criterion validity is determined by comparison to a ‘gold-standard’ instrument, ensuring the new measure is theoretically related to a well-established measure. Given that we imposed no limits on studies and measures included by time period, and that there is no suitable or widely agreed upon gold standard for assessing ED cognitions, if the study assessed for criterion validity as per Terwee et al. [87] definition, we allowed each study and its authors to provide their own justification or a convincing argument for their definition of a gold-standard. While the term ‘criterion validity’ was indeed used in some studies, the authors were, in fact, referring to other forms of validity (e.g., content validity or construct validity). No measures received a positive rating, most received a ‘no information available’ rating. Five measures (EBQ-18, EDBQ, EDI [BD] subscale, EDI-II [BD] subscale, EDI-III [BD] subscale) received an indeterminate rating for a lack of convincing argument for the gold standard measure utilised for comparison, and one (TQ) received a negative rating as the correlation with the proposed gold standard was < 0.07.

Construct validity

Construct validity refers to the degree to which scores on a particular questionnaire relate (or do not relate) to other measures in a manner that is consistent with theoretically derived hypotheses concerning the concepts that are being measured [87]. To have received a positive rating, studies were required to provide clear predictions regarding their hypotheses with 75% of the results in the expected direction (e.g., reporting the correlation between two measures in the expected direction)’. Almost all measures received a positive rating for this criterion, indicating the vast majority possessed adequate construct validity. Only the TAQ-ED-R received an indeterminate rating, and the BTQ received a ‘no information available’ rating.

Reproducibility: agreement (test–retest)

Agreement refers to the extent to which scores on a measure remain stable over time. According to Terwee et al. [87] adequate agreement is demonstrated when the absolute measurement error is smaller than the Minimally Important Change (MIC) factor. However, MIC was not defined or utilised in any of the studies evaluated in this review. Therefore, we utilised the criterion for agreement previously used by Burton et al. [9], and Zuccala et al. [101], defining adequate agreement as a test–retest reliability of r > 0.70. To have received a positive rating for this criterion, the means and standard deviations must have been presented at both time points. Seven measures received a positive rating for this criterion, indicating the EBQ, EBQ-18, EDI-III (BD) subscale, IOET, MACQ, MDDI (DS) subscale, and the PBTS possessed adequate test–retest agreement.

Reproducibility: reliability

Reproducibility reliability refers to the extent to which individuals can be distinguished from each other [87]. To have received a positive rating measures needed to provide an intraclass correlation or weighted Kappa > 0.70 to test this. The EBQ-18 and the EDI-III (BD) subscale were the only measures found to demonstrate adequate reliability. All other measures all received a ‘no information available’ rating.

Responsiveness

Responsiveness refers to the ability of a measure to detect clinically important changes over time or following an intervention [87]. To have received a positive rating, the studies must have outlined a treatment program and the longitudinal expected changes, and 75% of results must have been in the expected direction. Alternatively, measures had to demonstrate a responsiveness ratio of > 1.96 or an area under the curve > 0.70. Only the EBQ, EBQ-18, SATAQ-4 (IT) subscale, and WISE-Q demonstrated adequate responsiveness.

Floor and ceiling effects

According to Terwee et al. [87], floor and ceiling effects have occurred when > 15% of participants achieve the highest or lowest possible score on a measure (or subscale). Only the EDI-III (BD) subscale demonstrated a positive rating for this criterion, for reporting information demonstrating a lack of floor and ceiling effects. The ED-CBQ-R demonstrated a floor effect on one of its subscales, consequently receiving a negative rating. All other measures all received a ‘no information available’ rating.

Interpretability

Interpretability refers to the extent to which qualitative meaning can be given to quantitative scores [87]. To have received a positive rating, a measure must have presented means and standard deviation scores for at least four relevant subgroups within one study (in isolation). As per Terwee et al.’s suggestion, subgroups stratified by demographic variables (e.g., age, gender) may be included as subgroups. Further, as in Zuccala et al. [101], this criterion was modified such that a minimal important change (MIC) was not required to have received a positive rating, as no studies reported a MIC. Findings indicated that 14 measures (or subscales) possessed adequate interpretability, including the BATT, EBQ, EBQ-18, ED-CBQ-R, EDI (BD) subscale, IFBS, MACQ-R, SATAQ-R (IT) subscale, SATAQ-4R (IT) subscale, SEDS (ADC) subscale, SEDS (BDC) subscale, SEDS (LSE) subscale, TQ, and the WISE-Q. As Modini et al. [59] and Zuccala et al. [101] have previously noted, it is important to note that if considering an accumulation of subgroups between studies (not within one study in isolation), it is likely that more measures would have met this criterion.

Discussion

Prioritising the assessment of ED cognitions is essential for monitoring key factors contributing to the development and maintenance of EDs. It is essential that self-report measures possess adequate psychometric properties to increase their validity, reliability and utility for clinical research and practice. This systematic review aimed to evaluate the psychometric properties of existing self-report measures of ED cognitions using the Terwee et al. [87] criteria of adequacy for measurement properties, to conduct quality assessment.

This review identified 59 studies that evaluated the psychometric properties of 31 self-report measures (or subscales) that assess ED cognitions. The type of cognitive focus examined most across these measures was negative thoughts about weight, shape or body image, followed then by negative thoughts about food and eating. Self-referent and meta-cognitive beliefs were types of cognition assessed to a lesser extent across the included measures. This suggests that the majority of self-report measures specifically developed to assess ED cognitions have primarily been focused on the ‘thought’ level, where fewer have been developed to examine longstanding or engrained beliefs sets, including conditional and unconditional assumptions and beliefs.

With respect to the psychometric properties of included measures, no measure (or subscale) received positive ratings across all categories. The criterion that received the greatest number of positive ratings across all measures was construct validity. This suggests firstly that there is a propensity for studies to examine construct validity, as compared to other psychometric properties which are less routinely considered, and also of a general trend that existing measures assessing ED cognitions show demonstrate strong evidence for construct validity. This is compared to other criterion categories, which findings demonstrate are evaluated and considered less overall. Examples include criterion validity, reproducibility, reliability, responsiveness and floor and ceiling effects, where the majority of measures had no information found on the assessment of these properties. Interestingly, in terms of content validity, most studies and measures indeed clearly described the concepts measured, the intended target population, also involving experts in item selection. However, all measures with an indeterminate rating were characterised by a lack of clarity about whether and how the target population had been involved in item selection, and those with a negative rating failed to describe any involvement of the target population in item selection. It is clearly important to have items reflecting cognitive constructs that have been acknowledged as relevant by those populations whose cognitions the measures are attempting to describe and assess.

In relation to the four broad categories of cognitive focuses identified, some specific observations and subsequent recommendations can be made. Whilst attempting to assess negative thoughts relating to food and eating, the EBQ-18 currently possesses the most evidence for its validity, reliability and utility, compared to other measures assessing similar content. Relatedly, of measures that assess negative thoughts related to weight, shape and body image, the EDI-III (BD) subscale and the SATAQ-4 (IT) subscale have the most psychometric support. However, it is important to note these measures both focus on body dissatisfaction that implies desire for a thinner body, as opposed to, for example, a more muscular body. Thus, even within the categories identified, it remains important to consider the construct of interest when selecting and utilising a cognitive ED measure. With respect to self-referent beliefs, the ED-CBQ-R currently has the most evidence for its validity, reliability and utility. The EBQ-18 has the most empirical support for its psychometric properties of measures that assess meta-cognitive beliefs related to EDs.

An overall pattern also emerged indicating that newer, revised versions of an original measure demonstrated a greater number of positive ratings, as evidenced by the EBQ-18, ED-CBQ-R, EDI-III (BD), and the MACQ-R. The results of this systematic review also suggest that the EBQ, EBQ-18, and EDI-III (BD) have the most evidence in support of their psychometric properties. However, a strict conclusion cannot be drawn that these measures are objectively superior, as not all measurement properties are necessarily equally important [87]. As previously noted [101], different measures may be utilised for different purposes, and certain psychometric properties may have varied importance under diverse circumstances. For example, for the purposes of a clinical trial, adequate responsiveness might be of greatest importance, whereas in a prevalence study aiming to identify the presence or absence of various cognitions, choosing a measure with greater breadth may be more beneficial. Moreover, conclusions cannot be made as to the superiority of a specific measure, as this review considers different types of ED cognitions. The included self-report measures of ED cognitions examine different constructs (e.g., body dissatisfaction versus meta-cognitive beliefs about eating), and types of cognitions (e.g., automatic thoughts versus unconditional beliefs). Thus, we recommend those wishing to utilise measures with more evidence for their validity, reliability and utility, should also consider the construct of interest, its intended usage, and the type of cognition to be assessed.

This systematic review had several notable strengths, including the standardised and systematic approach applied through its utilisation of a previously validated tool, widely utilised in previous systematic reviews evaluating psychometric properties of self-report measures [9, 59, 84, 101]. This review also demonstrated evidence of good interrater reliability at several stages of the review process. Additionally, the present review included all versions of measures in included studies, regardless of whether the measure was an original, revised or short version of a self-report measure. This facilitated a comprehensive summary and allowed for comparison of the state of evidence for each questionnaire version.

It is also important to note several limitations, the first of which is the stringent nature of the appraisal of quality tool utilised in the present study. The criteria used may have meant that some measures received either an intermediate or negative rating in accordance with strict standards for appraisal of adequacy, when using less exacting criteria might have resulted in a more positive rating. For example, we adhered to the strict Cronbach’s alpha cut-off in assessing internal consistency, where a few studies presented alphas that were only marginally above or below the range given in order to give a positive rating. The authors attempted to address this in several instances whilst still maintaining the intended rigor, by providing some leniency across ratings as described previously, and in alignment with carefully considered modifications that have also been employed in previous systematic reviews [9, 101]. For example, in test–retest reliability, in consenting to studies measuring reliability using more frequently utilised statistical methods. Further, authors were somewhat lenient in terms of content validity where body image measures were concerned, as the requirement for ‘involving the target population in item selection’ meant that the target population did not necessarily need to be a clinical ED population, but simply those presenting with varied body image concerns. Altogether, it was considered necessary and beneficial to continue to maintain this standard in order to recommend usage of psychometrically sound measures, and importantly to provide researchers incentive to continue to improve quality of existing and future assessment tools, and by extension, improve quality of empirical evidence more generally.

Additionally, it is important to consider the strict nature of our inclusion and exclusion criteria alongside the evidence presented. A limitation of this review is that only articles utilising an adult population and those utilising a non-English speaking population were considered for inclusion. As such, some measures or studies in support of certain psychometric properties might have been excluded in accordance with these criteria. Ultimately, studies utilising non-English speaking populations or measures not administered in English were out of the scope of this review. The authors also considered the importance of decreasing some heterogeneity in the evidence evaluated in the context of this systematic review [51]. This was relevant also to the exclusion of studies utilising child, purely adolescent, or mixed populations. As a function of differences in ED symptomatology across developmental stages [23, 50], it was important to consider potential differences in ED cognitions between adult and child or adolescent populations, and that some assessment tools may be valid and reliable in one population and not another.

Finally, the stringency of our criteria regarding measures being developed specifically and solely to measure ED cognitions, meant some more commonly utilised ED measures were not included in this systematic review. This included the EDE-Q [26] concerns subscales and versions of the Eating Attitudes Test (EAT-40 [34], EAT-40 [35]). Examination of their intention during development and careful scrutiny at the item level reflected that they captured other symptomatology, including emotional, behavioural symptoms, and items measuring functional impairment. These exclusions may impact the clinical utility of this review, due to the frequency of use of these measures for clinical and diagnostic purposes [42]. However, we also considered that the psychometric properties of these two measures have been assessed and systematically reviewed elsewhere [5, 45, 67]. Another commonly utilised cognitive measure not considered for inclusion was the Young Schema Questionnaire (YSQ-S3) [100]. Although useful in capturing core cognitive constructs, it was not developed specifically to capture ED cognitions, which was the focus of this systematic review. Finally, although this study included the EDI-III (BD), previous systematic reviews have considered the EDI-III (BD) to assess both evaluative and affective components of body image [48]. Due to this discrepancy, we recommend findings be interpreted with caution.

The results of this systematic review suggest several areas of improvement for future research. Primarily, our findings do not necessarily indicate that measures lacking adequate evidence should not be utilised, but primarily highlights where there is either an absence of psychometric support, there is clear suggestion for future researchers to focus efforts on improving evidence for the validity, reliability and utility of these measures for the populations considered. There is perhaps greater need to consider the responsiveness and clinical utility of ED cognitive measures, as well as less widely considered measurement properties, such as criterion validity and examining floor and ceiling effects.

When examining the cognitive focus, content and types of cognitions of included measures, it is clear that most tend to consider only AN, BN, BED, and assessment of body image concerns. There is certainly space to consider the development and availability of tools to assess cognitions in comparatively less well researched EDs, or those with rapidly increasing clinical focus. This includes disorders such as Avoidant Restrictive Food Intake Disorder (ARFID) and orthorexia nervosa, which currently only have symptomatic and behavioural inventories, and tools available to support diagnosis. Finally, whilst examining populations most frequently utilised in included studies, it is clear that research needs to continue to prioritise inclusion of more male and gender-diverse samples. This is particularly critical due to growing evidence outlining the increased and substantial ED risk in gender-diverse individuals [37, 39, 81].

Altogether, the present study was the first to provide a systematic review of self-report measures of ED cognitions, and valuable information about the existing, relevant evidence for their psychometric properties. This information provided a basis for the future selection of valid, reliable and clinically useful tools for measuring a variety of ED cognitions. Although no measure appeared to possess adequate evidence across all nine measurement criteria, several provided a good amount of evidence in support of their reliability, validity and utility. Ultimately, comprehensive information was provided to support future selection of measure of ED cognitions dependent on the specific aims of research and/or treatment. It is hoped that the findings of the present review assist both researchers and clinicians alike in identifying, evaluating and comparing relevant measures for use in identifying and monitoring important treatment targets in clinical, sub-clinical or prodromal ED populations.

Availability of data and materials

The data using during the current review is available from the corresponding author on reasonable request.

Notes

  1. As an example, the Eating Disorder Examination Questionnaire (EDE-Q; [26] Eating, Weight and Shape Concerns subscales were each considered for inclusion during screening and study selection stage of the review. However, closer examination at the item level revealed each subscale contained items measuring behaviours, emotions, or other symptoms (e.g., functional impairment). As each subscale was not developed solely to assess cognitions (as per the inclusion criteria), studies examining the psychometric properties of the EDE-Q subscales were ultimately excluded from this review.

Abbreviations

AN:

Anorexia Nervosa

AN (B/P):

Anorexia Nervosa binge-purge subtype

APA:

American Psychiatric Association

ARFID:

Avoidant Restrictive Food Intake Disorder

AUC:

Area under the curve

BAAS:

Beliefs About Appearance Questionnaire

BATT:

Bulimic Automatic Thoughts Test

BCCS:

Body Checking Cognitions Scale

BCDS:

Bulimia Cognitive Distortions Scale

BE:

Binge eating

BED:

Binge Eating Disorder

BN:

Bulimia Nervosa

BTQ:

Bulimic Thoughts Questionnaire

CBT-E:

Cognitive Behavioural Therapy Enhanced

CFA:

Confirmatory Factor Analysis

EAT:

Eating Attitudes Test

EBQ:

Eating Beliefs Questionnaire

EBQ-18:

Eating Beliefs Questionnaire 18

ED:

Eating Disorder

EDBQ:

Eating Disorder Beliefs Questionnaire

ED-CBQ:

Eating Disorders Core Beliefs Questionnaire

ED-CBQ-R:

Eating Disorder Core Beliefs Questionnaire Revised

EDE-Q:

Eating Disorder Examination Questionnaire

EDI (BD):

Eating Disorder Inventory Body Dissatisfaction subscale

EDI-II (BD):

Eating Disorder Inventory II Body Dissatisfaction subscale

EDI-III (BD):

Eating Disorder Inventory III Body Dissatisfaction subscale

EDNOS:

Eating disorder not otherwise specified

EEI:

Eating Expectancy Inventory

EFA:

Exploratory Factor Analysis

FBES:

Functions of Binge Eating Scale

ICC:

Intraclass correlation

IFBS:

Irrational Food Beliefs Scale

IOET:

Interpersonal Outcome Expectancy for Thinness Scale

LOA:

Limits of agreement

MACQ:

Mizes Anorectic Cognitions Questionnaire

MACQ-B:

Mizes Anorectic Cognitions Questionnaire Brief

MACQ-R:

Mizes Anorectic Cognitions Questionnaire Revised

MBDS:

Male Body Dissatisfaction Scale

MDDI (DS):

Muscle Dysmorphia Inventory Drive for Size subscale

MIC:

Minimally Important Change

N/A:

Not applicable

NAT:

Negative automatic thoughts

OSFED:

Other Specified Feeding and Eating Disorders

PBTS:

Perceived Benefits of Thinness Scale

PCA:

Principal components analysis

PRISMA:

Preferred Reporting Items for Systematic Reviews and Meta-analyses

ROC:

Receiver operating characteristic curve

RR:

Responsiveness Ratio

SATAQ-4 (IT):

Sociocultural Attitudes Towards Appearance Questionnaire 4 Internalisation Thin subscale

SATAQ-4R (IT):

Sociocultural Attitudes Towards Appearance Questionnaire 4 Revised Internalisation Thin subscale

SD:

Standard Deviation

SDC:

Smallest detectable change

SEDS (ADC):

Stirling Eating Disorders Scale Anorexic Dietary Cognitions subscale

SEDS (BDC):

Stirling Eating Disorders Scale Bulimic Dietary Cognitions subscale

SEDS (LSE):

Stirling Eating Disorders Scale Low Self-esteem subscale

TAQ-ED:

Testable Assumptions Questionnaire for Eating Disorders

TAQ-ED-R:

Testable Assumptions Questionnaire for Eating Disorders Revised

TQ:

Thoughts Questionnaire

WISE-Q:

Weight Influenced Self-Esteem Questionnaire

YSQ:

Young Schema Questionnaire

References

  1. American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 5th ed. Washington: American Psychiatric Association; 2013.

    Book  Google Scholar 

  2. Appolinario JC, Sichieri R, Lopes CS, Moraes CE, da Veiga GV, Freitas S, Nunes M, Wang YP, Hay P. Correlates and impact of DSM-5 binge eating disorder, bulimia nervosa and recurrent binge eating: a representative population survey in a middle-income country. Soc Psychiatry Psychiatr Epidemiol. 2022. https://doi.org/10.1007/s00127-022-02223-z.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Atwood ME, Friedman A. A systematic review of enhanced cognitive behavioral therapy (CBT-E) for eating disorders. Int J Eat Disord. 2020;53(3):311–30. https://doi.org/10.1002/eat.23206.

    Article  PubMed  Google Scholar 

  4. Belon KE, McLaughlin EA, Smith JE, Bryan AD, Witkiewitz K, Lash DN, Winn JL. Testing the measurement invariance of the eating disorder inventory in nonclinical samples of Hispanic and Caucasian women [Clinical Psychological Testing 2224 Eating Disorders 3260]. Int J Eat Disord. 2015;48(3):262–70. https://doi.org/10.1002/eat.22286.

    Article  PubMed  Google Scholar 

  5. Berg KC, Peterson CB, Frazier P, Crow SJ. Psychometric evaluation of the eating disorder examination and eating disorder examination-questionnaire: a systematic review of the literature. Int J Eat Disord. 2012;45(3):428–38. https://doi.org/10.1002/eat.20931.

    Article  PubMed  Google Scholar 

  6. Bergin J, Wade TD. Psychometric properties of the Eating Disorder Belief Questionnaire. Int J Eat Disord. 2014;47(6):640–6. https://doi.org/10.1002/eat.22267.

    Article  PubMed  Google Scholar 

  7. Bonifazi DZ, Crowther JH, Mizes JS. Validity of questionnaires for assessing dysfunctional cognitions in bulimia nervosa. Int J Eat Disord. 2000;27(4):464–70. https://doi.org/10.1002/%28SICI%291098-108X%28200005%2927:4%3C464::AID-EAT12%3E3.0.CO;2-B.

    Article  PubMed  Google Scholar 

  8. Bramer WM, Rethlefsen ML, Kleijnen J, Franco OH. Optimal database combinations for literature searches in systematic reviews: a prospective exploratory study. Syst Rev. 2017. https://doi.org/10.1186/s13643-017-0644-y.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Burton AL, Abbott MJ, Modini M, Touyz S. Psychometric evaluation of self-report measures of binge-eating symptoms and related psychopathology: a systematic review of the literature. Int J Eat Disord. 2016;49(2):123–40. https://doi.org/10.1002/eat.22453.

    Article  PubMed  Google Scholar 

  10. Burton AL, Abbott MJ. The revised short-form of the eating beliefs questionnaire: measuring positive, negative, and permissive beliefs about binge eating. J Eat Disord. 2018;6:37. https://doi.org/10.1186/s40337-018-0224-0.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Burton AL, Abbott MJ. Processes and pathways to binge eating: development of an integrated cognitive and behavioural model of binge eating. J Eat Disord. 2019;7:18. https://doi.org/10.1186/s40337-019-0248-0.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Burton AL, Hay P, Kleitman S, Smith E, Raman J, Swinbourne J, Touyz SW, Abbott MJ. Confirmatory factor analysis and examination of the psychometric properties of the eating beliefs questionnaire. BMC Psychiatry. 2017;17(1):237. https://doi.org/10.1186/s12888-017-1394-z.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Burton AL, Mitchison D, Hay P, Donnelly B, Thornton C, Russell J, Swinbourne J, Basten C, Goldstein M, Touyz S, Abbott MJ. Beliefs about binge eating: psychometric properties of the Eating Beliefs Questionnaire (EBQ-18) in eating disorder, obese, and community samples. Nutrients. 2018;10(9):1306. https://doi.org/10.3390/nu10091306.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Burton AL, Smith E, Abbott MJ. Assessing the clinical utility of the Eating Beliefs Questionnaire: results from receiver operating characteristic curve analysis with a clinical sample. Eur J Psychol Assess. 2020;36(2):421–6. https://doi.org/10.1027/1015-5759/a000496.

    Article  Google Scholar 

  15. Coffino JA, Udo T, Grilo CM. Rates of help-seeking in US adults with lifetime DSM-5 eating disorders: prevalence across diagnoses and differences by sex and ethnicity/race. Mayo Clin Proc. 2019;94(8):1415–26. https://doi.org/10.1016/j.mayocp.2019.02.030.

    Article  PubMed  Google Scholar 

  16. Compte EJ, Cattle CJ, Lavender JM, Murray SB, Brown TA, Capriotti MR, Flentje A, Lubensky ME, Obedin-Maliver J, Lunn MR, Nagata JM. Psychometric evaluation of the Muscle Dysmorphic Disorder Inventory (MDDI) among cisgender gay men and cisgender lesbian women. Body Image. 2021;38:241–50. https://doi.org/10.1016/j.bodyim.2021.04.008.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Convertino AD, Gonzales MT, MalcarneBlashill VLAJ. A psychometric investigation of the Sociocultural Attitudes Towards Appearance Questionnaire-4-revised among sexual minority adults in the U.S. Body Image. 2019;31:96–101. https://doi.org/10.1016/j.bodyim.2019.08.013.

    Article  PubMed  Google Scholar 

  18. Cooper M, Cohen-Tovee E, Todd G, Wells A, Tovee M. The eating disorder belief questionnaire: preliminary development. Behav Res Ther. 1997;35(4):381–8. https://doi.org/10.1016/S00057967%2896%2900115-5.

    Article  PubMed  Google Scholar 

  19. Cooper Z, Cooper PJ, Fairburn CG. The specificity of the eating disorder inventory. Br J Clin Psychol. 1985;24(2):129–30. https://doi.org/10.1111/j.2044-8260.1985.tb01324.x.

    Article  PubMed  Google Scholar 

  20. Cooper MJ, Todd G, Woolrich R, Somerville K, Wells A. Assessing eating disorder thoughts and behaviors: the development and preliminary evaluation of two questionnaires. Cogn Ther Res. 2006;30(5):551–70. https://doi.org/10.1007/s10608-006-9003-3.

    Article  Google Scholar 

  21. Cooper MJ, Wells A, Todd G. A cognitive model of bulimia nervosa. Br J Clin Psychol. 2004;43(1):1–16. https://doi.org/10.1348/014466504772812931.

    Article  PubMed  Google Scholar 

  22. Cordero ED, Julian AK, Murray KE. Measurement of disordered eating in Latina college women. Eat Behav. 2013;14(2):220–3. https://doi.org/10.1016/j.eatbeh.2012.12.006.

    Article  PubMed  Google Scholar 

  23. Christian C, Perko VL, Vanzhula IA, Tregarthen JP, Forbush KT, Levinson CA. Eating disorder core symptoms and symptom pathways across developmental stages: a network analysis. J Abnorm Psychol. 2020;129(2):177–90. https://doi.org/10.1037/abn0000477.

    Article  PubMed  Google Scholar 

  24. de Jong M, Schoorl M, Hoek HW. Enhanced cognitive behavioural therapy for patients with eating disorders: a systematic review. Curr Opin Psychiatry. 2018;31(6):436–44. https://doi.org/10.1097/YCO.0000000000000452.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Dhokia R, Hinrichsen H, Meyer C, Waller G. Clinical and psychometric validation of an extended version of the Testable Assumptions Questionnaire (TAQ-ED-R). Eat Behav. 2009;10(1):62–4. https://doi.org/10.1016/j.eatbeh.2008.10.005.

    Article  PubMed  Google Scholar 

  26. Fairburn CG, Beglin SJ. Eating Disorder Examination Questionnaire (EDE-Q) [Database record]. APA PsycTests. 1994. https://doi.org/10.1037/t03974-000.

    Article  Google Scholar 

  27. Fairburn CG, Cooper Z, Shafran R. Cognitive behaviour therapy for eating disorders: a ‘transdiagnostic’ theory and treatment. Behav Res Ther. 2003;41(5):509–28. https://doi.org/10.1016/s0005-7967(02)00088-8.

    Article  PubMed  Google Scholar 

  28. Fairchild H, Cooper M. A multidimensional measure of core beliefs relevant to eating disorders: preliminary development and validation. Eat Behav. 2010;11(4):239–46. https://doi.org/10.1016/j.eatbeh.2010.05.004.

    Article  PubMed  Google Scholar 

  29. Flatt RE, Karam AM, Fitzsimmons-Craft EE, Balantekin KN, Graham AK, Eichen DM, Monterubio GE, Goel NJ, Fowler LA, Sadeh-Sharvit S, Wilfley D, Mazina V, Taylor CB, Trockel M. Psychometric properties of the Perceived Benefits of Thinness Scale in college-aged women. Body Image. 2022;40:103–9. https://doi.org/10.1016/j.bodyim.2021.11.005.

    Article  PubMed  Google Scholar 

  30. Forbush KT, Hilderbrand LA, Bohrer BK, Chapa DAN. Test-retest reliability of common measures of eating disorder symptoms in men versus women. Assessment. 2019;26(3):419–31. https://doi.org/10.1177/1073191117700267.

    Article  PubMed  Google Scholar 

  31. Franko DL, Zuroff DC. The bulimic automatic thoughts test: initial reliability and validity data. J Clin Psychol. 1992;48(4):505–9. https://doi.org/10.1002/1097-4679%28199207%2948:4%3C505::AID-JCLP2270480411%3E3.0.CO;2-B.

    Article  PubMed  Google Scholar 

  32. Galmiche M, Déchelotte P, Lambert G, Tavolacci MP. Prevalence of eating disorders over the 2000–2018 period: a systematic literature review. Am J Clin Nutr. 2019;109(5):1402–13. https://doi.org/10.1093/ajcn/nqy342.

    Article  PubMed  Google Scholar 

  33. Gamble C, Bryant-Waugh R, Turner H, Jones C, Mehta R, Graves A. An investigation into the psychometric properties of the Stirling Eating Disorder Scales. Eat Behav. 2006;7(4):395–403. https://doi.org/10.1016/j.eatbeh.2005.12.005.

    Article  PubMed  Google Scholar 

  34. Garner DM, Garfinkel PE. Eating attitudes test (EAT) [Database record]. APA PsycTests. 1979. https://doi.org/10.1037/t06908-000.

    Article  Google Scholar 

  35. Garner DM, Olmsted MP, Bohr Y, Garfinkel PE. Eating attitude test–26 (EAT-26) [Database record]. APA PsycTests. 1982. https://doi.org/10.1037/t07770-000.

    Article  Google Scholar 

  36. Garner DM, Olmstead MP, Polivy J. Development and validation of a multidimensional eating disorder inventory for anorexia nervosa and bulimia. Int J Eat Disord. 1983;2(2):15–34. https://doi.org/10.1002/1098-108X%28198321%292:2%3C15::AID-EAT2260020203%3E3.0.CO;2-6.

    Article  Google Scholar 

  37. Gordon AR, Moore LB, Guss C. Eating disorders among transgender and gender non-binary people. In: Nagata JM, Brown TA, Murray SB, Lavender JM, editors. Eating disorders in boys and men. Berlin: Springer; 2021. p. 265–81. https://doi.org/10.1007/978-3-030-67127-3_18.

    Chapter  Google Scholar 

  38. Gross J, Rosen JC, Leitenberg H, Willmuth ME. Validity of the eating attitudes test and the eating disorders inventory in bulimia nervosa. J Consult Clin Psychol. 1986;54(6):875–6. https://doi.org/10.1037/0022-006X.54.6.875.

    Article  PubMed  Google Scholar 

  39. Hartman-Munick SM, Silverstein S, Guss CE, Lopez E, Calzo JP, Gordon AR. Eating disorder screening and treatment experiences in transgender and gender diverse young adults. Eat Behav. 2021;41: 101517. https://doi.org/10.1016/j.eatbeh.2021.101517.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Hatoum AH, Burton AL, Abbott MJ. Assessing negative core beliefs in eating disorders: revision of the eating disorder core beliefs Questionnaire. J Eat Disord. 2022;10(1):18. https://doi.org/10.1186/s40337-022-00542-9.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Hatoum AH, Burton AL, Abbott MJ. Validation of the revised eating disorder core beliefs questionnaire (ED-CBQ-R) in an Australian sample. Clin Psychol. 2023;27(2):133–41. https://doi.org/10.1080/13284207.2022.2144717.

    Article  Google Scholar 

  42. Hay PJ, Rankin R, Ramjan L, Conti J. Current approaches in the recognition and management of eating disorders. Med J Aust. 2023;219(3):127–34. https://doi.org/10.5694/mja2.52008.

    Article  PubMed  Google Scholar 

  43. Hildebrandt T, Langenbucher J, Schlund DG. Muscularity concerns among men: development of attitudinal and perceptual measures. Body Image. 2004;1(2):169–81. https://doi.org/10.1016/j.bodyim.2004.01.001.

    Article  PubMed  Google Scholar 

  44. Hinrichsen H, Garry J, Waller G. Development and preliminary validation of the Testable Assumptions Questionnaire-Eating Disorders (TAQ-ED). Eat Behav. 2006;7(3):275–81. https://doi.org/10.1016/j.eatbeh.2005.11.001.

    Article  PubMed  Google Scholar 

  45. Jenkins PE, Rienecke RD. Structural validity of the Eating Disorder Examination-Questionnaire: a systematic review. Int J Eat Disord. 2022;55(8):1012–30. https://doi.org/10.1002/eat.23721.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Kashubeck-West S, Coker AD, Awad GH, Stinson RD, Bledman R, Mintz L. Do measures commonly used in body image research perform adequately with African American college women? Cultur Divers Ethnic Minor Psychol. 2013;19(3):357–68. https://doi.org/10.1037/a0031905.

    Article  PubMed  Google Scholar 

  47. Klemchuk HP, Hutchinson CB, Frank RI. Body dissatisfaction and eating-related problems on the college campus: usefulness of the Eating Disorder Inventory with a nonclinical population. J Couns Psychol. 1990;37(3):297–305. https://doi.org/10.1037/0022-0167.37.3.297.

    Article  Google Scholar 

  48. Kling J, Kwakkenbos L, Diedrichs PC, Rumsey N, Frisén A, Brandão MP, Silva AG, Dooley B, Rodgers RF, Fitzgerald A. Systematic review of body image measures. Body Image. 2019;30:170–211. https://doi.org/10.1016/j.bodyim.2019.06.006.

    Article  PubMed  Google Scholar 

  49. Li M, Chang EC, Chang OD. Psychometric properties of the Interpersonal Outcome Expectancies for Thinness (IOET) Scale: evidence for validity, reliability, and utility in the study of eating disturbances in females. J Soc Clin Psychol. 2019;38(2):160–79. https://doi.org/10.1521/jscp.2019.38.2.160.

    Article  Google Scholar 

  50. Loeb KL, Jones J, Roberto CA, Sonia Gugga S, Marcus SM, Attia E, Timothy Walsh B. Adolescent-adult discrepancies on the eating disorder examination: a function of developmental stage or severity of illness? Int J Eat Disord. 2011;44(6):567–72. https://doi.org/10.1002/eat.20882.

    Article  PubMed  Google Scholar 

  51. Lorenc T, Felix L, Petticrew M, Melendez-Torres GJ, Thomas J, Thomas S, O’Mara-Eves A, Richardson M. Meta-analysis, complexity, and heterogeneity: a qualitative interview study of researchers’ methodological values and practices. Syst Rev. 2016;5(1):192. https://doi.org/10.1186/s13643-016-0366-6.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Maher A, Cason L, Huckstepp T, Stallman H, Kannis-Dymand L, Millear P, Mason J, Wood A, Allen A. Early maladaptive schemas in eating disorders: a systematic review. Eur Eat Disord Rev J Eat Disord Assoc. 2022;30(1):3–22. https://doi.org/10.1002/erv.2866.

    Article  Google Scholar 

  53. Miller JL, Vaillancourt T, Hanna SE. The measurement of “eating-disorder-thoughts” and “eating-disorder-behaviors”: implications for assessment and detection of eating disorders in epidemiological studies. Eat Behav. 2009;10(2):89–96. https://doi.org/10.1016/j.eatbeh.2009.02.002.

    Article  PubMed  Google Scholar 

  54. Mizes JS. Criterion-related validity of the Anorectic Cognitions Questionnaire. Addict Behav. 1990;15(2):153–63. https://doi.org/10.1016/03064603%2890%2990019-T.

    Article  PubMed  Google Scholar 

  55. Mizes JS. Construct validity and factor stability of the anorectic cognitions questionnaire. Addict Behav. 1991;16(1–2):89–93. https://doi.org/10.1016/0306-4603%2891%2990045-J.

    Article  PubMed  Google Scholar 

  56. Mizes JS. Validity of the Mizes Anorectic Cognitions scale: a comparison between anorectics, bulimics, and psychiatric controls. Addict Behav. 1992;17(3):283–9. https://doi.org/10.1016/0306-4603(92)90033-r.

    Article  PubMed  Google Scholar 

  57. Mizes JS, Christiano B, Madison J, Post G, Seime R, Varnado P. Development of the Mizes Anorectic Cognitions Questionnaire-Revised: psychometric properties and factor structure in a large sample of eating disorder patients. Int J Eat Disord. 2000;28(4):415–21. https://doi.org/10.1002/1098-108X%28200012%2928:4%3C415::AID-EAT9%3E3.0.CO;2-Z.

    Article  PubMed  Google Scholar 

  58. Mizes JS, Klesges RC. Validity, reliability, and factor structure of the Anorectic Cognitions Questionnaire. Addict Behav. 1989;14(5):589–94. https://doi.org/10.1016/0306-4603%2889%2990081-6.

    Article  PubMed  Google Scholar 

  59. Modini M, Abbott MJ, Hunt C. A systematic review of the psychometric properties of trait social anxiety self-report measures. J Psychopathol Behav Assess. 2015;37(4):645–62. https://doi.org/10.1007/s10862-015-9483-0.

    Article  Google Scholar 

  60. Mountford V, Haase A, Waller G. Body checking in the eating disorders: associations between cognitions and behaviors. Int J Eat Disord. 2006;39(8):708–15. https://doi.org/10.1002/eat.20279.

    Article  PubMed  Google Scholar 

  61. Nagata JM, Compte EJ, McGuire FH, Lavender JM, Murray SB, Brown TA, Capriotti MR, Flentje A, Lubensky ME, Obedin-Maliver J, Lunn MR. Psychometric validation of the Muscle Dysmorphic Disorder Inventory (MDDI) among U.S. transgender men. Body Image. 2022;42:43–9. https://doi.org/10.1016/j.bodyim.2022.05.001.

    Article  PubMed  PubMed Central  Google Scholar 

  62. O’Loghlen E, Galligan R, Grant S. The functions of binge eating scale (FBES): development and preliminary psychometric validation. Appetite. 2023;183: 106479. https://doi.org/10.1016/j.appet.2023.106479.

    Article  PubMed  Google Scholar 

  63. Openshaw C, Waller G. Psychometric properties of the Stirling Eating Disorder Scales with bulimia nervosa patients. Eat Behav. 2005;6(2):165–8. https://doi.org/10.1016/j.eatbeh.2004.08.003.

    Article  PubMed  Google Scholar 

  64. Osberg TM, Poland D, Aguayo G, MacDougall S. The Irrational Food Beliefs Scale: development and validation. Eat Behav. 2008;9(1):25–40. https://doi.org/10.1016/j.eatbeh.2007.02.001.

    Article  PubMed  Google Scholar 

  65. Osman A, Chiros CE, Gutierrez PM, Kopper BA, Barrios FX. Factor structure and psychometric properties of the Brief Mizes Anorectic Cognitions Questionnaire. J Clin Psychol. 2001;57(6):785–99. https://doi.org/10.1002/jclp.1049.

    Article  PubMed  Google Scholar 

  66. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, Chou R, Glanville J, Grimshaw JM, Hróbjartsson A, Lalu MM, Li T, Loder EW, Mayo-Wilson E, McDonald S, McGuinness LA, Stewart LA, Thomas J, Tricco AC, Welch AV, Whiting P, Moher D. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021. https://doi.org/10.1136/bmj.n71.

    Article  PubMed  PubMed Central  Google Scholar 

  67. Papini NM, Jung M, Cook A, Lopez NV, Ptomey LT, Herrmann SD, Kang M. Psychometric properties of the 26-item eating attitudes test (EAT-26): an application of Rasch analysis. J Eat Disord. 2022;10(1):62. https://doi.org/10.1186/s40337-022-00580-3.

    Article  PubMed  PubMed Central  Google Scholar 

  68. Phelan PW. Cognitive correlates of bulimia: the Bulimic Thoughts Questionnaire. Int J Eat Disord. 1987;6(5):593–607. https://doi.org/10.1002/1098-108X%28198709%296:5%3C593::AID-EAT2260060503%3E3.0.CO;2-M.

    Article  Google Scholar 

  69. Prnjak K, Jukic I, Mitchison D, Griffiths S, Hay P. Body image as a multidimensional concept: a systematic review of body image facets in eating disorders and muscle dysmorphia. Body Image. 2022;42:347–60. https://doi.org/10.1016/j.bodyim.2022.07.006.

    Article  PubMed  Google Scholar 

  70. Pugh M. A narrative review of schemas and schema therapy outcomes in the eating disorders. Clin Psychol Rev. 2015;39:30–41. https://doi.org/10.1016/j.cpr.2015.04.003.

    Article  PubMed  Google Scholar 

  71. Raciti MC, Norcross JC. The EAT and EDI: screening, interrelationships, and psychometrics. Int J Eat Disord. 1987;6(4):579–86. https://doi.org/10.1002/1098-108X%28198707%296:4%3C579::AID-EAT2260060418%3E3.0.CO;2-C.

    Article  Google Scholar 

  72. Reilly EE, Anderson LM, Schaumberg K, Anderson DA. Gender-based differential item functioning in common measures of body dissatisfaction. Body Image. 2014;11(3):206–9. https://doi.org/10.1016/j.bodyim.2014.02.001.

    Article  PubMed  Google Scholar 

  73. Romano KA, Lipson SK, Beccia AL, Quatromoni PA, Gordon AR, Murgueitio J. Changes in the prevalence and sociodemographic correlates of eating disorder symptoms from 2013 to 2020 among a large national sample of US young adults: a repeated cross-sectional study. Int J Eat Disord. 2022;55(6):776–89. https://doi.org/10.1002/eat.23709.

    Article  PubMed  PubMed Central  Google Scholar 

  74. Rothstein LA, Sbrocco T, Carter MM. Factor analysis of EDI-3 eating disorder risk subscales among African American women. J Black Psychol. 2017;43(8):767–77. https://doi.org/10.1177/0095798417708506.

    Article  Google Scholar 

  75. Schaefer LM, Burke NL, Thompson JK, Dedrick RF, Heinberg LJ, Calogero RM, Bardone-Cone AM, Higgins MK, Frederick DA, Kelly M, Anderson DA, Schaumberg K, Nerini A, Stefanile C, Dittmar H, Clark E, Adams Z, Macwana S, Klump KL, Vercellone AC, Paxton SJ, Swami V. Development and validation of the Sociocultural Attitudes Towards Appearance Questionnaire-4 (SATAQ-4). Psychol Assess. 2015;27(1):54–67. https://doi.org/10.1037/a0037917.

    Article  PubMed  Google Scholar 

  76. Schaefer LM, Burke NL, Thompson JK. Thin-ideal internalization: how much is too much? Eat Weight Disord. 2019;24(5):933–7. https://doi.org/10.1007/s40519-018-0498-x.

    Article  PubMed  Google Scholar 

  77. Schaefer LM, Crosby RD, Machado P. A systematic review of instruments for the assessment of eating disorders among adults. Curr Opin Psychiatry. 2021;34(6):543–62. https://doi.org/10.1097/YCO.0000000000000746.

    Article  PubMed  PubMed Central  Google Scholar 

  78. Schaefer LM, Harriger JA, Heinberg LJ, Soderberg T, Kevin Thompson J. Development and validation of the sociocultural attitudes towards appearance questionnaire-4-revised (SATAQ-4R). Int J Eat Disord. 2017;50(2):104–17. https://doi.org/10.1002/eat.22590.

    Article  PubMed  Google Scholar 

  79. Schaefer WK, Maclennan RN, Yaholnitsky-Smith SA, Stover ED. Psychometric evaluation of the eating disorder inventory (EDI) in a clinical group. Psychol Health. 1998;13(5):873–81. https://doi.org/10.1080/08870449808407437.

    Article  Google Scholar 

  80. Schulman RG, Kinder BN, Powers PS, Prange M, Gleghorn A. The development of a scale to measure cognitive distortions in bulimia. J Pers Assess. 1986;50(4):630–9. https://doi.org/10.1207/s15327752jpa5004_9.

    Article  PubMed  Google Scholar 

  81. Simone M, Hazzard VM, Askew AJ, Tebbe EA, Lipson SK, Pisetsky EM. Variability in eating disorder risk and diagnosis in transgender and gender diverse college students. Ann Epidemiol. 2022;70:53–60. https://doi.org/10.1016/j.annepidem.2022.04.007.

    Article  PubMed  PubMed Central  Google Scholar 

  82. Spangler DL, Stice E. Validation of the beliefs about appearance scale. Cogn Ther Res. 2001;25(6):813–27. https://doi.org/10.1023/A:1012931709434.

    Article  Google Scholar 

  83. Spillane NS, Boerner LM, Anderson KG, Smith GT. Comparability of the Eating Disorder Inventory-2 between women and men. Assessment. 2004;11(1):85–93. https://doi.org/10.1177/1073191103260623.

    Article  PubMed  Google Scholar 

  84. Stein JE, Modini M, Hunt C, Abbott MJ. A systematic review of the psychometric properties of trait cognitive self-report measures in social anxiety. J Psychopathol Behav Assess. 2017;39(1):147–63. https://doi.org/10.1007/s10862-016-9573-7.

    Article  Google Scholar 

  85. Stein KF, Riley BB, Hoyland-Domenico L, Lee CK. Measurement of body dissatisfaction in college-enrolled Mexican American Women: a Rasch-based examination of the validity and reliability of the EDI-III. Eat Behav. 2015;19:5–8. https://doi.org/10.1016/j.eatbeh.2015.06.001.

    Article  PubMed  PubMed Central  Google Scholar 

  86. Tasca GA, Illing V, Lybanon-Daigle V, Bissada H, Balfour L. Psychometric properties of the Eating Disorders Inventory-2 among women seeking treatment for binge eating disorder. Assessment. 2003;10(3):228–36. https://doi.org/10.1177/1073191103255001.

    Article  PubMed  Google Scholar 

  87. Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, Bouter LM, de Vet HC. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42. https://doi.org/10.1016/j.jclinepi.2006.03.012.

    Article  PubMed  Google Scholar 

  88. Thompson JK, Schaefer LM, Dedrick RF. On the measurement of thin-ideal internalization: implications for interpretation of risk factors and treatment outcome in eating disorders research. Int J Eat Disord. 2018;51(4):363–7. https://doi.org/10.1002/eat.22839.

    Article  PubMed  PubMed Central  Google Scholar 

  89. Trottier K, McFarlane T, Olmsted MP, McCabe RE. The Weight Influenced Self-Esteem Questionnaire (Wise-Q): factor structure and psychometric properties. Body Image. 2013;10(1):112–20. https://doi.org/10.1016/j.bodyim.2012.08.008.

    Article  PubMed  Google Scholar 

  90. Udo T, Grilo CM. Psychiatric and medical correlates of DSM-5 eating disorders in a nationally representative sample of adults in the United States. Int J Eat Disord. 2019;52(1):42–50. https://doi.org/10.1002/eat.23004.

    Article  PubMed  Google Scholar 

  91. van Eeden AE, van Hoeken D, Hoek HW. Incidence, prevalence and mortality of anorexia nervosa and bulimia nervosa. Curr Opin Psychiatry. 2021;34(6):515–24. https://doi.org/10.1097/YCO.0000000000000739.

    Article  PubMed  PubMed Central  Google Scholar 

  92. Waller G. Recent advances in psychological therapies for eating disorders. F1000Research. 2016. https://doi.org/10.12688/f1000research.7618.1.

    Article  PubMed  PubMed Central  Google Scholar 

  93. Waller G, Gray E, Hinrichsen H, Mountford V, Lawson R, Patient E. Cognitive-behavioral therapy for bulimia nervosa and atypical bulimic nervosa: effectiveness in clinical settings. Int J Eat Disord. 2014;47(1):13–7. https://doi.org/10.1002/eat.22181.

    Article  PubMed  Google Scholar 

  94. Waller G, Kennerley H, Ohanian V. Schema-focused cognitive-behavioral therapy for eating disorders. In: Riso LP, du Toit PL, Stein DJ, Young JE, editors. Cognitive schemas and core beliefs in psychological problems: a scientist–practitioner guide. Washington: American Psychological Association; 2004. p. 139–75. https://doi.org/10.1037/11561-007.

    Chapter  Google Scholar 

  95. Wear RW, Pratz O. Test-retest reliability for the Eating Disorder Inventory. Int J Eat Disord. 1987;6(6):767–9. https://doi.org/10.1002/1098-108X%28198711%296:6%3C767::AID-EAT2260060611%3E3.0.CO;2-V.

    Article  Google Scholar 

  96. Welch G, Hall A, Walkey F. The factor structure of the Eating Disorders Inventory. J Clin Psychol. 1988;44(1):51–6. https://doi.org/10.1002/1097-4679%28198801%2944:1%3C51::AID-JCLP2270440111%3E3.0.CO;2-4.

    Article  PubMed  Google Scholar 

  97. Wells A. Emotional disorders and metacognition: innovative cognitive therapy. Hoboken: Wiley; 2000.

    Google Scholar 

  98. Williams GJ, Power KG, Miller HR, Freeman CP, Yellowlees A, Dowds T, Walker M, Parry-Jones WL. Development and validation of the Stirling Eating Disorder Scales. Int J Eat Disord. 1994;16(1):35–43. https://doi.org/10.1002/1098-108X%28199407%2916:1%3C35::AID-EAT2260160103%3E3.0.CO;2-4.

    Article  PubMed  Google Scholar 

  99. Williams-Kerver GA, Schaefer LM, Hawkins MAW, Crowther JH, Duncan J. Eating expectancies before bariatric surgery: assessment and associations with weight loss trajectories. Surg Obes Relat Dis. 2019;15(10):1793–9. https://doi.org/10.1016/j.soard.2019.07.028.

    Article  PubMed  PubMed Central  Google Scholar 

  100. Young JE, Brown G. Young Schema Questionnaire-Short Form; Version 3 (YSQ-S3, YSQ) [Database record]. APA PsycTests. 2005. https://doi.org/10.1037/t67023-000.

    Article  Google Scholar 

  101. Zuccala M, Menzies RE, Hunt CJ, Abbott MJ. A systematic review of the psychometric properties of death anxiety self-report measures. Death Stud. 2022;46(2):257–79. https://doi.org/10.1080/07481187.2019.1699203.

    Article  PubMed  Google Scholar 

Download references

Funding

The authors declare that they did not receive funding for this study.

Author information

Authors and Affiliations

Authors

Contributions

AH, AB and MA were involved in the conception and initial stages of the review. AH and SB were involved in screening, selection and data extraction. All authors were involved in approving included measures. AH prepared the first draft of the manuscript, and subsequent versions were refined in collaboration with MA. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Amaani H. Hatoum.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Competing interests

The authors declare that they are authors of two instruments included in this review (the EBQ-18, and the ED-CBQ-R). There are no other relevant financial or non-financial competing interests to report or benefits that have arisen from this research.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hatoum, A.H., Burton, A.L., Berry, S.L. et al. Psychometric properties of self-report measures of eating disorder cognitions: a systematic review. J Eat Disord 11, 233 (2023). https://doi.org/10.1186/s40337-023-00947-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40337-023-00947-0

Keywords