Instability of delay classification and determination of early intervention eligibility in the first two years of life

https://doi.org/10.1016/j.ridd.2013.10.017Get rights and content

Highlights

  • We assessed development longitudinally in low- and high-risk infants using the Bayley-III from 3 to 24 months.

  • It was common for Bayley-III delay classifications to switch multiple times across the first two years.

  • The Bayley-III was effective at identifying typical development but poor at identifying infants with delays.

  • Clinical opinion and risk factors should be valued above standardized assessments for identifying early delays.

Abstract

The purpose of this study was to determine the effectiveness of the Bayley Scales of Infant Development, Third Edition (Bayley-III) to track development and classify delays in low- and high-risk infants across the first two years of life. We assessed cognitive, language, and motor development in 24 low-risk full-term and 30 high-risk preterm infants via seven assessments performed between 3 and 24 months corrected age. The Bayley-III resulted in highly unstable delay classifications, low sensitivities, and poor positive predictive values across time. The results highlight that early intervention professionals, researchers, and policy makers should: (1) emphasize clinical opinion and prevalence of risk factors rather than standardized assessment findings when classifying delays and determining eligibility for services, and (2) develop more effective developmental assessments for infants and young children.

Introduction

The purpose of this study was to determine the effectiveness of repeated measurements using a common standardized assessment tool to track development in low- and high-risk infants in the first two years of life. This type of testing mirrors the standard of practice in the U.S. for assessing development when a potential delay has been noted or when high-risk infants, such as those born preterm or with brain injuries, receive developmental monitoring (Disabilities, 2006, Romanczyk et al., 2005). Each state has its own criteria for what defines “delay” and eligibility for early intervention services hinges on these criteria. In some cases, determination of eligibility is clear. For instance, if an infant is born with a diagnosis, such as Down syndrome, associated with future delays, the infant would qualify for services in most states. In other cases where outcomes are more heterogeneous so there is risk but not a certainty of future delay, it is more difficult to determine if an infant or child should receive services (Aarnoudse-Moens, Weisglas-Kuperus, van Goudoever, & Oosterlaan, 2009). This is the case for the growing population of infants born preterm.

Infants born preterm are more likely to have future cognitive, social, communication, and motor delays (Jongbloed-Pereboom et al., 2012, Taylor et al., 2011). Therefore, their development is typically closely monitored in the first years of life through follow-up clinics and home visits provided at regular intervals, such as every 3–6 months (Janssen et al., 2011). The standard of practice at these visits is to assess development using a standardized assessment tool, such as the Bayley Scales of Infant Development, Third Edition (Bayley-III). The Bayley-III is a commonly selected tool for classification of developmental delay because it assesses development across multiple domains, is normed on young populations (1–42 months), has a long history of use in clinical and research settings, and is often considered a “gold standard” for early developmental assessment (Bayley, 2006b, Vincer et al., 2005).

The Bayley-III is a useful tool for classification of early developmental delay because it allows professionals to standardize an individual's score to quantify degree of delay. This is important because if a child does not have a diagnosis clearly associated with future delays, early intervention eligibility is typically determined by his/her percentage of delay or the number of standard deviations his/her score lies below the mean. Therefore, the policy creates a demand for professionals to quantify developmental delay using specific measures. For all of these reasons, the Bayley-III was selected as the standardized assessment tool to track development in this study. Furthermore, the criterion for classification of delay was selected to be developmental performance more than 1.5 standard deviations below the mean because this is a common criterion in many states, such as New Mexico and Pennsylvania (New Mexico Administrative Code http://nmhealth.org/ddsd/nmfit/Documents/documents/7.30.8%20NMAC%2006-29-12%20FINAL.pdf; The Pennsylvania Code http://www.pacode.com/secure/data/055/chapter4226/s4226.22.html).

The two-year follow-up schedule, assessment tool, and delay criterion used in this study modeled the standard of practice and allowed us to test whether the Bayley-III is a valid and reliable means for the critical determination of a young child's early intervention eligibility. Although the newest version of the Bayley is one of the assessments most commonly used in practice and research, this is the first study to our knowledge assessing development longitudinally with the Bayley-III in a cohort of low- and high-risk infants (Johnson & Marlow, 2006). Previous research has identified problems when using other early norm-referenced assessment tools, including the second version of the Bayley, with young populations at risk for delays. Some of these problems include low sensitivity, poor positive predictive value, large discrepancies in scores between two proximal testing points, and instability of delay classification across time (Hess et al., 2004, Horner, 1988, McGrath et al., 2004, Vohr et al., 2012). Similar longitudinal studies have not been performed with the Bayley-III. Therefore, questions remain about the validity, reliability, and utility of standardized assessment tools, and specifically the commonly used Bayley-III, for young populations.

Section snippets

Participants

Fifty-four infants and families participated in this study. Parents of all infants provided informed written consent. Twenty-four participants were born full-term without any diagnosed delays or medical risks and were recruited from the community. Thirty participants were born preterm and were recruited from a regional Level Three neonatal intensive care unit (Christiana Care Health Services, Newark, DE). All full-term infants were born between 37 and 41 weeks of gestation and all preterm

Results

Between 4 and 22% of the participants demonstrated delays in the five Bayley-III subscale domains. Table 2 shows the distribution of these delays.

Fig. 1 depicts the stability of delay classifications for participants for each subscale. On average across the subscales, the classifications were Stable 44.1 ± 8.6% of the time (range 16.7–64.8%), Relatively Stable 13.0 ± 3.4% of the time (range 1.9–22.2%), and Unstable 43.0 ± 6.7% of the time (range 27.8–66.7%; Fig. 1A). The percentage of participants in

Discussion

The results of this study highlight the limitations of relying primarily on standardized assessment tools for identification of early developmental delays (McGrath et al., 2004, Vohr et al., 2012). In this group of low- and high-risk infants, all subscales of the Bayley-III resulted in highly Unstable delay classifications, low sensitivities, and poor positive predictive values across time. Although the sample size for this study was modest and the number of infants who demonstrated fine motor

Conflicts of interest

The authors declare that there are no conflicts of interest.

Acknowledgments

This research was supported by the National Institutes of Health, Eunice Kennedy Shriver National Institute of Child Health & Human Development, 1R01HD051748 (Motor learning and coordination in high-risk infants. Principal investigator: Cole Galloway, PT, PhD, August 2007–June 2012). The authors would like to express their sincere gratitude to the families who participated in this study.

References (29)

  • N. Bayley

    Bayley scales of infant development

    (2006)
  • H. Carey et al.

    The pediatric physical therapist's role in promoting and measuring participation in children with disabilities

    Pediatric Physical Therapy

    (2012)
  • C.o.C.W Disabilities

    Identifying infants and young children with developmental disorders in the medical home: An algorithm for developmental surveillance and screening

    Pediatrics

    (2006)
  • S.L. Flagler

    Multidimensional assessment of young children through play

    (1996)
  • Cited by (34)

    • Cognitive and motor development in preterm children from 6 to 36 months of age: Trajectories, risk factors and predictability

      2022, Early Human Development
      Citation Excerpt :

      Su et al. [18] found associations of lower birth body weight, male sex, moderate to severe bronchopulmonary dysplasia (BPD), severe retinopathy of prematurity (ROP) (stage III-IV), and major brain damage with the risk of deteriorating and persistently delayed motor patterns in preterm children born VLBW in the first year of age. In contrast, Lobo et al. [8] observed no association of perinatal and socio-environmental factors with unstable developmental patterns in preterm children, and neither were the early patterns predictive of later outcome. The disparity of findings in these studies might be attributed to differences in the sample characteristics and variations in the definitions of developmental patterns.

    • The stability of the Bayley scales in early childhood and its relationship with future intellectual abilities in a low to middle income country

      2022, Early Human Development
      Citation Excerpt :

      Although an overall poor agreement taking all measurement time points into account, our findings suggest a pattern of poor correlations involving Bayley scores in infancy, and improved correlations with scores later in the period of early childhood. Overall, the variation in the coefficients in the current study is similar to previous studies [3–5]. A Polish study investigating the stability of scores on the Bayley-II taken at 12, 24 and 36 months, showed low to moderate correlations with stronger correlations between the latter time points than the first in accordance with current findings [26].

    • Bayley Scales of Infant Development

      2020, Encyclopedia of Infant and Early Childhood Development
    • Cognition–Action Trade-Offs Reflect Organization of Attention in Infancy

      2018, Advances in Child Development and Behavior
      Citation Excerpt :

      Early in life, tests for cognition are strongly motor based, making it difficult to diagnose delays in one system vs another. Standardized assessments such as the Bayley III that focus on the discrete performance of tasks do not consistently identify or classify delays in the first 2 years of life and provide poor prediction of future cognition relative to assessments that involve perceptual-motor exploration and problem solving (Lobo & Galloway, 2013; Lobo, Paul, Mackley, Maher, & Galloway, 2014). Tests for cognitive skill in infancy that are motor dependent do not help determine which system is deficient and act to perpetuate the division between motor and cognitive domains in early life (Visser, Ruiter, van der Meulen, Ruijssenaars, & Timmerman, 2014).

    View all citing articles on Scopus
    View full text