Transitions between versions of the International Classification of Diseases and chronic disease prevalence estimates from administrative health data: a population-based study

Sanusi, Ridwan A.; Yan, Lin; Hamad, Amani F.; Ayilara, Olawale F.; Vasylkiv, Viktoriya; Jozani, Mohammad Jafari; Banerji, Shantanu; Delaney, Joseph; Hu, Pingzhao; Wall-Wieler, Elizabeth; Lix, Lisa M.

doi:10.1186/s12889-022-13118-8

Research
Open access
Published: 09 April 2022

Transitions between versions of the International Classification of Diseases and chronic disease prevalence estimates from administrative health data: a population-based study

Ridwan A. Sanusi¹,
Lin Yan¹,
Amani F. Hamad¹,
Olawale F. Ayilara¹,
Viktoriya Vasylkiv¹,
Mohammad Jafari Jozani²,
Shantanu Banerji³,
Joseph Delaney^4,5,
Pingzhao Hu⁶,
Elizabeth Wall-Wieler¹ &
…
Lisa M. Lix¹

BMC Public Health volume 22, Article number: 701 (2022) Cite this article

1606 Accesses
4 Citations
5 Altmetric
Metrics details

Abstract

Background

Diagnosis codes in administrative health data are routinely used to monitor trends in disease prevalence and incidence. The International Classification of Diseases (ICD), which is used to record these diagnoses, have been updated multiple times to reflect advances in health and medical research. Our objective was to examine the impact of transitions between ICD versions on the prevalence of chronic health conditions estimated from administrative health data.

Methods

Study data (i.e., physician billing claims, hospital records) were from the province of Manitoba, Canada, which has a universal healthcare system. ICDA-8 (with adaptations), ICD-9-CM (clinical modification), and ICD-10-CA (Canadian adaptation; hospital records only) codes are captured in the data. Annual study cohorts included all individuals 18 + years of age for 45 years from 1974 to 2018. Negative binomial regression was used to estimate annual age- and sex-adjusted prevalence and model parameters (i.e., slopes and intercepts) for 16 chronic health conditions. Statistical control charts were used to assess the impact of changes in ICD version on model parameter estimates. Hotelling’s T² statistic was used to combine the parameter estimates and provide an out-of-control signal when its value was above a pre-specified control limit.

Results

The annual cohort sizes ranged from 360,341 to 824,816. Hypertension and skin cancer were among the most and least diagnosed health conditions, respectively; their prevalence per 1,000 population increased from 40.5 to 223.6 and from 0.3 to 2.1, respectively, within the study period. The average annual rate of change in prevalence ranged from -1.6% (95% confidence interval [CI]: -1.8, -1.4) for acute myocardial infarction to 14.6% (95% CI: 13.9, 15.2) for hypertension. The control chart indicated out-of-control observations when transitioning from ICDA-8 to ICD-9-CM for 75% of the investigated chronic health conditions but no out-of-control observations when transitioning from ICD-9-CM to ICD-10-CA.

Conclusions

The prevalence of most of the investigated chronic health conditions changed significantly in the transition from ICDA-8 to ICD-9-CM. These results point to the importance of considering changes in ICD coding as a factor that may influence the interpretation of trend estimates for chronic health conditions derived from administrative health data.

Peer Review reports

Introduction

International Classification of Diseases (ICD) codes were developed by the World Health Organization (WHO) as the worldwide standard for classifying the causes of injury and death [1]. These codes are captured in administrative health data, such as hospital records. ICD codes have been updated multiple times to reflect advances in health and medical science, and these changes are reflected in administrative health data. For example, US administrative health billing data changed from the 9^th revision Clinical Modification (ICD-9-CM) to the 10^th revision (ICD-10-CM) in 2015 [2]. Several countries, including Australia (ICD-10-AM), Canada (ICD-10-CA), Germany (ICD-10-GM), Korea (ICD-10-KM), and Thailand (ICD-10-TM), developed ICD-10 modifications to address country-specific needs [3].

In Canada, three ICD versions are captured in many administrative health databases: 8^th revision (ICD-8), 9^th revision (ICD-9), and 10^th revision (ICD-10), which were introduced by the WHO in 1965, 1975, and 1993, respectively [4,5,6]. Each ICD version has a greater number of codes, resulting in new diseases being added, and other diseases being recategorized, removed, or combined [1]. Though the increasing level of detail with each update of the ICD system is essential for diagnostic and administrative purposes, codes in one ICD version may not map (i.e., correspond) exactly to codes in another ICD version [7]. This introduces challenges in using ICD codes to track trends in disease prevalence. A change in the trend may be associated with changes in coding standards rather than a change in the true disease prevalence.

Heslin and Barrett [8] observed an upward shift in the number of diagnosed cases of alcohol abuse, alcohol-induced mental disorders, and intoxication after ICD-10-CM was introduced as compared to when ICD-9-CM was used in the US. The study used 2-sided t-statistics to test for a difference in the average quarterly counts of inpatient stays between ICD periods. Slavova et al. [2] used segmented regression to model injury hospitalizations to evaluate the effect of transitioning from ICD-9-CM to ICD-10-CM in the US; they reported a significant change in the slope estimate after the transition in 2015. The effects of transitions to a new ICD version help researchers to know where to expect significant and sustained changes in trends between one ICD version and another [9].

A control chart, an efficient statistical tool to monitor and signal changes in a process over time [10], can also be used to investigate the trend pattern before and after transitions to a different ICD version. The control chart was first introduced in manufacturing for monitoring product estimates, such as the number of defects [11]. The control chart has been used in population health and health services research and surveillance to monitor trend estimates, such as outcomes of pneumonia [12], proportion of live births by caesarean section [13], ratio of nurse attendance to ward workload [14], mortality rates [15], morbidity rates of patients after undergoing coronary artery bypass graft surgery [16], measurement error in vital signs [17], and patient dissatisfaction with hospitals [18]. As well, Hanslik et al. [19] implemented the control chart as an epidemiological tool to test for a significant increase in the average number of cases of communicable, environmental and societal diseases relating to mass gatherings. Coory et al. [20] used the control chart to monitor clinical indicators in administrative health data. Other control chart applications include monitoring infection rates and lengths of hospital stays [21]. Control charts could also be used to investigate changes in disease trends amongst ICD versions.

This study applied control charts for the surveillance of chronic health conditions over time. Our objective was to examine the impact of transitions between ICD versions on the prevalence of chronic health conditions estimated from administrative health data.

Methods

Data sources

Study data were from the province of Manitoba, Canada, which has a population of approximately 1.3 million according to the 2016 Statistics Canada Census [22]. Manitoba has a universal healthcare system for publicly-funded services, which include hospitalizations, prescription drug dispensations, and outpatient physician visits. More than 99% of the population is eligible to receive health insurance coverage. Details of who is captured and excluded from the Manitoba Health Insurance Registry is provided by Hamm et al. [23].

The Manitoba Population Research Data Repository housed at the Manitoba Centre for Health Policy, University of Manitoba, has a variety of administrative health databases that contain ICD codes and can be linked via a unique, anonymized personal health identifier. The specific databases used in this study were the Medical Services database, the Hospital Discharge Abstracts Database (DAD), and the Manitoba Health Insurance Registry (Table 1). The Medical Services database consists of physician billing claims, which are forwarded to the ministry of health for reimbursement of fee-for-service physicians. Each record includes the date of service and a single ICD code that corresponds to the reason for the physician visit. The diagnoses were recorded using the 8^th revision of ICD with adaptations (ICDA-8) from 1970 until 1979, when ICD-9-CM was adopted. Physician billing claims capture visits to family physicians and specialists provided in outpatient (i.e., clinic) settings. This database covers more than 80 provider categories and multiple specialist fields [24]. The DAD contains hospital records for all acute care facilities in the province; it does not capture emergency department visits. Diagnosis codes in the DAD were defined using ICDA-8 codes from 1970 to 1979. After this period, ICD-9-CM was used until 2004, when ICD-10-CA (ICD-10 with Canadian enhancement) was adopted. The Manitoba Health Insurance Registry contains records for all individuals eligible for health insurance coverage in the province [25]. It also captures the start and end dates of coverage and socio-demographic information. The study databases were used to define the annual study cohorts from 1974 to 2018, as well as to produce demographic characteristics and prevalence estimates of chronic health conditions. The accuracy and completeness of Manitoba’s administrative data have been demonstrated in multiple studies and tools for data quality assessment have been developed and are routinely applied to the data [26,27,28,29].

Table 1 Characteristics of the study data

Full size table

Study cohorts

The study cohorts were formed by including all individuals residing in Manitoba that were 18 years of age or older in each year between 1974 and 2018. We categorised age into the following groups: 18–29, 30–39, 40–44, 45–49, 50–54, 55–59, 60–64, 65–69, and 70 years and above. Individuals with less than three years of continuous health coverage were excluded.

Identifying chronic health conditions in administrative health data

We selected 16 chronic health conditions for this study that include both physical health and mental health. The chronic health conditions include rare and common conditions, those that affect a range of body systems; and conditions that can adversely affect health-related quality of life. Thus, the selected conditions provide a good representation of the variety of health conditions captured by the ICD system. The chronic health conditions were selected from multiple chronic health conditions derived using Clinical Classification Software (CCS) [7]. The CCS, developed by the Agency for Healthcare Research and Quality in the US, has been adopted in many studies to provide clinically meaningful and interpretable statistics such as disease prevalence, frequencies, and medical expenditures [30, 31]. Crosswalks of diagnoses codes to CCS categories are provided by Hamad et al. [7]. The selected chronic health conditions include mood and anxiety disorders, menstrual disorders, hypertension, osteoarthritis, anemia, diabetes, asthma, acute myocardial infarction, heart valve disorders, acute cerebrovascular disease, cataracts, breast cancer, colon cancer, lung and respiratory cancers, prostate cancer, and skin cancer.

Statistical analyses

The analyses were conducted for each year in the 45-year study period (1974 to 2018). Descriptive statistics, including means, standard deviations, and percentages, were used to describe the study cohorts’ demographic characteristics. We fit negative binomial regression models $(M1)$, a generalization of Poisson regression models $(M2)$ that loosen the restrictive assumption that the variance is equal to mean, to counts of the number of cases of each chronic health condition in each age group and sex strata for the ${j}^{th}$ year (j = 1,…, m; m = 45). Model goodness of fit was assessed using a likelihood ratio test with a nominal significance level of α = 0.01, to reduce the likelihood of a Type 1 error. We also compared the models’ ratio of residual deviance to degrees of freedom (df) [32] for both the negative binomial and Poisson regression models. The natural logarithm of the population size in each age group and sex strata was the model offset. Sex and age group were the model covariates. We estimated the annual age- and sex-adjusted prevalence and annual regression coefficients $\widehat{{{\varvec{\beta}}}_{\mathrm{j}}}=\left({b}_{kj}\right), k=0, 1, \dots ,(p-1)$, where $p=10$ is the number of estimated parameters. The regression model parameters for the ${j}^{th}$ year include the intercept (${b}_{0j}$) and the slopes $\left({b}_{1j},\dots ,{b}_{9j}\right)$ where the reference categories were female for sex and 40–44 years for age group.

We estimated the average annual rate of change, expressed as a percentage, in the age- and sex-adjusted prevalence for three time segments. In segment 1 (1974–1979) diagnosis codes were defined using ICDA-8 in both data sources (i.e., physician billing claims and hospital records). In segment 2 (1980–2004), diagnosis codes were defined using ICD-9-CM in both data sources. In segment 3 (2005–2018), diagnosis codes were defined using ICD-10-CA in hospital records, while in physician billing claims they were still defined using ICD-9-CM.

We used Hotelling’s T² control chart [33] to monitor and signal changes in the regression model parameter estimates for each year of the study period. We hypothesized that stability of the model parameters is an indicator of stability in the prevalence estimates. Hotelling’s T² statistic simultaneously monitors the regression model parameter estimates; an out-of-control signal occurs if the statistic is greater than a pre-specified control limit value [17]. This approach had been described previously [34,35,36,37]; Woodall et al. [35] noted that since the estimators of intercept and slope are dependent, it is reasonable to monitor them together.

We used the Durbin-Watson test statistic to detect the presence of autocorrelation (and to estimate the autocorrelation coefficient) between years [38], because the same individuals may be captured in prevalence estimates for subsequent years. Noorossana et al. [39] performed a simulation study to illustrate a significant decrease in control charts performance when autocorrelation is overlooked. To reduce the impact of autocorrelation, we used a U-statistic method (see U-Statistic Definition in Additional file 1), where the estimate for the ${j}^{th}$ year is adjusted for correlation in the preceding (i.e., j – 1) year [38, 40].

Hotelling’s T² statistic [17, 41] for the jth year is defined as.

$${T^2}_{j}\;={({{\mathbf{U}}}_{j}-{{\varvec{\upmu}}}_{U})}^{'}{{\varvec{\Sigma}}}^{-1}_{U}({{\mathbf{U}}}_{j}-{{\varvec{\upmu}}}_{U})$$

(1)

and plotted against an upper control limit (UCL) of.

$$UCL\;=\;\frac{p(m-1)(m+1)}{m(m-p)}F_{\alpha,p,m-p}$$

(2)

and a lower control limit (LCL) of zero. In Eq. 1, ${\mathbf{U}}_{j}$ is the vector of adjusted regression model parameters that are assumed to be independently and normally distributed with mean ${{\varvec{\upmu}}}_{U}$ and covariance matrix ${{\varvec{\Sigma}}}_{U}$. In Eq. 2, F is the critical value of the F distribution with degrees of freedom p and m—p, and $\alpha$ is the nominal level of significance. The control limit was 19.8 when α = 0.01. The mean vector ${{\varvec{\upmu}}}_{U}$ and covariance matrix ${{\varvec{\Sigma}}}_{U}$ can be estimated from a reference sample (or a training dataset) [20, 42]. In the absence of a reference sample, the mean vector and covariance matrix can be estimated from ${\mathbf{U}}_{j}$ [20, 42] by finding the mean and covariance of ${\mathbf{U}}_{j}$ across all $j$. The test statistic is said to be an in-control observation when it is within the bounds of LCL and UCL, otherwise it is an out-of-control observation. The presence of at least one out-of-control observation in a transition period was used to decide if changes in the ICD version affected the prevalence estimate of a chronic health condition. A transition period was defined as $\pm 2$ years around a transition year, where the transition year was 1979 when transitioning from ICDA-8 to ICD-9-CM and the transition year was 2005 when transitioning from ICD-9-CM to ICD-10-CA. Thus, the first transition period was from 1977 to 1981 and the second transition period was from 2003 to 2007. In a sensitivity analysis, we defined a transition period as $\pm 1$ year around a transition year. Since the DAD was the only data source that contains diagnoses recorded using ICD-10-CA, we also conducted separate analyses for each of the data sources (i.e., in physician billing claims, we focused on the transition period from ICDA-8 to ICD-9-CM, while in hospital records, we focused on the transition periods from ICDA-8 to ICD-9-CM and ICD-9-CM to ICD-10-CA).

Results

Characteristics of the study cohorts

The demographic characteristics of the study cohorts are described in Table 2 for selected study years. The annual cohort sizes increased from 360,341 in 1974 to 824,816 in 2018, while the average age increased from 37.3 years to 48.0 years, reflecting the growth and ageing of the Manitoba population over time.

Table 2 Demographic characteristics of the study cohorts in selected study years

Full size table

Goodness of fit tests for regression models

We compared the goodness of fit of $M1$ (negative binomial model) and $M2$ (Poisson model) for each year in the study period using a likelihood ratio test and rejected the null hypothesis (dispersion parameter is infinity) for all the chronic health conditions because the resulting p-values were less than $0.01$. This implies that the data was over-dispersed and $M1$ was a better fit to the data. Also, the ratios of residual deviance to df were small and close to one for $M1$, unlike for $M2$ where the ratios were high (see Figure S1, Additional file 1). This indicates that $M2$ did not account for the standard errors of the over-dispersed data.

Age- and sex-adjusted prevalence

Table 3 presents the age- and sex-adjusted prevalence (per 1,000 population) and the average annual rate of change (%) in prevalence for the chronic health conditions during the three time segments in which different ICD versions were used in the administrative data. Mood and anxiety disorders were the most common health condition in segments 1 and 2 with prevalence estimates of 74.4 and 138.8 in 1974 and 2004, respectively, and hypertension was the most prevalent health condition in segment 3 with prevalence estimates of 142.6 and 223.6 in 2005 and 2018, respectively. Skin cancer was the least diagnosed health condition in all the segments, with prevalence estimates of 0.3 and 2.1 in 1974 and 2018, respectively.

Table 3 Prevalence and average annual rate of change (%) in prevalence for chronic health conditions

Full size table

The highest average annual rate of change (%) in prevalence was recorded for hypertension in segment 1 (14.6; 95% confidence interval [CI]: 13.9, 15.2), asthma in segment 2 (7.4; 95% CI: 7.3, 7.5), and skin cancer in segment 3 (5.9; 95% CI: 5.6, 6.1). The lowest average annual rate of change in prevalence was recorded for menstrual disorders in segment 1 (2.4; 95% CI: 2.0, 2.8); acute myocardial infarction in segment 2 (0.9; 95% CI: 0.7, 1.0); and acute myocardial infarction in segment 3 (-1.6; 95% CI: -1.8, -1.4). Acute myocardial infarction and breast cancer showed a significant decline in prevalence during segment 3, while other health conditions showed an increase (Table 3).

Control chart results

Figure 1 displays the chronic health conditions with significant changes in regression model parameter estimates when transitioning from one ICD version to another. Within the first transition period, there was at least one out-of-control observation (i.e., at least one Hotelling’s T² statistic greater than the UCL) for each of the investigated chronic health conditions. The maximum Hotelling’s T² statistics for mood and anxiety disorders, menstrual disorders, and hypertension in the first transition period were 29.0, 22.5, and 20.9, respectively, which are greater than the UCL (Table 4). However, in the second transition period, there were no out-of-control observations for any chronic health conditions.

Table 4 Summary of Hotelling’s T² statistics for chronic health conditions in the transition periods

Full size table

Figure 2 displays the chronic health conditions with no out-of-control observations in any of the transition periods: acute myocardial infarction, acute cerebrovascular disease, colon cancer, and lung and respiratory cancers. Their respective maximum Hotelling’s T² statistics were 16.0, 19.5, 17.2, and 17.3 during the first transition period and 10.4, 12.8, 9.5, and 13.7 during the second transition period; these values are lower than the UCL (Table 4).

In the sensitivity analysis, in which a transition period was defined as $\pm 1$ year around a transition year, the results were similar to the results for the main analysis when a transition period was defined as $\pm 2$ year around a transition year. The exceptions were for osteoarthritis, asthma, and breast cancer. These health conditions had no out-of-control observations within the transition periods (see Figure S2, Figure S3, and Table S1, Additional file 1).

We conducted separate analyses for physician billing claims and hospital records. For the former, there were 10 chronic health conditions (i.e., mood and anxiety disorders, hypertension, anemia, diabetes, asthma, acute cerebrovascular disease, cataracts, breast cancer, prostate cancer, and skin cancer) with significant changes in regression model parameter estimates within the transition period from ICDA-8 to ICD-9-CM (see Figure S4 and Figure S5, Additional file 1). For hospital records, there were six chronic health conditions (i.e., menstrual disorders, acute cerebrovascular disease, breast cancer, colon cancer, lung and respiratory cancers, and skin cancer) with significant changes in regression model parameter estimates within the first transition period and one chronic health condition (i.e., skin cancer) with a significant change in regression model parameter estimates within the second transition period (i.e., when transitioning from ICD-9-CM to ICD-10-CA; see Figure S6 and Figure S7, Additional file 1).

Discussion

We used control charts to monitor the estimated regression model parameters for 16 chronic health conditions during a 45-year period (1974–2018) when three different ICD versions were used to record diagnoses in administrative health data. We focused on the effect of changes in the ICD version on the prevalence of chronic health conditions, which can result in real changes in the usage, meaning, and interpretation of diagnosis codes [7].

Our results showed that the estimated regression model parameters for most of the investigated chronic health conditions changed significantly during the transition from ICDA-8 to ICD-9-CM. There was no significant changes in the estimated regression model parameters during the transition from ICD-9-CM to ICD-10-CA when both data sources (i.e., physician billing claims and hospital records) were combined. Similar results were obtained when we examined trends in physician billing claims and hospital records separately. However, the exception to this was that we did detect a significant change in the estimated regression model parameters for skin cancer during the transition from ICD-9-CM to ICD-10-CA in hospital records.

These findings may also have occurred because there may have been less standardized training in diagnosis coding methodologies in the late 1970s than in the 2000s, and also less opportunity to communicate amongst healthcare coders, particularly in rural and remote areas of the province of Manitoba, to share information about coding practices.

Databases that include more than one ICD version represent a challenge for trend analyses because significant change in the prevalence estimates of chronic health conditions could emerge solely from the change in coding version, independent of true change in population health [43]. In other to mitigate the effect of change in ICD version on trend analysis, Janssen and Kunst [44] examined five ICD revisions in six European countries and recommended aggregating ICD codes into broader, clinically meaningful groups to reduce the impact of discontinuities in individual codes on trend estimates.

One of the strengths of this study is the population-based data that were used in trend estimation, which ensure generalizability of the results across the entire population of this Canadian province. Also, this study has a long time span, which captured data coded using three ICD versions. These strengths provide an opportunity for longitudinal research covering multiple ICD versions and a unique ability to examine changes within these ICD versions. The inclusion of ICDA-8 in this study aids in filling a knowledge gap; previous studies have focused on the transition from ICD-9 to ICD-10 only [45,46,47,48]. Tracking chronic health conditions across multiple decades can contribute to answering questions related to generational impacts of chronic health conditions [49, 50]. We considered 16 chronic health conditions that vary in prevalence and encompass multiple body systems, unlike previous studies that only focused on a single health condition or a single category of health conditions [51,52,53]. Finally, our methodology can be applied to other health conditions and to data from other Canadian provinces/territories, as well as to data from international jurisdictions.

This study is not without its limitations. Transitions between ICD versions may not be the only factor responsible for the significant changes in prevalence estimates. Prevalence of chronic health conditions derived from administrative health data may be influenced by changes in healthcare providers, including the number of primary care providers, the number and types of specialists, and the availability of other types of care (e.g., emergency departments, long-term care) [54,55,56,57]. Also, we limited our attention to modelling trends in prevalence and not trends in incidence of the selected chronic health conditions; different results might have arisen if we had focused on incidence. Furthermore, an out-of-control signal in our control charts analysis indicates a significant change in at least one of the regression model parameters; the specific coefficient(s) responsible for the signal is (are) not directly identified. However, this is not disadvantageous to our study since we were interested in detecting out-of-control signal(s), not the specific parameter responsible for signal detection.

Administrative health data have a number of strengths for research about chronic health conditions; they are relatively inexpensive to access and process and many data repositories now capture multiple decades of data [58, 59]. However, when integrating chronic disease information that arises across different versions of the ICD, there is the potential for misinterpretation of changes in trends if changes in ICD version are not accounted for. Changes in ICD versions captured in administrative health data require crosswalks of the diagnoses if data sources are to be integrated. Crosswalks between three ICD versions (ICDA-8, ICD-9-CM, and ICD-10-CA) for multiple chronic health conditions were recently developed [7].

Conclusion

In conclusion, we observed that the prevalence estimates of most of the investigated chronic health conditions were significantly affected when transitioning from ICDA-8 to ICD-9-CM, but not when transitioning from ICD-9-CM to ICD-10-CA. The findings of this study will benefit researchers and public health decision makers that rely on administrative health data spanning multiple decades to estimate change in chronic health condition prevalence.

Availability of data and materials

The data that support the findings of this study are not publicly available due to Manitoba privacy restrictions. These data are available with submission of appropriate ethics approval forms to the Health Research Ethics Board of the University of Manitoba (see https://www.umanitoba.ca/research/orec/ethics_medicine/forms.html for more details), and data access approval forms to the Manitoba Health Information Privacy Committee (see https://www.gov.mb.ca/health/hipc/submission.html for more details).

References

World Health Organization. International Statistical Classification of Diseases and Related Health Problems (ICD). 2021. https://www.who.int/standards/classifications/classification-of-diseases. Accessed 12 Sep 2021.
Slavova S, Costich JF, Luu H, Fields J, Gabella BA, Tarima S, et al. Interrupted time series design to evaluate the effect of the ICD-9-CM to ICD-10-CM coding transition on injury hospitalization trends. Inj Epidemiol. 2018;5:1–12. https://doi.org/10.1186/s40621-018-0165-8.
Article Google Scholar
Jetté N, Quan H, Hemmelgarn B, Drosler S, Maass C, Oec D-G, et al. The development, evolution, and modifications of ICD-10: challenges to the international comparability of morbidity data. Med Care. 2010;48:1105–10. https://www.jstor.org/stable/25767019.
Article Google Scholar
Lynge E, Sandegaard JL, Rebolj M. The Danish national patient register. Scand J Public Health. 2011;39(7_suppl):30–3. https://doi.org/10.1177/1403494811401482.
Article PubMed Google Scholar
Sund R. Quality of the finnish hospital discharge register: a systematic review. Scand J Public Health. 2012;40:505–15. https://doi.org/10.1177/1403494812456637.
Article PubMed Google Scholar
Lindström U, Exarchou S, Sigurdardottir V, Sundström B, Askling J, Eriksson JK, et al. Validity of ankylosing spondylitis and undifferentiated spondyloarthritis diagnoses in the Swedish National Patient Register. Scand J Rheumatol. 2015;44:369–76. https://doi.org/10.3109/03009742.2015.1010572.
Article CAS PubMed Google Scholar
Hamad AF, Vasylkiv V, Yan L, Sanusi R, Ayilara O, Delaney JA, et al. Mapping three versions of the international classification of diseases to categories of chronic conditions. Int J Popul Data Sci. 2021;6:1406. https://doi.org/10.23889/ijpds.v6i1.1406.
Article PubMed PubMed Central Google Scholar
Heslin KC, Barrett ML. Shifts in alcohol-related diagnoses after the introduction of international classification of diseases, tenth revision, clinical modification coding in U.S. hospitals: implications for epidemiologic research. Alcohol Clin Exp Res. 2018;42:2205–13. https://doi.org/10.1111/acer.13866.
Article PubMed Google Scholar
Annest JL, Hedegaard H, Chen LH, Warner M, Smalls EA. Proposed framework for presenting injury data using ICD-10-CM external cause of injury codes. National Center for Injury Prevention and Control, National Center for Health Statistics, Centers for Disease Control and Prevention. 2014. https://stacks.cdc.gov/view/cdc/27312. Accessed 24 Nov 2021.
Abbasi SA, Riaz M, Ahmad S, Sanusi RA, Abid M. New efficient exponentially weighted moving average variability charts based on auxiliary information. Qual Reliab Eng Int. 2020;36:2203–24. https://doi.org/10.1002/qre.2692.
Article Google Scholar
Montgomery DC. Introduction to statistical quality control. 7th edition. John Wiley & Sons Inc.; 2009.
Hand R, Piontek F, Klemka-Walden L, Inczauskis D. Use of statistical control charts to assess outcomes of medical care: pneumonia in Medicare patients. Am J Med Sci. 1994;307:329–34. https://doi.org/10.1097/00000441-199405000-00003.
Article CAS PubMed Google Scholar
Kaminsky FC, Maleyeff J, Mullins DL. Using SPC to analyze measurements in a healthcare organization. J Healthc Risk Manag. 1998;18:36–46. https://doi.org/10.1002/jhrm.5600180106.
Article CAS PubMed Google Scholar
Gabbay U, Bukchin M. Does daily nurse staffing match ward workload variability? Three hospitals’ experiences. Int J Health Care Qual Assur. 2009. https://doi.org/10.1108/09526860910986885.
Article PubMed Google Scholar
Jones MA, Steiner SH. Assessing the effect of estimation error on risk-adjusted CUSUM chart performance. Int J Qual Heal Care. 2012;24:176–81. https://doi.org/10.1093/intqhc/mzr082.
Article Google Scholar
Smith IR, Gardner MA, Garlick B, Brighouse RD, Cameron J, Lavercombe PS, et al. Performance monitoring in cardiac surgery: Application of statistical process control to a single-site database. Hear Lung Circ. 2013;22:634–41. https://doi.org/10.1016/j.hlc.2013.01.011.
Article Google Scholar
Mahmood T, Wittenberg P, Zwetsloot IM, Wang H, Tsui KL. Monitoring data quality for telehealth systems in the presence of missing data. Int J Med Inform. 2019;126:156–63. https://doi.org/10.1016/j.ijmedinf.2019.03.011.
Article PubMed Google Scholar
Altuntas S, Dereli T, Kaya İ. Monitoring patient dissatisfaction: a methodology based on SERVQUAL scale and statistical process control charts. Total Qual Manag Bus Excell. 2020;31:978–1008. https://doi.org/10.1080/14783363.2018.1457434.
Article Google Scholar
Hanslik T, Boelle P-Y, Flahault A. The control chart: an epidemiological tool for public health monitoring. Public Health. 2001;115:277–81. https://doi.org/10.1038/sj.ph.1900782.
Article CAS PubMed Google Scholar
Coory M, Duckett S, Sketcher-baker K. Using control charts to monitor quality of hospital care with administrative data. Int J Qual Heal Care. 2008;20:31–9. https://doi.org/10.1093/intqhc/mzm060.
Article Google Scholar
Suman G, Prajapati D. Control chart applications in healthcare: a literature review. Int J Metrol Qual Eng. 2018;9:5. https://doi.org/10.1051/ijmqe/2018003.
Article Google Scholar
Statistics Canada. Manitoba [Province] and Canada [Country] (table). Census Profile. 2016 Census. Statistics Canada Catalogue no. 98–316-X2016001. Ottawa. 2017. https://www12.statcan.gc.ca/census-recensement/2016/dp-pd/prof/details/page.cfm?Lang=E&Geo1=PR&Code1=46&Geo2=PR&Code2=01&Data=Count&SearchText=46&SearchType=Begins&SearchPR=01&B1=All&Custom=&TABID=3. Accessed 23 Nov 2021.
Hamm NC, Robitaille C, Ellison J, O’Donnell S, McRae L, Hutchings K, et al. At-a-glance–Population coverage of the Canadian chronic disease surveillance system: a survey of the contents of health insurance registries across Canada. Heal Promot Chronic Dis Prev Canada. 2021;41 No 7/8. https://doi.org/10.24095/hpcdp.41.7/8.04.
Lix LM, Walker R, Quan H, Nesdole R, Yang J, Chen G. Features of physician services databases in Canada. Chronic Dis Inj Can. 2012;32:186–93.
Article CAS Google Scholar
Roos LL, Nicol JP. A research registry: uses, development, and accuracy. J Clin Epidemiol. 1999;52:39–47. https://doi.org/10.1016/S0895-4356(98)00126-7.
Article CAS PubMed Google Scholar
Roos LL, Mustard CA, Nicol JP, McLerran DF, Malenka DJ, Young TK, et al. Registries and administrative data: organization and accuracy. Med Care. 1993;31:201–12. https://doi.org/10.1097/00005650-199303000-00002.
Article CAS PubMed Google Scholar
Roos LL, Gupta S, Soodeen R-A, Jebamani L. Data quality in an information-rich environment: Canada as an example. Can J Aging/La Rev Can du Vieil. 2005;24:153–70. https://doi.org/10.1353/cja.2005.0055.
Article Google Scholar
Lix LM, Yao X, Kephart G, Quan H, Smith M, Kuwornu JP, et al. A prediction model to estimate completeness of electronic physician claims databases. BMJ Open. 2015;5:e006858. https://doi.org/10.1136/bmjopen-2014-006858.
Article PubMed PubMed Central Google Scholar
Smith M, Lix LM, Azimaee M, Enns JE, Orr J, Hong S, et al. Assessing the quality of administrative data for research: a framework from the Manitoba Centre for Health Policy. J Am Med Informatics Assoc. 2018;25:224–9. https://doi.org/10.1093/jamia/ocx078.
Article Google Scholar
Chi M, Lee C, Wu S. The prevalence of chronic conditions and medical expenditures of the elderly by chronic condition indicator (CCI). Arch Gerontol Geriatr. 2011;52:284–9. https://doi.org/10.1016/j.archger.2010.04.017.
Article PubMed Google Scholar
Kannan VC, Andriamalala CN, Reynolds TA. The burden of acute disease in Mahajanga, Madagascar–a 21 month study. PLoS ONE. 2015;10:e0119029. https://doi.org/10.1371/journal.pone.0119029.
Article CAS PubMed PubMed Central Google Scholar
Jansen J. On the statistical analysis of ordinal data when extravariation is present. J R Stat Soc Ser C Appl Stat. 1990;39:75–84.
Google Scholar
Waterhouse M, Smith I, Assareh H, Mengersen K. Implementation of multivariate control charts in a clinical setting. Int J Qual Heal Care. 2010;22:408–14. https://doi.org/10.1093/intqhc/mzq044.
Article Google Scholar
Mahmoud MA, Woodall WH. Phase I analysis of linear profiles with calibration applications. Technometrics. 2004;46:380–91. https://doi.org/10.1198/004017004000000455.
Article Google Scholar
Woodall WH, Spitzner DJ, Montgomery DC, Gupta S. Using control charts to monitor process and product quality profiles. J Qual Technol. 2004;36:309–20. https://doi.org/10.1080/00224065.2004.11980276.
Article Google Scholar
Saeed U, Mahmood T, Riaz M, Abbas N. Simultaneous monitoring of linear profile parameters under progressive setup. Comput Ind Eng. 2018;125:434–50. https://doi.org/10.1016/j.cie.2018.09.013.
Article Google Scholar
Kim K, Mahmoud MA, Woodall WH. On the monitoring of linear profiles. J Qual Technol. 2003;35:317–28. https://doi.org/10.1080/00224065.2003.11980225.
Article Google Scholar
Khedmati M, Niaki STA. Phase II monitoring of general linear profiles in the presence of between-profile autocorrelation. Qual Reliab Eng Int. 2016;32:443–52. https://doi.org/10.1002/qre.1762.
Article Google Scholar
Noorossana R, Amiri A, Soleimani P. On the monitoring of autocorrelated linear profiles. Commun Stat Theory Methods. 2008;37:425–42. https://doi.org/10.1080/03610920701653136.
Article Google Scholar
Hauck DJ, Runger GC, Montgomery DC. Multivariate statistical process monitoring and diagnosis with grouped regression-adjusted variables. Commun Stat Comput. 1999;28:309–28. https://doi.org/10.1080/03610919908813551.
Article Google Scholar
Erfanian M, Sadeghpour Gildeh B, Reza AM. A new approach for monitoring healthcare performance using generalized additive profiles. J Stat Comput Simul. 2021;91:167–79. https://doi.org/10.1080/00949655.2020.1807981.
Article Google Scholar
Keller DS, Stulberg JJ, Lawrence JK, Samia H, Delaney CP. Initiating statistical process control to improve quality outcomes in colorectal surgery. Surg Endosc. 2015;29:3559–64. https://doi.org/10.1007/s00464-015-4108-y.
Article PubMed Google Scholar
Khera R, Dorsey KB, Krumholz HM. Transition to the ICD-10 in the United States: an emerging data chasm. JAMA. 2018;320:133–4. https://doi.org/10.1001/jama.2018.6823.
Article PubMed Google Scholar
Janssen F, Kunst AE. ICD coding changes and discontinuities in trends in cause-specific mortality in six European countries, 1950–99. Bull World Health Organ. 2004;82:904–13.
PubMed Google Scholar
De Coster C, Quan H, Finlayson A, Gao M, Halfon P, Humphries KH, et al. Identifying priorities in methodological research using ICD-9-CM and ICD-10 administrative data: report from an international consortium. BMC Health Serv Res. 2006;6:1–6. https://doi.org/10.1186/1472-6963-6-77.
Article Google Scholar
Hsu M, Wang C, Huang L, Lin C, Lin F, Toh S. Effect of ICD-9-CM to ICD-10-CM coding system transition on identification of common conditions: an interrupted time series analysis. Pharmacoepidemiol Drug Saf. 2021;30:1653–74. https://doi.org/10.1002/pds.5330.
Article CAS PubMed Google Scholar
Sebastião YV, Metzger GA, Chisolm DJ, Xiang H, Cooper JN. Impact of ICD-9-CM to ICD-10-CM coding transition on trauma hospitalization trends among young adults in 12 states. Inj Epidemiol. 2021;8:1–13. https://doi.org/10.1186/s40621-021-00298-x.
Article Google Scholar
Pollock NJ, Liu L, Wilson MM, Reccord C, Power ND, Mulay S, et al. Suicide in Newfoundland and Labrador, Canada: a time trend analysis from 1981 to 2018. BMC Public Health. 2021;21:1–11. https://doi.org/10.1186/s12889-021-11293-8.
Article Google Scholar
Wehby GL, Domingue BW, Wolinsky FD. Genetic risks for chronic conditions: implications for long-term wellbeing. J Gerontol A Biol Sci Med Sci. 2018;73:477–83. https://doi.org/10.1093/gerona/glx154.
Article PubMed Google Scholar
Rappaport SM. Genetic factors are not the major causes of chronic diseases. PLoS One. 2016;11:1–9. https://doi.org/10.1371/journal.pone.0154387.
Article CAS Google Scholar
Metcalfe A, Sheikh M, Hetherington E. Impact of the ICD-9-CM to ICD-10-CM transition on the incidence of severe maternal morbidity among delivery hospitalizations in the United States. Am J Obstet Gynecol. 2021. https://doi.org/10.1016/j.ajog.2021.03.036.
Article PubMed Google Scholar
Ohnuma T, Raghunathan K, Fuller M, Ellis AR, JohnBull EA, Bartz RR, et al. Trends in comorbidities and complications using ICD-9 and ICD-10 in total hip and knee arthroplasties. JBJS. 2021;103:696–704. https://doi.org/10.2106/JBJS.20.01152.
Article Google Scholar
Lee ES, Lee PSS, Xie Y, Ryan BL, Fortin M, Stewart M. The prevalence of multimorbidity in primary care: a comparison of two definitions of multimorbidity with two different lists of chronic conditions in Singapore. BMC Public Health. 2021;21:1–9. https://doi.org/10.1186/s12889-021-11464-7.
Article CAS Google Scholar
Glynn LG, Valderas JM, Healy P, Burke E, Newell J, Gillespie P, et al. The prevalence of multimorbidity in primary care and its effect on health care utilization and cost. Fam Pract. 2011;28:516–23. https://doi.org/10.1093/fampra/cmr013.
Article PubMed Google Scholar
Muggah E, Graves E, Bennett C, Manuel DG. The impact of multiple chronic diseases on ambulatory care use; a population based study in Ontario Canada. BMC Health Serv Res. 2012;12:1–6. http://www.biomedcentral.com/1472-6963/12/452.
Article Google Scholar
Ronksley PE, McKay JA, Kobewka DM, Mulpuru S, Forster AJ. Patterns of health care use in a high-cost inpatient population in Ottawa, Ontario: a retrospective observational study. C Open. 2015;3:E111–8. https://doi.org/10.9778/cmajo.20140049.
Article Google Scholar
Palladino R, Tayu Lee J, Ashworth M, Triassi M, Millett C. Associations between multimorbidity, healthcare utilisation and health status: evidence from 16 European countries. Age Ageing. 2016;45:431–5. https://doi.org/10.1093/ageing/afw044.
Article PubMed PubMed Central Google Scholar
Quam L, Ellis LBM, Venus P, Clouse J, Taylor CG, Leatherman S. Using claims data for epidemiologic research: the concordance of claims-based criteria with the medical record and patient survey for identifying a hypertensive population. Med Care. 1993;31:498–507. https://www.jstor.org/stable/3766130.
Article CAS Google Scholar
Motheral BR, Fairman KA. The use of claims databases for outcomes research: rationale, challenges, and strategies. Clin Ther. 1997;19:346–66. https://doi.org/10.1016/S0149-2918(97)80122-1.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We acknowledge the Manitoba Centre for Health Policy for use of data contained in the Population Health Research Data Repository (HIPC #: 2019/2020–52; MCHP Project #: 2020-005). The results and conclusions are those of the authors and no official endorsement by the Manitoba Centre for Health Policy, Manitoba Health and Seniors Care, or other data providers is intended or should be inferred.

Funding

This study was supported by funding from the Winnipeg Foundation Innovation Fund of the Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Canada. LML is supported by a Tier 1 Canada Research Chair.

Author information

Authors and Affiliations

Department of Community Health Sciences, University of Manitoba, Winnipeg, MB, R3E0T6, Canada
Ridwan A. Sanusi, Lin Yan, Amani F. Hamad, Olawale F. Ayilara, Viktoriya Vasylkiv, Elizabeth Wall-Wieler & Lisa M. Lix
Department of Statistics, University of Manitoba, Winnipeg, MB, R3T2N2, Canada
Mohammad Jafari Jozani
CancerCare Manitoba Research Institute, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, R3EOV9, Canada
Shantanu Banerji
College of Pharmacy, University of Manitoba, Winnipeg, MB, R3E0T5, Canada
Joseph Delaney
Department of Epidemiology, University of Washington, Seattle, WA, 98195, USA
Joseph Delaney
Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, MB, R3E0J9, Canada
Pingzhao Hu

Authors

Ridwan A. Sanusi
View author publications
You can also search for this author in PubMed Google Scholar
Lin Yan
View author publications
You can also search for this author in PubMed Google Scholar
Amani F. Hamad
View author publications
You can also search for this author in PubMed Google Scholar
Olawale F. Ayilara
View author publications
You can also search for this author in PubMed Google Scholar
Viktoriya Vasylkiv
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Jafari Jozani
View author publications
You can also search for this author in PubMed Google Scholar
Shantanu Banerji
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Delaney
View author publications
You can also search for this author in PubMed Google Scholar
Pingzhao Hu
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Wall-Wieler
View author publications
You can also search for this author in PubMed Google Scholar
Lisa M. Lix
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors conceived the study and prepared the analysis plan. RAS and LML conducted the analysis and prepared the draft manuscript. All authors reviewed and approved the final version of the manuscript.

Corresponding author

Correspondence to Ridwan A. Sanusi.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Health Research Ethics Board of the University of Manitoba. Data access approval was provided by the Manitoba Health Information Privacy Committee. Study cohort members were not required to provide informed consent for participation in this study because this study involved a retrospective review of electronic healthcare records. Informed consent to participate was waived by the Health Research Ethics Board at the University of Manitoba. The study protocol was carried out in accordance with relevant guidelines and regulations at the University of Manitoba.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

U-Statistic Definition. Table S1. Summary of Hotelling’s T² statistics for chronic health conditions in the transition periods. Figure S1. Goodness-of-fit statistics for negative binomial and Poisson regression models for 16 chronic health conditions. Figure S2. Chronic health conditions with significant changes in regression coefficients within the transition periods. Figure S3. Chronic health conditions with no significant changes in regression coefficients within the transition periods. Figure S4. Chronic health conditions with significant changes in regression model parameter estimates, physician billing claims. Figure S5. Chronic health conditions with no significant changes in regression model parameter estimates, physician billing claims. Figure S6. Chronic health conditions with significant changes in regression model parameter estimates, hospital records. Figure S7. Chronic health conditions with no significant changes in regression model parameter estimates, hospital records.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Sanusi, R.A., Yan, L., Hamad, A.F. et al. Transitions between versions of the International Classification of Diseases and chronic disease prevalence estimates from administrative health data: a population-based study. BMC Public Health 22, 701 (2022). https://doi.org/10.1186/s12889-022-13118-8

Download citation

Received: 25 November 2021
Accepted: 30 March 2022
Published: 09 April 2022
DOI: https://doi.org/10.1186/s12889-022-13118-8

Transitions between versions of the International Classification of Diseases and chronic disease prevalence estimates from administrative health data: a population-based study

Abstract

Background

Methods

Results

Conclusions

Introduction

Methods

Data sources

Study cohorts

Identifying chronic health conditions in administrative health data

Statistical analyses

Results

Characteristics of the study cohorts

Goodness of fit tests for regression models

Age- and sex-adjusted prevalence

Control chart results

Discussion

Conclusion

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Public Health

Contact us