INTRODUCTION
Chronic obstructive pulmonary disease (COPD) is characterized as a chronic respiratory disease with progressive and irreversible airflow limitations. The global prevalence of COPD stands at 13.1%, with a rising trend1. COPD is also the third-leading cause of death globally2 and is projected to increase in the next 40 years, leading to approximately 5 million deaths each year3. Although smoking is the primary risk factor for COPD, it is not the sole determinant. In 2019, non-smokers may constitute about one-third of the 391.9 million people worldwide aged 30–79 years with COPD4,5.
In addition to cigarette smoking, several studies have highlighted biomass fuels, air pollutants, workplace dust and fumes, a history of respiratory infections, poor nutrition, and socioeconomic disadvantage as notable risk factors for COPD6,7. Consequently, there is a growing importance in investigating COPD among smokers and never smokers within the field of global public health. However, there persists a dearth of knowledge regarding never-smoking COPD, with only a limited number of studies conducted on the prevalence and mortality of COPD among smokers and never smokers8-10. Therefore, it is essential to undertake additional research on the epidemiology of COPD among smokers and never smokers, to understand disease trends better and inform strategies for prevention and treatment.
The National Health and Nutrition Examination Survey (NHANES) is distinguished by its substantial and nationally representative sample size. This study aimed to examine the prevalence trend of self-reported COPD among smokers (including current smokers and ex-smokers) and never smokers aged 20–79 years in the US, along with its associated risk factors, using data from the NHANES collected between 1999 and 2018. Furthermore, a cohort study was carried out to investigate the all-cause mortality from self-reported COPD among current smokers, ex-smokers, and never smokers by linking NHANES data with mortality data from the National Death Index (NDI) in 2019.
METHODS
Data and study population
NHANES, conducted by the National Center for Health Statistics (NCHS), employs a nationally representative cross-sectional design utilizing a complex, multi-stage, stratified, clustered probability sampling method. Approval for the study was obtained from the NCHS Ethics Review Board (ERB), and all participants provided informed consent11. The data, publicly accessible on the Centers for Disease Control and Prevention (CDC) website12, encompassed health interviews conducted in respondents’ residences and health measurements obtained at mobile examination centers (MECs). Our study utilized data from 10 successive survey cycles of the NHANES (1999–2018, two-year cycle) and adhered to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines.
Definition of main variables
Using data from the National Health and Nutrition Examination Survey (NHANES), we included adults aged 20–79 years in our study. Never smokers were identified based on their responses to the question: ‘Have you smoked at least 100 cigarettes in your entire life?’. Participants who answered affirmatively were classified as smokers, while those who answered negatively were classified as never smokers. Subsequently, we further assessed smoking status with the question: ‘Do you now smoke cigarettes?’. Smokers who responded ‘every day’ or ‘some days’ were categorized as current smokers, whereas those who responded ‘not at all’ were classified as ex-smokers. The definition of COPD in this study was based on the following question: ‘Has a doctor or other health professional ever told you that you had emphysema/chronic bronchitis/COPD?’. A positive response to any part of this question was necessary to classify a case of self-reported COPD.
Other covariates
In addition to collecting data on smoking status and COPD, demographic characteristics were extracted, encompassing variables such as sex, age (20–39, 40–59, 60–79 years), race/ethnicity (non-Hispanic White, non-Hispanic Black, Hispanic, Mexican American, other), citizenship status (US citizen, non-US citizen), an education level (lower than high school, high school or higher), body mass index (BMI, kg/ m2) categories (underweight: <18.5, normal weight: 18.5–24.9, overweight: 25.0–29.9, obese: ≥30.0), and poverty-to-income ratio (PIR) classifications (low level: PIR <2, medium level: PIR=2–4, high level: PIR >4).
Demographic information, including sex, age, race/ethnicity, citizenship status, and education level, was self-reported by participants in the questionnaire section. BMI was determined by trained health professionals at the Mobile Examination Center (MEC) using a standardized protocol, calculated as an individual’s weight (kg) divided by the square of their height (m).
The PIR was computed as the total household income divided by the poverty threshold defined by the US Census Bureau, which is an index developed by NHANES to reflect the annual income of a household adjusted for family size and established poverty threshold guidelines set by the US Department of Health and Human Services (HHS)13. A higher PIR indicates a higher income level.
Mortality follow-up
The NHANES data were cross-referenced with death records in the National Death Index (NDI) from 1999 through 31 December 2019 using a rigorous probabilistic matching process and review of death certificates based on seven matching criteria, such as Social Security number, sex, and date of birth. The follow-up period for each study participant was defined as the duration between the NHANES baseline examination and the date of the participant’s death or the last follow-up visit (31 December 2019). The primary outcome measure in this study was allcause mortality, which was defined as death resulting from any cause. For further details on pertinent mortality records and definitions of causes of death, please refer to the following resources: https://www.cdc.gov/nchs/data-linkage/mortality-public.htm.
Statistical analysis
We performed a descriptive analysis of demographic characteristics using R software (V.4.2.2). The Rao-Scott chi-squared test was utilized to assess the significance of the association between each demographic variable and COPD, and the weights provided by NHANES were integrated into the statistical analysis of the NHANES sample to adjust for the complex survey design. Our study also examined COPD prevalence in various demographic groups from 1999 to 2018 and standardized COPD prevalence using age data from the 2010 US Census, which involved stratifying into three age groups (20–39, 40–59, and 60–79 years) and calculating age-standardized prevalence for each survey period. Lastly, proportional estimations of demographic variables were calculated along with their corresponding 95% confidence intervals.
To explore potential explanatory factors such as demographic variables, we employed weighted univariate logistic regression models to analyze and identify variables with significant impacts. Four nested logistic regression models were constructed in these groups, each building upon the previous one by adding sequentially explanatory variables: sex, age, and race/ethnicity in Model 1, plus citizenship in Model 2, plus BMI and PIR in Model 3, plus education level in Model 4. Parameter estimation, significance tests for regression parameters, multicollinearity test, and the likelihood ratio test were performed.
Parameters with two-tailed p<0.05 were considered statistically significant. Finally, Model 3 was chosen for the current and never smokers, while Model 4 was chosen for the ex-smokers, and odds ratios with corresponding 95% confidence intervals were calculated for each parameter (Supplementary file).
Participants’ survival time was calculated from their baseline medical examination until death, loss to follow-up, or 31 December 2019. In addition to calculating the survival rates, the study also determined medians and quartiles of survival time. A Kaplan-Meier plot was created for current smokers, ex-smokers, and never smokers to examine the impact of COPD on all-cause mortality. Then, the log-rank test was used to assess the statistical significance of differences in survival rates between COPD and nonCOPD groups (Supplementary file).
RESULTS
Characteristics of study participants
The study included a sample of 50773 individuals aged 20–79 years (Figure 1). Of these participants, 11257 were categorized as current smokers (weighted n = 46600007), 11865 were categorized as ex-smokers (weighted n = 49512069), and 27651 were categorized as never smokers (weighted n = 11378367). The participants, categorized as current smokers, ex-smokers, and never smokers, were subsequently divided into two subgroups (COPD and non-COPD). Current smoking-related COPD was identified in 12.59% (n=1374) of current smokers, while ex-smoking-related COPD was observed in 9.61% (n=1210) of ex-smokers. Additionally, never smoking-related COPD accounted for 4.14% (n=1109) of never smokers. The proportions of current smoking COPD (n=789; 62.2%), ex-smoking-related COPD (n=576; 51.1%), and never smoking-related COPD (n=829; 75%) were higher in female patients compared to male patients. Among never smokers, female COPD patients (n=829; 75%) outnumbered male patients (n=280; 25%) by three times.
It is noteworthy that the age distribution of COPD prevalence varies by smoking status. Specifically, most individuals with current smoking-related COPD were aged 40–59 years (n=619; 48.9%), whereas the majority of those with ex-smoking-related COPD were aged 60–79 years (n=804; 56.7%). Furthermore, non-Hispanic Whites exhibited a higher proportion of COPD among current smokers (n=866; 78.9%), ex-smokers (n=752; 81.4%), and never smokers (n=510; 70.3%). However, no significant difference was observed among never smokers (p=0.677). Among current smokers, ex-smokers, and never smokers, demographic disparities in citizenship status, BMI, and PIR were observed between COPD and non-COPD subgroups. Notably, 98.2% of current smokers with COPD were US citizens (n=1330). COPD with low PIR (<2) was higher in current smokers (n=869; 58.3%), ex-smokers (n=629; 56.3%), and never smokers (n=531; 40.3%). Obese COPD (BMI ≥30) accounted for more than 50% of ex-smokers (n=583; 51.5%) and never smokers (n=561; 53.7%) (Table 1).
Table 1
The distribution of current smokers, ex-smokers, and never smokers within both the general population and the population with COPD
Between 1999 and 2018, the distribution of smoking status within the overall population underwent significant changes. Specifically, the proportion of current smokers declined, whereas the proportion of never smokers had an upward trajectory (Figure 2A). In the population with COPD, the proportion of current smokers and ex-smokers is notably high. Specifically, in 2018, the percentage of current smokers among COPD patients reached 38.9%. Conversely, the proportion of never smokers within this population exhibited a downward trend between 1999 and 2018 (Figure 2B).
Prevalence of COPD among current smokers, ex-smokers, and never smokers
Between 1999 and 2018, the crude and age-standardized prevalence rates of COPD among current smokers were 12.2% (n=1374) and 12.6% (n=1374), respectively. Among ex-smokers, these rates were 10.2% (n=1210) and 9.6% (n=1210), respectively. In never smokers, the crude and age-standardized prevalence rates of COPD were 4.0% (n=1109) and 4.1% (n=1109), respectively. The prevalence of COPD among current smokers exhibited a higher overall rate compared to ex-smokers and never smokers, with an increasing trend from 13.7% (n=96) in 1999 to 21.9% (n=192) in 2018. The prevalence of COPD among ex-smokers demonstrated a fluctuating upward trend, remaining relatively stable at 10.1% in both 1999 (n=115) and 2018 (n=180). Conversely, the prevalence of COPD among never smokers showed an overall decline, decreasing from 4.9% (n=321) in 1999 to 3.3% (n=493) in 2018 (Figure 3). For current smokers, disease incidence shows an overall increasing trend year by year by age, gender, race, education level, income level, citizenship status, and BMI. In contrast, disease rates among ex-smokers remained relatively stable. However, certain subgroups have notable fluctuations, particularly among older individuals (60–79 years), those with moderate income levels (PIR=2–4), and US citizens. These subgroups have shown an upward trend in disease incidence in recent years. In addition, among non-smokers, there was a significant decrease in the prevalence of COPD among women, US citizens, and individuals with low PIR levels (Supplementary file).
All-cause mortality of COPD among current smokers, ex-smokers, and never smokers
The sample for the all-cause mortality analysis included 50773 individuals aged 20–79 years, comprising current-smokers (n=11257), ex-smokers (n=11865), and never smokers (n=27651), drawn from the NHANES (1999–2018). The follow-up period was extended until 31 December 2019. Among the cohort of current smokers, a total of 336 patients with COPD succumbed, accounting for 21.1%, while 1362 non-COPD patients, accounting for 11.2%, also died. In the group of ex-smokers, there were 430 deaths attributed to COPD, constituting 29% of the COPD cases, alongside 1851 deaths among individuals without COPD, which corresponds to 12.3%. Among never smokers, 156 COPD patients (12%) and 2069 individuals without COPD (5.7%) died. The median follow-up period for the study was 114 months (IQR: 62–171.25). Kaplan-Meier survival curves illustrate a statistically significant difference in all-cause mortality between individuals with COPD and those without COPD among current smokers (Figure 4A), ex-smokers (Figure 4B), and never smokers (Figure 4C).
Risk factors for COPD among current smokers, ex-smokers, and never smokers
Initially, risk factors for self-reported COPD among current smokers, ex-smokers, and never smokers were identified through univariate logistic regression analysis (Supplementary file Figure 4). Subsequently, multivariate logistic regression analysis results are presented (Figure 5). Among current smokers, the following factors were identified as significant risk factors for COPD: being female (OR=1.90; 95% CI: 1.62–2.24), aged 60–79 years (OR=4.61; 95% CI: 3.71–5.73), US citizenship (OR=3.92; 95% CI: 2.48–6.22), underweight (OR=2.04; 95% CI: 1.35– 3.09), and obese (OR=1.46; 95% CI: 1.20–1.77). Among ex-smokers, the risk of COPD was found to be significantly higher in women (OR=1.32; 95% CI: 1.10–1.58), individuals aged 60–79 years (OR=3.29; 95% CI: 2.50–4.32), US citizens (OR=4.67; 95% CI: 3.06–7.12), and those classified as obese (OR=1.53; 95% CI: 1.21–1.93). Among never smokers, individuals aged 40–59 years (OR=1.59; 95% CI: 1.29–1.95) and 60–79 years (OR=2.20; 95% CI: 1.78–2.73), females (OR=2.08; 95% CI: 1.71–2.52), those with US citizenship (OR=2.15; 95% CI: 1.59– 2.92), individuals classified as overweight (OR=1.45; 95% CI: 1.13–1.86), and those classified as obese (OR=2.34; 95% CI: 1.86–2.93) exhibited a higher risk of COPD. Regardless of smoking status, whether individuals were current smokers, ex-smokers, or never smokers, higher levels of PIR were associated with a decreased risk of COPD. Additionally, among ex-smokers and non-smokers, Mexican Americans and non-Hispanic Blacks exhibit a lower risk of COPD compared to their non-Hispanic White counterparts (Figure 5).
DISCUSSION
This research utilized data from the NHANES cross-sectional study to explore the prevalence of self-reported COPD and its associated risk factors among current smokers, ex-smokers, and non-smokers, aged 20–79 years in the US from 1999 to 2018. Additionally, cohort studies were conducted to assess all-cause mortality. The prevalence of COPD among current smokers, ex-smokers, and never smokers was 12.6%, 9.6%, and 4.1%, respectively. The observed prevalence of COPD among current smokers, ex-smokers, and never smokers may vary from findings in other studies, potentially attributable to differences in the operational definition of COPD, specifically in relation to the higher incidence of positive spirometry results. Additionally, the age demographic under investigation, which predominantly consisted of individuals aged >40 years, and variations in environmental exposures between countries, likely contributed to these differences. The risk of COPD exhibited varying degrees of susceptibility among current smokers, ex-smokers, and never smokers, highlighting the multifactorial nature of the disease. Notably, the risk of COPD was higher in women compared to men across all smoking status categories, albeit to varying extents. Females had a significantly higher risk of developing the disease compared to men, as previously documented14,15. A variety of physiological, environmental, and behavioral factors16,17 may influence this difference in risk. Older age is a significant risk factor for COPD among current smokers, ex-smokers, and never smokers, and is also associated with increased all-cause mortality in our study. COPD is characterized as a chronic condition in which a combination of genetic predisposition and environmental factors gradually contribute to structural and functional deterioration of the lungs16. Early life adversities, such as preterm birth, low birth weight, asthma, and infections of the lower respiratory tract during childhood, can result in irreversible impairment of lung function, potentially heightening susceptibility to developing chronic lung diseases in adulthood17,18. The mortality rate associated with ex-smokers with COPD was higher than that observed in current smokers with COPD. This disparity may be partially attributable to the higher prevalence of COPD among the elderly ex-smokers. The prevalence of COPD among ex-smokers and never smokers varied by race, with non-Hispanic Whites exhibiting a higher prevalence compared to other racial groups.
However, the absence of racial differences in current smoking-related COPD suggests that any potential protective advantages associated with race do not mitigate the detrimental effects of cigarette smoke exposure. This indicates that following the cessation of cigarette exposure, the variability in protective benefits among different racial groups becomes apparent. This disparity may be attributed to genetic ancestry19, genetic variations, and familial medical history20,21. Prior research has indicated that non-US citizens, or immigrants, are more likely to experience adverse behavioral, mental, and physical health outcomes that may elevate their susceptibility to illness and mortality22. Contrary to these findings, our research findings indicate that non-US citizens have a decreased likelihood of developing COPD, but the underlying reasons for this discrepancy remain unclear. Our study also identified a correlation between individuals with high PIR levels, indicative of high incomes, and a decreased prevalence. Previous studies have consistently demonstrated a strong association between COPD and socioeconomic status23,24. For example, higher wealth scores are related to a lower incidence of airway obstruction25, indicating that public health interventions should prioritize addressing socioeconomic disparities. Prior studies have indicated a negative correlation between education level and the prevalence of COPD among never smokers6,26. However, our research did not find a significant connection between education level and the prevalence of self-reported COPD among never smokers.
The prevalence and mortality rates of COPD among both current smokers and ex-smokers remain significantly high, underscoring the urgent need for enhanced preventative measures targeting these populations in the US. Conversely, the declining prevalence of COPD among never smokers over recent years suggests that the US has made notable progress in the prevention of COPD among never smokers. The prevalence and mortality rates of COPD among current smokers, ex-smokers, and never smokers remain elevated. The overall disease burden is particularly pronounced in patients with COPD who also have comorbid conditions27. This presents a significant challenge for healthcare providers both currently and in the future. Studies have indicated that clinical symptoms of COPD in never smokers are less pronounced compared to those in smokers28,29, which complicates diagnosis and increases the likelihood of the condition being overlooked. Consequently, in addition to efforts aimed at preventing COPD in current and ex-smokers, early identification of COPD in never smokers is crucial to enable timely intervention and treatment.
Limitations
Our study has some limitations, primarily due to its cross-sectional design. This design prevents establishing a causal relationship between variables and COPD among current smokers, ex-smokers, and never smokers. Furthermore, the diagnosis of COPD was based on self-reported information and did not undergo validation through medical records. Then, social expectations, recall biases and proxy response biases can impact the accuracy of COPD prevalence estimates. Additionally, incomplete data in the NHANES, such as missing pulmonary function data (excluding the years 2007–2012), occupational exposure history, smoking intensity among smokers, and length of time since quitting among ex-smokers, also hindered the exploration of risk factors for COPD among current smokers, ex-smokers, and never smokers. Moreover, this study did not investigate the current prevalence of self-reported COPD among current smokers, ex-smokers, and never smokers from 2019 to the present, as well as acute exacerbation, due to limitations in this database. Finally, this study lacks generalizability to other countries.
CONCLUSIONS
This study offers a comprehensive update on the epidemiology, including prevalence, mortality, and risk factors, of self-reported COPD among current smokers, ex-smokers, and never smokers within the US population aged 20–79 years from 1999 to 2018. The findings underscore that COPD continues to pose a significant public health challenge across these groups. Furthermore, the study emphasizes the critical importance of early detection and intervention in managing COPD, warranting further investigation into effective public health strategies.