Tuberculosis (TB) remains one of the leading causes of death globally, ranking second after COVID-19 among infectious agents.1 TB is a prevalent opportunistic infection and a primary cause of hospitalization and mortality in people living with HIV (PLWH).1,2
Early identification of PLWH individuals TB is critical for reducing mortality and disease burden. Inadequate diagnostics remain a major obstacle to improving outcomes in HIV-TB co-infected patients.3 Some PLWH were confirmed active TB through pathogen-based methods such as sputum smear, culture, GeneXpert MTB/Rifampicin (RIF), and targeted next-generation sequencing. Studies have shown that up to 45.8% of PLWH had undiagnosed TB at the time of death, TB burden in this population may be significantly underestimated.4,5 Consequently, there is an immediate necessity to develop methodologies capable of precisely detecting TB infection among PLWH individuals. Plasma biomarkers possess significant potential for evaluating the inflammatory response and immune status of patients with active TB. Single plasma biomarkers are unlikely to differentiate between disease states in HIV-TB co-infected individuals, necessitating the use of multiple biomarkers.6,7 PLWH infected with active TB also contributes to a higher rate of drug-resistant TB, poorer treatment outcomes, and increased mortality, further complicating TB control efforts worldwide.8
Machine learning (ML) has demonstrated significant potential in enhancing predictive capabilities within healthcare, as it can significantly enhance diagnostic accuracy, anomaly detection, and disease progression prediction.9 Biomarkers’ predictive power for TB in PLWH individuals is limited, with few studies establishing reliable models linking HIV and TB.10 Therefore, developing an accurate predictive model for TB in PLWH is highly meaningful.
In this study, we developed four predictive models based on demographic and clinical data to predict active TB in PLWH with pulmonary infection and evaluated its diagnostic accuracy in both training and validation sets of patients. The aim was to assess the value of model in predicting active TB in PLWH with pulmonary infection; conducting a screening for biomarkers that possess potential predictive significance, and to compare ability of these models to predict active TB in PLWH with pulmonary infection in the random subsets and the chronological subsets.
Materials and Methods Study SubjectsThis was a retrospective study conducted at Kunming Third People’s Hospital from April 1, 2020 to June 1, 2024. PLWH who was clinically diagnosed with pulmonary infection, regardless of their antiretroviral therapy status, were enrolled in the study. PLWH who met the following criteria were considered for pulmonary infection: (i) chest imaging suggestive of pulmonary infection; (ii) fever, cough, expectoration, dyspnea, or other respiratory symptoms. (iii) the specific pathogen causing the pulmonary infection has not yet been identified. Exclusion criteria comprised individuals with taking immunosuppressive medication, those was diagnosed with malignant tumor, those undergoing TB treatment for more than a week, and pregnant. A total of 760 PLWH were included in the study.
All participants underwent a clinical assessment that included routine laboratory tests, such as complete blood count, coagulation function, and serum biochemical tests, twelve cytokines and infection indicators measurements. The flowchart of the study is shown in Figure 1.
Figure 1 Research design and numbers of participants included in the study.
Abbreviations: TB, tuberculosis; GBM, gradient boosting machine; SVM, support vector machine; CART, classification and regression trees; RF, random forest; IFN-γ, interferon-γ; IL-6, interleukin-6.
Laboratory ExaminationsThe following routine laboratory examinations were collected at the time of admission: Complete blood count, coagulation function, serum biochemical indicators, twelve cytokines and infection indicators measurements. These included:
Complete blood count: white blood cell count (WBC), neutrophil count (NEU_CT), percentage of neutrophils (NEU_PT), lymphocyte count (LYM_CT), percentage of lymphocytes (LYM_PT), monocytes count (MON_CT), percentage of monocytes (MON_PT), eosinophil count (EOS_CT), percentage of eosinophils (EOS_PT), basophil count (BAS_CT), percentage of basophils (BAS_PT), red blood cells (RBC), hemoglobin (HGB), blood platelets (PLT). Coagulation function tests: prothrombin time (PT), PT range of motion (PT_PT), international normalized ratio (INR), partial prothrombin time (APTT), fibrinogen (FIB), thrombin time (TT). Serum biochemical tests: total bilirubin (TBIL), direct bilirubin (DBIL), aspartate aminotransferase (AST), alanine aminotransferase (ALT), albumin (ALB), globulin (GLOB), prealbumin (PALB), gamma-glutamyl transferase (GGT), alkaline phosphatase (ALP), alpha-L-fucosidase (AFU), glutamate dehydrogenase (GLDH), urea (UREA), creatinine (CREA), uric acid (UA). Twelve cytokines and infection indicators measurements: interleukin-1β (IL-1β), interleukin-2 (IL-2), interleukin-4 (IL-4), interleukin-5 (IL-5), interleukin-6 (IL-6), interleukin-8 (IL-8), interleukin-10 (IL-10), interleukin-12p70 (IL-12p70), interleukin-17 (IL-17), interferon-γ (IFN-γ), interferon-α (IFN-α), tumor necrosis factor-α (TNF-α), hypersensitive C-reactive protein (hCRP), procalcitonin (PCT), erythrocyte sedimentation rate (ESR).Additionally, six demographic characteristics were collected: gender, age, marital status, smoking history, drinking history, and drug use history.
The concentrations of the 12 cytokines were quantified using flow cytometer (DxFLEX) via multiplex bead-based flow cytometric immunoassay. All measurements were performed using PLWH blood samples, and the results are expressed in pg/mL.
Data Integration and ClassificationThe clinical data and laboratory test results for each participant were integrated using their hospital identification numbers. TB status was determined based on the medical records and categorized into two groups: with active TB and without active TB. The outcome of interest was whether the patient had active TB or not.
Outcome (TB) DefinitionsThe diagnosis of active TB in participants was made by two experienced specialists based on a comprehensive assessment of both clinical manifestations and laboratory results. A definitive diagnosis of “with active TB” was microbiologically confirmed by a positive result in any of the following tests conducted on samples collected during admission: culture, GeneXpert MTB/RIF, or Mycobacterium tuberculosis (Mtb)-DNA tests. Conversely, “without active TB” was defined as individuals with negative microbiology, IGRA, and chest radiography results, and other infectious etiology for which TB infection had been ruled out. This definition was adopted from “The People’s Republic of China health industry standard for diagnosis of pulmonary TB”, which can be seen at http://www.nhc.gov.cn/ewebeditor/uploadfile/2017/11/20171128164254246.pdf.
Statistical AnalysisThe data analysis was conducted using R software (version 4.2.3). Continuous variables were assessed for normality using the Shapiro–Wilk test. For normally distributed continuous variables, the mean and standard deviation (SD) was reported, while non-normal distributed variables were presented as the median and interquartile range (IQR). Categorical variables were summarized using frequency counts and percentages. Age were analyzed as continuous variables, five variables were analyzed as categorical variables, including: gender, marital history, smoking history, drinking history, drug use history, and other variables were analyzed as continuous variables. To compare differences between PLWH diagnosed with active TB and those without active TB, the Pearson χ2 test was applied to categorical variables. For continuous variables, a t-test was used for those following a normal distribution, and the Rank-sum test was used for non-normally distributed continuous variables. Missing data were addressed using interpolation techniques implemented in the “MICE” package, which stands for Multivariate Imputation by Chained Equations, and can enhance the accuracy and reliability of subsequent statistical analyses. Applying synthetic minority oversampling technique (SMOTE) to mitigate model bias, prevent overfitting, and improve model performance.
In order to randomly divide the dataset to ensure the credibility of the data analysis, for the construction and evaluation of predictive models, the dataset was randomly divided into a training set (80%) and a test set (20%). The splitting procedure employed stratified sampling to ensure that the class distribution of TB diagnosis in both the training and test sets remained consistent with that of the original dataset. All random processes were controlled by setting a fixed random seed to ensure full reproducibility.
To explore temporal trends, we employed chronological subsetting, partitioning the data into sequential time periods. This facilitates the identification of evolving patterns and period-specific effects that a random split might obscure. However, it assumes temporal independence between subsets and may oversimplify continuous processes by treating time as discrete states.
Model Development and ValidationFour ML algorithms—Gradient boosting machine (GBM), support vector machine (SVM), classification and regression trees (CART), and random forest (RF)—were developed and validated using the caret package. Indicators with missing values exceeding 30% were excluded from the model, and 55 variables were retained for modeling. The patients were randomly divided into a training set (80%) and a validation set (20%). The performance of each model was evaluated using several metrics, including accuracy, sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), and area under the curve (AUC) from the receiver operating characteristic (ROC) curve. Kappa statistics were used to assess agreement between model predictions and actual results. To ensure model stability and repeatability, 10-fold cross-validation with 10 repetitions was applied. We also conducted an analysis across the entire dataset.
Results Basic Characteristics of the Study PopulationA total of 760 PLWH were enrolled in this retrospective study, conducted from April 1, 2020 to July 1, 2024. Of these, 212 (28.7%) were diagnosed with active TB, while 548 (71.3%) were not, 572 (75.3%) were male, and median (IQR) was 48.0 (39.0, 57.0) years old. Additionally, 574 (75.5%) were married, 308 (40.5%) had a history of smoking, 391 (51.4%) had a history of alcohol consumption, and 647 (85.1%) had a history of drug use. The cohort was randomly split into a training set (80% of the participants, n=609) and a validation set (20% of the participants, n=151) (Supplemental Table 1). In the training set, 170 patients (27.9%) were diagnosed with active TB. Of the TB patients, 79.4% were male, and the mean age was 45.1 (SD=12.9) years old. Additionally, 72.9% of TB patients were married, 36.5% had a history of smoking, 46.5% had a history of alcohol consumption, and 82.4% had a history of drug use (Table 1).
Table 1 Basic Characteristics of Study Population
Table 2 Complete Blood Count, Coagulation Function, Serum Biochemical Indicators in PLWH with Pulmonary Infection
Significant differences between with active TB and without active TB patients were observed in various parameters, including age, NEU_PT, LYM_CT, LYM_PT, RBC, HGB, PT, PT_PT, APTT, FIB, ALB, GLOB, PALB, GGT, ALP, GLDH, UREA, CREA, IL-6, IL-8, IFN-γ, hCRP, PCT and ESR (each P<0.05). These characteristics are summarized in Tables 1–3. Most of the inflammatory biomarkers are increased in the active TB cases.
Table 3 Infection Indicators in PLWH with Pulmonary Infection
The results of overall baseline analysis are shown in Supplemental Table 2. Significant differences between with active TB and without active TB patients were observed in various parameters, including age, IL-6, IFN-γ, hCRP, ESR, NEU_PT, LYM_CT, LYM_PT, HGB, PALB, GGT, ALP and UREA (each P<0.001). These characteristics are summarized in Supplemental Table 2.
Construction and Performance Validations of the ML ModelsTo develop and validate TB predictive model, the cohort was randomly split into a training set (80%) and a validation set (20%). The training set comprised 170 patients with active TB and 439 without, while the validation set included 42 patients with active TB and 109 without. Four ML models—GBM, SVM, CART, and RF—were developed using demographic characteristics and clinical parameters (55 variables in total). The GBM model demonstrated superior predictive accuracy compared to the other models (Table 4). In training set, the GBM model achieved the following performance metrics: 96.6% accuracy (95% CI: 95.5% to 97.6%), 98.8% specificity (95% CI: 98.0% to 99.6%), 93.7% sensitivity (95% CI: 91.6% to 95.8%), 95.5% NPV (95% CI: 93.9% to 97.0%), 98.4% PPV (95% CI: 97.2% to 99.5%), and 0.96 AUC (95% CI: 0.95 to 0.97). In validation set: the model showed 78.2% accuracy (95% CI: 70.7% to 84.5%), 84.4% specificity (95% CI: 77.6% to 91.2%), 61.9% sensitivity (95% CI: 47.2% to 76.6%), 85.2% NPV (95% CI: 78.5% to 91.9%), 60.5% PPV (95% CI: 45.9% to 75.1%), and 0.73 AUC (95% CI: 0.65 to 0.81). The distributions of each variable in the training and validation sets are presented in Supplemental Table 1.
Table 4 Comparison of Four Models of the Training Set and the Verification Set
Importance and Venn Diagram Analysis of Variables of Four ModelThe importance of biomarkers in predicting active TB was assessed through, variable importance analysis across the four models (Figure 2). It revealed that IFN-γ was consistently the most important variable for predicting TB in four models. In the GBM model, the top ten most important variables, ranked by importance, were: IFN-γ (100.0), age (85.8), IL-6 (74.4), TT (72.4), IL-10 (63.8), GGT (54.5), ALT (47.5), TBIL (39.2), IL-17 (35.9), and BAS-CT (35.1) (Figure 2). The ranking of variables importance for all four models is provided in Supplemental Table 3.
Figure 2 Importance of variables for top ten biomarkers of four models of GBM (A), SVM (B), CART (C), and RF (D); higher values indicating greater importance in the model.
Abbreviations: ALT, alanine aminotransferase; BAS_CT, basophil count; DBIL, direct bilirubin; ESR, erythrocyte sedimentation rate; GGT, gamma-glutamyltransferase; HGB, hemoglobin; hCRP, hypersensitive C-reactive protein; IL-5, interleukin-5; IL-6, interleukin-6; IL-8, interleukin-8; IL-10, interleukin-10; IL-17, interleukin-17; IFN-γ, interferon-γ; LYM_PT, percentage of lymphocytes; NEU_PT, percentage of neutrophils; PLT, blood platelets; RBC, red blood cells; TT, thrombin time; TBIL, total bilirubin; UREA, urea.
The common element of the top ten important variables across all four models was analyzed using a Venn diagram, which revealed that IFN-γ, IL-6 and age were consistently identified as key biomarkers across all models, and both IFN-γ, IL-6 and age are key biomarkers for predicting active TB in PLWH (Figure 3).
Figure 3 Intersection of top ten predictive variables in four.
Abbreviations: ALT, alanine aminotransferase; BAS_CT, basophil count; DBIL, direct bilirubin; ESR, erythrocyte sedimentation rate; GGT, gamma-glutamyltransferase; HGB, hemoglobin; hCRP, hypersensitive C-reactive protein; IL-5, interleukin-5; IL-6, interleukin-6; IL-8, interleukin-8; IL-10, interleukin-10; IL-17, interleukin-17; IFN-γ, interferon-γ; LYM_PT, percentage of lymphocytes; NEU_PT, percentage of neutrophils; PLT, blood platelets; RBC, red blood cells; TT, thrombin time; TBIL, total bilirubin; UREA, urea.
Reanalysis of Chronological DatasetThe dataset was reanalysed using the first 609 records as the training set and subsequent 151 records as the test set. In training set, the GBM model achieved the following performance metrics: 92.7% accuracy (95% CI: 92.7% to 92.7%), 96.2% specificity (95% CI: 94.7.0% to 97.6%), 88.2% sensitivity (95% CI: 85.4% to 91.0%), 91.5% NPV (95% CI: 89.5% to 973.6%), 94.5% PPV (95% CI: 92.4.% to 96.6%), and 0.92 AUC (95% CI: 0.91 to 0.94). In validation set: the model showed only 70.4% accuracy (95% CI: 70.1% to 70.7%), 76.1% specificity (95% CI: 68.1% to 84.1%), 55.8% sensitivity (95% CI: 40.0% to 70.7%), 81.4% NPV (95% CI: 73.8% to 88.9%), 48.0% PPV (95% CI: 34.2% to 61.8%), and 0.66 AUC (95% CI: 0.58 to 0.75). Comparison of four models of the training set and the verification set is presented in randomly split datasets (Supplemental Table 4). On chronological order, GBM model trained from the first 609 records AUC of 0.92 (0.91, 0.94) poorly predicted the 151 final records with the AUC of 0.66 (0.58, 0.75).
DiscussionIn this retrospective study, we developed and validated four predictive models based on demographic characteristics and clinical parameters to predict active TB in PLWH with pulmonary infection patients. Our results show that GBM model offers robust predictive value in random dataset. Additionally, serum cytokines, particularly IFN-γ and IL-6 demonstrated stable significance in both the variable importance analysis and Venn diagram. However, inability of GBM model to have good prediction in chronological validation dataset.
GBM model is a good predictor of active TB in PLWH. It performed well in AUC, specificity and sensitivity. At present, the GBM model has been widely used in disease prediction, including the diagnosis of gastrointestinal stromal tumors,11 the prediction of survival in patients with colorectal cancer,12 the detection of coronary artery disease.13 GBM model has high predictive performances at the individual scale, and it was demonstrated significant predictive efficacy and found extensive applications in disease prediction.12
Various ML algorithms, including the GBM model, have been developed and applied to tackle many problems in TB-related research.14 There exist several models utilized for the diagnosis of TB. A retrospective study developed an ML model based on leukocyte volume, conductivity, and light scatter parameters, which demonstrated strong potential for distinguishing active TB and Latent TB infection.15 Liu et al developed a diagnostic model for active TB that showed a remarkable potential for efficient and accurate diagnosis.16 Queiroz et al used a neural network-based ML approach to identify TB in advanced PLWH using plasma concentrations of interleukin 15 and interleukin 10.17 Ye et al employed ML to identify blood-specific factors in patients with bone and joint TB, constructing a diagnostic model that could aid clinicians in future diagnosis.18 Zhang et al built logistic regression, RF, SVM, and k-nearest neighbor models using four-cytokines to identify TB among PLWH.19 Despite these advances, to the best of our knowledge, the predictive potential of biomarkers for TB in PLWH individuals has been limited, with few studies establishing reliable models linking HIV and TB.10
In our study, we found that value of IFN-γ and IL-6 are higher in patients with active TB compared with patients without active TB. Our research outcomes are in accordance with the finding documented in the current literature.19,20 Multiple possible explanations exist for the strong predictive value of these two cytokines. IFN-γ and IL-6 are elevated in TB patients, which may be related to the body’s inflammatory response. When people are infected with Mtb, IFN-γ in the body will increase to help macrophages in getting rid of the Mtb infection.21,22 IFN-γ is involved in reactive oxygen species production and Mtb clearance by inducing gene expression in the GTPase family,23 and IFN-γ can limit the replication of Mtb within macrophages by significantly reducing the use of iron by intracellular Mtb.24,25 Although HIV-associated immune dysfunction, PLWH often exhibit indeterminate or blunted IFN-γ responses. In TB patients of PLWH, IFN-γ levels may tend to increase compared to without active TB patients. The literature also indicates that IFN-γ is an important predictor of TB in PLWH,19 this aligns with our research findings. When people are infected with Mtb, the infection triggers dysregulation of IL-6 production in the body, leading to an excessive IL-6 production in epithelium.26 IL-6 induces the expression of various proteins responsible for acute inflammation, excessive production of IL-6 indicates that the body is undergoing an autoimmune and inflammatory response.27 IL-6-induced immune dysregulation is a key contributor to the pathogenesis of TB.28,29 Many studies have shown IFN-γ and IL-6 serve as pivotal biomarkers not only in the diagnosis of TB but also throughout its treatment progress, severity assessment, and prognosis, playing an indispensable role in each of these aspects.30–38
To address the limitations of Expert (which directly detects TB and drug resistance) and IGRA (which requires complex antigen stimulation and processing), the use of IFN-γ and IL-6 can help evaluate immune status, differentiate infection, or monitor treatment. This approach offers faster speed, accessibility, lower cost, and easier specimen collection. It holds particular clinical value in patients with false-negative IGRA results.39 IFN-γ and IL-6 provides different information than current tests that pathogen-based detection (eg, GeneXpert) and immune-based assays (eg, IGRA). By employing multi-factor combinatorial strategies, it can improve diagnostic accuracy, especially in complex or resource-limited settings.40
In our study, we also found that value of age is lower in patients with active TB compared with patients without active TB. It is consistent with the literature reports prevalence of TB is most common in the 15–44 age range.41 People in this age group tend to have a high prevalence rate of TB infection. Coupled with their participation in a wider range of social activities and high-risk behaviors for TB, they deserve more attention. Both hCRP, an acute-phase protein, and ESR, which reflects levels of acute-phase proteins such as fibrinogen, were elevated in cases of TB infection.42–44 Our study likewise observed this association. An increase in HGB may occur during TB infection. This is supported by the findings of our study.45 NEU_PT, commonly used as an inflammatory biomarker, tends to increase following TB infection.46 In contrast, LYM_CT and LYM_PT, which reflect the host’s immune status, are often observed to decrease in TB infection, as reported in previous studies.47 The decrease in PALB levels, which correlates with chronic consumption and inflammatory activity, is often significantly reduced in TB patients due to inflammatory consumption and loss of appetite—a finding consistent with our observations.48 Similarly, elevations in GGT and ALP, sensitive indicators of hepatobiliary disease, may occur in TB as a result of the infection itself or drug-induced liver injury from anti-TB medications,44,49 which aligns with our study results. Our findings are consistent with the observation that UREA levels may decrease in TB infection, potentially due to its utilization as a nitrogen source by Mtb.50
Our study focuses on predicting TB in PLWH with pulmonary infections. The dataset does not include other infectious diseases, the identified biomarkers might simply reflect general inflammation rather than signatures specific to TB. However, other inflammatory mediators such as CRP, NEU-PT and others were not a significant predictor for TB. Thus, the two cytokines of IL-6 and IFN-γ probably had the highest link to TB process than other cytokines. The difference in roles of different cytokines needs to be clarified by future research.
Our study offers several strengths. We are the first to utilize ML models to predict TB in PLWH with pulmonary infection. The optimal model was selected through rigorous analysis using various ML techniques and demonstrated superior predictive performance. The sufficient sample size in this study provides a robust foundation for model development and validation. However, the study has several limitations. First, all participants were PLWH presenting with pulmonary infection symptoms, their baseline levels of IFN-γ and IL-6 may differ systematically from those in HIV-negative individuals. Therefore, our diagnostic model may not be directly applicable to HIV-negative individuals. Secondly, while the GBM model exhibited good performance in both the training and the validation sets, it has not been tested on external cohorts. The single-center nature of the study may also limit the generalizability of the findings to a global population. Finally, our study cannot distinguish active TB or Latent tuberculosis infection (LTBI). Without active TB group may include some individuals with LTBI, which could affect the performance of our model.
That the GBM model has good predictions was consistently reported by the previous investigator.15,18,19 Their conclusions on high predictive values of the biomarkers were however based solely on predicting the random subsets. Theoretically, the relationship between variables in the random subset would be closed to that in the training (random) subset. This way of splitting created a flaw of overvaluing the predicting ability. The predictivity of the model in the training subset declined in the chronological subset indicated that in reality, the relationship between the set of variables had been changed over time. We strongly advocate this chronological approach in future work to avoid misleading results.
ConclusionIn conclusion, our findings indicate GBM model effectively predicts active TB in PLWH, based on the data from blood samples even without sputum and alveolar lavage fluid samples. This predictive model provides an alternative approach for diagnosing active TB in PLWH. IFN-γ and IL-6 are valuable biomarkers for predicting active TB in this population. These biomarkers may play a crucial role in detecting active TB among PLWH and offer valuable insights for monitoring and preventing TB in this high-risk group.
AbbreviationsAFU, alpha-L-fucosidase; ALB, albumin; ALP, alkaline phosphatase; ALT, alanine aminotransferase; APTT, partial prothrombin time; AST, aspartate aminotransferase; BAS_CT, basophil count; BAS_PT, percentage of basophils; CREA, creatinine; DBIL, direct bilirubin; EOS_CT, eosinophil count; EOS_PT, percentage of eosinophils; ESR, erythrocyte sedimentation rate; FIB, fibrinogen; GGT, gamma-glutamyltransferase; GLDH, glutamate dehydrogenase; GLOB, globulin; hCRP, hypersensitive C-reactive protein; HGB, hemoglobin; IFN-α, interferon-α; IFN-γ, interferon-γ; IL-10, interleukin-10; IL-12p70, interleukin-12p70; IL-1β, interleukin-1β; IL-17, interleukin-17; IL-2, interleukin-2; IL-4, interleukin-4; IL-5, interleukin-5; IL-6, interleukin-6; IL-8, interleukin-8; INR, international normalized ratio; LYM_CT, lymphocyte count; LYM_PT, percentage of lymphocytes; MON_CT, monocytes count; MON_PT, percentage of monocytes; NEU_CT, neutrophil count; NEU_PT, percentage of neutrophils; PALB, prealbumin; PCT, procalcitonin; PLT, blood platelets; PT, prothrombin time; PT_PT, PT range of motion; RBC, red blood cells; TBIL, total bilirubin; TNF-α, tumor necrosis factor-α; TT, thrombin time; UA, uric acid; UREA, urea; WBC, white blood cell count.
Ethical Approval and Consent to ParticipateEthics approval was obtained from the Ethics Committee of Kunming Third People’s Hospital (No. KSLL2023071153). This study was conducted in compliance with the Declaration of Helsinki.
The need for informed consent to participate was waived by the Ethics Committee of Kunming Third People’s Hospital because of the retrospective nature of the study. The names and identification numbers of all patients included in this study were anonymized to ensure confidentiality and protect the identities of the participants.
AcknowledgmentsThe authors are grateful for assistance of Lu Zhou and Lei Zhang in data collection and organization. Their expertise and dedication were instrumental in the successful completion of this study.
Author ContributionsAll authors made a significant contribution to the work reported, whether that is in the conception, study design, execution, acquisition of data, analysis and interpretation, or in all these areas; took part in drafting, revising or critically reviewing the article; gave final approval of the version to be published; have agreed on the journal to which the article has been submitted; and agree to be accountable for all aspects of the work.
FundingThis work was funded by Clinical Medical Center Scientific Research Project of Yunnan Province (2024YNLCYXZX0205), Yumnan Province Major Science and Technology Special Plan (202402AA310011) and Kunming Young and Middle-aged Academic and Technical Leaders (202405B040053).
DisclosureThe authors report no conflicts of interest in this work.
References1. World Health Organization. Global tuberculosis report 2024. Geneva; 2024.
2. Ford N, Matteelli A, Shubber Z, et al. TB as a cause of hospitalization and in-hospital mortality among people living with HIV worldwide: a systematic review and meta-analysis. J Int AIDS Soc. 2016;19(1):20714. doi:10.7448/IAS.19.1.20714
3. Gupta RK, Lucas SB, Fielding KL, et al. Prevalence of tuberculosis in post-mortem studies of HIV-infected adults and children in resource-limited settings: a systematic review and meta-analysis. AIDS. 2015;29(15):1987–13. doi:10.1097/QAD.0000000000000802
4. Bates M, Mudenda V, Shibemba A, et al. Burden of tuberculosis at post mortem in inpatients at a tertiary referral centre in sub-Saharan Africa: a prospective descriptive autopsy study. Lancet Infect Dis. 2015;15(5):544–551. doi:10.1016/S1473-3099(15)70058-7
5. Yang Q, Han J, Shen J, et al. Diagnosis and treatment of tuberculosis in adults with HIV. Medicine. 2022;101(35):e30405. doi:10.1097/MD.0000000000030405
6. Jacobsen M, Repsilber D, Gutschmidt A, et al. Candidate biomarkers for discrimination between infection and disease caused by Mycobacterium tuberculosis. J Mol Med. 2007;85(6):613–621. doi:10.1007/s00109-007-0157-6
7. Dheda K, Van-Zyl Smit RN, Sechi LA, et al. Clinical diagnostic utility of IP-10 and LAM antigen levels for the diagnosis of tuberculous pleural effusions in a high burden setting. PLoS One. 2009;4(3):e4689. doi:10.1371/journal.pone.0004689
8. Khan PY, Yates TA, Osman M, et al. Transmission of drug-resistant tuberculosis in HIV-endemic settings. Lancet Infect Dis. 2019;19(3):e77–e88. doi:10.1016/S1473-3099(18)30537-1
9. Chen J, Liu L, Huang J, et al. LSTM-based prediction model for tuberculosis among HIV-infected patients using structured electronic medical records: a retrospective machine learning study. J Multidiscip Health. 2024;17:3557–3573. doi:10.2147/JMDH.S467877
10. Lesosky M, Rangaka MX, Pienaar C, et al. Plasma biomarkers to detect prevalent or predict progressive tuberculosis associated with human immunodeficiency Virus-1. Clin Infect Dis. 2019;69(2):295–305. doi:10.1093/cid/ciy823
11. Hu B, Wang C, Jiang K, et al. Development and validation of a novel diagnostic model for initially clinical diagnosed gastrointestinal stromal tumors using an extreme gradient-boosting machine. BMC Gastroenterol. 2021;21(1):481. doi:10.1186/s12876-021-02048-1
12. Bibault JE, Chang DT, Xing L. Development and validation of a model to predict survival in colorectal cancer using a gradient-boosted machine. Gut. 2021;70(5):884–889. doi:10.1136/gutjnl-2020-321799
13. Guo Y, Xia C, Zhong Y, et al. Machine learning-enhanced echocardiography for screening coronary artery disease. Biomed Eng Online. 2023;22(1):44. doi:10.1186/s12938-023-01106-x
14. Hansun SJTJ. Medical informatics in Indonesia: importance, development, and future directions. TEM J. 2021;78–81. doi:10.18421/TEM101-10
15. Chen L, Yuan L, Sun T, et al. The performance of VCS (volume, conductivity, light scatter) parameters in distinguishing latent tuberculosis and active tuberculosis by using machine learning algorithm. BMC Infect Dis. 2023;23(1):881. doi:10.1186/s12879-023-08531-2
16. Liu Y, Zhang L, Wu F, et al. Identification and validation of a pyroptosis-related signature in identifying active tuberculosis via a deep learning algorithm. Front Cell Infect Mi. 2023;13:1273140. doi:10.3389/fcimb.2023.1273140
17. Queiroz ATL, Araújo-Pereira M, Barreto-Duarte B, et al. Immunologic biomarkers in peripheral blood of persons with tuberculosis and advanced HIV. Front Immunol. 2022;13:890003. doi:10.3389/fimmu.2022.890003
18. Ye Z, Zhu J, Liu C, et al. Difference between the blood samples of patients with bone and joint tuberculosis and patients with tuberculosis studied using machine learning. Front Surg. 2022;9:1031105. doi:10.3389/fsurg.2022.1031105
19. Zhang H, Li L, Liu Y, et al. Serum cytokine biosignatures for identification of tuberculosis among HIV-positive inpatients. Thorax. 2024;79(5):465–471. doi:10.1136/thorax-2023-220782
20. Manabe YC, Andrade BB, Gupte N, et al. A parsimonious host inflammatory biomarker signature predicts incident tuberculosis and mortality in advanced human immunodeficiency virus. Clin Infect Dis. 2020;71(10):2645–2654. doi:10.1093/cid/ciz1147
21. Herbst S, Schaible UE, Schneider BE. Interferon gamma activated macrophages kill mycobacteria by nitric oxide induced apoptosis. PLoS One. 2011;6(5):e19105. doi:10.1371/journal.pone.0019105
22. Zhang J, Sun B, Huang Y, et al. IFN-γ promotes THP-1 cell apoptosis during early infection with Mycobacterium bovis by activating different apoptotic signaling. FEMS Immunol Med Microbiol. 2010;60(3):191–198. doi:10.1111/j.1574-695X.2010.00732.x
23. Kim BH, Shenoy AR, Kumar P, et al. A family of IFN-γ-inducible 65-kD GTPases protects against bacterial infection. Science. 2011;332(6030):717–721. doi:10.1126/science.1201711
24. Abreu R, Essler L, Giri P, et al. Interferon-gamma promotes iron export in human macrophages to limit intracellular bacterial replication. PLoS One. 2020;15(12):e0240949. doi:10.1371/journal.pone.0240949
25. Chao A, Sieminski PJ, Owens CP, et al. Iron acquisition in Mycobacterium tuberculosis. Chem Rev. 2019;119(2):1193–1220. doi:10.1021/acs.chemrev.8b00285
26. Domingo-Gonzalez R, Prince O, Cooper A, et al. Cytokines and chemokines in Mycobacterium tuberculosis INFECTION. Microbiol Spectr. 2017;33–72.
27. Ciechowski P, Dempke WCM. Interleukin-6: a Masterplayer in the Cytokine Network. Oncology. 2020;98(3):131–137. doi:10.1159/000505099
28. Obeagu E, Leticia O, Nwanjo H, et al. Evaluation of Interferon-Gamma, Interleukin 6 and Interleukin 10 in tuberculosis patients in Umuahia. iMedPub. 2019.
29. Al-Aska AI, Al-Anazi AR, Al-Subaei SS, et al. CD4+ T-lymphopenia in HIV negative tuberculous patients at King Khalid University Hospital in Riyadh, Saudi Arabia. Eur J Med Res. 2011;16(6):285. doi:10.1186/2047-783X-16-6-285
30. Pillay K, Lewis L, Rambaran S, et al. Plasma biomarkers of risk of tuberculosis recurrence in HIV co-infected patients from South Africa. Front Immunol. 2021;12:631094. doi:10.3389/fimmu.2021.631094
31. Nie W, Wang J, Jing W, et al. Value of serum cytokine biomarkers TNF-α, IL-4, sIL-2R and IFN-γ for use in monitoring bacterial load and anti-tuberculosis treatment progress. Cytokine. 2020;2(2):100028.
32. Verma S, Du P, Nakanjako D, et al. Tuberculosis in advanced HIV infection is associated with increased expression of IFN-γ and its downstream target. BMC Infect Dis. 2018;18(1):220. doi:10.1186/s12879-018-3127-4
33. Kisuya J, Chemtai A, Raballah E, et al. The diagnostic accuracy of Th1 (IFN-γ, TNF-α, and IL-2) and Th2 (IL-4, IL-6 and IL-10) cytokines response in AFB microscopy smear negative PTB- HIV co-infected patients. Sci Rep. 2019;9(1):2966. doi:10.1038/s41598-019-39048-x
34. Seyedhosseini F, Mohammadi S, Zare Ebrahimabad M, et al. Interleukin-6, Interleukin-17 and transforming growth factor-beta are overexpressed in newly diagnosed tuberculosis patients; potent biomarkers of mycobacterial infection. Arch Clin Infect Dis. 2019;14:e68417. doi:10.5812/archcid.68417
35. Joshi L, Ponnana M, Sivangala R, et al. Evaluation of TNF-α, IL-10 and IL-6 cytokine production and their correlation with genotype variants amongst tuberculosis patients and their household contacts. PLoS One. 2015;10(9):e0137727. doi:10.1371/journal.pone.0137727
36. Sullivan ZA, Wong EB, Ndung’u T, et al. Latent and active tuberculosis infection increase immune activation in individuals co-infected with HIV. EBioMedicine. 2015;2(4):334–340. doi:10.1016/j.ebiom.2015.03.005
37. Sivro A, McKinnon LR, Yende-Zuma N, et al. Plasma cytokine predictors of tuberculosis recurrence in antiretroviral-treated human immunodeficiency virus-infected individuals from Durban, South Africa. Clin Infect Dis. 2017;65(5):819–826. doi:10.1093/cid/cix357
38. Nosik M, Ryzhov K, Rymanova I, et al. Dynamics of plasmatic levels of pro- and anti-inflammatory cytokines in HIV-infected individuals with M. tuberculosis co-infection. Microorganisms. 2021;9(11). doi:10.3390/microorganisms9112291
39. 古丽比克·木拉提, 姚丛,任卫聪,等. 细胞因子联合检测在结核病辅助诊断中的应用价值 [Auxiliary diagnostic value of combined cytokines in Tuberculosis]. 新疆医学. 2023;53(5):524–527. Chinese.
40. Li Q, Ren W, Yuan J, et al. Significant difference in Th1/Th2 paradigm induced by tuberculosis-specific antigens between IGRA-positive and IGRA-negative patients. Front Immunol. 2022;13:904308. doi:10.3389/fimmu.2022.904308
41. Yang H, Ruan X, Li W, et al. Global, regional, and national burden of tuberculosis and attributable risk factors for 204 countries and territories, 1990-2021: a systematic analysis for the Global Burden of Diseases 2021 study. BMC public Health. 2024;24(1):3111. doi:10.1186/s12889-024-20664-w
42. 中国医药教育协会感染疾病专业委员会. 感染相关生物标志物临床意义解读专家共识 [Expert Consensus on Clinical Interpretation of Infection-related Biomarkers]. 中华结核和呼吸杂志. 2017; (4). Chinese.
43. Gaeddert M, Glaser K, Chendi BH, et al. Host blood protein biomarkers to screen for tuberculosis disease: a systematic review and meta-analysis. J Clin Microbiol. 2024;62(11):14. doi:10.1128/jcm.00786-24
44. 秦坤, 李莉, 邓婷婷, 等. 肺结核预后血清学指标研究进展 [Research progress on prognostic serological indicators of pulmonary tuberculosis]. 分子诊断与治疗杂志. 2024;16(2):391–394. Chinese.
45. Barzegari S, Afshari M, Movahednia M, et al. Prevalence of anemia among patients with tuberculosis: a systematic review and meta-analysis. Indian J Tuberc. 2019;66(2):299–307. doi:10.1016/j.ijtb.2019.04.002
46. Shiratori B, Leano S, Nakajima C, et al. Elevated OPN, IP-10, and neutrophilia in loop-mediated isothermal amplification confirmed tuberculosis patients. Mediators Inflammation. 2014;513263.
47. Liu X, Xu J, Feng Y, et al. Application of γδ T cells with different memory phenotypes in clinical diagnosis of active pulmonary tuberculosis. Microb Pathogenesis. 2025;198:107032. doi:10.1016/j.micpath.2024.107032
48. Wang C, Wei -L-L, Shi L-Y, et al. Screening and identification of five serum proteins as novel potential biomarkers for cured pulmonary tuberculosis. Sci Rep. 2015;5:15615. doi:10.1038/srep15615
49. Mokondjimobe E, Longo-Mbenza B, Akiana J, et al. Biomarkers of oxidative stress and personalized treatment of pulmonary tuberculosis: emerging role of Gamma-Glutamyltransferase. Adv Pharmacol Sci. 2012;2012(7):465634. doi:10.1155/2012/465634
50. Lin W, Mathys V, Ang E, et al. Urease activity represents an alternative pathway for Mycobacterium tuberculosis Nitrogen. Metabolism Infect Immun. 2012;80:2771–2779. doi:10.1128/IAI.06195-11
Comments (0)