A Correlation Technique to Reduce the Number of Predictors to Estimate the Survival Time of HIV/ AIDS Patients on ART


  • Anurag Sharma Department of Statistics, University of Delhi, Delhi-110007, India
  • Vajala Ravi Department of Statistics, Lady Shri College, University of Delhi, Delhi-110024, India
  • Gurprit Grover Department of Statistics, University of Delhi, Delhi-110007, India
  • Rabindra Nath Das Department of Statistics, The University of Burdwan, Burdwan, West Bengal-713104, India
  • M.K. Varshney Department of Statistics, Hindu College, University of Delhi, Delhi-110007, India




AIDS, AFT, Correlation, Chi- Square test, One- Way ANOVA.


Till now, many research papers have been published which aims to estimate the survivle time of the HIV/AIDS patients taking into consideration all the predictors viz, Age, Sex, CD4, MOT, Smoking, Weight, HB, Coinfection, Time, BMI, Location Status, Marital Status, Drug etc, although all the predictors need not to be included in the model. Since some of the predictors may be correlated/ associated and may have some influence on the outcome variable, therefore, instead of taking both the significantly correlated/ associated predictors, we may take only one of the two. In this way, we may be able to reduce the number of predictors without affecting the estimated survival time. In this paper we have tried to reduce the number of predictors by determining the highly positively correlated predictors and then evaluating the effect of correlation/ association on the survival time of HIV/AIDS patients. These predictors that we have considered in the starting are Age, Sex, State, Smoking, Alcohol, Drugs, Opportunistic Infections (OI), Living Status (LS), Occupation (OC), Marital Status (MS) and Spouse for the data collected from 2004 to 2014 of AIDS patients in an ART center of Delhi, India. We have performed one – way ANOVA to test the association between a quantitative and a categorical variable and Chi-square test to test between two categorical variables. To select one of the two highly correlated/ associated predictors, a suitable model is fitted keeping one predictor independent at a time and other dependent and the model having the smaller AIC is considered and the independent variable in the model is included in the modified model. The fitted models are logistic, linear and multinomial logistic depending on the type of the independent variable to be fitted. Then the true model (having all the predictors) and the modified model (with reduced number of predictors) are compared on the basis of their AICs and the model having minimum AIC is chosen. In this way we could reduce the number of predictors by almost 50% without affecting the estimated survival time with a reduced standard error.


AIDS info Fact Sheet, http://aidsinfo.nih.gov/2012

WHO: Global update on HIV treatment 2013: Results, Impact and Opportunities 2013.

UNAIDS (2014): a GAP report 2014.

Ghate M, Deshpande S, Tripathy S, Godbole S, Nene M, et al. Mortality in HIV infected individuals in Pune, India. Ind J Med Res 2011; 133: 414-420.

Rai S, Mahapatra B, Sircar S, Raj PY, Venkatesh S, et al. Adherence to antiretroviral therapy and its effect on survival of HIV-infected individuals in Jharkhand, India. PloS One 2013. DOI: 10.1371/journal.pone.0066860 DOI: https://doi.org/10.1371/journal.pone.0066860

Kee MK, Lee JH, Kim EJ, Lee J, Nam JG, et al. Improvement in survival among HIV-infected individuals in the Republic of Korea: Need for an early HIV diagnosis. BMC Infect Dis 2009; 9: 128-128. https://doi.org/10.1186/1471-2334-9-128 DOI: https://doi.org/10.1186/1471-2334-9-128

Jerene D, Endale A, Hailu Y, Lindtjørn B. Predictors of early death in a cohort of Ethiopian patients treated with HAART. BMC Infect Dis 2006; 6: 136-136. https://doi.org/10.1186/1471-2334-6-136 DOI: https://doi.org/10.1186/1471-2334-6-136

Hernán MA, Cole SR, Margolick J, Cohen M, Robins JM. Structural accelerated failure time models for survival analysis in studies with time-varying treatments. J. Pharmacoepidemiol. Drug Safety 2005; 14: 477-491. https://doi.org/10.1002/pds.1064 DOI: https://doi.org/10.1002/pds.1064

Xue H, Lam KF, Cowling BJ, Wolf FD. Semi-parametric accelerated failure time regression analysis with application to interval-censored HIV/AIDS data. Stat Med 2006; 25: 3850-3863. https://doi.org/10.1002/sim.2486 DOI: https://doi.org/10.1002/sim.2486

Grover G, Banerjee T. Estimation of survival times of HIV-1 infected children for doubly and interval censored data. Electron J App Sta Anal 2011; 4: 155-163. DOI: 10.1285/i20705948v4n2p155

Nawumbeni DN, Luguterah A, Adampah T. Performance of Cox Proportional Hazard and Accelerated Failure Time Models in the Analysis of HIV/TB Co-infection Survival Data. Research on Humanities and Social Sciences 2014; 4(21): 94-102.

Tarekegn S. The Effect of HAART on Incidence of Tuberculosis among HIV Infected Patient in Hawassa University Referral Hospital, South Ethiopia Clinics in Rungwe District, Tanzania. Unpublished dissertation in partial fulfillment of the requirements for the degree of Master of Science, Tanzania 2011.

Musenge E, Vounatsou P, Collinson M, Tollman S, Kaln K. The Contribution of the Spatial Analysis to Understanding HIV/TB Mortality in Children a Structural Equation Modeling Approach. Global Health Action 2013 PMCID: PMC35566702 DOI: 103402/gha.v6i0.19266. DOI: https://doi.org/10.3402/gha.v6i0.19266

Grover G, Kumar SP. Determination of predictors associated with HIV/AIDS patients on ART using Accelerated Failure Time model for interval, American Journal of BioStatistics (USA) 2016; 6(1): ISSN: 1948-9897. DOI: https://doi.org/10.3844/amjbsp.2016.12.19

Grover G, Gadpayle AK, Varshney MK. On the estimation of probability of death of AIDS patients in the presence of competing risks, Aligarh Journal of Statistics 2012; 32: 69-83.

Grover G, Ravi V. On the estimation of expected survival time of AIDS patients undergoing Antiretroviral therapy using generalized Poisson regression model, Turkiye Klinikleri J BioStat (Turkey) 2014; 6(1): 538-541. ISSN: 2249-555X

Tibshirani R. The lasso method for variable selection in the Cox model. Statistics in Medicine 1997; 16(4): 385-395. https://doi.org/10.1002/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3 DOI: https://doi.org/10.1002/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3

Zou H, Hastie T. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2005; 67(2): 301-320. https://doi.org/10.1111/j.1467-9868.2005.00503.x DOI: https://doi.org/10.1111/j.1467-9868.2005.00503.x




How to Cite

Sharma, A., Ravi, V., Grover, G., Das, R. N., & Varshney, M. (2018). A Correlation Technique to Reduce the Number of Predictors to Estimate the Survival Time of HIV/ AIDS Patients on ART. International Journal of Statistics in Medical Research, 7(4), 129–136. https://doi.org/10.6000/1929-6029.2018.07.04.3



General Articles