Integration of contextual, behavioral, and sociodemographic variables in predicting student dropout in Higher Education: A systematic review

Main Article Content

Ronald Verástegui Sánchez
Isidora Concepción Zapata Periche
Simón Zapata Periche
Rafael Jesús Herrera Torres

Abstract

Student dropout in higher education has been a complex and persistent problem affecting institutional efficiency, educational equity, and students' academic trajectories, prompting the development of predictive models aimed at its early detection. In this context, this article aimed to analyze the integration of contextual, behavioral, and sociodemographic variables in predictive models of student dropout in higher education, examining how these dimensions have been incorporated and valued in recent scientific literature. To this end, a systematic review was conducted following standardized methodological guidelines. The methodology employed allowed for a structured synthesis of the approaches, models, and strategies for operationalizing variables used in predicting dropout. The results showed that models integrating contextual, behavioral, and sociodemographic variables tended to have greater explanatory power and a better capacity to identify risk profiles compared to those based exclusively on academic variables. In conclusion, the study reaffirmed that student dropout was addressed more effectively from a multidimensional perspective, highlighting the need to develop more comprehensive, interpretable, and contextualized predictive models capable of appropriately guiding institutional prevention and retention strategies in higher education

Downloads

Download data is not yet available.

Article Details

How to Cite
Verástegui Sánchez , R., Zapata Periche , I. C. ., Zapata Periche, S., & Herrera Torres , R. J. . (2026). Integration of contextual, behavioral, and sociodemographic variables in predicting student dropout in Higher Education: A systematic review. Aula Virtual, 7(14), 186-203. https://doi.org/10.5281/zenodo.18814655
Section
Articles

References

Aina, C., Baici, E., Casalone, G., & Pastore, F. (2022). The determinants of university dropout: A review of the socio-economic literature. Socio-Economic Planning Sciences, 79, 101102. Documento en línea. Disponible https://doi.org/10.1016/j.seps.2021.101102

Alshahrani, A. (2025). SMOTE-optimized machine learning framework for predicting retention in workforce development training. Computational Materials and Continua, 85(2). Documento en línea. Disponible https://doi.org/10.32604/cmc.2025.065211

Álvarez Quiñones, L. I., Lozano Moncada, C. A., & Bravo Montenegro, D. A. (2022). Metodología para el mantenimiento predictivo de transformadores de distribución basada en aprendizaje automático. Ingeniería, 27(3), e17742. Documento en línea. Disponible https://doi.org/10.14483/23448393.17742

Ayala Franco, E., López Martínez, R. E., & Menéndez Domínguez, V. H. (2021). Modelos predictivos de riesgo académico en carreras de computación con minería de datos educativos. Revista de Educación a Distancia (RED), 21(66). Documento en línea. Disponible https://doi.org/10.6018/red.463561

Barahona-Anguisaca, D., Vega-Calvas, P., Moyota-Paguay, A., & Porras-Ramírez, L. (2024). Análisis de modelos estadísticos para predecir el éxito académico en estudiantes universitarios. MQR Investigar, 8(2), 2951–2969. Documento en línea. Disponible https://doi.org/10.56048/mqr20225.8.2.2024.2951-2969

Cabral Acosta, B. E., Cabral Acosta, C. L., & Cabral Acosta, K. R. (2024). Estrategias de retención estudiantil implementadas por la Facultad de Ciencias Contables, Administrativas y Económicas (2018–2022). Ciencia Latina Revista Científica Multidisciplinar, 8(4), 12440–12454. Documento en línea. Disponible https://doi.org/10.37811/cl_rcm.v8i4.13446

Caraballo Julio, N. R. (2024). El aprendizaje autónomo desde la perspectiva metacognitiva de Flavell. Ciencia Latina Revista Científica Multidisciplinar, 8(1), 10587–10603. Documento en línea. Disponible https://doi.org/10.37811/cl_rcm.v8i1.10361

Cárdenas Matute, J. M., Valle Franco, A., & Tapia Segarra, J. I. (2023). Factors that influence student desertion in the academic unit of social sciences of the Catholic University of Cuenca. ConcienciaDigital, 6(3), 30–48. Documento en línea. Disponible https://doi.org/10.33262/concienciadigital.v6i3.2621

Casanova, J. R., Gomes, A., Moreira, M. A., & Almeida, L. S. (2022). Promoting success and persistence in pandemic times: An experience with first-year students. Frontiers in Psychology, 13, 815584. Documento en línea. Disponible https://doi.org/10.3389/fpsyg.2022.815584

Castro-Azuara, M., Domínguez-Ángel, R., & Nava-Nava, R. (2020). Representaciones sobre prácticas letradas de estudiantes universitarios en situaciones vulnerables. Magis. Revista Internacional de Investigación en Educación, 12(25), 3–20. Documento en línea. Disponible https://doi.org/10.11144/javeriana.m12-25.rple

Chacha, B., López, W., & Constante, M. (2023). Predictive model of student dropout based on logistic regression. ESPOCH Congresses: The Ecuadorian Journal of STEAM, 3(1), 630–656. Documento en línea. Disponible https://doi.org/10.18502/espoch.v3i1.14477

Choque, V., & Sosa-Jauregui, V. (2022). Análisis del diseño curricular como factor de deserción académica utilizando minería de datos. Yachay – Revista Científico Cultural, 11(1), 551–555. Documento en línea. Disponible https://doi.org/10.36881/yachay.v11i1.139

Contreras-Villalobos, T., Baleriola Escudero, E., & López Leiva, V. (2024). Digging and building: How transformative mixed-methods research contributes to explaining and responding to educational exclusion and school dropout. Methods in Psychology, 11, 100170. Documento en línea. Disponible https://doi.org/10.1016/j.metip.2024.100170

Coussement, K., Phan, M., De Caigny, A., Benoit, D. F., & Raes, A. (2020). Predicting student dropout in subscription-based online learning environments: The beneficial impact of the logit leaf model. Decision Support Systems, 135, 113325. Documento en línea. Disponible https://doi.org/10.1016/j.dss.2020.113325

Deleña, R. D., Dia, N. J., Sacayan, R. R., Sieras, J. C., Khalid, S. A., Macatotong, A. H. T., & Gulam, S. B. (2025). Predicting student retention: A comparative study of machine learning approaches utilizing sociodemographic and academic factors. Systems and Soft Computing, 7, 200352. Documento en línea. Disponible https://doi.org/10.1016/j.sasc.2025.200352

Delogu, M., Lagravinese, R., Paolini, D., & Resce, G. (2024). Predicting dropout from higher education: Evidence from Italy. Economic Modelling, 130, 106583. Documento en línea. Disponible https://doi.org/10.1016/j.econmod.2023.106583

Díaz-Landa, B., Romero, R., & Rodríguez, W. (2021). Rendimiento académico de estudiantes en educación superior: Predicciones de factores influyentes a partir de árboles de decisión. Telos, 23(3), 616–639. Documento en línea. Disponible https://doi.org/10.36390/telos233.08

González, B., Mendes, T. P., Pinto, R., Correia, S. V., Albuquerque, S., & Paulino, P. (2025). Predictors of higher education dropout intention in the post-pandemic era: The mediating role of academic exhaustion. PLOS ONE, 20(7), e0327643. Documento en línea. Disponible https://doi.org/10.1371/journal.pone.0327643

González-Morales, M. O., López-Aguilar, D., Álvarez-Pérez, P. R., & Toledo-Delgado, P. A. (2025). Dropping out of higher education: Analysis of variables that characterize students who interrupt their studies. Acta Psychologica, 252, 104669. Documento en línea. Disponible https://doi.org/10.1016/j.actpsy.2024.104669

Gutiérrez, A. P., Londoño, D. M. M., Landínez-Martínez, D., & Toro-Zuluaga, N. A. (2025). Las variables sociales y la conciencia metacognitiva de los jóvenes adultos colombianos. Revista Latinoamericana de Ciencias Sociales, Niñez y Juventud, 20(3), 1–32. Documento en línea. Disponible https://doi.org/10.11600/rlcsnj.20.3.5379

Kuz, A., & Morales, R. (2023). Ciencia de datos educativos y aprendizaje automático: Un estudio de caso sobre la deserción estudiantil universitaria en México. Education in the Knowledge Society (EKS), 24, e30080. Documento en línea. Disponible https://doi.org/10.14201/eks.30080

Kuzilek, J., Zdrahal, Z., & Fuglik, V. (2021). Student success prediction using student exam behaviour. Future Generation Computer Systems, 125, 661–671. Documento en línea. Disponible https://doi.org/10.1016/j.future.2021.07.009

López-Angulo, Y., Sáez-Delgado, F., Mella-Norambuena, J., Bernardo, A. B., & Díaz-Mujica, A. (2023). Predictive model of the dropout intention of Chilean university students. Frontiers in Psychology, 13, 893894. Documento en línea. Disponible https://doi.org/10.3389/fpsyg.2022.893894

Maldonado, S., Miranda, J., Olaya, D., Vásquez, J., & Verbeke, W. (2021). Redefining profit metrics for boosting student retention in higher education. Decision Support Systems, 143, 113493. Documento en línea. Disponible https://doi.org/10.1016/j.dss.2021.113493

Mallik, S., & Gangopadhyay, A. (2023). Proactive and reactive engagement of artificial intelligence methods for education: A review. Frontiers in Artificial Intelligence, 6, 1151391. Documento en línea. Disponible https://doi.org/10.3389/frai.2023.1151391

Matas Terrón, A., Leiva Olivencia, J. J., & Núñez Rojas, N. (2020). El big data desde la perspectiva de estudiantes universitarios del norte del Perú. Propósitos y Representaciones, 8(1), e376. Documento en línea. Disponible https://doi.org/10.20511/pyr2020.v8n1.376

Matz, S. C., Bukow, C. S., Peters, H., Deacons, C., Dinu, A., & Stachl, C. (2023). Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics. Scientific Reports, 13, 5705. Documento en línea. Disponible https://doi.org/10.1038/s41598-023-32484-w

Monteverde-Suárez, D., González-Flores, P., Santos-Solórzano, R., García-Minjares, M., Zavala-Sierra, I., Luna de la Luz, V., & Sánchez-Mendiola, M. (2024). Predicting students’ academic progress and related attributes in first-year medical students. BMC Medical Education, 24, 74. Documento en línea. Disponible https://doi.org/10.1186/s12909-023-04918-6

Mustofa, S., Emon, Y. R., Mamun, S. B., Akhy, S. A., & Ahad, M. T. (2025). A novel AI-driven model for student dropout risk analysis with explainable AI insights. Computers and Education: Artificial Intelligence, 8, 100352. Documento en línea. Disponible https://doi.org/10.1016/j.caeai.2024.100352

Phan, M., De Caigny, A., & Coussement, K. (2023). A decision support framework to incorporate textual data for early student dropout prediction in higher education. Decision Support Systems, 168, 113940. Documento en línea. Disponible https://doi.org/10.1016/j.dss.2023.113940

Rabelo, A. M., & Zárate, L. E. (2025). A model for predicting dropout of higher education students. Data Science and Management, 8, 72–85. Documento en línea. Disponible https://doi.org/10.1016/j.dsm.2024.07.001

Ragni, A., Ippolito, D., & Masci, C. (2024). Assessing the impact of hybrid teaching on students’ academic performance. Socio-Economic Planning Sciences, 92, 101824. Documento en línea. Disponible https://doi.org/10.1016/j.seps.2024.101824

Rodríguez, E. M., & Treviño, M. d. R. G. (2024). Causas de abandono y estrategias de permanencia universitaria en modalidad virtual: Revisión de literatura. Ciencia Latina Revista Científica Multidisciplinar, 8(5), 6217–6241. Documento en línea. Disponible https://doi.org/10.37811/cl_rcm.v8i5.14036

Sánchez Martínez, F., Barrera-Cámara, R., & Sáenz, A. (2020). Estudio evaluativo del sistema institucional de tutorías. RIDE. Revista Iberoamericana para la Investigación y el Desarrollo Educativo, 10(20). Documento en línea. Disponible https://doi.org/10.23913/ride.v10i20.609

Silva, F. C. da, Cabral, T. L. de O., & Pacheco, A. S. V. (2020). Dropout or permanence? Predictive models for higher education management. Education Policy Analysis Archives, 28, 149. Documento en línea. Disponible https://doi.org/10.14507/epaa.28.5387

Szabó, L., Zsolnai, A., & Fehérvári, A. (2024). The relationship between student engagement and dropout risk in early adolescence. International Journal of Educational Research Open, 6, 100328. Documento en línea. Disponible https://doi.org/10.1016/j.ijedro.2024.100328

Tavico Chamay, A. T. (2021). Factores que influyen en la deserción de la carrera de administración de empresas. Revista Científica Internacional, 4(1), 39–46. Documento en línea. Disponible https://doi.org/10.46734/revcientifica.v4i1.45

Vaarma, M., & Li, H. (2024). Predicting student dropouts with machine learning: An empirical study in Finnish higher education. Technology in Society, 76, 102474. Documento en línea. Disponible https://doi.org/10.1016/j.techsoc.2024.102474

Vega-Rebolledo, I., Sánchez-García, A. J., Muñoz León, J. J., Ocharán-Hernández, J. O., & Cortés-Verdín, K. (2025). Applying survival analysis and explainable artificial intelligence to understand academic success. Array, 28, 100540. Documento en línea. Disponible https://doi.org/10.1016/j.array.2025.100540

Wagner, F., Wagner, R. G., Makuapane, L. P., Masango, M., Kolanisi, U., & Gómez-Olivé, F. X. (2024). Mental distress, food insecurity and university student dropout during COVID-19. Frontiers in Psychiatry, 15, 1336538. Documento en línea. Disponible https://doi.org/10.3389/fpsyt.2024.1336538

Zheng, L., Wang, C., Chen, X., Song, Y., Meng, Z., & Zhang, R. (2023). Evolutionary machine learning builds smart education big data platforms. Applied Soft Computing, 136, 110114. Documento en línea. Disponible https://doi.org/10.1016/j.asoc.2023.110114

Most read articles by the same author(s)