Optimizing Machine Learning Models for Urinary Tract Infection Diagnostics: A Comparative Study of Logistic Regression and Random Forest
|
Keywords:
Logistic Regression, Random Forest, Urinary Tract Infections Detection, UTI Diagnostics, Machine Learning
AbstractUrinary Tract Infections (UTIs) present a significant healthcare challenge due to their prevalence and diagnostic complexity. Timely and accurate diagnosis is critical for effective treatment, yet traditional methods like microbial cultures and urinalysis are often slow and inconsistent. This study introduces machine learning (ML) as a transformative solution for UTI diagnostics, particularly focusing on logistic regression and random forest models renowned for their interpretability and robustness. We conducted a meticulous hyperparameter tuning process using a rich dataset from a clinic in Northern Mindanao, Philippines, incorporating demographic, clinical, and urinalysis data. Our research outlines a detailed methodology for applying and refining these ML models to predict UTI outcomes accurately. Through comprehensive hyperparameter optimization, we enhanced the predictive performance, demonstrating a significant improvement over standard diagnostic practice. The findings reveal a clear superiority of the random forest model, achieving a top testing accuracy of 0.9814, compared to the best-performing logistic regression model's accuracy of 0.7626. This comparative analysis not only validates the efficacy of ML in medical diagnostics but also emphasizes the potential clinical impact of these models in real-world settings. The study contributes to the burgeoning literature on ML applications in healthcare by providing a blueprint for optimizing ML models for clinical use, particularly in diagnosing UTIs. It underscores the promise of ML in augmenting diagnostic precision, thereby potentially reducing the global healthcare burden associated with UTIs. Downloads
Download data is not yet available.
ReferencesÖztürk, R., & Murt, A. (2020). Epidemiology of urological infections: a global burden. World journal of urology, 38, 2669-2679. https://doi.org/10.1007/s00345-019-03071-4 Akhtar, A., Ahmad Hassali, M. A., Zainal, H., & Khan, A. H. (2021). A cross-sectional assessment of urinary tract infections among geriatric patients: prevalence, medication regimen complexity, and factors associated with treatment outcomes. Frontiers in public health, 9, 657199. Mancuso, G., Midiri, A., Gerace, E., Marra, M., Zummo, S., & Biondo, C. (2023). Urinary tract infections: the current scenario and future prospects. Pathogens, 12(4), 623. https://doi.org/10.3390/pathogens12040623 Iriya, R., Braswell, B., Mo, M., Zhang, F., Haydel, S. E., & Wang, S. (2024). Deep Learning-Based Culture-Free Bacteria Detection in Urine Using Large-Volume Microscopy. Biosensors, 14(2), 89. https://doi.org/10.3390/bios14020089 Schinas, G., Dimopoulos, G., & Akinosoglou, K. (2023). Understanding and implementing diagnostic stewardship: a guide for resident physicians in the era of antimicrobial resistance. Microorganisms, 11(9), 2214. https://doi.org/10.3390/microorganisms11092214 Hasan, J., & Bok, S. (2024). Plasmonic Fluorescence Sensors in Diagnosis of Infectious Diseases. Biosensors, 14(3), 130. https://doi.org/10.3390/bios14030130 Jarzembowski, T., & Daca, A. (Eds.). (2024). Advances and Challenges in Urine Laboratory Analysis. Kight, E. C. (2023). Advancements in Point-of-Care Diagnostic Assays for Non-Invasive Samples in Resource Constrained Settings (Doctoral dissertation, Vanderbilt University). Sykes, J. E., Reagan, K. L., Nally, J. E., Galloway, R. L., & Haake, D. A. (2022). Role of diagnostics in epidemiology, management, surveillance, and control of leptospirosis. Pathogens, 11(4), 395. https://doi.org/10.3390/pathogens11040395 Kumar, Y., Koul, A., Sisodia, P. S., Shafi, J., Kavita, V., Gheisari, M., & Davoodi, M. B. (2021). Heart failure detection using quantum-enhanced machine learning and traditional machine learning techniques for internet of artificially intelligent medical things. Wireless Communications and Mobile Computing, 2021, 1-16. https://doi.org/10.1155/2021/1616725 Tsai, A. Y., Carter, S. R., & Greene, A. C. (2024, January). Artificial Intelligence in Pediatric Surgery. In Seminars in Pediatric Surgery (p. 151390). WB Saunders. https://doi.org/10.1016/j.sempedsurg.2024.151390 Pachiyannan, P., Alsulami, M., Alsadie, D., Saudagar, A. K. J., AlKhathami, M., & Poonia, R. C. (2024). A Novel Machine Learning-Based Prediction Method for Early Detection and Diagnosis of Congenital Heart Disease Using ECG Signal Processing. Technologies, 12(1), 4. https://doi.org/10.3390/technologies12010004 Efron, B. (2020). Prediction, estimation, and attribution. International Statistical Review, 88, S28-S59. https://doi.org/10.1111/insr.12409 Dastile, X., Celik, T., & Potsane, M. (2020). Statistical and machine learning models in credit scoring: A systematic literature survey. Applied Soft Computing, 91, 106263. https://doi.org/10.1016/j.asoc.2020.106263 Zhao, B., Song, R., Guo, X., & Yu, L. (2023). Bridging Interpretability and Performance: Enhanced Machine Learning-based Prediction of Hematoma Expansion Post-Stroke via Comprehensive Feature Selection. IEEE Access. https://doi.org/10.1109/ACCESS.2023.3348244 Rajula, H. S. R., Verlato, G., Manchia, M., Antonucci, N., & Fanos, V. (2020). Comparison of conventional statistical methods with machine learning in medicine: diagnosis, drug development, and treatment. Medicina, 56(9), 455. https://doi.org/10.3390/medicina56090455 Shu, X., & Ye, Y. (2023). Knowledge Discovery: Methods from data mining and machine learning. Social Science Research, 110, 102817. https://doi.org/10.1016/j.ssresearch.2022.102817 Mannering, F., Bhat, C. R., Shankar, V., & Abdel-Aty, M. (2020). Big data, traditional data and the tradeoffs between prediction and causality in highway-safety analysis. Analytic methods in accident research, 25, 100113. https://doi.org/10.1016/j.amar.2020.100113 De Bruyne, S., De Kesel, P., & Oyaert, M. (2023). Applications of Artificial Intelligence in Urinalysis: Is the Future Already Here? Clinical Chemistry, 69(12), 1348-1360. https://doi.org/10.1093/clinchem/hvad136 Caruccio, L., Cirillo, S., Polese, G., Solimando, G., Sundaramurthy, S., & Tortora, G. (2024). Can ChatGPT provide intelligent diagnoses? A comparative study between predictive models and ChatGPT to define a new medical diagnostic bot. Expert Systems with Applications, 235, 121186. https://doi.org/10.1016/j.eswa.2023.121186 Çubukçu, H. C., Topcu, D. İ., & Yenice, S. (2023). Machine learning-based clinical decision support using laboratory data. Clinical Chemistry and Laboratory Medicine (CCLM), (0). https://doi.org/10.1515/cclm-2023-1037 Morado, F., & Wong, D. W. (2022). Applying diagnostic stewardship to proactively optimize the management of urinary tract infections. Antibiotics, 11(3), 308. https://doi.org/10.3390/antibiotics11030308 Xu, R., Deebel, N., Casals, R., Dutta, R., & Mirzazadeh, M. (2021). A new gold rush: a review of current and developing diagnostic tools for urinary tract infections. Diagnostics, 11(3), 479. https://doi.org/10.3390/diagnostics11030479 Santos, M., Mariz, M., Tiago, I., Martins, J., Alarico, S., & Ferreira, P. (2022). A review on urinary tract infections diagnostic methods: Laboratory-based and point-of-care approaches. Journal of Pharmaceutical and Biomedical Analysis, 219, 114889. https://doi.org/10.1016/j.jpba.2022.114889 Bischl, B., Binder, M., Lang, M., Pielok, T., Richter, J., Coors, S., ... & Lindauer, M. (2023). Hyperparameter optimization: Foundations, algorithms, best practices, and open challenges. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 13(2), e1484. https://doi.org/10.1002/widm.1484 Asif, D., Bibi, M., Arif, M. S., & Mukheimer, A. (2023). Enhancing heart disease prediction through ensemble learning techniques with hyperparameter optimization. Algorithms, 16(6), 308. https://doi.org/10.3390/a16060308 Quinton, F., Presles, B., Leclerc, S., Nodari, G., Lopez, O., Chevallier, O., ... & Alberini, J. L. (2024). Navigating the nuances: comparative analysis and hyperparameter optimisation of neural architectures on contrast-enhanced MRI for liver and liver tumour segmentation. Scientific Reports, 14(1), 3522. https://doi.org/10.1038/s41598-024-53528-9 Naik, N., Talyshinskii, A., Shetty, D. K., Hameed, B. Z., Zhankina, R., & Somani, B. K. (2024). Smart Diagnosis of Urinary Tract Infections: is Artificial Intelligence the Fast-Lane Solution?. Current Urology Reports, 25(1), 37-47. https://doi.org/10.1007/s11934-023-01192-3 Anahtar, M. N., Yang, J. H., & Kanjilal, S. (2021). Applications of machine learning to the problem of antimicrobial resistance: an emerging model for translational research. Journal of clinical microbiology, 59(7), 10-1128. https://doi.org/10.1128/jcm.01260-20 Luz, C. F., Vollmer, M., Decruyenaere, J., Nijsten, M. W., Glasner, C., & Sinha, B. (2020). Machine learning in infection management using routine electronic health records: tools, techniques, and reporting of future technologies. Clinical Microbiology and Infection, 26(10), 1291-1299. https://doi.org/10.1016/j.cmi.2020.02.003 Avarice02. (2024). Urinalysis Test Results Dataset. Kaggle Dataset. Retrieved from https://www.kaggle.com/datasets/avarice02/urinalysis-test-results |
Published
2024-03-31
Section
Articles
How to Cite
Airlangga, G. (2024). Optimizing Machine Learning Models for Urinary Tract Infection Diagnostics: A Comparative Study of Logistic Regression and Random Forest. Jurnal Informatika Ekonomi Bisnis, 6(1), 246-250. https://doi.org/10.37034/infeb.v6i1.854
![]() This work is licensed under a Creative Commons Attribution 4.0 International License. |


















