Human Metapneumovirus (HMPV) is a prominent respiratory pathogen, particularly affecting children, the elderly, and immunocompromised populations. Early detection of HMPV is critical for timely intervention and improved patient outcomes; however, traditional diagnostic methods are often hindered by overlapping symptoms with other respiratory illnesses. This research explores the application of machine learning models for HMPV detection using synthetic clinical data designed to replicate real-world scenarios. The dataset incorporates vital clinical features such as fever, cough, fatigue, symptom duration, oxygen saturation, heart rate, and respiratory rate. To address data imbalance, the Synthetic Minority Oversampling Technique (SMOTE) was employed, resulting in improved sensitivity toward minority class cases. A tuned XGBoost classifier demonstrated robust performance, achieving an accuracy of 73.54%, an F1-score of 0.7063, and a ROC-AUC of 0.7990. Key visualizations, including confusion matrices, ROC curves, and feature importance analyses, provided insights into the model’s efficacy and clinical relevance. This study underscores the potential of machine learning in augmenting clinical decision-making processes for early and accurate detection of HMPV, while also highlighting the importance of preprocessing techniques like data balancing in enhancing model performance. These findings pave the way for scalable, AI-driven diagnostic solutions that can be extended to other respiratory illnesses.
Cite this paper
Filippis, R. D. and Foysal, A. A. (2025). Machine Learning-Based Detection of Human Metapneumovirus (HMPV) Using Clinical Data. Open Access Library Journal, 12, e3193. doi: http://dx.doi.org/10.4236/oalib.1113193.
Panda, S., Mohakud, N.K., Pena, L. and Kumar, S. (2014) Human Metapneumovirus: Review of an Important Respiratory Pathogen. International Journal of Infectious Diseases, 25, 45-52. https://doi.org/10.1016/j.ijid.2014.03.1394
Pa-penburg, J. and Boivin, G. (2010) The Distinguishing Features of Human Metapneumovirus and Respiratory Syncytial Virus. Reviews in Medical Virology, 20, 245-260. https://doi.org/10.1002/rmv.651
Crowe, J.E. (2004) Human Metap-neumovirus as a Major Cause of Human Respiratory Tract Disease. Pediatric Infectious Disease Journal, 23, S215-S221. https://doi.org/10.1097/01.inf.0000144668.81573.6d
Kroll, J. and Weinberg, A. (2011) Human Metapneu-movirus. Seminars in Respiratory and Critical Care Medicine, 32, 447-453. https://doi.org/10.1055/s-0031-1283284
Principi, N., Bosis, S. and Esposito, S. (2006) Human Metapneumovirus in Paediatric Patients. Clinical Microbiology and Infection, 12, 301-308. https://doi.org/10.1111/j.1469-0691.2005.01325.x
Gandhi, L., Maisnam, D., Rathore, D., Chauhan, P., Bonagiri, A. and Venkataramana, M. (2022) Respiratory Illness Virus Infections with Special Emphasis on COVID-19. European Journal of Medical Research, 27, Article No. 236. https://doi.org/10.1186/s40001-022-00874-x
Weng, L., Su, X. and Wang, X. (2021) Pain Symptoms in Patients with Coronavirus Disease (COVID-19): A Literature Review. Journal of Pain Research, 14, 147-159. https://doi.org/10.2147/jpr.s269206
Ji, W., Chen, Y., Han, S., Dai, B., Li, K., Li, S., et al. (2024) Clinical and Epidemi-ological Characteristics of 96 Pediatric Human Metapneumovirus Infections in Henan, China after COVID-19 Pandemic: A Retrospective Analysis. Virology Journal, 21, Article No. 100. https://doi.org/10.1186/s12985-024-02376-0
Feng, Y., He, T., Zhang, B., Yuan, H. and Zhou, Y. (2024) Epidemiology and Diagnosis Technologies of Human Metapneumovirus in China: A Mini Review. Virology Journal, 21, Article No. 59. https://doi.org/10.1186/s12985-024-02327-9
Larcher, C., Geltner, C., Fischer, H., Nachbaur, D., Müller, L.C. and Huemer, H.P. (2005) Human Metapneumovirus Infection in Lung Transplant Recipients: Clinical Presentation and Epidemi-ology. The Journal of Heart and Lung Transplantation, 24, 1891-1901. https://doi.org/10.1016/j.healun.2005.02.014
Chen, L., Han, X., Bai, L. and Zhang, J. (2020) Clinical Characteristics and Outcomes in Adult Patients Hospi-talized with Influenza, Respiratory Syncytial Virus and Human Metapneumovirus Infections. Expert Review of Anti-infective Therapy, 19, 787-796. https://doi.org/10.1080/14787210.2021.1846520
Mastrolia, M. and Esposito, S. (2016) Metapneumovirus Infections and Respiratory Complications. Seminars in Respiratory and Critical Care Medicine, 37, 512-521. https://doi.org/10.1055/s-0036-1584800
Filippis, R.D. and Foysal, A.A. (2024) Harnessing the Power of Artificial Intelligence in Neuromuscular Disease Rehabilitation: A Comprehensive Review and Algorithmic Approach. Advances in Bioscience and Biotechnology, 15, 289-309. https://doi.org/10.4236/abb.2024.155018
Yang, Y., Cui, J., Kumar, A., Luo, D., Murray, J., Jones, L., et al. (2025) Multiplex Detection and Quantification of Virus Co-Infections Using Label-Free Surface-Enhanced Raman Spectroscopy and Deep Learning Algorithms. ACS Sensors, 10, 1298-1311. https://doi.org/10.1021/acssensors.4c03209
Pereira, M.M., Brown, A. and Vogel, T. (2018) 2018 CIS Annual Meet-ing: Immune Deficiency & Dysregulation North American Conference. Journal of Clinical Immunology, 38, 330-444.
Kelly, C.J., Karthikesalingam, A., Suleyman, M., Corrado, G. and King, D. (2019) Key Challenges for Deliver-ing Clinical Impact with Artificial Intelligence. BMC Medicine, 17, Article No. 195. https://doi.org/10.1186/s12916-019-1426-2
Kaur, H., Pannu, H.S. and Malhi, A.K. (2019) A Systematic Review on Imbalanced Data Challenges in Machine Learning. ACM Computing Surveys, 52, 1-36. https://doi.org/10.1145/3343440
Albahri, A.S., Duhaim, A.M., Fadhel, M.A., Alnoor, A., Baqer, N.S., Alzubaidi, L., et al. (2023) A Systematic Review of Trustworthy and Explainable Artificial Intelligence in Healthcare: Assessment of Quality, Bias Risk, and Data Fusion. Information Fusion, 96, 156-191. https://doi.org/10.1016/j.inffus.2023.03.008
Zhu, R., Vora, B., Menon, S., Younis, I., Dwivedi, G., Meng, Z., et al. (2023) Clinical Pharmacology Applications of Real-World Data and Real-World Evidence in Drug Development and Approval—An Industry Perspective. Clinical Pharmacology & Thera-peutics, 114, 751-767. https://doi.org/10.1002/cpt.2988
Wang, J.K., Ahn, S., Dalal, T., Zhang, X.D., et al. (2024) Augmented Risk Prediction for the Onset of Alzheimer’s Disease from Electronic Health Records with Large Language Mod-els.
Sydney, A., Singh, M.K. and Nyavor, H. (2024) Advancing Clinical Trial Outcomes Using Deep Learning and Pre-dictive Modelling: Bridging Precision Medicine and Patient-Centered Care.
Alkhawaldeh, I.M., Albalkhi, I. and Naswhan, A.J. (2023) Challenges and Limitations of Synthetic Minority Oversampling Techniques in Machine Learning. World Journal of Methodology, 13, 373-378. https://doi.org/10.5662/wjm.v13.i5.373
Alex, S.A., Jesu Vedha Na-yahi, J. and Kaddoura, S. (2024) Deep Convolutional Neural Networks with Genetic Algorithm-Based Synthetic Minority Over-Sampling Technique for Improved Imbalanced Data Classification. Applied Soft Computing, 156, Article 111491. https://doi.org/10.1016/j.asoc.2024.111491
Dablain, D., Krawczyk, B. and Chawla, N.V. (2023) Deepsmote: Fusing Deep Learning and SMOTE for Imbalanced Data. IEEE Transactions on Neural Networks and Learning Systems, 34, 6390-6404. https://doi.org/10.1109/tnnls.2021.3136503
Ogunleye, A. and Wang, Q. (2020) XGBoost Model for Chronic Kidney Disease Diagnosis. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 17, 2131-2140. https://doi.org/10.1109/tcbb.2019.2911071
Tseng, C. and Tang, C. (2023) An Optimized XGBoost Technique for Accurate Brain Tumor Detection Using Feature Selection and Image Segmentation. Healthcare Analytics, 4, Article 100217. https://doi.org/10.1016/j.health.2023.100217
Soltanzadeh, P. and Hashemzadeh, M. (2021) RCSMOTE: Range-Controlled Synthetic Minority Over-Sampling Technique for Handling the Class Imbalance Problem. Information Sci-ences, 542, 92-111. https://doi.org/10.1016/j.ins.2020.07.014
Jude, A. and Uddin, J. (2024) Explainable Software Defects Classification Using SMOTE and Machine Learning. Annals of Emerging Technologies in Computing, 8, 36-49. https://doi.org/10.33166/aetic.2024.01.004
Vanhoeyveld, J. and Martens, D. (2017) Imbalanced Classification in Sparse and Large Behaviour Datasets. Data Mining and Knowledge Discovery, 32, 25-82. https://doi.org/10.1007/s10618-017-0517-y
Lin, K. and Jamrus, T. (2024) Industrial Data-Driven Modeling for Imbalanced Fault Diagnosis. Industrial Management & Data Systems, 124, 3108-3137. https://doi.org/10.1108/imds-12-2023-0927
Joloudari, J.H., Marefat, A., Nematollahi, M.A., Oyelere, S.S. and Hussain, S. (2023) Effective Class-Imbalance Learning Based on SMOTE and Convolutional Neural Networks. Applied Sci-ences, 13, Article 4006. https://doi.org/10.3390/app13064006
Van Den Hoogen, B.G., Osterhaus, D.M.E. And Fouchier, R.A.M. (2004) Clinical Impact and Diagnosis of Human Metapneumovirus Infection. Pediatric Infectious Disease Journal, 23, S25-S32. https://doi.org/10.1097/01.inf.0000108190.09824.e8
Purwar, A. and Singh, S.K. (2015) Hybrid Prediction Model with Missing Value Imputation for Medical Data. Expert Systems with Applications, 42, 5621-5631. https://doi.org/10.1016/j.eswa.2015.02.050
Ahmad, Z., Rahim, S., Zubair, M. and Abdul-Ghafar, J. (2021) Artificial Intelligence (AI) in Medicine, Current Applications and Future Role with Special Emphasis on Its Potential and Promise in Pathology: Present and Future Impact, Obstacles Including Costs and Acceptance among Pathologists, Practical and Philo-sophical Considerations. a Comprehensive Review. Diagnostic Pathology, 16, Article No. 24. https://doi.org/10.1186/s13000-021-01085-4
Dhahbi, S., Barhoumi, W., Kurek, J., Swiderski, B., Kruk, M. and Zagrouba, E. (2018) False-positive Reduction in Computer-Aided Mass Detection Using Mammographic Texture Analysis and Classification. Computer Methods and Programs in Biomedicine, 160, 75-83. https://doi.org/10.1016/j.cmpb.2018.03.026
Foysal, A.A. and Sultana, S. (2025) AI-Driven Pneumonia Diagnosis Using Deep Learning: A Comparative Analysis of CNN Models on Chest X-Ray Images. Open Access Library, 12, 1-17. https://doi.org/10.4236/oalib.1112899
Adeniran, A.A., Onebunne, A.P. and William, P. (2024) Explainable AI (XAI) in Healthcare: Enhancing Trust and Transparency in Critical Decision-Making. World Journal of Advanced Research and Reviews, 23, 2447-2658. https://doi.org/10.30574/wjarr.2024.23.3.2936
Beeler, P., Bates, D. and Hug, B. (2014) Clinical Decision Support Systems. Swiss Medical Weekly, 144, Article 14073. https://doi.org/10.4414/smw.2014.14073
Bright, T.J., Wong, A., Dhurjati, R., Bristow, E., Bastian, L., Coeytaux, R.R., et al. (2012) Effect of Clinical Decision-Support Systems. Annals of Internal Medicine, 157, 29-43. https://doi.org/10.7326/0003-4819-157-1-201207030-00450
Pawloski, P.A., Brooks, G.A., Nielsen, M.E. and Olson-Bullis, B.A. (2019) A Systematic Review of Clinical Decision Support Systems for Clini-cal Oncology Practice. Journal of the National Comprehensive Cancer Network, 17, 331-338. https://doi.org/10.6004/jnccn.2018.7104
Liu, F. and Panagiotakos, D. (2022) Real-World Data: A Brief Review of the Methods, Applications, Challenges and Opportunities. BMC Medical Research Methodology, 22, Article No. 287. https://doi.org/10.1186/s12874-022-01768-6
Sherman, R.E., Anderson, S.A., Dal Pan, G.J., Gray, G.W., Gross, T., Hunter, N.L., et al. (2016) Real-World Evidence—What Is It and What Can It Tell Us? New England Journal of Medicine, 375, 2293-2297. https://doi.org/10.1056/nejmsb1609216
Verkerk, K. and Voest, E.E. (2024) Generating and Us-ing Real-World Data: A Worthwhile Uphill Battle. Cell, 187, 1636-1650. https://doi.org/10.1016/j.cell.2024.02.012