Bipolar disorder is a complex psychiatric condition characterized by high variability in treatment response, posing a major challenge for clinicians striving to personalize care. Traditional machine learning models often require centralized data access, which is typically restricted in healthcare settings due to privacy regulations and institutional barriers. To address this, we propose a Federated Learning (FL) framework that enables collaborative model training across multiple institutions without the need to share sensitive patient-level data. In this study, we generated synthetic datasets representing five distinct hospitals, each comprising 1000 virtual bipolar disorder patients. These datasets included a range of features encompassing
demographic characteristics (e.g., age, gender), clinical history (e.g., illness duration, episode counts), and comorbid conditions. A fully connected neural network was trained using a federated approach over 15 communication rounds. Each institution performed local training, followed by weight averaging to update a shared global model. Our global model achieved an overall test accuracy of 73.0% and an area under the receiver operating characteristic curve (AUC) of 0.84, demonstrating robust performance across diverse simulated institutional settings. Performance improved steadily over training rounds, with the highest-performing site reaching 84.5% accuracy. Permutation-based feature importance analysis revealed that illness duration and baseline mood were the most predictive features of treatment response. These results support the feasibility of federated learning for psychiatric prediction tasks, offering a promising path toward building generalizable, privacy-preserving models in real-world healthcare environments. Future work will extend this framework to real electronic health records and explore fairness-aware training to address institutional bias and population heterogeneity.
Cite this paper
Foysal, A. A. and Filippis, R. D. (2025). Federated Learning for Treatment Response Prediction in Bipolar Disorder: A Simulation-Based Institutional Study. Open Access Library Journal, 12, e13954. doi: http://dx.doi.org/10.4236/oalib.1113954.
Eichler, H., Abadie, E., Breckenridge, A., Flamion, B., Gustafsson, L.L., Leufkens, H., et al. (2011) Bridging the Efficacy-Effectiveness Gap: A Regulator’s Perspective on Addressing Variability of Drug Response. Nature Reviews Drug Discovery, 10, 495-506. https://doi.org/10.1038/nrd3501
Gallagher, R.M. (1999) Treatment Planning in Pain Medicine: Integrating Medical, Physical, and Be-havioral Therapies. Medical Clinics of North America, 83, 823-849. https://doi.org/10.1016/s0025-7125(05)70136-x
Post, R.M., Denicoff, K.D., Frye, M.A., Dunn, R.T., Leverich, G.S., Osuch, E., et al. (1998) A History of the Use of Anticonvulsants as Mood Stabilizers in the Last Two Decades of the 20th Cen-tury. Neuropsychobiology, 38, 152-166. https://doi.org/10.1159/000026532
Nayak, R., Rosh, I., Kustanovich, I. and Stern, S. (2021) Mood Stabilizers in Psychiatric Disorders and Mechanisms Learnt from in Vitro Model Systems. International Journal of Molecular Sciences, 22, Article 9315. https://doi.org/10.3390/ijms22179315
Calabrese, J.R., Fatemi, S.H., Kujawa, M. and Woyshville, M.J. (1996) Predictors of Response to Mood Stabilizers. Journal of Clinical Psychopharmacology, 16, 24S-31S. https://doi.org/10.1097/00004714-199604001-00004
Bzdok, D. and Meyer-Lindenberg, A. (2018) Machine Learning for Precision Psychiatry: Op-portunities and Challenges. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 3, 223-230. https://doi.org/10.1016/j.bpsc.2017.11.007
Manchia, M., Pisanu, C., Squassina, A. and Carpiniello, B. (2020) Challenges and Future Prospects of Precision Medicine in Psychiatry. Pharmacogenomics and Personalized Medi-cine, 13, 127-140. https://doi.org/10.2147/pgpm.s198225
Tai, A.M.Y., Albuquerque, A., Carmona, N.E., Subramanieapillai, M., Cha, D.S., Sheko, M., et al. (2019) Machine Learning and Big Data: Implications for Disease Modeling and Therapeutic Discovery in Psychiatry. Artificial Intelligence in Medicine, 99, Article 101704. https://doi.org/10.1016/j.artmed.2019.101704
Dinov, I.D. (2016) Methodological Challenges and Analytic Opportunities for Modeling and Interpreting Big Healthcare Data. GigaScience, 5, 2-15. https://doi.org/10.1186/s13742-016-0117-6
Miotto, R., Wang, F., Wang, S., Jiang, X. and Dudley, J.T. (2017) Deep Learning for Healthcare: Review, Op-portunities and Challenges. Briefings in Bioinformatics, 19, 1236-1246. https://doi.org/10.1093/bib/bbx044
Ngiam, K.Y. and Khor, I.W. (2019) Big Data and Machine Learning Algorithms for Health-Care Delivery. The Lancet Oncology, 20, e262-e273. https://doi.org/10.1016/s1470-2045(19)30149-4
Ettaloui, N., Arezki, S. and Gadi, T. (2023) An Overview of Blockchain-Based Electronic Health Records and Compliance with GDPR and HIPAA. Data and Metadata, 2, 166. https://doi.org/10.56294/dm2023166
Mello, M.M., Adler‐Milstein, J., Ding, K.L. and Savage, L. (2018) Legal Barriers to the Growth of Health Infor-mation Exchange—Boulders or Pebbles? The Milbank Quarterly, 96, 110-143. https://doi.org/10.1111/1468-0009.12313
Gonçalves-Ferreira, D., Sousa, M., Bacelar-Silva, G.M., Frade, S., Antunes, L.F., Beale, T., et al. (2019) OpenEHR and General Data Protection Regulation: Evaluation of Principles and Requirements. JMIR Medical Informatics, 7, e9845. https://doi.org/10.2196/medinform.9845
Herath, H.M.S.S., Herath, H.M.K.K.M.B., Madhusanka, B.G.D.A. and Guruge, L.G.P.K. (2024) Data Protec-tion Challenges in the Processing of Sensitive Data. In: Hewage, C., Yasakethu, L. and Jayakody, D.N.K., Eds., Data Protection, Springer, 155-179. https://doi.org/10.1007/978-3-031-76473-8_8
Richter, M., Emden, D., Leenings, R., Winter, N.R., Mikola-jczyk, R., Massag, J., et al. (2025) Generaliza-bility of Clinical Prediction Models in Mental Health. Molecular Psychiatry, 30, 3632-3639.
Meehan, A.J., Lewis, S.J., Fazel, S., Fusar-Poli, P., Steyerberg, E.W., Stahl, D., et al. (2022) Clinical Prediction Models in Psychiatry: A System-atic Review of Two Decades of Progress and Challenges. Molecular Psychiatry, 27, 2700-2708. https://doi.org/10.1038/s41380-022-01528-4
Cearns, M., Hahn, T. and Baune, B.T. (2019) Recommendations and Future Directions for Supervised Machine Learning in Psychiatry. Translational Psychiatry, 9, Ar-ticle No. 271. https://doi.org/10.1038/s41398-019-0607-2
Ebrahimi, M., Sahay, R., Hosseinalipour, S. and Akram, B. (2025) The Transition from Centralized Machine Learning to Federated Learning for Mental Health in Edu-cation: A Survey of Current Methods and Future Directions. arXiv:2501.11714.
Rauniyar, A., Hagos, D.H., Jha, D., Håkegård, J.E., Bagci, U., Rawat, D.B., et al. (2024) Federated Learning for Medical Applica-tions: A Taxonomy, Current Trends, Challenges, and Future Research Direc-tions. IEEE Internet of Things Journal, 11, 7374-7398. https://doi.org/10.1109/jiot.2023.3329061
Antunes, R.S., Stoffel, R., André da Costa, C., Küderle, A., Yari, I.A. and Eskofier, B. (2022) Federated Learning for Healthcare: Systematic Review and Architecture Proposal. ACM Transactions on Intelligent Systems and Technology, 13, 1-23. https://doi.org/10.1145/3501813
Akande, O.A. (2022) Inte-grating Blockchain with Federated Learning for Privacy-Preserving Data Ana-lytics Across Decentralized Governmental Health Information Systems. Interna-tional Journal of Computer Applications Technology and Research, 11, 622-637.
Joshi, M., Pal, A. and Sankarasubbu, M. (2022) Federated Learning for Healthcare Domain-Pipeline, Applications and Challenges. ACM Transactions on Computing for Healthcare, 3, 1-36. https://doi.org/10.1145/3533708
Aledhari, M., Razzak, R., Parizi, R.M. and Saeed, F. (2020) Federated Learning: A Survey on Enabling Technologies, Protocols, and Applications. IEEE Access, 8, 140699-140725. https://doi.org/10.1109/access.2020.3013541
Prasad, V.K., Bhattacharya, P., Maru, D., Tanwar, S., Verma, A., Singh, A., et al. (2022) Fed-erated Learning for the Internet-of-Medical-Things: A Survey. Mathematics, 11, Article 151. https://doi.org/10.3390/math11010151
Mosquera, C., Fer-rer, L., Milone, D.H., Luna, D. and Ferrante, E. (2024) Class Imbalance on Med-ical Image Classification: Towards Better Evaluation Practices for Discrimination and Calibration Performance. European Radiology, 34, 7895-7903. https://doi.org/10.1007/s00330-024-10834-0
Thabtah, F., Hammoud, S., Kamalov, F. and Gonsalves, A. (2020) Data imbalance in classification: Ex-perimental Evaluation. Information Sciences, 513, 429-441. https://doi.org/10.1016/j.ins.2019.11.004
Parker, B.J., Günter, S. and Bedo, J. (2007) Stratification Bias in Low Signal Microarray Studies. BMC Bioin-formatics, 8, Article No. 326. https://doi.org/10.1186/1471-2105-8-326
Araf, I., Idri, A. and Chairi, I. (2024) Cost-Sensitive Learning for Imbalanced Medical Data: A Review. Artifi-cial Intelligence Review, 57, Article No. 80. https://doi.org/10.1007/s10462-023-10652-8
Fernández, A., García, S., Galar, M., Prati, R.C., Krawczyk, B. and Herrera, F. (2018) Learning from Im-balanced Data Sets. Vol. 10, Springer. https://doi.org/10.1007/978-3-319-98074-4
Mi, X., Zou, B., Zou, F. and Hu, J. (2021) Permutation-based Identification of Important Biomarkers for Complex Diseases via Machine Learning Models. Nature Communications, 12, Article No. 3008. https://doi.org/10.1038/s41467-021-22756-2
Gómez-Ramírez, J., ávi-la-Villanueva, M. and Fernández-Blázquez, M.á. (2020) Selecting the Most Im-portant Self-Assessed Features for Predicting Conversion to Mild Cognitive Im-pairment with Random Forest and Permutation-Based Methods. Scientific Re-ports, 10, Article No. 20630. https://doi.org/10.1038/s41598-020-77296-4
Biswas, S., Grundlingh, N., Boardman, J., White, J. and Le, L. (2025) A Target Permutation Test for Sta-tistical Significance of Feature Importance in Differentiable Models. Electronics, 14, Article 571. https://doi.org/10.3390/electronics14030571
Espinosa, R., Sánchez, G., Pal-ma, J. and Jiménez, F. (2025) Permutation-Based Multi-Objective Evolutionary Feature Selection for High-Dimensional Data. arXiv:2501.14310.
Ali, S., Ahsan, M., Tasnim, L., Afrin, S., Biswas, K., Hossain, M., Ahmed, M., Hashan, R., Islam, K. and Raman, S. (2024) Federated Learning in Healthcare: Model Mis-conducts, Security, Challenges, Applications, and Future Research Directions—A Systematic Review. arXiv:2405.13832.
Abbas, S.R., Abbas, Z., Zahir, A. and Lee, S.W. (2024) Federated Learning in Smart Healthcare: A Comprehensive Review on Privacy, Security, and Predictive Analytics with IoT Integration. Healthcare, 12, Article 2587. https://doi.org/10.3390/healthcare12242587
Dasaradharami Reddy, K. and Gadekallu, T.R. (2023) A Comprehensive Survey on Federated Learning Techniques for Healthcare Informatics. Computational Intelligence and Neuro-science, 2023, Article ID: 8393990. https://doi.org/10.1155/2023/8393990
Zulueta, N.M. (2024) Development of Federated Learning Models for Improved Genetic Variant As-sessment in a Multi-Site Clinical Setting. PhD Diss., Université Paris Cité.
Nasajpour, M., Pouriyeh, S., Parizi, R.M., Han, M., Mosaiyebzadeh, F., Liu, L., et al. (2025) Federated Learning in Smart Healthcare: A Survey of Ap-plications, Challenges, and Future Directions. Electronics, 14, Article 1750. https://doi.org/10.3390/electronics14091750
Sadilek, A., Liu, L., Nguyen, D., Kamruzzaman, M., Serghiou, S., Rader, B., et al. (2021) Priva-cy-First Health Research with Federated Learning. npj Digital Medicine, 4, Arti-cle No. 132. https://doi.org/10.1038/s41746-021-00489-2
Li, S., Cai, T. and Duan, R. (2023) Targeting Underrepresented Populations in Precision Medicine: A Federated Transfer Learning Approach. The Annals of Applied Sta-tistics, 17, Article No. 2970. https://doi.org/10.1214/23-aoas1747
Kang, S. (2020) Model Validation Failure in Class Imbalance Problems. Expert Systems with Ap-plications, 146, Article 113190. https://doi.org/10.1016/j.eswa.2020.113190
Johnson, J.M. and Khosh-goftaar, T.M. (2019) Survey on Deep Learning with Class Imbalance. Journal of Big Data, 6, Article No. 27. https://doi.org/10.1186/s40537-019-0192-5
Nguyen, G.H., Bou-zerdoum, A. and Phung, S.L. (2009) Learning Pattern Classification Tasks with Imbalanced Data Sets. Pattern Recognition, 10, 1322-1328.
Walsh, C.G., Ripperger, M.A., Hu, Y., Sheu, Y., Lee, H., Wilimitis, D., et al. (2024) Development and Mul-ti-Site External Validation of a Generalizable Risk Prediction Model for Bipolar Disorder. Translational Psychiatry, 14, Article No. 58. https://doi.org/10.1038/s41398-023-02720-y
Bişkin, O.T., Candemir, C. and Selver, M.A. (2025) Detection of Bipolar Disorder and Schizophrenia Em-ploying Bayesian-Optimized Grad-Cam-Driven Deep Learning. Applied Sciences, 15, Article 1717. https://doi.org/10.3390/app15041717
Ford, T., Stewart, R. and Downs, J. (2020) Surveillance, Case Registers, and Big Data. In: Jayati Das-Munshi, et al., Eds., Practical Psychiatric Epidemiology, Oxford Uni-versity Press, 219-C13.P111. https://doi.org/10.1093/med/9780198735564.003.0013
Lu, Z., Pan, H., Dai, Y., Si, X. and Zhang, Y. (2024) Federated Learning with Non-IID Data: A Survey. IEEE Internet of Things Journal, 11, 19188-19209. https://doi.org/10.1109/jiot.2024.3376548
Nandan, U., Sai, C., Sai, N. and Viswanadapalli, A. (2024) Enhanced Brain Tumor Detection and Privacy Preserving Using Federated Learning. International Journal of Scientific Re-search in Science and Technology, 11, 131-144. https://doi.org/10.32628/IJSRST24116166
Hsieh, K., Phanishayee, A., Mutlu, O. and Gibbons, P. (2020) The Non-IID Data Quagmire of Decentralized Machine Learning. In: Daumé, H. and Singh, A., Eds., International Conference on Machine Learning, PMLR, 4387-4398.
Benmalek, M. and Seddiki, A. (2024) Bias in Federated Learning: Factors, Effects, Mitigations, and Open Is-sues. Ingénierie des Systèmes d Information, 29, 2137-2160. https://doi.org/10.18280/isi.290605
Gadekallu, T.R., Dev, K., Khowaja, S.A., Wang, W., Feng, H., Fang, K., Pandya, S. and Wang, W. (2025) Framework, Standards, Applications and Best practices of Responsible AI: A Comprehensive Survey. arXiv:2504.13979.