A Framework for Managing Configuration and Version Control of AI Models in DevOps-Driven Healthcare Systems
Keywords:
AI Model Governance, Healthcare DevOps, Configuration Management, Version Control, MLOps, Continuous Deployment, Model Drift, ReproducibilityAbstract
AI models in healthcare systems demand precision, adaptability, and traceability, especially within DevOps-driven environments where rapid iteration and deployment are routine. However, most healthcare organizations lack robust mechanisms for configuration and version control of AI models, leading to reproducibility issues, compliance risks, and integration failures. This paper proposes a structured framework that unifies DevOps principles with AI model lifecycle governance, focusing on configuration management, version control, and traceable workflows. The proposed model integrates continuous integration/continuous delivery (CI/CD), model registries, and audit trails tailored for healthcare standards. We evaluate the framework's theoretical underpinnings using historical best practices and highlight its implications in minimizing model drift and maximizing traceability.
References
Amershi, Saleema, et al. “Software Engineering for Machine Learning: A Case Study.” Proceedings of the IEEE/ACM 41st International Conference on Software Engineering, vol. 1, 2019, pp. 291–300.
Sculley, D., et al. “Hidden Technical Debt in Machine Learning Systems.” Advances in Neural Information Processing Systems, vol. 28, 2015, pp. 2503–2511.
Sendak, Mark, et al. “A Path for Translation of Machine Learning Products into Healthcare Delivery.” npj Digital Medicine, vol. 3, no. 1, 2020, pp. 1–6.
Wiens, Jenna, et al. “Do No Harm: A Roadmap for Responsible Machine Learning for Health Care.” Nature Medicine, vol. 25, no. 9, 2019, pp. 1337–1340.
Beaulieu-Jones, Brett K., and Casey S. Greene. “Reproducibility of Computational Workflows Is Automated Using Continuous Analysis.” Nature Biotechnology, vol. 35, no. 4, 2017, pp. 342–346.
Kavuri, S. (2025). The future of QA leadership: Balancing human expertise and automation in software testing teams. International Journal of Applied Mathematics, 38(9s), 1942–1953.
Breck, Eric, et al. “The ML Test Score: A Rubric for ML Production Readiness and Technical Debt Reduction.” Google Research, vol. 1, 2017, pp. 1–8.
Chen, M. et al. “Smart Healthcare: AI-Driven Medical Diagnosis.” IEEE Access, vol. 7, 2019, pp. 13913–13926.
Rajkomar, A., et al. “Machine Learning in Medicine.” New England Journal of Medicine, vol. 380, no. 14, 2019, pp. 1347–1358.
Ribeiro, M. T., et al. “Why Should I Trust You?: Explaining the Predictions of Any Classifier.” Proceedings of the 22nd ACM SIGKDD, vol. 1, no. 1, 2016, pp. 1135–1144.
Ghassemi, M., et al. “A Review of Challenges and Opportunities in Machine Learning for Health.” AMIA Annual Symposium Proceedings, vol. 2018, 2018, pp. 191–200.
Cabitza, Federico, et al. “Unintended Consequences of Machine Learning in Medicine.” JAMA, vol. 318, no. 6, 2017, pp. 517–518.
He, J., et al. “The Practical Implementation of Artificial Intelligence Technologies in Medicine.” Nature Medicine, vol. 25, no. 1, 2019, pp. 30–36.
Topol, Eric. “High-Performance Medicine: The Convergence of Human and Artificial Intelligence.” Nature Medicine, vol. 25, no. 1, 2019, pp. 44–56.
Kavuri, S. (2025). AI-driven test automation frameworks: Enhancing efficiency and accuracy in software quality assurance. International Journal of Applied Mathematics, 38(10s), 699–710.
Obermeyer, Z., et al. “Dissecting Racial Bias in an Algorithm Used to Manage the Health of Populations.” Science, vol. 366, no. 6464, 2019, pp. 447–453.
Esteva, A., et al. “A Guide to Deep Learning in Healthcare.” Nature Medicine, vol. 25, no. 1, 2019, pp. 24–29.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Afram Enim, Khanita Ruhia (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
