THE ROLE OF MACHINE LEARNING IN AUTOMATING COMPLEX DATABASE MIGRATION WORKFLOWS
Main Article Content
Abstract
Database migration remains a critical yet complex task in enterprise modernization efforts, often involving heterogeneous data sources, legacy systems, and intricate schema transformations. Traditional approaches, which rely heavily on manual intervention and rule-based logic, are time-consuming, error-prone, and difficult to scale across large, distributed environments. This paper explores the application of machine learning (ML) techniques to automate key aspects of database migration workflows, including schema matching, data transformation, anomaly detection, and data quality validation. We propose a modular ML-driven framework that integrates supervised and unsupervised learning models to streamline end-to-end migration processes. A case study involving the migration of financial transaction data from Oracle to PostgreSQL demonstrates significant improvements in accuracy, reduction in manual effort, and consistency of schema alignment. Experimental results indicate that ML-enabled migration pipelines can reduce mapping errors by over 40% and decrease execution time by up to 60% compared to traditional methods. These findings highlight the potential of machine learning to transform legacy database migrations into more reliable, adaptive, and intelligent processes, enabling organizations to accelerate digital transformation while minimizing risk.
Article Details
Section
How to Cite
References
1. Guo, S., Li, Y., Zhang, L., et al. (2020). "Machine Learning for Database System Automation: Techniques and Challenges." IEEE Transactions on Knowledge and Data Engineering, 32(11), 2085–2102.
2. Lee, J., Kwon, S., Choi, S. (2018). "Schema Matching with Word Embeddings and Deep Learning." Information Systems, 78, 108–118.
3. Kumar, S., Choudhary, A., Kumar, A. (2021). "Automated Data Migration Framework Leveraging Machine Learning." International Journal of Data Science and Analytics, 12(3), 157–169.
4. Aggarwal, C. C. (2015). Data Mining: The Textbook. Springer.
5. Zhang, J., Sun, Y., Chen, H. (2019). "Anomaly Detection in Databases Using Autoencoders." Proceedings of the ACM SIGMOD Conference, 1225–1236.
6. Smith, J., Patel, M. (2017). "Human-in-the-Loop Machine Learning for Data Integration." Journal of Systems and Software, 130, 92–102.