Real-Time Bayesian Risk Intelligence AI-Augmented Threat Detection in Cloud–Lakehouse Systems for Data-Limited Environments

Main Article Content

Lucas Vinícius Almeida Lima

Abstract

As organizations increasingly adopt cloud–lakehouse architectures to unify analytical and operational workloads, the need for timely and reliable threat detection becomes critical—especially in environments with limited labeled data. This work proposes a Real-Time Bayesian Risk Intelligence framework that integrates probabilistic modeling, streaming analytics, and AI-augmented inference to detect cyber-security threats under uncertainty. The system leverages Bayesian networks and hierarchical priors to dynamically update risk estimates as new telemetry arrives from distributed lakehouse components, enabling robust reasoning even when observations are sparse, noisy, or partially missing. We incorporate lightweight edge-side feature extraction, generative models for imputing incomplete signals, and online learning mechanisms to maintain model calibration while respecting compute and cost constraints. Experimental evaluations across simulated and real-world cloud telemetry streams demonstrate that the Bayesian approach outperforms conventional anomaly detectors in low-data regimes, reducing false positives while improving early detection of stealthy behaviors such as lateral movement and privilege escalation. The proposed architecture offers a principled, explainable, and resource-efficient pathway for operationalizing cyber risk intelligence in modern data ecosystems.

Article Details

Section

Articles

How to Cite

Real-Time Bayesian Risk Intelligence AI-Augmented Threat Detection in Cloud–Lakehouse Systems for Data-Limited Environments. (2025). International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 8(Special Issue 1), 48-54. https://doi.org/10.15662/IJRPETM.2025.0801809

References

1. Bellotti, T., & Crook, J. (2009). Support vector machines for credit scoring and discovery of significant features. Expert Systems with Applications, 36(2), 3302–3308.

2. Muthusamy, M. (2024). Cloud-Native AI metrics model for real-time banking project monitoring with integrated safety and SAP quality assurance. International Journal of Research and Applied Innovations (IJRAI), 7(1), 10135–10144. https://doi.org/10.15662/IJRAI.2024.0701005

3. Balaji, K. V., Sugumar, R., Mahendran, R., & Subramanian, P. (2025). Weather forecasting model using attentive residual gated recurrent unit for urban flood prediction. GLOBAL NEST JOURNAL, 27(5).

4. Binu, C. T., Kumar, S. S., Rubini, P., & Sudhakar, K. (2024). Enhancing Cloud Security through Machine Learning-Based Threat Prevention and Monitoring: The Development and Evaluation of the PBPM Framework. https://www.researchgate.net/profile/Binu-C-T/publication/383037713_Enhancing_Cloud_Security_through_Machine_Learning-Based_Threat_Prevention_and_Monitoring_The_Development_and_Evaluation_of_the_PBPM_Framework/links/66b99cfb299c327096c1774a/Enhancing-Cloud-Security-through-Machine-Learning-Based-Threat-Prevention-and-Monitoring-The-Development-and-Evaluation-of-the-PBPM-Framework.pdf

5. Poornima, G., & Anand, L. (2024, May). Novel AI Multimodal Approach for Combating Against Pulmonary Carcinoma. In 2024 5th International Conference for Emerging Technology (INCET) (pp. 1-6). IEEE.

6. Konda, S. K. (2024). AI Integration in Building Data Platforms: Enabling Proactive Fault Detection and Energy Conservation. International Journal of Advanced Research in Computer Science & Technology (IJARCST), 7(3), 10327-10338.

7. Kotapati, V. B. R., & Yakkanti, B. (2023). Real-Time Analytics Optimization Using Apache Spark Structured Streaming: A Lambda Architecture-based Scala Framework. American Journal of Data Science and Artificial Intelligence Innovations, 3, 86-119.

8. Konatham, M. R., Uddandarao, D. P., & Vadlamani, R. K. Engineering Scalable AI Systems for Real-Time Payment Platforms. https://www.jisem-journal.com/download/33_Engineering%20Scalable%20AI%20Systems%20for%20Real-Time%20Payment%20Platforms.pdf

9. Vinay Kumar Ch, Srinivas G, Kishor Kumar A, Praveen Kumar K, Vijay Kumar A. (2021). Real-time optical wireless mobile communication with high physical layer reliability Using GRA Method. J Comp Sci Appl Inform Technol. 6(1): 1-7. DOI: 10.15226/2474-9257/6/1/00149

10. Kumar, R. K. (2023). Cloud-integrated AI framework for transaction-aware decision optimization in agile healthcare project management. International Journal of Computer Technology and Electronics Communication (IJCTEC), 6(1), 6347–6355. https://doi.org/10.15680/IJCTECE.2023.0601004

11. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative adversarial nets. In Advances in Neural Information Processing Systems (Vol. 27).

12. Kingma, D. P., & Welling, M. (2014). Auto-Encoding Variational Bayes. Proceedings of ICLR.

13. Thangavelu, K., Muthusamy, P., & Das, D. (2024). Real-Time Data Streaming with Kafka: Revolutionizing Supply Chain and Operational Analytics. Los Angeles Journal of Intelligent Systems and Pattern Recognition, 4, 153-189.

14. Kumar, S. N. P. (2025). Regulating Autonomous AI Agents: Prospects, Hazards, and Policy Structures. Journal of Computer Science and Technology Studies, 7(10), 393-399.

15. Molnar, C. (2020). Interpretable Machine Learning: A Guide for Making Black Box Models Explainable. Leanpub.

16. Plattner, H. (2013). A course in in-memory data management: The SAP HANA database system. Springer.

17. Suchitra, R. (2023). Cloud-Native AI model for real-time project risk prediction using transaction analysis and caching strategies. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 6(1), 8006–8013. https://doi.org/10.15662/IJRPETM.2023.0601002

18. Vasugi, T. (2023). AI-empowered neural security framework for protected financial transactions in distributed cloud banking ecosystems. International Journal of Advanced Research in Computer Science & Technology, 6(2), 7941–7950. https://doi.org/0.15662/IJARCST.2023.0602004

19. Nagarajan, G. (2022). An integrated cloud and network-aware AI architecture for optimizing project prioritization in healthcare strategic portfolios. International Journal of Research and Applied Innovations, 5(1), 6444–6450. https://doi.org/10.15662/IJRAI.2022.0501004

20. Chiranjeevi, Y., Sugumar, R., & Tahir, S. (2024, November). Effective Classification of Ocular Disease Using Resnet-50 in Comparison with Squeezenet. In 2024 IEEE 9th International Conference on Engineering Technologies and Applied Sciences (ICETAS) (pp. 1-6). IEEE.

21. HV, M. S., & Kumar, S. S. (2024). Fusion Based Depression Detection through Artificial Intelligence using Electroencephalogram (EEG). Fusion: Practice & Applications, 14(2).

22. Poornima, G., & Anand, L. (2024, April). Effective Machine Learning Methods for the Detection of Pulmonary Carcinoma. In 2024 Ninth International Conference on Science Technology Engineering and Mathematics (ICONSTEM) (pp. 1-7). IEEE.

23. Karanjkar, R., & Karanjkar, D. Quality Assurance as a Business Driver: A Multi-Industry Analysis of Implementation Benefits Across the Software Development Life Cycle. International Journal of Computer Applications, 975, 8887.

24. Kusumba, S. (2025). Unified Intelligence: Building an Integrated Data Lakehouse for Enterprise-Wide Decision Empowerment. Journal Of Engineering And Computer Sciences, 4(7), 561-567.

25. Kandula, N. Evolution and Impact of Data Warehousing in Modern Business and Decision Support Systems

26. Adari, V. K. (2020). Intelligent Care at Scale AI-Powered Operations Transforming Hospital Efficiency. International Journal of Engineering & Extended Technologies Research (IJEETR), 2(3), 1240-1249.

27. Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., & Tzoumas, K. (2015). Apache Flink™: Stream and batch processing in a single engine. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering.