Generative AI–Driven Trusted Credit Scoring with Threat-Aware Analytics for Project and Healthcare Data
Main Article Content
Abstract
Credit scoring is a foundational component of modern financial systems: it determines who can borrow, at what cost, and under what conditions. Traditional credit scoring models rely on historical financial data and statistical models such as logistic regression or decision trees. However, as financial institutions seek greater accuracy, scalability, and robustness, generative artificial intelligence (AI) methods—especially Generative Adversarial Networks (GANs) and large language models (LLMs)—are emerging as promising tools. Yet, the adoption of generative AI in credit scoring raises critical concerns around trust: adversarial attacks, lack of interpretability, regulatory compliance, and computational efficiency.
In this work, we propose a trusted generative AI framework for credit scoring that is threat-aware, explainable, and Apache-accelerated. First, we design a adversarially robust credit scoring model: we use GAN-based data augmentation to mitigate class imbalance in default versus non-default classes, while simultaneously analyzing potential adversarial vulnerabilities by generating counterfactual perturbations. Second, we integrate explainability modules using SHAP values and counterfactual explanations to provide both global and local interpretability of credit decisions. Third, we accelerate the training and inference pipeline using Apache Spark (or Apache Flink) to scale to large datasets typical of financial institutions, ensuring that risk scoring can be done in near real-time.
We evaluate our framework on a realistic credit dataset (simulated or proprietary) and demonstrate that (a) generative augmentation improves predictive performance (AUC, F1) compared to baseline models, (b) adversarial counterfactuals help uncover fragile regions in the model decision space, and (c) explainability techniques provide human-comprehensible rationales aligned with regulatory needs. Furthermore, leveraging Apache Spark for distributed training significantly reduces latency and increases throughput, making the system practically deployable in enterprise settings.
Our contributions are threefold: (1) a unified architecture combining generative AI, adversarial robustness, and interpretability; (2) an empirical evaluation showing trade-offs between performance, security, and explainability; (3) a scalable implementation leveraging Apache big-data frameworks demonstrating feasibility in production. We argue that this trusted generative AI approach can reconcile the power of advanced models with the transparency and regulation requirements of credit risk management, paving the way for more inclusive, secure, and efficient lending decisions.
Article Details
Section
How to Cite
References
1. Hashemi, M., & Fathi, A. (2020). PermuteAttack: Counterfactual Explanation of Machine Learning Credit Scorecards. arXiv. (arXiv)
2. McGrath, R., Costabello, L., Le Van, C., Sweeney, P., Kamiab, F., Shen, Z., & Lecue, F. (2018). Interpretable Credit Application Predictions With Counterfactual Explanations. arXiv. (arXiv)
3. Vasugi, T. (2023). AI-empowered neural security framework for protected financial transactions in distributed cloud banking ecosystems. International Journal of Advanced Research in Computer Science & Technology, 6(2), 7941–7950. https://doi.org/0.15662/IJARCST.2023.0602004
4. Suchitra, R. (2023). Cloud-Native AI model for real-time project risk prediction using transaction analysis and caching strategies. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 6(1), 8006–8013. https://doi.org/10.15662/IJRPETM.2023.0601002
5. Dastile, X., Pretorius, A., & Venter, P. (2022). Model-Agnostic Counterfactual Explanations in Credit Scoring. IEEE Access, (via Genetic Algorithm) (ResearchGate)
6. Nagarajan, G. (2022). An integrated cloud and network-aware AI architecture for optimizing project prioritization in healthcare strategic portfolios. International Journal of Research and Applied Innovations, 5(1), 6444–6450. https://doi.org/10.15662/IJRAI.2022.0501004
7. Kumar, Sanjay Nakharu Prasad. "Navigating the AI Horizon: Transformations, Ethical Imperatives, and Pathways to Responsible Innovation." Journal Of Applied Sciences 5.10 (2025): 34-43.
8. Kotapati, V. B. R., Perumalsamy, J., & Yakkanti, B. (2022). Risk-Adapted Investment Strategies using Quantum-enhanced Machine Learning Models. American Journal of Autonomous Systems and Robotics Engineering, 2, 279-312.
9. Kesavan, E., Srinivasulu, S., & Deepak, N. M. (2025, July). Cloud Computing for Internet of Things (IoT): Opportunities and Challenges. In 2025 2nd International Conference on Computing and Data Science (ICCDS) (pp. 1-6). IEEE.
10. Mohile, A. (2021). Performance Optimization in Global Content Delivery Networks using Intelligent Caching and Routing Algorithms. International Journal of Research and Applied Innovations, 4(2), 4904-4912.
11. Sabin Begum, R., & Sugumar, R. (2019). Novel entropy-based approach for cost-effective privacy preservation of intermediate datasets in cloud. Cluster Computing, 22(Suppl 4), 9581-9588.
12. Konatham, M. R., Uddandarao, D. P., Vadlamani, R. K., & Konatham, S. K. R. (2025, July). Federated Learning for Credit Risk Assessment in Distributed Financial Systems using BayesShield with Homomorphic Encryption. In 2025 International Conference on Computing Technologies & Data Communication (ICCTDC) (pp. 1-6). IEEE.
13. Kusumba, S. (2025). Modernizing US Healthcare Financial Systems: A Unified HIGLAS Data Lakehouse for National Efficiency and Accountability. International Journal of Computing and Engineering, 7(12), 24-37.
14. Kumar, R. K. (2022). AI-driven secure cloud workspaces for strengthening coordination and safety compliance in distributed project teams. International Journal of Research and Applied Innovations (IJRAI), 5(6), 8075–8084. https://doi.org/10.15662/IJRAI.2022.0506017
15. Thangavelu, K., Kota, R. K., & Mohammed, A. S. (2022). Self-Serve Analytics: Enabling Business Users with AI-Driven Insights. Los Angeles Journal of Intelligent Systems and Pattern Recognition, 2, 73-112.
16. Sudhan, S. K. H. H., & Kumar, S. S. (2015). An innovative proposal for secure cloud authentication using encrypted biometric authentication scheme. Indian journal of science and technology, 8(35), 1-5.
17. Konda, S. K. (2024). AI Integration in Building Data Platforms: Enabling Proactive Fault Detection and Energy Conservation. International Journal of Advanced Research in Computer Science & Technology (IJARCST), 7(3), 10327-10338.
18. Udayakumar, M. A. K. D. (2023). Assessing learning behaviors using gaussian hybrid fuzzy clustering (ghfc) in special education classrooms.
19. Christadoss, J., Devi, C., & Mohammed, A. S. (2024). Event-Driven Test-Environment Provisioning with Kubernetes Operators and Argo CD. American Journal of Data Science and Artificial Intelligence Innovations, 4, 229-263.
20. Adari, V. K. (2020). Intelligent Care at Scale AI-Powered Operations Transforming Hospital Efficiency. International Journal of Engineering & Extended Technologies Research (IJEETR), 2(3), 1240-1249.
21. Poornima, G., & Anand, L. (2024, April). Effective Machine Learning Methods for the Detection of Pulmonary Carcinoma. In 2024 Ninth International Conference on Science Technology Engineering and Mathematics (ICONSTEM) (pp. 1-7). IEEE.
22. Muthusamy, M. (2024). Cloud-Native AI metrics model for real-time banking project monitoring with integrated safety and SAP quality assurance. International Journal of Research and Applied Innovations (IJRAI), 7(1), 10135–10144. https://doi.org/10.15662/IJRAI.2024.0701005
23. Peram, S. (2024). Cost Optimization in AI Systems Leveraging DEMATEL for Machine Learning and Cloud Efficiency. https://www.researchgate.net/profile/Sudhakara-Peram/publication/396293333_Cost_Optimization_in_AI_Systems_Leveraging_DEMATEL_for_Machine_Learning_and_Cloud_Efficiency/links/68e5f179f3032e2b4be76f3f/Cost-Optimization-in-AI-Systems-Leveraging-DEMATEL-for-Machine-Learning-and-Cloud-Efficiency.pdf
24. Kandula N (2023). Gray Relational Analysis of Tuberculosis Drug Interactions A Multi-Parameter Evaluation of Treatment Efficacy. J Comp Sci Appl Inform Technol. 8(2): 1-10.
25. Adari, V. K. (2024). How Cloud Computing is Facilitating Interoperability in Banking and Finance. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 7(6), 11465-11471.
26. Pasumarthi, A. (2023). Dynamic Repurpose Architecture for SAP Hana Transforming DR Systems into Active Quality Environments without Compromising Resilience. International Journal of Engineering & Extended Technologies Research (IJEETR), 5(2), 6263-6274.