Adaptive Chaos Engineering and AI-Driven Dependability Modeling for Resilient Cloud-Native and Safety-Critical Systems
Abstract
The increasing reliance on cloud-native architectures, serverless computing, and artificial intelligence-driven systems has introduced new complexities in ensuring system dependability, resilience, and safety. Traditional reliability engineering approaches, while foundational, are often insufficient in addressing the dynamic, distributed, and failure-prone nature of modern cloud ecosystems. This research presents a comprehensive, theoretically grounded framework that integrates chaos engineering, machine learning-based reliability modeling, and human-centered safety principles to enhance system robustness across cloud-native and safety-critical domains, including healthcare and autonomous systems.
The study synthesizes interdisciplinary perspectives from cloud computing, dependability engineering, fault injection methodologies, and AI-based safety analysis. It explores how experimental fault injection, particularly through chaos engineering practices, can be combined with predictive analytics to proactively identify and mitigate system vulnerabilities. Furthermore, the research emphasizes the importance of realism in error injection, the role of serverless architectures in resilience testing, and the integration of human factors in safety-critical environments.
A qualitative, theory-driven methodology is employed to construct a unified framework that bridges gaps between cloud system resilience and safety engineering in domains such as healthcare. The findings suggest that integrating chaos engineering with machine learning enhances predictive fault detection, improves failure propagation understanding, and supports adaptive system recovery mechanisms. Additionally, the study highlights that human-centered design and error taxonomy integration significantly contribute to reducing systemic risks in critical infrastructures.
The proposed framework offers a novel contribution by aligning chaos engineering practices with AI-driven reliability assessment and safety assurance principles. It provides a scalable and adaptable approach for organizations seeking to build resilient, trustworthy, and high-performance systems in increasingly complex technological landscapes.
Β
Keywords
References
Similar Articles
- Sanjay K. Morello, Securing Multi-Tenant FPGA Clouds: Architectures, Threats, and Integrated Defenses for Trusted Reconfigurable Computing , International Journal of Next-Generation Engineering and Technology: Vol. 2 No. 08 (2025): Volume 02 Issue 08
- Dr. Adrian Keller, Queuing-Integrated Deep Reinforcement Learning For Adaptive Task Scheduling In Cloud Data Centers , International Journal of Next-Generation Engineering and Technology: Vol. 3 No. 01 (2026): Volume 03 Issue 01
- Evan Richman, Advanced Evolutionary Optimization and Intelligent Sensor Integration for Electromagnetic Compatibility and Signal Integrity in Autonomous Vehicle Architectures , International Journal of Next-Generation Engineering and Technology: Vol. 3 No. 01 (2026): Volume 03 Issue 01
- Mateo Villarreal, Cloud-Enabled Big Data Analytics: Architectural Foundations, Security Challenges, And Sectoral Applications in The Era of Scalable Digital Intelligence , International Journal of Next-Generation Engineering and Technology: Vol. 2 No. 12 (2025): Volume 02 Issue 12
- Dr. Rico Fernandez, HARNESSING SOLAR ENERGY FOR COOLING: INNOVATIONS IN SOLAR THERMAL COOLING SYSTEMS , International Journal of Next-Generation Engineering and Technology: Vol. 2 No. 01 (2025): Volume 02 Issue 01
- Aghasi Gevorgyan, Cybersecurity in Networks Supporting Card Payment Systems , International Journal of Next-Generation Engineering and Technology: Vol. 3 No. 02 (2026): Volume 03 Issue 02
- Dr. Alistair J. Sterling, Architectural Frameworks for Multimodal Learning Analytics and Autonomic System Feedback: Integrating Physiological, Inertial, And Temporal Data for Enhanced Skill Acquisition , International Journal of Next-Generation Engineering and Technology: Vol. 2 No. 12 (2025): Volume 02 Issue 12
- Linh Thuy Nguyen, Kofi Mensah, OPTIMIZING SOFTWARE EFFORT ESTIMATION: A SYNERGISTIC HYBRID DEEP LEARNING FRAMEWORK WITH ENHANCED METAHEURISTIC OPTIMIZATION , International Journal of Next-Generation Engineering and Technology: Vol. 2 No. 11 (2025): Volume 02 Issue 11
- Paul Hathaway, A Comparative Analysis of Data-Driven Decision Support Systems: Bridging Clinical Epidemiology, Public Health Informatics, And Predictive E-Commerce Analytics in The Era of Big Data , International Journal of Next-Generation Engineering and Technology: Vol. 3 No. 01 (2026): Volume 03 Issue 01
- Dr. Jonathan R. Whitmore, Architecting Resilient Continuous Integration and Delivery Ecosystems for Large-Scale Java Enterprises: An Integrated Perspective on Information Needs, Modular Evolution, and Pipeline Governance , International Journal of Next-Generation Engineering and Technology: Vol. 2 No. 10 (2025): Volume 02 Issue 10
You may also start an advanced similarity search for this article.