Chaos Engineering for PCF — SpringOne Platform 2018

September 29, 2018

Modern Internet-scale microservice architectures exhibit complex communication behavior and failure scenarios with chaotic behavior (a.k.a the Butterfly Effect) that may lead to large scale disruptive events. This complexity comes from the Pivotal Cloud Foundry (PCF) components, services running thereon, and the underlying infrastructure necessary to provide highly available compute, network, security, storage, persistence services. For a distributed microservice architecture to function ideally, these elements must all work in tandem and tolerate failure. To systematically verify that a system can tolerate failure, a disciplined approach is necessary. One such approach is Chaos Engineering. This proposal demonstrates the approach and the custom tools T-Mobile is building to purposefully break systems, identify weaknesses and take corrective actions. It's an enhanced API on top of the ChaosLemur project for introducing more complex failure scenarios into the PCF environment. Speakers: Karun Chennuri and Ramesh Krishnaram , T-Mobile

Previous
The Anatomy of Building a Compliant PCF Service in a Limited Connectivity Environment #BoomSauce
The Anatomy of Building a Compliant PCF Service in a Limited Connectivity Environment #BoomSauce

Raytheon, a multinational Defense and Cybersecurity industry leader, is on a Digital Transformation journey...

Next
Continuous Delivery to the Cloud Wreaks of Humanity (and That’s a Good Thing!)
Continuous Delivery to the Cloud Wreaks of Humanity (and That’s a Good Thing!)