A list of my chaos engineering related blog posts and open-source projects.
Series on chaos engineering
This is a collection of three articles on chaos engineering that present and discuss the different phases of the chaos engineering process.
Part 1: The art of breaking things purposefully
In Part 1 of this series, I introduce chaos engineering and explain how it helps uncover and fix unknowns in your system before they become outages in production; and also how it fosters positive cultural change inside organizations.
Chaos Engineering — Part 1
The art of breaking things purposefullymedium.com
Part 2: Planning your first experiment
In Part 2, I discuss areas to invest in to start designing your first chaos engineering experiments and pick up the right hypothesis.
Chaos Engineering — Part 2
Planning your first experimentmedium.com
Part 3: Failure Injection — Tools and Methods
In Part 3, I focus on the experiment itself and present a collection of tools and methods that cover the broad spectrum of failure injection necessary for running chaos engineering experiments.
Chaos Engineering — Part 3
Failure Injection — Tools and Methodsmedium.com
Climbing out of the Chasm: Unleashing the Potential of Chaos Engineering
Climbing out of the Chasm
Unleashing the Potential of Chaos Engineeringmedium.com
Chaos Engineering with AWS Fault Injection Simulator, Introduction
What is AWS Fault Injection Simulator and why you should care.
In this blog post I will explain a little bit about distributed systems and why chaos engineering, a relatively new…medium.com
Injecting custom faults with AWS Fault Injection Simulator
Injecting custom faults with AWS Fault Injection Simulator
Part 2 — AWS Fault Injection Simulator seriesmedium.com
Testing Spot Instance interruptions with AWS Fault Injection Simulator
Testing Spot Instance interruptions with AWS Fault Injection Simulator
Data-driven chaos engineeringmedium.com
Schedule a recurring experiment with AWS Fault Injection Simulator (FIS)
Schedule a recurring experiment with AWS Fault Injection Simulator (FIS)
Chaos Engineering on AWSmedium.com
Monitoring Chaos Engineering Experiments with Datadog
Monitoring Chaos Engineering Experiments with Datadog
Tutorial — Using Datadog Alarms with Amazon EventBridge to Safely Monitor AWS Fault Injection Simulator Experimentsmedium.com
How AWS Lambda uses AWS FIS to validate their Resilience
AWS Lambda: Resilience under-the-hood
medium.com
Chaos Engineering — Looking back to look forward: Thoughts on chaos engineering and AWS Fault Injection Simulator
Looking back to look forward
Thoughts on chaos engineering and AWS Fault Injection Simulatormedium.com
Chaos Engineering — What and who is a chaos engineer?
Chaos Engineering Q&A — What and who is a chaos engineer?
Answering questions from my chaos engineering webinarmedium.com
Chaos Engineering — How to safely inject failure in your application?
Chaos Engineering Q&A — How to safely inject failure?
Answering questions from my webinarmedium.com
Practical Chaos Engineering
A set of articles presenting practical implementations of chaos engineering experiments.
Building resilient services at Prime Video with chaos engineering
In this article, we present an open source library, AWSSSMChaosRunner based on AWS System Manager. We also provide an example of integration and explain how Prime Video used this library to prevent potentially customer-impacting outages.
Building resilient services at Prime Video with chaos engineering
Originally published at https://aws.amazon.com on August 18, 2020 by Varun Jewalikar and Adrian Hornsbymedium.com
Creating your own Chaos Monkey with AWS Systems Manager Automation
In this article, I show how to create your own Chaos Monkey (randomly killing EC2 instances) and opened source ready-made failure injection to get started. Give it a try — you won’t regret it!
Creating your own Chaos Monkey with AWS Systems Manager Automation
Chaos Engineering on AWSmedium.com
Injecting Chaos to Amazon EC2 using AWS System Manager
In this article, I show how to inject failure into your application using AWS System Manager and opened source plenty of ready-made failure injection to get started. Try it — it’s pretty awesome!
Injecting Chaos to Amazon EC2 using AWS System Manager
Ansible, Chef, and Bastion host — these are the standard, widely known tools and techniques, used to execute ad-hoc…medium.com
adhorn/chaos-ssm-documents
Collection of SSM Documents. This allows to conduct small chaos engineering experiments for your Amazon EC2 instances…github.com
Injecting Chaos to AWS Lambda functions using Lambda Layers
In this article, I explain how to use AWS Lambda Layers to conduct chaos engineering experiments on Lambda functions.
Original post:
Injecting Chaos to AWS Lambda functions using Lambda Layers
“As far as the laws of mathematics refer to reality, they are not certain, and as far as they are certain, they do not…medium.com
Update:
Injecting Chaos to AWS Lambda functions with Lambda Layers- RELOADED
medium.com
adhorn/aws-lambda-chaos-injection
chaos_lambda is a small library injecting chaos into AWS Lambda. It offers simple python decorators to do delay…github.com
adhorn/aws-lambda-layer-chaos-injection
chaos_lib is a small library injecting chaos into AWS Lambda Layers. It offers simple python decorators to do delay…github.com
Collection of python scripts to run failure injection on AWS infrastructure
adhorn/aws-chaos-scripts
⚠️USE AT YOUR OWN RISK ⚠️ Using these scripts may create an unreasonable risk. If you choose to use the scripts…github.com