Recreating 3 Common Outages with Gremlin Scenarios

Webinar

Recreating 3 Common Outages with Gremlin Scenarios

In this live tutorial, we show you how Gremlin Scenarios can be used to recreate complex failure conditions and proactively prepare your systems to withstand them.

On-demand

Save your seat

About this webinar

In October of 2016, large swathes of the internet were knocked offline. The cause, as we would eventually learn, was a distributed denial-of-service (DDoS) attack against the DNS provider, Dyn. Many of the internet’s most popular sites that relied on Dyn experienced extended downtime, resulting in a significant impact on revenue, engineering velocity, and brand reputation.

Since the Dyn outage, many major services have built-in redundancy around their DNS provider, allowing them to gracefully failover to a backup service should the primary provider become unavailable.

This is just one example of a failure scenario that caused widespread outages. However, there are other more common scenarios that can cause problems.

You'll walk away understanding how Gremlin can be used to recreate complex failure conditions and proactively prepare your systems to withstand them. You’ll also have the opportunity to have your questions answered by our experts during our Q&A segment.

Agenda

In this live session, we will introduce 3 failure scenarios that can cause downtime, ranging from simple to complex: autoscaling errors, unreliable networks, and DNS outages.
You'll see real-world examples of incidents caused by these failure scenarios.
We will demonstrate how you can recreate these failure conditions and test your systems for resilience against them using Gremlin’s new Recommended Scenarios.
Finally, you will get a framework for building your own Custom Scenarios specific to your use case.

About the speakers

On-Demand

Explore our tutorials to learn about the technologies and processes that help you manage reliability to a higher standard

Chaos Engineering: the history, principles, and practice

How To Establish a High Severity Incident Management Program

4 Chaos Experiments to Start With

Avoid downtime. Use Gremlin to turn failure into resilience.

Gremlin empowers you to proactively root out failure before it causes downtime. See how you can harness chaos to build resilient systems by requesting a demo of Gremlin.

Get started

Webinar