Chaos Engineering¶
Architectural Context
Detailed reference for Chaos Engineering in the context of Platform & Site Reliability.
Standard Reference¶
- Chaos Mesh β 7710 [ENTERPRISE-STABLE]
- pingcap.com: chaos-mesh-action: Integrate Chaos Engineering into Your CI [COMMUNITY-TOOL]
- openshift.com: Introduction to Kraken, a Chaos Tool for OpenShift/Kubernetes [COMMUNITY-TOOL]
- blog.flant.com: Open Source solutions for chaos engineering in Kubernetes [COMMUNITY-TOOL] β - kube-monkey
- chaoskube
- Chaos Mesh
- Litmus Chaos
- Chaos Toolkit
- KubeInvaders
- blog.palark.com: Attaining harmony of chaos in Kubernetes with Chaos Mesh [COMMUNITY-TOOL]
- Azure Chaos Studio [COMMUNITY-TOOL] β - techcommunity.microsoft.com: Announcing the Public Preview of Azure Chaos Studio
- Awesome Chaos Engineering β 6564 [ENTERPRISE-STABLE]
- thenewstack.io: Chaos Engineering Is Not Just for Ops [COMMUNITY-TOOL]
- thenewstack.io: Why Chaos Engineering Isnβt Just for Operations [COMMUNITY-TOOL]
- medium.com/adidoescode: Chaos Engineering: How simulating adversity can' help build eCommerce Resilience [COMMUNITY-TOOL]
- opsmx.com: What is Chaos Engineering? [COMMUNITY-TOOL]
- aws.amazon.com: Verify the resilience of your workloads using Chaos Engineering [COMMUNITY-TOOL]
- faun.pub: What is Chaos Engineering? [COMMUNITY-TOOL]
- reddit: Help with Kube Monkey setup [COMMUNITY-TOOL]
- GitHub: kube-monkey β 3055 [ENTERPRISE-STABLE]
- GitHub: monkey-ops, Openshift compliant, no cluster-admin required [COMMUNITY-TOOL]
- Litmus Chaos is a toolset to do chaos engineering in a kubernetes native way. Litmus provides chaos CRDs for Cloud-Native developers and SREs to inject, orchestrate and monitor chaos to find weaknesses in Kubernetes deployments β 5407 [ENTERPRISE-STABLE]
- thenewstack.io: Using Chaos Engineering to Improve the Resilience of Stateful' Applications on Kubernetes [COMMUNITY-TOOL]
- infoq.com: Chaos Engineering on Kubernetes : Chaos Mesh Generally Available' with v1.0 [COMMUNITY-TOOL]
- chaos-mesh.org: Chaos Mesh 1.0: Chaos Engineering on Kubernetes Made Easier [COMMUNITY-TOOL]
- thenewstack.io: Develop a Daily Reporting System for Chaos Mesh to Improve' System Resilience [COMMUNITY-TOOL]
- thenewstack.io: Chaos Engineering Progressively Moves to Production [COMMUNITY-TOOL]
- PowerfulSeal β 1977 [COMMUNITY-TOOL]
- BuggyApp: Simulate performance problems [COMMUNITY-TOOL]
- medium.com: Getting Started with Chaos Engineering [COMMUNITY-TOOL]
- Chaos Mesh π [COMMUNITY-TOOL]
- opensource.com: 5 lessons I learned about chaos engineering for Kubernetes [COMMUNITY-TOOL]
- thenewstack.io: Chaos Engineering Made Simple [COMMUNITY-TOOL]
- thenewstack.io: Use Chaos Engineering to Strengthen Your Incident Response [COMMUNITY-TOOL]
- thenewstack.io: Operationalizing Chaos Engineering with GitOps [COMMUNITY-TOOL]
- medium.com/better-practices: Learn how your Kubernetes clusters respond' to failure using Gremlin and Grafana [COMMUNITY-TOOL]
- aws.amazon.com: Chaos Engineering with LitmusChaos on Amazon EKS [COMMUNITY-TOOL]
- blog.container-solutions.com: Comparing Chaos Engineering Tools for Kubernetes' Workloads [COMMUNITY-TOOL]
- awstip.com: Kubernetes Chaos Monkey: A Scheduled Random Pod Deletion Python' Script for Testing Cluster Resilience [COMMUNITY-TOOL]
- medium.com/@alex.ivenin: Chaos engineering in kubernetes [COMMUNITY-TOOL]
- thenewstack.io: Breaking Serverless on Purpose with Chaos Engineering [COMMUNITY-TOOL]
- chaosblade β 6334 [ENTERPRISE-STABLE]
- aws.amazon.com: Automating and Scaling Chaos Engineering using AWS Fault' Injection Simulator [COMMUNITY-TOOL]
Platform Engineering¶
Architectural Patterns¶
Internal Developer Platforms¶
- Platform Democracy: Rethinking Who Builds and Consumes Your Internal Platform [ADVANCED LEVEL] [COMMUNITY-TOOL] β An analytical piece explaining Platform Democracy as an operational framework. Discusses user-centric workflows when designing internal developer platform structures (IDPs).
Public Cloud Platforms¶
AWS¶
Chaos Engineering (1)¶
- Chaos engineering on Amazon EKS using AWS Fault Injection Simulator [ADVANCED LEVEL] [COMMUNITY-TOOL] [GUIDE] β Guided workflow utilizing AWS FIS (Fault Injection Simulator) to execute controlled resilience and disruption experiments against EKS node groups and containers. Demonstrates monitoring system reaction metrics and reinforcing application failover.
π‘ Explore Related: DevOps | QA | Project Management Methodology