Managed Kubernetes in Public Cloud¶
Architectural Context
Detailed reference for Managed Kubernetes in Public Cloud in the context of Cloud Providers (Hyperscalers).
Standard Reference¶
- neal-davis.medium.com: ECS vs EC2 vs Lambda [COMMUNITY-TOOL]
- Banzai Cloud π [COMMUNITY-TOOL]
- medium.com/@ishana98dadhich: Integrating AWS Secret Manager with EKS and' use Secrets inside the Pods: Part-1 [COMMUNITY-TOOL]
- mehighlow.medium.com: Hardened-AKS/Secrets [COMMUNITY-TOOL]
- faun.pub: External Secret Operator on AKS (with Terraform) for Azure Key' Vault Integration (with Workload Identity) [COMMUNITY-TOOL]
- medium.com: Kubernetes Cloud Services: Comparing GKE, EKS and AKS [COMMUNITY-TOOL]
- medium: State of Managed Kubernetes 2020 [COMMUNITY-TOOL]
- medium: Managed Kubernetes Services Compared: GKE vs. EKS vs. AKS [COMMUNITY-TOOL]
- otomi.io π [COMMUNITY-TOOL]
- udemy.com: amazon eks starter kubernetes on aws [COMMUNITY-TOOL]
- magalix.com: Deploying Kubernetes Cluster With EKS π [COMMUNITY-TOOL]
- Deploying Infrastructure (FrontEnd + BackEnd) on AWS using Amazon EKS [COMMUNITY-TOOL]
- medium: Building the CI/CD of the Future, Creating the EKS Cluster π [COMMUNITY-TOOL]
- daveops.xyz: Administrar usuarios en EKS [COMMUNITY-TOOL]
- medium: Designing a Kubernetes Cluster with Amazon EKS From Scratch π [COMMUNITY-TOOL]
- en.sokube.ch: AWS + Kubernetes = AWS Elastic Kubernetes Service (EKS) π [COMMUNITY-TOOL]
- medium: Run Kubernetes Production Environment on EC2 Spot Instances With' Zero Downtime: A Complete Guide [COMMUNITY-TOOL]
- releaseops.io: Scaling Kubernetes Deployments in AWS with Container Insights' Metrics [COMMUNITY-TOOL]
- medium: Create Kubernetes Cluster On AWS EKS [COMMUNITY-TOOL]
- info.acloud.guru: Scaling the hottest app in tech on AWS and Kubernetes [COMMUNITY-TOOL]
- medium: How to Deploy an EKS stack in AWS? [COMMUNITY-TOOL]
- faun.pub: Upgrading and Scaling Kubernetes cluster in AWS [COMMUNITY-TOOL]
- particule.io: Create Kubernetes federated clusters on AWS [COMMUNITY-TOOL]
- betterprogramming.pub: Amazon EKS Is Eating My IPs! [COMMUNITY-TOOL]
- blog.usejournal.com: Spice up Your Kubernetes Environment with AWS Lambda' π [COMMUNITY-TOOL]
- faun.pub: Kubernetes Multi-tenancy with Amazon EKS: Best practices and considerations' π [COMMUNITY-TOOL]
- aws.plainenglish.io: 6 Tips to Improve Availability with AWS Load Balancers' and Kubernetes [COMMUNITY-TOOL]
- blog.searce.com: Optimise cost for AWS EKS cluster using Spotinst π [COMMUNITY-TOOL]
- medium.com/@abhinav.ittekot: Granting IAM permissions to pods in EKS using' OIDC [COMMUNITY-TOOL]
- medium.com/@radha.sable25: Enabling IAM users/roles Access on Amazon EKS' cluster [COMMUNITY-TOOL]
- medium.com/avmconsulting-blog: Installing Vault On EKS With TLS And Persistent' Storage [COMMUNITY-TOOL]
- dzone.com: How to Use AWS IAM Role on AWS EKS PODs π [COMMUNITY-TOOL]
- akintola-lonlon.medium.com: AWS Kubernetes: The #1 Rule You Need To Master' Before Going To Production. [COMMUNITY-TOOL]
- amod-kadam.medium.com: Are there two Load Balancer Controllers with EKS?' π [COMMUNITY-TOOL]
- joachim8675309.medium.com: ExternalDNS with EKS and Route53 [COMMUNITY-TOOL]
- opssorry.substack.com: GitOps: A Simple Approach to using AWS Secrets' Manager with Kubernetes π [COMMUNITY-TOOL]
- medium.com/@chandranathmondal: Self-service Amazon EKS Cluster provisioning' with Kubernetes configuration applied π [COMMUNITY-TOOL]
- eng.grip.security: Enabling AWS IAM Group Access to an EKS Cluster Using' RBAC [COMMUNITY-TOOL]
- medium.com/@andriikrymus: DNS config for EKS [COMMUNITY-TOOL]
- silvr.medium.com: Using Kyverno To Enforce AWS Load Balancer Annotations' For Centralized Logging To S3 [COMMUNITY-TOOL]
- blog.jimmyray.io: Kubernetes Workload Identity with AWS SDK for Go v2 [COMMUNITY-TOOL]
- medium.com/geekculture: EKS β Kubernetes β Not Ready nodes [COMMUNITY-TOOL]
- faun.pub: How to access AWS services from EKS [COMMUNITY-TOOL]
- faun.pub: AWS EKS: The Ultimate Guide To Deploy AWS Load Balancer Controller' add-on [COMMUNITY-TOOL]
- medium.com/@ankit.wal: Understanding IAM roles for service accounts, IRSA,' on AWS EKS [COMMUNITY-TOOL]
- levelup.gitconnected.com: Running Workflows on windows with Jenkins pipeline' and Kubernetes [COMMUNITY-TOOL]
- nivogt.medium.com: Boost your Kubernetes clusterβs Autoscaler on AWS EKS' with Karpenter [COMMUNITY-TOOL]
- towardsaws.com: Autoscale Kubernetes Metrics Server on Amazon EKS [COMMUNITY-TOOL]
- faun.pub: Analyze AWS EKS Audit logs with Falco [COMMUNITY-TOOL]
- hardiks.medium.com: Where should you manage your Kubernetes in 2023? Amazon' ECS or EKS [COMMUNITY-TOOL]
- awstip.com: Amazon Elastic Kubernetes Service (Amazon EKS) β The Only Resource' Hub You Ever Need [COMMUNITY-TOOL]
- awstip.com: Working The Amazon EKS Immersion Workshop β Chapter 1 β Deploying' A Microservices Application In A Kubernetes Cluster [COMMUNITY-TOOL]
- blog.antoinechoula.ga: Native EKS Ingress with AWS Load Balancer Controller [COMMUNITY-TOOL]
- devopslearning.medium.com: Lesson learned while scaling Kubernetes cluster' to 1000 pods in AWS EKS [COMMUNITY-TOOL]
- sitepoint.com: Getting Started With Kubernetes on AWS Tutorial (2023 Update) [COMMUNITY-TOOL]
- medium.com: Saving costs in Google Kubernetes Engine using Spot VMs [COMMUNITY-TOOL]
- medium.com/@benjamin.christmann_12432: Setting up your first EKS cluster' on AWS: some practical tips [COMMUNITY-TOOL]
- blog.ratnopamc.com: Reduce cross-AZ traffic costs on EKS using topology' aware hints [COMMUNITY-TOOL]
- medium.com/@danielresponda: Testing Spot Reclamation Mechanisms with AWS' Node Termination Handler and Kubernetes Autoscaler [COMMUNITY-TOOL]
- medium.com/@leocherian: Simple CDK app to create EKS Cluster [COMMUNITY-TOOL]
- blog.clouddrove.com: AWS EKS Blue/Green Deployment with Best Practices [COMMUNITY-TOOL]
- blog.stackademic.com: Create the AWS EKS Cluster with a Managed Node Group' Using Custom Launch Templates [COMMUNITY-TOOL]
- blog.devops.dev: HACKING KUBERNETES in AWS [COMMUNITY-TOOL]
- rahulbhatia1998.medium.com: Designing A Multi-Region Kubernetes Cluster' For Disaster Recovery On AWS EKS [COMMUNITY-TOOL]
- towardsaws.com: From Scratch to Production: Deploying EKS Clusters and Applications' with CI/CD using Jenkins and Terraform [COMMUNITY-TOOL]
- awstip.com: Per-pod PIDs limit on EKS [COMMUNITY-TOOL]
- medium.com/ekino-france: Addressing private IPv4 shortage: 5 Strategies' for Amazon EKS [COMMUNITY-TOOL]
- medium.com/scout24-engineering: How did we upgrade our EKS clusters from' 1.15 to 1.22 without K8s knowledge? [COMMUNITY-TOOL]
- marcincuber.medium.com: Amazon EKS Upgrade Journey From 1.24 to 1.25 [COMMUNITY-TOOL]
- gokulchandrapr.medium.com: Amazon EKS Anywhere & EKS Connector [COMMUNITY-TOOL]
- ambar-thecloudgarage.medium.com: EKS Anywhere., decoding the architecture. [COMMUNITY-TOOL]
- blog.techknowtrendz.com: Taking Amazon EKS Anywhere for a spin [COMMUNITY-TOOL]
- medium: Kubernetes + EKS + Canary Deployment [COMMUNITY-TOOL]
- mehmetozkaya.medium.com: Deploying .Net Microservices to Azure Kubernetes' Services(AKS) and Automating with Azure DevOps [COMMUNITY-TOOL]
- faun.pub: How to implement Azure Kubernetes Service (AKS) in Cloud? [COMMUNITY-TOOL]
- joachim8675309.medium.com: AKS with GRPC and ingress-nginx [COMMUNITY-TOOL]
- medium: AKS with Calico Network Policies [COMMUNITY-TOOL]
- joachim8675309.medium.com: AKS with Istio Service Mesh [COMMUNITY-TOOL]
- blog.kasten.io: AKS and Storage: How to Design Storage for Cloud Native' Applications [COMMUNITY-TOOL]
- blog.kasten.io: AKS and Storage: Performance Differences Among K8s Storage' Services [COMMUNITY-TOOL]
- medium: AKS β different load balancing options. When to use what? [COMMUNITY-TOOL]
- medium: Going multicloud with kubernetes and Azure Front Door [COMMUNITY-TOOL]
- akhilsharma.work: How to list Azure RBAC Roles to Secure AKS Clusters [COMMUNITY-TOOL]
- logz.io: Collecting Metrics from Windows Kubernetes Nodes in AKS π [COMMUNITY-TOOL]
- medium.com/kocsistem: Installation Internal Nginx Ingress for a Private' AKS Cluster [COMMUNITY-TOOL]
- joachim8675309.medium.com: ExternalDNS with AKS & Azure DNS [COMMUNITY-TOOL]
- medium.com/dzerolabs: Accessing Azure Key Vault Secrets in Azure Kubernetes' with Secrets Store CSI Driver π [COMMUNITY-TOOL]
- medium.com/@gjoshevski: Reduce the cost of running AKS cluster by leveraging' Azure Spot VMs| 70% and more ππ [COMMUNITY-TOOL]
- medium.com/@vamsi.lakshman: Overview of Azure Kubernetes Services Networking' Models [COMMUNITY-TOOL]
- medium.com/credera-engineering: How to blue-green deploy an AKS cluster [COMMUNITY-TOOL]
- medium.com/@danieljimgarcia: The Application Gateway Ingress Controller' is broken π [COMMUNITY-TOOL]
- medium.com/@ershivamgupta: Disaster Recovery Solution for Azure Kubernetes' Service (AKS) Persistent Volume Storage π [COMMUNITY-TOOL]
- medium.com/microsoftazure: Automating Managed Prometheus and Grafana with' Terraform for scalable observability on Azure Kubernetes Service and Istio π [COMMUNITY-TOOL]
- medium.com/@GiantSwarm: Deep Dive Into Kubernetes Networking in Azure [COMMUNITY-TOOL]
- medium.com/@lfoster49203: Kubernetes on Azure: Setting up a cluster on Microsoft' Azure (with Azure AKS) [COMMUNITY-TOOL]
- medium.com/@pauldotyu: Effortlessly Deploy to AKS with Open Source Tools' Draft and Acorn [COMMUNITY-TOOL]
- medium.com/adessoturkey: Azure DevOps Agents on AKS with the kaniko Option [COMMUNITY-TOOL]
- inder-devops.medium.com: AKS Networking Deep Dive: Kubenet vs Azure-CNI' vs Azure-CNI (overlay) [COMMUNITY-TOOL]
- medium.com/@anjkeesari: Install Grafana Loki-Stack Helmchart in Azure Kubernetes' Services (AKS) [COMMUNITY-TOOL]
- blog.stackademic.com: Advanced End-to-End DevSecOps Kubernetes Three-Tier' Project using Azure AKS, fluxCD, Prometheus, Grafana, and GitLab [COMMUNITY-TOOL]
- blog.doit-intl.com: How to Set Up Multi-Cluster Load Balancing with GKE [COMMUNITY-TOOL]
- medium: How to provision Kubernetes Cluster in GCP Cloud (K8s)? π [COMMUNITY-TOOL]
- faun.pub: How to automate the setup of a Kubernetes cluster on GCP [COMMUNITY-TOOL]
- medium.com/@glen.yu: Getting started with eBPF and Cilium on GKE [COMMUNITY-TOOL]
- medium.com/@glen.yu: NGINX Ingress or GKE Ingress? [COMMUNITY-TOOL]
- medium.com/google-developer-experts: Getting started with GKE Gateway controller [COMMUNITY-TOOL]
- medium.com/google-cloud: Monitoring Kubernetes Clusters on GKE (Google Container' Engine) [COMMUNITY-TOOL]
- blog.devgenius.io: Explore API Priority and Fairness to Ease the Load of' the APIServer [COMMUNITY-TOOL]
- faun.pub: Make Your Kubernetes Cluster Highly Available and Fault Tolerant' π [COMMUNITY-TOOL]
- medium.com/@pbijjala: reCap: Kube vrs Cloud DNS in GKE [COMMUNITY-TOOL]
- medium.com/google-cloud: Ingress in Google Kubernetes Products [COMMUNITY-TOOL]
- medium.com/@pbijjala: Considerations for Hardening your GKE, a workload' perceptive [COMMUNITY-TOOL]
- medium.com/@jjlakis: GCP Secret Manager with self-hosted Kubernetes [COMMUNITY-TOOL]
- tech.loveholidays.com: GKE Multi-Cluster Services β one bad probe away from' disaster [COMMUNITY-TOOL]
- Looking for GPU Capacity ? DWS got you covered ! [COMMUNITY-TOOL]
- medium.com/google-cloud: Understanding health checks in GKE & Gateway API [COMMUNITY-TOOL]
- medium: Multizone Kubernetes and VPC Load Balancer Setup with terraform [COMMUNITY-TOOL]
- Linode Kubernetes Engine (LKE) [COMMUNITY-TOOL]
- medium: Create Kubernetes Cluster Using Linode LKE [COMMUNITY-TOOL]
- blog.ediri.io: DigitalOcean Kubernetes Challenge [COMMUNITY-TOOL]
Application Delivery¶
CICD and GitOps¶
- (2023) insights.project-a.com: Using GitHub Actions to deploy to Kubernetes in GKE π [EN CONTENT] [GUIDE] π [COMMUNITY-TOOL] [GUIDE] β Outlines pipeline setup using GitHub Actions to deploy application loads onto GKE. Focuses on setting up Google Workload Identity Federation to secure registry authentication and cluster connections.
- (2022) blog.baeke.info: Trying out Draft 2 on AKS [EN CONTENT] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Evaluates features in Azure Draft v2 on a running AKS instance. Demonstrates bootstrapping manual code, automated configuration outputs, and continuous build integration tests on Azure.
- Azure/Draft π β 642 [EN CONTENT] [COMMUNITY-TOOL] β The official Azure Draft project designed to ease early-stage developer transitions onto Kubernetes. Scans source directories to dynamically output standard Dockerfiles, Kubernetes manifests, Helm deployments, and pipeline workflows.
- youtube: Day -25 | No Dockerfile, No K8s Manifests | Setup CI/CD in 5' minutes for any programming language [EN CONTENT] [COMMUNITY-TOOL] β A video guide evaluating rapid deployment processes. Demonstrates using Azure Draft to generate necessary Dockerfiles and manifest definitions directly from code to build functional CI/CD loops with minimal overhead.
Cloud Infrastructure¶
Orchestration¶
AWS EKS Tools¶
- (2026) eksctl: EKS installer β 5202 [EN CONTENT] [ADVANCED LEVEL] πππππ [DE FACTO STANDARD] β The official CLI tool for creating and managing EKS clusters on AWS. Automates CloudFormation stacks, node group configurations, IAM integration (IRSA), and VPC provisions.
Cluster Resource Management¶
- (2022) Allocatable memory and CPU in Kubernetes Nodes π [EN CONTENT] [ADVANCED LEVEL] πππ [COMMUNITY-TOOL] β Technical breakdown of node allocatable resources in Kubernetes. Explains how
kube-reserved,system-reserved, and eviction thresholds reduce physical capacity available for user pods.
Cluster Security¶
- (2021) Amazon EKS Security Best Practices [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Exhaustive architectural guide compiling key security recommendations for EKS. Addresses IAM integration, VPC configurations, network segmentation, and host vulnerability hardening.
- EKS Service Accounts Explained [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Architectural deep-dive explaining IAM Roles for Service Accounts (IRSA) in EKS. Demystifies OIDC providers, identity mapping, and least-privilege pod-level AWS credential injection.
Managed Kubernetes¶
- (2023) community.aws/kubernetes [EN CONTENT] [COMMUNITY-TOOL] β AWS builder portal hub focusing on EKS and cloud-native practices, featuring deep-dives, developer tutorials, and best practices.
- (2021) infoworld.com: 6 reasons to switch to managed Kubernetes [EN CONTENT] [COMMUNITY-TOOL] β Explores key drivers for offloading Kubernetes cluster administration to managed services. Examines control-plane management, security patching, and scaling benefits.
- (2021) redhat.com: What architects need to know about managed Kubernetes [EN CONTENT] [COMMUNITY-TOOL] β Strategic overview from Red Hat outlining what technical architects must consider regarding portability, vendor lock-in, and operational boundaries when adopting cloud-managed Kubernetes.
- (2021) acloudguru.com: AKS vs EKS vs GKE: Managed Kubernetes services compared [EN CONTENT] [COMMUNITY-TOOL] β Compares EKS, GKE, and AKS across performance, simplicity, and pricing benchmarks to help users choose a provider depending on their existing cloud footprint.
- armosec.io: Which Managed Kubernetes Is Right for Me? [EN CONTENT] [COMMUNITY-TOOL] β Comparative analysis evaluating EKS, AKS, and GKE. Focuses on security defaults, networking models, IAM integration, and pricing models to assist architects in selection.
- dev.to/thenjdevopsguy: AKS vs EKS vs GKE [EN CONTENT] [COMMUNITY-TOOL] β Community comparison comparing control plane cost, upgrade reliability, networking plugins, and developer experience across AKS, EKS, and GKE.
- youtube: Kubernetes Comparison [EN CONTENT] [COMMUNITY-TOOL] β Video walkthrough assessing the features and integration depths of EKS, GKE, AKS, and self-hosted k3s deployments.
Market Trends¶
- (2022) infoworld.com: CNCF survey: Managed Kubernetes becomes the norm [EN CONTENT] [COMMUNITY-TOOL] β Reviews CNCF annual survey results showing massive adoption of managed Kubernetes over self-hosted alternatives, mapping out industrial patterns in enterprise deployments.
Platform Engineering¶
- thenewstack.io: Otomi Container Platform Offers an Integrated Kubernetes' Bundle [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β In-depth profile of Otomi Container Platform, highlighting how it integrates open-source tools (such as Cert-Manager, Knative, Prometheus) into an out-of-the-box developer platform.
Storage¶
Cloud-Native Storage¶
- thenewstack.io: Install and Configure OpenEBS on Amazon Elastic Kubernetes' Service [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Tutorial on integrating OpenEBS as a container-attached storage engine on AWS EKS to manage local and persistent block storage natively.
Cloud Providers¶
AWS¶
Continuous Deployment¶
- Using CDK to perform continuous deployments in multi-region Kubernetes environments [ADVANCED LEVEL] [COMMUNITY-TOOL] β AWS Container Blog technical post showing how to orchestrate multi-cluster and multi-region deployments using AWS CDK. Demonstrates declarative CD pipelines, traffic management, and code configurations for reliable, global application synchronization.
AWS EKS¶
Autoscaling¶
- aws.amazon.com: Autoscaling EKS on Fargate with custom metrics [ADVANCED LEVEL] [COMMUNITY-TOOL] β Explores the architectural patterns for scaling serverless Kubernetes pods on AWS Fargate using Prometheus metrics processed via KEDA. Since traditional DaemonSet-based collectors are incompatible with Fargate, this guide establishes a robust sidecar pattern for metric extraction. It bridges the gap between serverless execution and custom metric-driven elasticity.
- itnext.io: Running resilient workloads in EKS using Spot instances [GUIDE] [COMMUNITY-TOOL] [GUIDE] β A practical operational guide highlighting strategies for reliable execution of workloads on AWS Spot Instances within EKS. It showcases how to leverage Karpenter or AWS Node Termination Handler alongside pod disruption budgets (PDBs) to handle instance interruptions gracefully. Reduces platform overhead by up to 70% while preserving uptime.
- Eliminate Kubernetes node scaling lag with pod priority and over-provisioning [COMMUNITY-TOOL] β Introduces a smart autoscaling architectural pattern: using low-priority 'placeholder' pods to reserve capacity inside an AWS EKS cluster. When a real, higher-priority pod is scheduled, Kubernetes evicts the placeholders, initiating immediate node-level preemption while a new node spins up asynchronously. Eliminates scaling delay in performance-sensitive services.
Batch Workloads¶
- thenewstack.io: Amazon Web Services Gears Elastic Kubernetes Service for' Batch Work [ADVANCED LEVEL] [COMMUNITY-TOOL] β An analytical report outlining AWS upgrades focused on optimizing EKS for high-performance computing (HPC) and batch processing tasks. It explores the native integrations with Karpenter and AWS Batch, aimed at resolving historical scheduling bottlenecks. It details how EKS adapts to heavy-load machine learning and computational workloads.
Case Studies¶
- Scaling Amazon EKS and Cassandra Beyond 1,000 Nodes [ADVANCED LEVEL] [COMMUNITY-TOOL] β An engineering case study detailing the technical constraints and performance tunings executed to host Apache Cassandra on a 1,000+ node EKS cluster. It addresses VPC IP limits, CoreDNS bottlenecks, etcd performance under high resource counts, and AWS storage throughput tuning. Exceptional resource for massive-scale system design.
Development Tools¶
- (2020) github.com/rebataur/djkube β 27 π [LEGACY] β A lightweight, community-driven development aid designed to bridge local filesystems with Kubernetes volumes. Live Grounding indicates the project has had minimal recent activity, classifying it as a legacy utility. It may serve as a historical reference implementation for simple synchronization mechanisms.
FinOps¶
- aws.amazon.com: Understanding and Cost Optimizing Amazon EKS Control Plane' Logs [COMMUNITY-TOOL] β Analyzes CloudWatch logging costs generated by EKS API server audit logs, offering practical strategies to filter and optimize them. It details how to use Logstash, FluentBit, or CloudWatch filter patterns to eliminate verbose, low-value telemetry. Crucial for enterprise platform administrators looking to cut hidden SaaS expenses.
- AWS and Kubecost collaborate to deliver cost monitoring for EKS customers [COMMUNITY-TOOL] β Documents the native integration of Kubecost with EKS to offer real-time, granular cost visibility for cloud platform operators. It highlights cost attribution strategies across namespaces, controller types, and pods. This collaboration ensures users have access to reliable financial telemetry directly within their cluster control systems.
GitOps¶
- aws.amazon.com: GitOps model for provisioning and bootstrapping Amazon' EKS clusters using Crossplane and Argo CD [ADVANCED LEVEL] [ENTERPRISE-STABLE] β An advanced GitOps architectural pattern demonstrating the unification of Crossplane and Argo CD on AWS. By leveraging Crossplane to declare AWS resources as Kubernetes Custom Resources, teams can manage both physical EKS infrastructure and application deployments through a single unified GitOps pipeline. Enhances cloud control-plane convergence.
- aws.amazon.com: Blue/Green Kubernetes upgrades for Amazon EKS Anywhere using' Flux [ADVANCED LEVEL] [COMMUNITY-TOOL] β Details a zero-downtime, blue/green cluster upgrade strategy designed for on-premises EKS Anywhere clusters. Leveraging the Flux GitOps controller, this pattern automates the provisioning of parallel target cluster versions and safe traffic shifts. Bridges GitOps automation with on-prem infrastructure orchestration rules.
Hybrid Cloud¶
- EKS Anywhere: github.com/aws/eks-anywhere β 2095 [ADVANCED LEVEL] [ENTERPRISE-STABLE] β An open-source tool that allows operators to easily create and run on-premises Kubernetes clusters using the curated distribution of Amazon EKS. It brings EKS lifecycle management tooling, security tooling, and optimization practices into local bare-metal or VMware environments. Bridges hybrid cloud operations with consistent tooling.
- aws.amazon.com: Amazon EKS Anywhere β Now Generally Available to Create' and Manage Kubernetes Clusters on Premises [COMMUNITY-TOOL] β The GA announcement for Amazon EKS Anywhere, describing its initial support matrix, licensing structure, and architectural goals. It explores how platform operators can achieve consistent cluster management interfaces across local data centers and public cloud clusters. A landmark shift in AWS's hybrid cloud execution strategy.
- anywhere.eks.amazonaws.com: Compare EKS Anywhere and EKS [DOCUMENTATION] [ENTERPRISE-STABLE] β The official comparison page mapping the functional differences, feature matrices, and pricing structures of standard EKS versus EKS Anywhere. It clearly details how control plane hosting, support SLA boundaries, and operating systems differ across deployment models. A vital document for hybrid architecture planning.
- aws.amazon.com: Getting started with Amazon EKS Anywhere [COMMUNITY-TOOL] β An introductory walkthrough from the AWS Container team illustrating the step-by-step setup of EKS Anywhere clusters on VMware vSphere. It covers the preparation of local hardware resources, networking topologies, and the use of the
eksctl anywhereCLI command. Highly practical starting guide for hybrid trials. - aws/eks-distro β 1457 [ADVANCED LEVEL] [ENTERPRISE-STABLE] β Amazon EKS Distro provides the exact open-source Kubernetes components, patches, and dependencies validated by Amazon Web Services for its own managed EKS clusters. Live Grounding verifies its role in letting teams run identical, secure, and long-term-supported Kubernetes distributions locally or on non-AWS nodes. Facilitates absolute platform consistency across physical and cloud clusters.
Infrastructure as Code¶
- (2024) aws-quickstart/cdk-eks-blueprints: Amazon EKS Blueprints for CDK β 511 [ADVANCED LEVEL] ππππ [ENTERPRISE-STABLE] β An AWS Cloud Development Kit (CDK) based framework that simplifies bootstrapping and configuring production-ready EKS clusters. Synthesizing developer insight with live deployment footprints, it provides programmatic control over EKS configurations, core add-ons, and IAM integrations. It is ideal for teams seeking TypeScript/Python program-based IaC over static YAML or HCL configurations.
- github.com/aws-ia/terraform-aws-eks-blueprints (examples) πππ β 3021 [ADVANCED LEVEL] [DE FACTO STANDARD] [ENTERPRISE-STABLE] β A highly opinionated, production-ready collection of Terraform modules designed to accelerate Amazon EKS cluster deployments. Live Grounding highlights its architecture for bootstrapping clusters with essential add-ons like Karpenter, AWS Load Balancer Controller, and Prometheus. It represents the industry standard for declarative EKS infrastructure provisioning.
Lifecycle Management¶
- (2026) docs.aws.amazon.com: Managing Amazon EKS add-ons [DOCUMENTATION] ππππ [ENTERPRISE-STABLE] β Official AWS documentation explaining the management of curated, enterprise-grade EKS cluster add-ons (such as VPC CNI, CoreDNS, and kube-proxy). It outlines the lifecycle workflow, covering configuration options, upgrade patterns, and IAM role integration via EKS Pod Identity. A mandatory guide for cluster operators.
- aws.amazon.com: Amazon EKS announces native support for autoscaling CoreDNS' Pods [DOCUMENTATION] [ENTERPRISE-STABLE] β Details the introduction of native autoscaling capabilities for the CoreDNS cluster DNS service inside Amazon EKS. Rather than relying on custom Horizontal Pod Autoscalers or manual tuning, EKS automatically adjusts replica configurations to protect cluster name resolution from traffic spikes. Ensures out-of-the-box system availability.
- Updating a managed node group [DOCUMENTATION] [ENTERPRISE-STABLE] β The official AWS guide to executing rolling upgrades on EKS Managed Node Groups with zero-downtime guarantees. It details node drain strategies, maximum unavailable parameters, and pod disruption considerations. This documentation serves as the base reference blueprint for cluster updates.
- aws.amazon.com: Planning Kubernetes Upgrades with Amazon EKS [ADVANCED LEVEL] [ENTERPRISE-STABLE] β Presents a strategic operational playbook for planning and executing Kubernetes version upgrades on Amazon EKS clusters. It addresses schema deprecations, API version migration strategies, and testing methodologies inside staging structures. An essential read for ensuring update continuity.
- repost.aws: How do I plan an upgrade strategy for an Amazon EKS cluster? [GUIDE] [COMMUNITY-TOOL] [GUIDE] β A structured AWS Knowledge Center guide explaining step-by-step procedures to minimize disruption during EKS control plane and data plane upgrades. It covers validation checks, API compatibility, and dependency tracking (e.g., matching the ECR credential helper and VPC CNI versions). Highly practical troubleshooting-oriented runbook.
Migration¶
- github.com/awslabs: Kubernetes Migration Factory User Guide π β 131 [ADVANCED LEVEL] [LEGACY] β The AWS Kubernetes Migration Factory provides an automated, programmatic framework for migrating legacy VM-based or on-premises workloads into Amazon EKS. Curator Insight notes its structured pipelines that reduce migration errors, while Live Grounding confirms its utility in enterprise-scale rehosting plans. Key features include source-to-target automation, pre-migration validation, and automated target cluster provisioning.
Networking¶
- dev.to: One technique to save your AWS EKS IP addresses 10x [GUIDE] [COMMUNITY-TOOL] [GUIDE] β A highly practical guide detailing how to mitigate IPv4 exhaustion in EKS clusters by leveraging custom networking features within the AWS VPC CNI. Curator Insight highlights the use of secondary non-routable CIDRs (such as CGNAT blocks) for pod allocation. This enables efficient IP utilization without requiring major network topology redesigns.
- aws.github.io/aws-eks-best-practices: Amazon EKS Best Practices Guides' πππ [ADVANCED LEVEL] [DOCUMENTATION] [DE FACTO STANDARD] β A specialized subset of the Amazon EKS Best Practices, focusing strictly on high-performance networking architectures. It covers crucial configurations like AWS VPC CNI optimization, security groups for Pods, custom networking, and prefix delegation. Indispensable for designing reliable and secure network planes within AWS.
- github.com/kubernetes-sigs/aws-load-balancer-controller β 4292 [ADVANCED LEVEL] [DE FACTO STANDARD] [ENTERPRISE-STABLE] β The core controller that manages AWS Elastic Load Balancers (ALB and NLB) on behalf of a Kubernetes cluster. Live Grounding verifies its continuous support for advanced features like target grouping by IP, ACM certificate integration, and shared ALBs. It acts as the primary ingress controller for modern AWS EKS network architectures.
- docs.aws.amazon.com: Access container applications privately on Amazon EKS' using AWS PrivateLink and a Network Load Balancer [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β A prescriptive AWS design pattern illustrating private, cross-VPC service consumption using AWS PrivateLink and a Network Load Balancer (NLB). It highlights secure service exposition strategies without exposing internal IP routing configurations to external peers. Critical for multi-account and enterprise compliance structures.
- aws.amazon.com: Addressing IPv4 address exhaustion in Amazon EKS clusters' using private NAT gateways [ADVANCED LEVEL] [COMMUNITY-TOOL] β Provides a specialized architectural blueprint for handling IP exhaustion in enterprise environments using Private NAT Gateways. It explains how to scale Pod topologies using non-routable CIDRs while seamlessly maintaining egress translation to enterprise networks. Solves complex IP coordination issues in hybrid cloud setups.
Observability and Alerting¶
- aws.amazon.com: Streaming Kubernetes Events in Slack [COMMUNITY-TOOL] β This technical post outlines the architecture for exporting real-time Kubernetes cluster events to Slack channels using Amazon EventBridge and AWS Lambda. It demonstrates decoupling event streams from cluster internals to prevent slack spamming while maintaining critical alerting. The blueprint integrates with standard AWS observability mechanisms.
- aws.amazon.com: Troubleshooting Amazon EKS API servers with Prometheus' and Grafana [ADVANCED LEVEL] [COMMUNITY-TOOL] β An in-depth guide to monitoring and debugging the managed Amazon EKS API server's performance. It details metric exposition patterns for latency, request depth, and response codes using standard Prometheus operators. It empowers platform teams to localize control plane issues and establish defensive alerts.
- awslabs/eks-node-viewer β 1633 [ENTERPRISE-STABLE] β A CLI tool that visualizes the current cost, resource usage, and allocation of nodes within an EKS cluster. Highly valued by teams using dynamic scaling engines like Karpenter, it aggregates financial metrics to show real-time workload-to-infrastructure pricing efficiency. It is an invaluable operational diagnostic utility.
Performance¶
- aws.amazon.com: Start Pods faster by prefetching images [COMMUNITY-TOOL] β Analyzes latency bottlenecks during container initialization caused by large image pull steps. The post outlines the architecture of 'image prefetching' patterns, leveraging daemonsets or custom Karpenter startup scripts to warm up worker nodes with target container layers before runtime allocation. Critical for latency-sensitive applications.
Security¶
- cast.ai: EKS Security Checklist: 10 Best Practices for a Secure Cluster [GUIDE] [COMMUNITY-TOOL] [GUIDE] β An actionable security checklist compiled by Cast.ai, detailing major cluster isolation and hardening vectors for EKS. Highlights include IAM role configurations, network policy enforcement, control plane logging, and image scanning pipelines. Ideal for rapid architectural audits before promotion to production.
- aws-samples/hardeneks β 957 [ENTERPRISE-STABLE] β A command-line interface tool designed to run programmatic audits against an EKS cluster to identify violations of EKS Best Practices. It reviews cluster networking, IAM, configuration control, and pod policies. The output acts as an actionable hardening roadmap for platform operators.
- itnext.io: Top 10 Ways to Protect EKS Workloads from Ransomware [GUIDE] [COMMUNITY-TOOL] [GUIDE] β An industry-focused checklist detailing tactical maneuvers to secure EKS against ransomware and supply chain attacks. It covers immutable storage configurations (EFS/EBS backups), strict RBAC permissions, runtime threat detection, and cluster isolation strategies. A helpful handbook for security engineering teams.
- Amazon EKS introduces EKS Pod Identity [ADVANCED LEVEL] [DOCUMENTATION] [ENTERPRISE-STABLE] β Highlights the launch of EKS Pod Identity, an evolved architectural alternative to IRSA (IAM Roles for Service Accounts). By utilizing a highly optimized local agent daemon on worker nodes, it simplifies IAM association, scales beyond IRSA session limits, and works across multiple clusters with ease. A fundamental improvement to EKS access management.
- itnext.io: AWS Elastic Kubernetes Service: RBAC Authorization via AWS IAM' and RBAC Groups [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Details the inner workings of mapping AWS IAM identity vectors to Kubernetes internal Role-Based Access Control (RBAC) groups via
aws-authConfigMaps or newer EKS Access Entries. It presents strategies for configuring secure, least-privilege team cluster access. Vital for security compliance in multi-tenant environments.
Service Mesh¶
- aws.amazon.com: Addressing latency and data transfer costs on EKS using' Istio [ADVANCED LEVEL] [COMMUNITY-TOOL] β Addresses the significant cloud cost vector of cross-AZ data transfer within EKS multi-AZ setups. This article describes how to configure Istio Service Mesh to enforce zone-aware routing policies, keeping internal network traffic localized inside the same Availability Zone. It provides a real-world optimization strategy for high-throughput microservices.
- solo.io: Connect Your Services Seamlessly with Amazon EKS Anywhere and Istio [ADVANCED LEVEL] [COMMUNITY-TOOL] β Analyzes the integration of enterprise-grade Solo.io Istio setups with Amazon EKS Anywhere. It details the setup of cross-environment service mesh networks that span from local physical data centers into public AWS EKS. This enables unified security policies, service discovery, and traffic shaping in hybrid clouds.
Storage (1)¶
- aws.amazon.com: Persistent storage for Kubernetes [COMMUNITY-TOOL] β An architectural breakdown of storage options for Kubernetes on AWS, evaluating AWS EBS, Amazon EFS, and Amazon FSx. It defines best practices for stateful workloads by mapping technical requirements to the appropriate storage driver. This serves as a foundational reference for stateful app topologies.
- aws.amazon.com: Machine Learning with Kubeflow on Amazon EKS with Amazon' EFS [ADVANCED LEVEL] [COMMUNITY-TOOL] β A specialized guide showing how to build highly scalable Machine Learning pipelines using Kubeflow and Amazon EFS for shared model storage. It outlines multi-node parallel processing layouts with distributed storage configurations. The blueprint is crucial for ML platform engineers building workflows on EKS.
- dev.to: Autoprovisioning NFS volumes in EKS with CDK [GUIDE] [COMMUNITY-TOOL] [GUIDE] β A developer-oriented tutorial for programmatically provisioning NFS volume drivers within EKS using the AWS CDK. It demonstrates dynamic persistent volume claim (PVC) binding backed by managed EFS resources. This simplifies dynamic storage allocations for development and test environments.
- Simplifying Amazon EBS volume migration and modification on Kubernetes using the EBS CSI Driver [GERMAN CONTENT] [COMMUNITY-TOOL] β Explains how the out-of-tree Amazon EBS CSI Driver handles live volume modification, type conversion (e.g., gp2 to gp3), and resizing without downtime. Curator Insight highlights the declarative nature of these infrastructure changes within native PVC specs. A key resource for maintaining persistent databases under evolving workloads.
Alternative Clouds¶
DigitalOcean¶
- (2022) digitalocean.com: Kubernetes for startups: Why, when, and how to adopt [EN CONTENT] [COMMUNITY-TOOL] β Business and engineering decision guide analyzing when, why, and how early-stage startups should adopt Kubernetes. Focuses on balancing administrative overhead with microservice scalability requirements.
- (2021) digitalocean.com: Automating GitOps and Continuous Delivery With DigitalOcean Kubernetes (Terraform, Helm and Flux) [EN CONTENT] [COMMUNITY-TOOL] β An interactive tutorial showcasing GitOps pipeline construction using Terraform, Helm, and Flux CD on DigitalOcean. Outlines structural separation between application repositories and declarative cluster states.
- docs.digitalocean.com: Kubernetes on DigitalOcean [EN CONTENT] [DOCUMENTATION] [COMMUNITY-TOOL] β Official DigitalOcean Kubernetes (DOKS) product documentation. Details storage configurations, container networking setups, and cluster architecture requirements for developer-focused managed Kubernetes instances.
Linode¶
- dev.to: Practical Introduction to Kubernetes Autoscaling Tools with Linode' Kubernetes Engine π [EN CONTENT] [COMMUNITY-TOOL] β Technical guide to deploying and tuning Cluster Autoscaler and Horizontal Pod Autoscaler (HPA) on the Linode Kubernetes Engine (LKE). Features step-by-step metric-server configurations and traffic simulations.
Oracle Cloud¶
- arnoldgalovics.com: GitHub Actions CI/CD For Oracle Cloud Kubernetes [EN CONTENT] [COMMUNITY-TOOL] β Outlines building a streamlined CI/CD pipeline using GitHub Actions targeting Oracle Container Engine for Kubernetes (OKE). Details secure credential rotation and container registry synchronization.
Azure¶
AKS Updates¶
- (2022) techcommunity.microsoft.com: Azure Kubernetes Service Microsoft Ignite announcements ππ [COMMUNITY-TOOL] β Roadmap summary of AKS features announced at Microsoft Ignite 2022. Covers the launch of Fleet Manager, automated node lifecycle tools, and native cost profiling integrations. Useful for understanding product evolution timelines.
- Azure Updates AKS π [DOCUMENTATION] [ENTERPRISE-STABLE] β Official update tracking feed detailing Azure Kubernetes Service platform improvements, retired APIs, and native feature promotions. Curator insights mark it as a vital operational pulse for infrastructure engineers, while live grounding confirms its role in tracking Kubernetes version deprecations and control plane releases.
Cost Optimization¶
- (2025) docs.microsoft.com: Start and stop an Azure Kubernetes Service (AKS) node pool π [DOCUMENTATION] ππππ [ENTERPRISE-STABLE] β Official Microsoft guide on using native commands to safely pause and start individual node pools on AKS. This operational strategy helps organizations significantly reduce compute costs on non-production clusters without losing configuration states.
- zartis.com: How To Save A Fortune On Azure Kubernetes Service [COMMUNITY-TOOL] β FinOps-centric strategy guide detailing actionable ways to minimize AKS resource costs. Covers Spot VM integrations, autoscaler tuning, node pool start/stop patterns, and fine-tuning pod CPU and memory requests to prevent cluster over-provisioning.
- returngis.net: Desescalar nodos de AKS apagando las mΓ‘quinas en lugar de' eliminarlas [SPANISH CONTENT] [COMMUNITY-TOOL] β Technical guide explaining how to scale down AKS node pools by deallocating (stopping) virtual machines rather than deleting them, preserving their state for faster scale-ups. [SPANISH CONTENT]
Ecosystem Strategy¶
- thenewstack.io: Microsoftβs Practical Approach to Kubernetes Management [COMMUNITY-TOOL] β Press analysis detailing Microsoft's strategic efforts to reduce Kubernetes operational complexities. Explains their engineering initiatives on hybrid management through Azure Arc, ecosystem tool integrations, and the simplification of AKS management planes.
Enterprise Strategy¶
- buchatech.com/2022: A Guide to Navigating the AKS Enterprise Documentation' & Scripts ππ [COMMUNITY-TOOL] β Detailed exploration of Microsoft's AKS Enterprise Landing Zone architectures and provisioning templates. Provides a structured overview of design patterns for security, hub-spoke peering, and compliance boundaries in regulated enterprise clouds.
Hybrid Cloud (1)¶
- (2022) techcommunity.microsoft.com: Azure Kubernetes Service and Azure Container Registry Service on Azure Stack Hub [ADVANCED LEVEL] ππ [COMMUNITY-TOOL] β Reference detailing how to run AKS and ACR on-premises using Azure Stack Hub environments. Outlines patterns for entirely disconnected or strictly regulated on-premises data centers that require standardized Azure cloud workflows.
Infrastructure as Code (1)¶
- build5nines.com: Terraform: Create an AKS Cluster π [COMMUNITY-TOOL] β Step-by-step walkthrough explaining the provisioning of fully functioning AKS clusters using Terraform HCL. Provides modular templates containing standard configurations for nodes, subnets, and identity profiles. Excellent for starting GitOps infrastructure-as-code patterns.
- docs.cloudblue.com: Deploying an AKS Cluster with Custom IP Ranges (ARM' template) [ADVANCED LEVEL] [DOCUMENTATION] [LEGACY] β Technical reference for deploying AKS clusters with specific, custom IP ranges via ARM Templates. While modern architectures have transitioned to Bicep or Terraform, this offers structural networking reference configurations for legacy templates.
Microservices Architecture¶
- kyverno.io: Check deprecated APIs π (AKS) π [ADVANCED LEVEL] [DOCUMENTATION] [DE FACTO STANDARD] β Official microservices architectural blueprint detailing application deployments in AKS clusters. Focuses on networking bounds, CI/CD pipeline structures, and enterprise data security. Crucial pattern reference for cloud migration pipelines.
- optisolbusiness.com: Implementing Microservices Architecture in AKS [COMMUNITY-TOOL] β Introduction to shifting monolithic platforms to decentralized microservices topologies within AKS. Highlights container separation, database patterns, API gateway configurations, and key cluster operations. Best for project planners and system designers.
Migration (1)¶
- (2021) techcommunity.microsoft.com: Containerize and migrate applications to AKS with the Azure Migrateβs new App Containerization tool πππ [LEGACY] β Guide highlighting the application containerization utility within Azure Migrate for porting legacy ASP.NET or Java apps to AKS. Details packaging automation, basic ingress mapping, and network routing configurations. Ideal for legacy application transformations.
- dev.to: Moving Azure Functions from AKS to Container Apps [COMMUNITY-TOOL] β Comparative guide detailing migration of Azure Functions from AKS environments to serverless Azure Container Apps (ACA). Focuses on scaling efficiencies using KEDA, reduction of cluster management overhead, and resource cost structures.
Networking (1)¶
- (2025) docs.microsoft.com: Configure Azure CNI networking in Azure Kubernetes Service (AKS) [ADVANCED LEVEL] [DOCUMENTATION] ππππ [ENTERPRISE-STABLE] β Microsoft operational guide detailing Azure CNI configurations to assign direct VNET IPs to Kubernetes pods. Live grounding highlights dynamic IP allocation patterns designed to reduce subnet pressure. Highly recommended for complex, high-throughput hybrid enterprise networks.
- (2024) docs.microsoft.com: Use kubenet networking with your own IP address ranges in Azure Kubernetes Service (AKS) π [DOCUMENTATION] πππ [COMMUNITY-TOOL] β Practical documentation for configuring kubenet networking inside AKS to optimize and preserve corporate IP structures. Outlines NAT translation mechanisms and route-table modifications managed by Azure. Crucial for infrastructure environments with strict subnet limits.
- (2023) docs.microsoft.com: Create an HTTPS ingress controller on Azure Kubernetes Service (AKS) [DOCUMENTATION] πππ [LEGACY] β Archived Microsoft documentation for configuring Nginx Ingress Controllers with cert-manager on AKS. Serves as a reliable baseline reference for legacy self-managed TLS ingress setups. Modern teams should look toward AGIC or Application Gateway for Containers.
- itnext.io: Kubernetes Ingress on Azure using the Application Gateway [COMMUNITY-TOOL] β Integration guide detailing ingress design utilizing Azure Application Gateway (AGIC) with AKS. Explains L7 load balancing, SSL termination, and Azure Web Application Firewall (WAF) integration. Provides highly secure, managed access profiles.
- itnext.io: Network Isolated AKS β Part 1: Controlling network traffic [ADVANCED LEVEL] [ENTERPRISE-STABLE] β Detailed technical framework for deploying completely network-isolated AKS clusters. Walks through outbound proxy paths, Azure Firewall integrations, and user-defined routing (UDR) controls. Essential reading for secure bank or government cloud architectures.
- thenewstack.io: Turbocharging AKS Networking with Calico eBPF [ADVANCED LEVEL] [COMMUNITY-TOOL] β Architectural breakdown of the performance gains of running Calico eBPF on AKS. Explains how the eBPF data plane bypasses traditional IPTables limits, offering superior routing performance, direct packet insights, and reduced resource footprint.
- tigera.io: Turbocharging AKS networking with Calico eBPF [ADVANCED LEVEL] [COMMUNITY-TOOL] β Tigera-published analysis showing direct performance gains when enabling Calico eBPF networking inside AKS. Contrasts packet processing throughput and latency metrics against standard IPTable configurations. Recommended for low-latency microservice architectures.
- tigera.io: Calico WireGuard support with Azure CNI [ADVANCED LEVEL] [COMMUNITY-TOOL] β Technical guide showing how to set up pod-to-pod network encryption in AKS using Calico WireGuard with Azure CNI. Discusses how to configure high-performance, secure cross-pod tunnels without the complex setups of traditional IPSec mesh architectures.
- dev.to/javiermarasco: HTTPs with Ingress controller, cert-manager and DuckDNS' (in AKS/Kubernetes) [COMMUNITY-TOOL] β Step-by-step lab tutorial for establishing dynamic public DNS routing on AKS using DuckDNS, Nginx Ingress, and automatic TLS configurations with cert-manager. Ideal for setting up proof-of-concept projects or budget sandbox clusters.
- pixelrobots.co.uk: Bring your own Container Network Interface (CNI) plugin' with Azure Kubernetes Service (AKS) (PREVIEW) [ADVANCED LEVEL] [COMMUNITY-TOOL] β Overview of 'Bring Your Own CNI' patterns in AKS. Explains how platform teams can provision bare control planes to manually deploy customized network providers (such as tailored Cilium or Calico engines) to meet unique compliance requirements.
- isovalent.com: Announcing Azure CNI Powered by Cilium [ADVANCED LEVEL] [DE FACTO STANDARD] β Release announcement detailing Azure CNI Powered by Cilium. Explains how this integration pairs Microsoft's virtual network control plane with Cilium's high-performance eBPF data plane for advanced routing and security policies. Marks a significant milestone in native AKS networking.
- blog.coffeeapplied.com: Securing AKS in peered virtual networks using only' network security groups (NSGs) [ADVANCED LEVEL] [COMMUNITY-TOOL] β Technical guide detailing how to secure peered virtual networks in AKS using only Network Security Groups (NSGs). Explains how to block unauthorized lateral traffic in hub-and-spoke models without the cost overhead of deploying Azure Firewall.
Observability¶
- dev.to/thenjdevopsguy: Monitoring AKS With Prometheus and Grafana π [COMMUNITY-TOOL] β Hands-on guide to implementing monitoring on AKS using Prometheus and Grafana. Explains how to deploy scraping targets, configure local metric storage, and design dashboards independent of Azure Monitor. Perfect for teams wanting a unified multi-cloud observability stack.
Performance (1)¶
- itnext.io: AKS Performance: Limit Ranges [COMMUNITY-TOOL] β Technical article examining the configuration of LimitRanges in AKS namespaces. Demonstrates how setting default container resource requests prevents multi-tenant environments from experiencing noisy neighbor syndromes or complete node exhaustion.
Reference Architecture¶
- (2025) docs.microsoft.com: Baseline architecture for an Azure Kubernetes Service (AKS) cluster π [ADVANCED LEVEL] [DOCUMENTATION] πππππ [DE FACTO STANDARD] β The baseline production-grade architecture blueprint designed for AKS clusters following the Well-Architected Framework. Live grounding verifies its emphasis on secure private networks, ingress traffic patterns, and identity integration. Serves as the authoritative starting point for corporate infrastructure setups.
Scheduling¶
- trstringer.com: Run Kubernetes Pods on Specific VM Types in AKS [COMMUNITY-TOOL] β A practical guide to scheduling Kubernetes pods onto specific Azure VM types within AKS. Explains how to leverage node selectors, taints, and tolerations to isolate workloads. Perfect for separating compute-intensive microservices from general services.
Security (1)¶
- (2025) docs.microsoft.com: AKS-managed Azure Active Directory integration [ADVANCED LEVEL] [DOCUMENTATION] ππππ [ENTERPRISE-STABLE] β Documentation covering native Microsoft Entra ID integration within AKS control planes. Enables infrastructure architects to map cluster RBAC profiles directly to corporate identity databases. Simplifies credential workflows and security audits by eliminating static admin certs.
- (2025) docs.microsoft.com: Best practices for cluster isolation in Azure Kubernetes Service (AKS) [DOCUMENTATION] ππππ [ENTERPRISE-STABLE] β Official best practices outlining physical and logical isolation boundaries inside AKS clusters. Details namespaces limits, network policy rules, and multi-tenant isolation topologies. Vital for running shared enterprise-grade platforms safely.
- (2022) blog.baeke.info: AKS Workload Identity Revisited [ADVANCED LEVEL] πππ [COMMUNITY-TOOL] β Deep-dive analysis of Azure AD Workload Identity's internal mechanics on AKS. Covers OIDC issuer configurations, federated identity setups, and the mutation processes used to inject tokens. Essential reading for platform architects implementing zero-trust identity policies.
- github.com: AKS: Use AAD identity for pods and make your SecOps happy β 6 [ADVANCED LEVEL] [COMMUNITY-TOOL] β Historical journal covering the implementation of Azure AD Pod Identity to secure pod communication with Azure resources. Note that while this highlights core security concepts, live grounding demonstrates this pattern has been succeeded by Entra Workload Identity.
- itnext.io: Running Your Microservices Securely on AKS [ADVANCED LEVEL] [COMMUNITY-TOOL] β In-depth guide addressing microservices application boundaries inside AKS. Focuses on secure pod credentials, identity translation, and network segmentation to limit horizontal attack paths. Vital resource for cluster security design.
- dev.to: Implement Azure AD Workload Identity on AKS with terraform [ADVANCED LEVEL] [ENTERPRISE-STABLE] [LEGACY] β Highly structured guide demonstrating how to use Terraform to set up Azure AD Workload Identity on AKS. Explains how to establish federation credentials between Kubernetes service accounts and Azure AD, successfully replacing deprecated pod identity methods.
- dev.to: Access Secrets in AKV using Managed identities for AKS π [COMMUNITY-TOOL] β Detailed guide on using Managed Identities to access secrets in Azure Key Vault (AKV) from AKS. Explains how to configure the Secrets Store CSI Driver to securely mount sensitive parameters directly into container workloads without exposing secret strings in files.
Storage (2)¶
- adamrushuk.github.io: Increasing the volumeClaimTemplates Disk Size in a' Statefulset on AKS [ADVANCED LEVEL] [COMMUNITY-TOOL] β Practical resource detailing how to execute volume resizing inside AKS StatefulSet templates without recreating resources. Outlines safe manual modifications and PVC override protocols. Solves common stateful storage expansion challenges.
- carlos.mendible.com: AKS: Persistent Volume Claim with an Azure File Storage' protected with a Private Endpoint [ADVANCED LEVEL] [COMMUNITY-TOOL] β Hands-on guide to configuring Persistent Volume Claims in AKS bound to Azure Files storage via secure Private Endpoints. Eliminates public access vectors to storage targets, enhancing data sovereignty and meeting enterprise compliance baselines.
Troubleshooting¶
- blog.nillsf.com: Customize core dump in Azure Kubernetes [ADVANCED LEVEL] [COMMUNITY-TOOL] β Advanced debugging guide outlining how to customize Linux core dump generations on AKS nodes. Explains node configurations, file mounting steps, and safe extraction of system-level crash dumps for deep application analysis.
- community.ops.io: One day I woke up to a crashed AKS cluster and this is' what I did to get it back to life [COMMUNITY-TOOL] β Incident response case study details diagnostic processes used to recover a crashed AKS cluster. Walks through API troubleshooting, control plane investigations, and network connectivity repairs. Offers valuable real-world context for post-mortem recovery planning.
- learn.microsoft.com: Connect with RDP to Azure Kubernetes Service (AKS)' cluster Windows Server nodes for maintenance or troubleshooting [DOCUMENTATION] [COMMUNITY-TOOL] β Official Microsoft guide to establishing secure RDP connections to Windows Server nodes on AKS for troubleshooting and maintenance. Explains the step-by-step setup of helper pods, bastions, and security groups to allow safe low-level OS debugging.
Windows Containers¶
- nillsf.com: Running Windows containers on the Azure Kubernetes Service (AKS) [LEGACY] β Technical walkthrough covering the instantiation of Windows Server node pools inside hybrid AKS environments. Addresses container setup, active directory domain joins, and specific scheduling parameters. Useful for legacy enterprise .NET migrations.
- dev.to: Getting started with Windows Containers on Azure Kubernetes Service [LEGACY] β Getting started guide for provisioning Windows Container pools on AKS. Details networking adjustments, pod YAML configurations, and active directory credential settings. Valuable for platform engineering teams managing legacy enterprise applications.
Azure AKS¶
Best Practices¶
- the-aks-checklist.com: The Azure Kubernetes Service Checklist πππ [ENTERPRISE-STABLE] β A highly interactive, community-driven checklist platform designed to validate AKS architectures against recommended practices for security, scalability, operation, and costs. Synthesizing experience from elite Microsoft field engineers, it organizes action items into an intuitive, status-tracked dashboard. Ideal for enterprise pre-production audits.
Infrastructure as Code (2)¶
- azure.github.io/AKS-Construction π [GUIDE] [ENTERPRISE-STABLE] [GUIDE] β An interactive, wizard-based configuration tool hosted by the Microsoft Azure team to dynamically generate ARM/Bicep or Terraform files for building production-ready AKS environments. Live Grounding emphasizes its value in generating compliant topologies following Azure Landing Zone best practices. It minimizes error rates during initial bootstrap phases.
Learning Path¶
- learn.microsoft.com: Introduction to Kubernetes on Azure [DOCUMENTATION] [COMMUNITY-TOOL] β The primary architectural learning path offered by Microsoft, designed to introduce platform teams to Azure Kubernetes Service (AKS). It details container register integration, core networking layouts, and standard Microsoft Entra ID authentication configurations. An excellent baseline instructional curriculum.
Google Cloud Platform¶
Config Management¶
- seroter.com: Using the new Google Cloud Config Controller to provision and' manage cloud services via the Kubernetes Resource Model [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Detailed technical walk-through of the Google Cloud Config Controller. Demonstrates how to leverage the Kubernetes Resource Model (KRM) to declare and manage external GCP resources natively.
Cost Optimization (1)¶
- cloud.google.com: Announcing Spot Pods for GKE Autopilotβsave on fault tolerant' workloads [EN CONTENT] [COMMUNITY-TOOL] β Announces and reviews the engineering mechanics of GKE Autopilot Spot Pods. Analyzes how fault-tolerant workloads can utilize dynamic pricing discounts of up to 60-80% using automated Spot scheduling policies.
- cloud.google.com: Know more, spend less: how GKE cost optimization insights' help you optimize Kubernetes [EN CONTENT] [COMMUNITY-TOOL] β Discusses the general availability of GKE's built-in cost optimization dashboards, highlighting over-provisioned namespaces and idle node resources. Analyzes methods to proactively scale down allocations using automated telemetry suggestions.
Data Protection¶
- cloud.google.com: Announcing Backup for GKE: the easiest way to protect' GKE workloads [EN CONTENT] [COMMUNITY-TOOL] β Deep dive into the native GKE Backup service, designed for stateful application restoration and cluster disaster recovery. Discusses how to safeguard configuration metadata and persistent volume states natively through GCP APIs.
GKE Networking¶
- (2024) Kubernetes Cloud DNS [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β A technical implementation guide for GKE Cloud DNS integration using VPC-scope DNS architectures. Explains how to achieve seamless name resolution across hybrid clusters without running local DNS caching daemons.
Governance¶
- google/gke-policy-automation β 526 [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Google-backed tool designed to automate policy checks on GKE configurations against best practices using OPA Gatekeeper. Relies on structured GKE cluster dumps to evaluate configuration posture and vulnerability profiles.
Multi-Cluster Architectures¶
- cloud.google.com: How to do multi-cluster Kubernetes in the real worldβone' GKE shopβs approach [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Enterprise architectural case study outlining a multi-cluster GKE topology. Explores ingress-routing, fleet management policies, and configuration synchronization patterns for global-scale telemetry networks.
Observability (1)¶
- cloud.google.com: Introducing Kubernetes control plane metrics in GKE [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Details GKE control plane performance exposure via Prometheus-compatible endpoints. Covers observability practices for API server request latency, etcd queue duration, and controller manager performance.
Performance Optimization¶
- (2021) acloudguru.com: GKE ludicrous speed! GKE Image Streaming speeds up container starts [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Evaluates GKE Image Streaming capabilities, which substantially reduce pod startup latencies by decoupling container initiation from full image caching. Underpins how streaming chunked assets reduces average start times from minutes to seconds.
IBM Cloud¶
Managed Kubernetes (1)¶
- (2025) IKS [EN CONTENT] [DOCUMENTATION] [COMMUNITY-TOOL] β Official enterprise landing portal for IBM Cloud Kubernetes Service (IKS). Highlights its native integration with OpenShift, hardware isolation options, and compliance frameworks for secure corporate deployments.
Microsoft Azure¶
AKS Community¶
- youtube: The AKS Community π [EN CONTENT] [COMMUNITY-TOOL] β Dedicated video portal hosted by the Microsoft Azure Kubernetes Service (AKS) product team. Details monthly community calls, deep technical system updates, and architecture roundtables.
Cluster Management¶
Ecosystem Platforms¶
Enterprise Managed¶
- Giant Swarm [EN CONTENT] [DOCUMENTATION] [COMMUNITY-TOOL] β Portal for Giant Swarm's fully managed enterprise Kubernetes management service. Emphasizes modern platform engineering workflows, governance tooling, and continuous operations support.
- giantswarm.io: [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Explores Cluster API (CAPI) mechanics and nested control plane virtualization. Reviews how to conceptualize management vs. workload cluster nodes under Cluster-as-a-Resource topologies.
KubeSphere¶
- (2021) youtube: Create a Jenkins Pipeline on Kubernetes with CI/CD Pipeline Template in KubeSphere [EN CONTENT] [COMMUNITY-TOOL] β Screencast guide focusing on the CI/CD DevOps engine built directly within KubeSphere. Demonstrates writing dynamic pipelines using visual and declarative Jenkinsfiles.
- kubesphere.io [EN CONTENT] [DOCUMENTATION] [COMMUNITY-TOOL] β Official site for KubeSphere, a pluggable multi-cluster management platform. It encapsulates underlying vanilla Kubernetes clusters with multi-tenant, GitOps, observability, and service mesh management layers.
- itnext.io: KubeSphere: A New Pluggable Kubernetes Application Management' Platform [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Outlines the modular and pluggable microservice-oriented design of KubeSphere's platform control plane. Focuses on system independence and API federation.
Multi-Cloud Evaluation¶
- (2022) Compare tools for multi-cloud Kubernetes management π [EN CONTENT] [COMMUNITY-TOOL] β Multi-faceted comparison of major ecosystem orchestrators (OpenShift, Rancher, Cloudify, Platform9) evaluating cost, modularity, workload migration ease, and administrative complexity.
Installation Tools¶
KubeKey¶
- kubekey β 2815 [EN CONTENT] [ADVANCED LEVEL] [ENTERPRISE-STABLE] β A Go-based command-line utility used to rapidly provision Kubernetes, KubeSphere, and underlying container runtimes (containerd/Docker) on bare metal or cloud instances. Extremely efficient for offline air-gapped installations.
- kubesphere.io: Install Kubernetes 1.22 and containerd the Easy Way with' kubekey [EN CONTENT] [COMMUNITY-TOOL] β Demonstrates setting up Kubernetes v1.22 using container runtime interface (CRI) containerd via KubeKey. Covers migrating away from Dockershim configurations seamlessly.
- kubesphere.io: Scaling a Kubernetes Cluster: One of the Best Practices for' Using KubeKey [EN CONTENT] [COMMUNITY-TOOL] β Step-by-step tutorial highlighting programmatic node scaling in live production environments. Details configuration modifications to hosts.yaml and idempotent runtime adjustments.
- itnext.io: Adding Master Nodes to Achieve HA: One of the Best Practices' for Using KubeKey [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Technical execution guide explaining how to scale a single control plane setup into a multi-master High Availability (HA) cluster. Integrates load balancers and key-value store configurations.
Databases¶
SQL Server¶
Storage (3)¶
- (2023) techcommunity.microsoft.com: SQL Server containers on Kubernetes with S3-compatible object storage - Getting started [ADVANCED LEVEL] πππ [COMMUNITY-TOOL] β Guide exploring SQL Server container setups on Kubernetes using S3-compatible object storage backends for backup and recovery patterns. Discusses storage performance metrics and high-availability options in containerized database configurations.
Public Cloud Platforms¶
AWS (1)¶
Chaos Engineering¶
- Chaos engineering on Amazon EKS using AWS Fault Injection Simulator [ADVANCED LEVEL] [COMMUNITY-TOOL] [GUIDE] β Guided workflow utilizing AWS FIS (Fault Injection Simulator) to execute controlled resilience and disruption experiments against EKS node groups and containers. Demonstrates monitoring system reaction metrics and reinforcing application failover.
- thenewstack.io: Deploy Gremlin to Amazon EKS Using AWS CloudFormation [COMMUNITY-TOOL] [GUIDE] β This article demonstrates setting up Gremlin on EKS clusters using AWS CloudFormation templates to bootstrap chaos daemonsets. Discusses using disruption tests to validate real-time alerts and state tracking reliability.
Cluster Provisioning¶
- POKE - Provision Opinionated Kubernetes on EKS [COMMUNITY-TOOL] [LEGACY] β An opinionated tool written to simplify and streamline provisioning of EKS clusters with specific default addons. Due to inactivity for more than four years, it is classified as legacy; teams should look to Terraform, Pulumi, or EKS Blueprints instead.
Container Orchestration Comparison¶
- cloudonaut.io: Scaling Container Clusters on AWS: ECS and EKS [COMMUNITY-TOOL] [GUIDE] β Deep comparison analyzing container scaling mechanics and metrics between ECS (using ASGs) and EKS (using Cluster Autoscaler). The analysis explains scale-up and scale-down behaviors, node provisioning latencies, and resource utilization optimizations.
- cast.ai: AWS EKS vs. ECS vs. Fargate: Where to manage your Kubernetes? [COMMUNITY-TOOL] [GUIDE] β Comparative evaluation analyzing resource isolation, infrastructure management, and compute overhead between EKS, ECS, and AWS Fargate. Highlights scheduling efficiency, control plane pricing, and cost-of-scale dynamics for enterprise systems.
Continuous Delivery¶
Canary Deployments¶
- Create a pipeline with canary deployments for Amazon EKS with AWS App Mesh π [ADVANCED LEVEL] [COMMUNITY-TOOL] [GUIDE] β Step-by-step post on architecting GitOps pipelines for automated canary releases using App Mesh service mesh on Amazon EKS. Outlines traffic-shifting mechanisms, virtual node configurations, and automated rolling rollbacks based on dynamic CloudWatch performance metrics.
GitOps (1)¶
- pages.awscloud.com: GitOps on AWS for High Performing Team Operations (eBook) [COMMUNITY-TOOL] [GUIDE] β Collaboratively published eBook introducing declarative GitOps operating models for EKS using Weaveworks technology. Outlines infrastructure-as-code, audit tracks, and drift correction techniques key to reliable pipeline operations.
Preview Environments¶
- thenewstack.io: How We Built Preview Environments on Kubernetes and AWS [ADVANCED LEVEL] [CASE STUDY] [COMMUNITY-TOOL] β This architectural case study reviews building dynamic preview environments on Amazon EKS. Outlines how to automate sandbox teardown, schedule resources effectively, and route unique domain endpoints back to feature-branch deployments.
EKS Compute Options¶
Machine Learning¶
- Amazon EKS Now Supports EC2 Inf1 Instances [ENTERPRISE-STABLE] β AWS announcement outlining AWS Inferentia (Inf1) support on EKS to execute high-performance deep learning inference models. Covers scheduling configurations, AWS Neuron SDK integration, and daemonsets required to expose hardware accelerators inside containers.
EKS Cost Management¶
- Amazon EKS Price Reduction [LEGACY] β Historic AWS announcement introducing the 50% price reduction for EKS cluster management fees. While highly significant for cloud budget projections at the time, it serves as archival context for operational billing structures.
EKS Cost Optimization¶
Spot Management¶
- (2019) Running spot instances effectively with Amazon EKS [ADVANCED LEVEL] πππ [CASE STUDY] [COMMUNITY-TOOL] β A real-world operational overview detailing strategies for running production workloads on cost-effective EC2 Spot Instances inside Amazon EKS. The architectural analysis examines handling interruption signals, cluster autoscaler node-group configuration, and stateless workload segregation.
- aws/aws-node-termination-handler π β 1755 [DE FACTO STANDARD] [ENTERPRISE-STABLE] β High-efficiency agent ensuring EKS pod rescheduling during abrupt EC2 instance maintenance events, Spot interruptions, or ASG rebalance recommendations. Gracefully drains affected nodes, maintaining overall cluster operational reliability.
- itnext.io: Deploy Kubernetes (K8s) on Amazon AWS using mixed on-demand and' spot instances π [ADVANCED LEVEL] [COMMUNITY-TOOL] [GUIDE] β Deep technical walkthrough of designing highly resilient clusters on AWS mixing On-Demand and Spot Node Groups. Demonstrates taints, tolerations, and affinity configuration policies designed to protect critical workloads from Spot interruptions.
- (2022) cast.ai: 8 best practices to reduce your AWS bill for Kubernetes ππππ [ENTERPRISE-STABLE] [GUIDE] β A deep-dive guide specifying concrete mechanisms to scale down unnecessary cloud spending inside EKS clusters. Provides strategies for dynamic right-sizing, Spot-instance scheduling, automated node-consolidation, and down-scaling non-production environments during idle hours.
EKS Fundamentals¶
Learning Resources¶
- eksworkshop.com π [DE FACTO STANDARD] β The absolute premier, hands-on learning resource for Amazon EKS managed and updated by AWS engineers. It takes practitioners from basic cluster installation to advanced patterns in storage, GitOps, security, multi-tenancy, and monitoring.
- youtube/StackSimplify: Kubernetes Deployments on AWS EKS | Amazon Elastic' Kubernetes Service | Amazon EKS π [COMMUNITY-TOOL] [GUIDE] β Comprehensive video walkthrough illustrating EKS cluster provisioning, node group configurations, and service deployments on AWS. Covers how to map services to ingress systems and run test deployments securely.
- hackerxone.com: 13 Steps Guide to Create Kubernetes Cluster on AWS [GUIDE] [LEGACY] β A beginner-oriented 13-step guide detailing raw cluster creation steps on AWS. Highly manual instructions compared to standard GitOps-driven IaC flows but useful for understanding fundamental cloud dependencies.
- hackerxone.com: Steps to Create Amazon EKS node group on Amazon web Service' (AWS) [GUIDE] [LEGACY] β Step-by-step tutorial addressing the deployment of EKS managed node groups via the AWS Console. Describes configuring instance types, scaling limits, and node execution roles.
- howtoforge.com: How to Create a Kubernetes Cluster with AWS CLI [COMMUNITY-TOOL] [GUIDE] β Manual walkthrough of constructing an EKS cluster directly via AWS CLI commands. It describes basic networking configuration steps and control-plane deployment procedures.
EKS Infrastructure¶
Helm Repositories¶
- github.com/aws/eks-charts π β 1294 [DE FACTO STANDARD] [ENTERPRISE-STABLE] β The official Helm chart repository maintained by Amazon Web Services to bootstrap essential cluster add-ons. Hosts deployment packages for tools such as the App Mesh Controller, AWS Load Balancer Controller, Node Termination Handler, and VPC CNI.
Kubernetes Distributions¶
- linkedin.com: Amazon EKS Distro (EKS-D): The Kubernetes Distribution Used' by Amazon EKS π [COMMUNITY-TOOL] [GUIDE] β An analysis of Amazon EKS Distro (EKS-D), a reliable, upstream-aligned Kubernetes distribution managed and secured by AWS. The post describes its role in facilitating hybrid cloud architectures and maintaining release alignment between on-premise deployments and cloud managed engines.
EKS Multi-Cluster Architecture¶
- Onfidoβs Journey to a Multi-Cluster Amazon EKS Architecture [ADVANCED LEVEL] [CASE STUDY] [ENTERPRISE-STABLE] β Onfido's real-world engineering case study describing their architectural pivot to a highly resilient multi-cluster AWS EKS layout. Demonstrates regional fault tolerance, ingress load splitting, and centralized operations management.
EKS Multi-Cluster Management¶
- Optimizing Your Kubernetes Clusters with Rancher and Amazon EKS π [ENTERPRISE-STABLE] β Details how enterprise teams use Rancher to coordinate, monitor, and enforce access policies on EKS clusters. It emphasizes unified management across multi-cloud networks, centralized security profiles, and unified developer dashboard setups.
EKS Multi-Region Architecture¶
High Availability¶
- aws.amazon.com: Operating a multi-regional stateless application using Amazon' EKS [ADVANCED LEVEL] [ENTERPRISE-STABLE] [GUIDE] β This AWS architectural guide presents a multi-region deployment pattern for stateless services using EKS. It explores cross-region Route 53 routing, continuous delivery strategies across distinct clusters, and handling geographic failover to guarantee enterprise business continuity.
EKS Networking¶
Ingress Control¶
- (2021) stacksimplify.com: AWS ALB Ingress Service - Basics π [GUIDE] ππ [COMMUNITY-TOOL] [GUIDE] β An operational guide walking through the basics of routing external traffic into an AWS EKS cluster using the Application Load Balancer (ALB) Ingress Controller. It explains target groups, listener rules, and routing configurations essential for initial ingress setups.
- AWS Load Balancer Controller π [ADVANCED LEVEL] [DE FACTO STANDARD] β The authoritative Kubernetes controller managing AWS Application (ALB) and Network (NLB) load balancers on behalf of Kubernetes Ingress and Service objects. It enables high-performance target grouping, TLS termination offloading, and AWS WAF integration.
- aws.amazon.com: Kubernetes Ingress with AWS ALB Ingress Controller [LEGACY] β Architectural post exploring the initial implementations of the AWS ALB Ingress Controller. Serves as highly valuable structural history, although this project has since evolved into the modern AWS Load Balancer Controller.
Load Balancing¶
- itnext.io: Using AWS NLB manually targeting an EKS Service exposing UDP' traffic [ADVANCED LEVEL] [COMMUNITY-TOOL] [GUIDE] β A technical post details routing high-performance, low-latency UDP traffic through an AWS Network Load Balancer (NLB) into EKS pods. It covers manual target group registrations and service mapping before such integrations were fully automated in newer ingress controllers.
Scale Optimization¶
- engineering.salesforce.com: Optimizing EKS networking for scale [ADVANCED LEVEL] [CASE STUDY] [ENTERPRISE-STABLE] β Technical breakdown of Salesforce's journey optimizing AWS VPC CNI on EKS to support massive container scale. Covers strategies to bypass IP address exhaustion, manage warm IP targets, configure custom networking, and optimize node sizing.
EKS Observability¶
APIServer Troubleshooting¶
- aws.amazon.com: Troubleshooting Amazon EKS API servers with Prometheus [ADVANCED LEVEL] [ENTERPRISE-STABLE] [GUIDE] β Operational manual on diagnosing performance bottlenecks inside Amazon EKS Control Plane API servers with Prometheus metrics. Breaks down API request latencies, error codes, and request volumes to improve overall system stability.
Autoscaling (1)¶
- aws.amazon.com: Using Prometheus Adapter to autoscale applications running' on Amazon EKS [ADVANCED LEVEL] [ENTERPRISE-STABLE] [GUIDE] β Details how to use the Prometheus Adapter to automatically scale workloads based on metrics stored in Amazon Managed Prometheus. Outlines mapping Prometheus PromQL queries into custom Kubernetes metrics for the Horizontal Pod Autoscaler (HPA).
Logging¶
- aws.amazon.com: Fluent Bit Integration in CloudWatch Container Insights' for EKS [ENTERPRISE-STABLE] β Technical overview of the native Fluent Bit integration with AWS CloudWatch Container Insights. Discusses migrating away from resource-heavy Fluentd agents to lightweight Fluent Bit configurations to process container logs efficiently at scale.
EKS Security and Isolation¶
Compliance¶
- aws whitepapers: Architecting Amazon EKS for PCI DSS Compliance (pdf) ππ [ADVANCED LEVEL] [CASE STUDY] [ENTERPRISE-STABLE] [GUIDE] β High-density security blueprint analyzing steps to implement PCI DSS compliance requirements on Amazon EKS. It explores data protection strategies, auditable logging mechanisms, network segmentation rules, and strict operational access controls.
IAM Integration¶
- (2021) nextlinklabs.com: Handling Auth in EKS Clusters: Setting Up Kubernetes User Access Using AWS IAM πππ [COMMUNITY-TOOL] [GUIDE] β Detailed architectural analysis of authentication handling in EKS clusters. Covers security boundaries between AWS IAM roles, AWS IAM Authenticator, and internal Kubernetes Role-Based Access Control (RBAC) configurations.
- azon EKS Pod Identity Webhook β 681 [ADVANCED LEVEL] [ENTERPRISE-STABLE] β An essential mutation webhook that automatically injects AWS IAM variables and credentials into Kubernetes Pod structures. Enables fine-grained authorization policies (IRSA) allowing pods to securely access specific AWS cloud API actions without cluster-wide node roles.
- dev.to: EKS IAM Deep Dive π [ADVANCED LEVEL] [COMMUNITY-TOOL] [GUIDE] β High-density security deep-dive analyzing EKS cluster credential boundaries. It contrasts AWS IAM authentication mechanics with standard OIDC federated identity providers, outlining optimal credential isolation policies for pods.
Multi-Tenancy¶
- (2021) clickittech.com: Kubernetes Multi tenancy with Amazon EKS: Best practices and considerations πππ [COMMUNITY-TOOL] [GUIDE] β This guide details patterns for sharing an EKS cluster among multiple tenants safely. It highlights namespace isolation, network policies, IAM Roles for Service Accounts (IRSA), and resource quotas designed to secure isolated developer environments.
Policy Management¶
- aws.amazon.com: Easy as one-two-three policy management with Kyverno on' Amazon EKS π [ENTERPRISE-STABLE] [GUIDE] β Walkthrough detailing how to manage native policy rules on EKS clusters using Kyverno instead of raw Rego. Illustrates automated resource validation, generation, and mutation patterns to enforce corporate configuration compliance.
EKS Storage¶
Shared Volumes¶
- (2020) Kubernetes PVCs with EFS provisioner ππ [GUIDE] [LEGACY] β This technical blog outlines mounting persistent storage to EKS clusters using the legacy Amazon EFS Provisioner. It covers IAM policy settings, storage class mapping, and volume claim binding, which has since transitioned into the modern EFS CSI driver paradigm.
- aws.amazon.com: Mount Amazon EFS file systems cross-account from Amazon' EKS, and utilize AWS Organizations more effectively [ADVANCED LEVEL] [ENTERPRISE-STABLE] [GUIDE] β Architectural design details on connecting Amazon EFS file-shares safely across different AWS accounts into EKS containers. Addresses multi-tenant access control policies, transit encryption, and cross-account IAM delegations.
Infrastructure as Code (3)¶
CDK and EKS¶
- aws.amazon.com: Continuous Delivery of Amazon EKS Clusters Using AWS CDK' and CDK Pipelines [ADVANCED LEVEL] [ENTERPRISE-STABLE] [GUIDE] β Guide illustrating how to write software-defined infrastructure to orchestrate and deploy Amazon EKS clusters using AWS Cloud Development Kit (CDK) Pipelines. Describes code-to-infrastructure deployments, testing stages, and lifecycle automation.
Terraform and EKS¶
- youtube: CloudGeeks - Terraform Eks Kubernetes RDS Secrets Manager Eksctl' Cloudformation ALB Controller (Redmine App) [COMMUNITY-TOOL] [GUIDE] β Complete orchestration video guide setting up an end-to-end containerized application on AWS EKS using Terraform. Features RDS database attachments, Secrets Manager integrations, and ALB Ingress configurations.
Resource Provisioning¶
- AWS Controllers for Kubernetes (ACK) π β 2627 [DE FACTO STANDARD] β Official community hub and development ecosystem for ACK (AWS Controllers for Kubernetes). Enables teams to model and provision standard cloud resources like RDS databases, SQS queues, and S3 buckets directly using native Kubernetes YAML configurations.
- Announcing the AWS Controllers for Kubernetes Preview [ENTERPRISE-STABLE] β The AWS Controllers for Kubernetes (ACK) allows developers to define and manage AWS resources directly from within Kubernetes using custom resources. This bridges the declarative Kubernetes API model with external cloud infrastructure lifecycle management.
Public Cloud Providers¶
Azure Kubernetes Service AKS¶
CICD and Deployment¶
- azuredevopslabs.com: Deploying a multi-container application to Azure' Kubernetes Services [EN CONTENT] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β A guided hands-on lab explaining multi-container deployment pipelines on AKS utilizing Azure DevOps. Details continuous deployment loops, packaging application layers via Helm charts, and managing network routing parameters for multi-tier microservice structures.
Cluster Management (1)¶
- (2024) techcommunity.microsoft.com: Leveraging Azure Copilot for Azure Kubernetes Services (AKS) [EN CONTENT] [COMMUNITY-TOOL] β Analyzes capabilities of Azure Copilot in assisting operations teams within AKS. Investigates how LLM-guided prompts can help discover, review, and troubleshoot standard cluster configurations and YAML structures.
- (2023) techcommunity.microsoft.com: Azure Kubernetes Service Free tier and Standard tier [EN CONTENT] [COMMUNITY-TOOL] β This architectural breakdown contrasts Azure Kubernetes Service (AKS) Free and Standard tiers. While curator notes highlight cost-efficiency, live grounding verifies that the Standard Tier's SLA-backed API server is essential for production scalability, offering dedicated resources to prevent control plane throttling.
- adamtheautomator.com: Getting Started with the Azure Kubernetes Service' (AKS) [EN CONTENT] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β An administrative deployment guide for initiating, scaling, and managing AKS clusters via the Azure CLI and portal. Walks through foundational networking profiles, node-pool scaling logic, and cluster validation techniques.
- learn.microsoft.com: AKS landing zone accelerator [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β The Cloud Adoption Framework (CAF) reference deployment guide for the AKS Landing Zone Accelerator. Focuses on hub-and-spoke virtual networks, policy governance, identity boundaries, and standard security Baselines required for enterprise landing zones.
- piotrminkowski.com: Getting Started with Azure Kubernetes Service π [EN CONTENT] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Provides a comprehensive getting started guide for AKS using modern Infrastructure as Code with Terraform. Walks through cluster creation, ACR container integrations, and primary routing configuration setups.
- github.com/stephaneey/azure-and-k8s-architecture: Azure and K8s Architecture' π [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β An open architecture repository featuring curated blueprints for running production workloads on Azure and AKS. Demonstrates network security zone mapping, private endpoint routing, and hub-spoke cluster topologies.
- dinantpaardenkooper.nl: Azure Day with Kubernetes [EN CONTENT] [COMMUNITY-TOOL] β Provides community insights and event takeaways regarding advanced AKS engineering, covering cloud scaling trends, managing internal mesh complexity with Istio, and secure authentication models.
- pixelrobots.co.uk: Exploring Azure Kubernetes Serviceβs Node Autoprovision:' A Deep Dive into the Latest Public Preview Feature [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Deep dive into AKS Node Autoprovisioning (NAP) based on Karpenter technology. Explains how the cluster dynamically provisions and optimizes worker node sizes directly matching scheduling requirements, reducing unused resource allocations.
Edge Computing¶
- infoq.com: Microsoft Brings Kubernetes to the Edge with AKS Edge Essentials [EN CONTENT] [COMMUNITY-TOOL] β Outlines Microsoft's strategy for running Kubernetes at the resource-constrained edge via AKS Edge Essentials. Synthesizes early product announcements with actual deployment frameworks, detailing how Arc-enabled management simplifies lightweight Windows-and-Linux IoT orchestrations.
- thenewstack.io: Microsoft Takes Kubernetes to the Edge with AKS Lite [EN CONTENT] [COMMUNITY-TOOL] β An industry report detailing AKS Lite (AKS Edge Essentials). Outlines Microsoft's strategy to deploy lightweight Kubernetes clusters onto small-footprint IoT hardware while keeping them linked via Azure Arc.
High Availability and Storage¶
- (2024) techcommunity.microsoft.com: A Practical Guide to Zone Redundant AKS Clusters and Storage [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β An enterprise blueprint addressing multi-zone AKS design and zone-redundant storage volumes. Investigates availability zone constraints, node-affinity impacts, and persistent volume failover latency to design zero-data-loss topologies.
Networking (2)¶
- (2023) techcommunity.microsoft.com: Kubernetes External DNS for Azure DNS & AKS [EN CONTENT] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Covers the dynamic configuration of ExternalDNS inside AKS linked to Azure DNS zones. Demonstrates how to automate external DNS record registration and lifecycle tasks, eliminating manual DNS updates for public-facing ingress zones.
- (2023) learn.microsoft.com: Deploy AKS and API Management with mTLS [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Provides guidelines to establish mutual TLS (mTLS) pipelines from external clients, through Azure API Management (APIM), down to AKS backend systems. Vital for compliance and high-security enterprise network architectures.
- azure.microsoft.com: Announcing the general availability of Azure CNI Overlay' in Azure Kubernetes Service [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Introduces the structural architecture of Azure CNI Overlay in AKS. Demonstrates how this overlay network design solves severe IP address exhaustion problems by decoupling the pod IP allocation space from the host VNet subnets, allowing massive scale limits.
- returngis.net: Configurar mΓ‘s de un Application Gateway con AGIC para AKS [ES CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Explica la configuraciΓ³n avanzada de mΓΊltiples instancias de Azure Application Gateway mediante AGIC para segmentar trΓ‘fico en AKS. Detalla tΓ©cnicas de aislamiento de rutas e Ingress classes para redes empresariales hΓbridas.
[SPANISH CONTENT] - returngis.net: Azure Application Gateway con WAF y wildcard + Nginx Controller' para AKS [ES CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Detalla la implementaciΓ³n hΓbrida de un Application Gateway frontal con reglas WAF junto a un controlador Nginx Ingress interno en AKS. Ideal para configuraciones SSL multi-dominio con comodines y polΓticas de mitigaciΓ³n OWASP.
[SPANISH CONTENT] - learn.microsoft.com: Use Application Gateway Ingress Controller (AGIC) with' a multitenant Azure Kubernetes Service [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Official architectural pattern for deploying AGIC (Application Gateway Ingress Controller) in multi-tenant AKS cluster environments. Highlights namespace-level ingress isolation, path-based load routing, and central SSL offloading models. - returngis.net: Exponer APIs en AKS a travΓ©s de Azure API Management [ES CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Muestra cΓ³mo establecer de manera segura una capa de Azure API Management (APIM) delante de un clΓΊster de AKS. Aborda la comunicaciΓ³n de red privada, el uso de Ingress y el control granular de polΓticas de consumo de API.
[SPANISH CONTENT]
Observability and Monitoring¶
- (2024) learn.microsoft.com: Monitor Azure Kubernetes Service (AKS) control plane metrics (preview) [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Outlines features designed to expose and monitor AKS control plane internal metrics using managed Prometheus and Grafana instances. Details monitoring etcd latency, scheduler run queues, and API server throughput.
Performance Optimization (1)¶
- danielstechblog.io: Mitigating slow container image pulls on Azure Kubernetes' Service [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Investigates mitigation patterns for cold-start latency issues associated with large container image pulls in AKS. Examines dynamic caching, optimal Azure Container Registry (ACR) alignment, and the deployment of advanced artifact streaming features to maximize application scaling speeds.
Security and Secret Management¶
- (2022) kristhecodingunicorn.com: Setting Up OAuth 2.0 Authentication for Applications in AKS With NGINX and OAuth2 Proxy ππ [EN CONTENT] [ADVANCED LEVEL] [GUIDE] ππ [COMMUNITY-TOOL] [GUIDE] β Step-by-step implementation of OAuth 2.0 authentication offloaded to the AKS ingress layer. Contrasts standard microservice authentication overhead with an edge-routed proxy configuration using NGINX Ingress and OAuth2 Proxy, enforcing centralized authentication checks prior to downstream request forwarding.
- (2024) techcommunity.microsoft.com: Simplifying Azure Kubernetes Service Authentication Part 2 [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Focuses on modern passwordless authentication setups within AKS. Covers Entra ID identity integrations, RBAC mapping, and the elimination of credential-rotation fatigue by migrating from classic API keys to managed credentials.
- (2023) techcommunity.microsoft.com: Securing Windows workloads on Azure Kubernetes Service with Calico [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [GUIDE] [LEGACY] β Demonstrates the implementation of Calico Network Policies for multi-platform AKS clusters featuring both Windows and Linux nodes. Highlights container-to-container isolation patterns and egress traffic restrictions to secure legacy Windows enterprise workloads.
- community.ops.io: Configuring AKS to read secrets and certificates from' Azure KeyVaults [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Details the structural mapping required to link AKS with Azure Key Vault using the Secrets Store CSI Driver. Synthesizes community configurations with security best practices, highlighting pod-identity integration, dynamic secret rotation, and declarative YAML definitions to replace static inline environment variables.
- kristhecodingunicorn.com: Setting Up OAuth 2.0 Authentication for Applications' in AKS With NGINX and OAuth2 Proxy [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β An in-depth configuration guide targeting NGINX Ingress controller annotations and OAuth2 Proxy settings. Synthesizes ingress routing flows with OpenID Connect (OIDC) identity providers to build secure, identity-aware application gateways on AKS without modification of backend application code.
- returngis.net: Desplegar AGIC en AKS utilizando workload identity [ES CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β GuΓa prΓ‘ctica para migrar el controlador de entrada AGIC de AKS hacia Azure AD Workload Identity. Reemplaza el uso inseguro de secretos persistentes con identidades federadas y cuentas de servicio con privilegios mΓnimos.
[SPANISH CONTENT]
Specialized Workloads¶
- (2024) techcommunity.microsoft.com: Running GPU accelerated workloads with NVIDIA GPU Operator on AKS π [EN CONTENT] [ADVANCED LEVEL] [GUIDE] π [COMMUNITY-TOOL] [GUIDE] β Highlights the installation and usage of the NVIDIA GPU Operator on AKS. Explains how to orchestrate high-performance computing, AI, and GPU partition options (MIG) for parallel deep learning training directly within Kubernetes worker nodes.
Troubleshooting and Diagnostics¶
- github.com/OvidiuBorlean/kubectl-windumps [EN CONTENT] [ADVANCED LEVEL] [LEGACY] β A specialized kubectl plugin facilitating raw packet capturing on AKS Windows worker nodes. Live grounding indicates the repository has been inactive for over four years, yet it remains a valuable conceptual reference for troubleshooting deep TCP/IP anomalies on legacy Windows container deployments.
Google Kubernetes Engine GKE¶
Cluster Management (2)¶
- (2024) cloud.google.com: GKE Autopilot π [EN CONTENT] [DOCUMENTATION] π [COMMUNITY-TOOL] β Primary conceptual documentation for GKE Autopilot. Highlights SLA details, security-hardening parameters, dynamic pricing models, and specific restrictions compared to standard node environments.
- (2021) youtube: GKE Autopilot - Fully Managed Kubernetes Service From Google π [EN CONTENT] π [COMMUNITY-TOOL] β A video introduction detailing GKE Autopilot operations and setup tasks. Explains how autopilot takes over day-two operational scaling and outlines fundamental node architecture abstractions.
- (2020) Looking ahead as GKE, the original managed Kubernetes, turns 5 [EN CONTENT] [COMMUNITY-TOOL] β Reflects on GKE's history and core features. Explains earlier architectural trends, the introduction of multi-cluster designs, and foundations that shaped Google's managed Kubernetes solutions.
- Google Kubernetes Engine [EN CONTENT] [DOCUMENTATION] [DE FACTO STANDARD] β The main technical documentation page for GKE (Google Kubernetes Engine). Details foundational and advanced options, covering Autopilot architecture, GKE Datapath V2 routing, and multi-cluster orchestrations.
- Introducing GKE Autopilot: a revolution in managed Kubernetes π [EN CONTENT] [COMMUNITY-TOOL] β Announces the launch of GKE Autopilot. Discusses its billing models based on active pod specifications and automated node scaling, shifting infrastructure control tasks directly to Google SREs.
- techcrunch.com: Google Cloud puts its Kubernetes Engine on autopilot [EN CONTENT] [COMMUNITY-TOOL] β A commercial analysis of GKE Autopilot's introduction. Evaluates how removing raw VM management options shifts operations tasks towards microservice scaling and application value delivery.
- zdnet.com: Google introduces GKE Autopilot for hands-off Kubernetes [EN CONTENT] [COMMUNITY-TOOL] β Examines industry trends towards hands-off Kubernetes management via GKE Autopilot. Details benefits for small to medium enterprise environments looking to cut container runtime operations costs.
- thenewstack.io: Googleβs New βAutopilotβ for Kubernetes [EN CONTENT] [COMMUNITY-TOOL] β Examines GKE Autopilot structural designs from an administrator point of view. Explains differences in scaling strategies, pod sizing mechanics, and embedded security parameters relative to traditional nodes.
Networking (3)¶
- (2023) Setting up NodeLocal DNSCache [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β Covers setting up NodeLocal DNSCache in GKE clusters. Explains how running a lightweight DNS caching daemon as a DaemonSet helps mitigate connection-tracking overhead and latency bottlenecks.
- (2021) Using new traffic control features in External HTTP(S) load balancer [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Outlines advanced load balancing features on Google Cloud external application proxies. Shows how request mirroring, canary weighted splits, and URL manipulations integrate with backend GKE Ingress controllers.
- cloud.google.com: Discover and invoke services across clusters with GKE' multi-cluster services [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β Introduces GKE Multi-Cluster Services (MCS). Focuses on cross-cluster discovery models that let disparate GKE instances interact securely without needing complex overlay networks or manual endpoint syncing.
Observability and Monitoring (1)¶
- codeburst.io: Google Kubernetes Engine Logging by Example [EN CONTENT] [GUIDE] [COMMUNITY-TOOL] [GUIDE] β A practical exploration of structural application logging inside GKE via Cloud Logging. Covers structured JSON formatting, log filtration techniques, and exports for audit compliance tasks.
Security and Secret Management (1)¶
- Fetches all Primitive and Predefined GCP IAM Roles [EN CONTENT] [ADVANCED LEVEL] [COMMUNITY-TOOL] β An open-source tool analyzing and exporting detailed lists of GCP IAM permissions and pre-defined roles. Highly beneficial for configuring least-privilege Workload Identity bounds inside secure GKE environments.
Security (2)¶
Cluster Hardening¶
Best Practices (1)¶
- Amazon EKS Best Practices Guide for Security π [EN CONTENT] [ADVANCED LEVEL] [GUIDE] [DE FACTO STANDARD] [GUIDE] β Curator Insight: The definitive handbook for securing AWS EKS environments, curated by AWS security engineers. Live Grounding: Serves as the primary operational baseline for hardening network, IAM, data, and compute resources in AWS.
π‘ Explore Related: AWS Tools Scripts | Azure | AWS