wave
https://github.com/wave-k8s/wave
Wave watches Deployments, StatefulSets and DaemonSets within a Kubernetes cluster and ensures that their Pods always have up to date configuration.
By monitoring mounted ConfigMaps and Secrets, Wave can trigger a Rolling Update of the Deployment when the mounted configuration is changed.
https://github.com/wave-k8s/wave
winter-soldier
https://github.com/devtron-labs/winter-soldier
Winter Soldier can be used to
- cleans up (delete) Kubernetes resources
- reduce workload pods to 0
at user defined time of the day and conditions. Winter Soldier is an operator which expects conditions to be defined using CRD hibernator.
https://github.com/devtron-labs/winter-soldier
Terraforming Your Grafana Alerts for Kubernetes Clusters
https://medium.com/h7w/terraforming-your-grafana-alerts-for-kubernetes-clusters-958616f89cc2
https://medium.com/h7w/terraforming-your-grafana-alerts-for-kubernetes-clusters-958616f89cc2
terraform-mcp-server
https://github.com/thrashr888/terraform-mcp-server
A Model Context Protocol (MCP) server that provides tools for interacting with the Terraform Registry API. This server enables AI agents to query provider information, resource details, and module metadata.
https://github.com/thrashr888/terraform-mcp-server
Don't make these feature flag mistakes
https://newsletter.posthog.com/p/dont-make-these-classic-feature-flag
https://newsletter.posthog.com/p/dont-make-these-classic-feature-flag
rybbit
https://github.com/rybbit-io/rybbit
Rybbit is the modern open source and privacy friendly alternative to Google Analytics. It takes only a couple minutes to setup and is super intuitive to use.
https://github.com/rybbit-io/rybbit
Using MCPs to Run Terraform
https://masterpoint.io/blog/using-mcps-to-run-terraform
We jump into a hands-on exploration of Model Context Protocol (MCP), sharing our experiment using a MCP client to run terraform init, plan, apply. We share our take on where agents add value and highlight security considerations when adding MCPs to your workflow.
https://masterpoint.io/blog/using-mcps-to-run-terraform
Upgrading ECK Operator: A Side-by-Side Kubernetes Operator Upgrade Approach
https://engineering.mercari.com/en/blog/entry/20250428-upgrading-eck-operator-a-side-by-side-kubernetes-operator-upgrade-approach/
To leverage the advancements in recently released ECK operator versions, we embarked on an upgrade project. Operator upgrades are inherently complex and risky, often involving significant changes that can affect system stability.
In this article, I’ll delve into the challenges we encountered and the strategies we employed to manage operator upgrades for stateful workloads like Elasticsearch. Additionally, I’ll detail how we modified the ECK operator to facilitate a more resilient side-by-side upgrade process.
https://engineering.mercari.com/en/blog/entry/20250428-upgrading-eck-operator-a-side-by-side-kubernetes-operator-upgrade-approach/
Zero-Touch Bare Metal at Scale
https://blog.railway.com/p/data-center-build-part-two
In this episode, we talk about how we operationalize the hardware once it’s installed.
https://blog.railway.com/p/data-center-build-part-two
A Case Study in Synchronizing Database Schema Updates between Projects and Environments
https://medium.com/doubleverify-engineering/a-case-study-in-synchronizing-database-schema-updates-between-projects-and-environments-a69a3cc38985
If you’ve come across this post, you probably know about relational database schema migrations. You’ve likely worked with a relational database like PostgreSQL or MySQL and a tool for managing schema migrations via code.
This post doesn’t walk you through the basics of schema migrations but rather explains an especially complicated and non-typical schema migrations use case. This post will also show you the nuances of the custom schema migration solution we created here at DoubleVerify.
https://medium.com/doubleverify-engineering/a-case-study-in-synchronizing-database-schema-updates-between-projects-and-environments-a69a3cc38985
Amazon EKS Auto Mode vs Azure AKS Automatic: The Better Managed Kubernetes Solution?
https://pixelrobots.co.uk/2024/12/amazon-eks-auto-mode-vs-azure-aks-automatic-the-better-managed-kubernetes-solution
https://pixelrobots.co.uk/2024/12/amazon-eks-auto-mode-vs-azure-aks-automatic-the-better-managed-kubernetes-solution
A CNI 'Chicken-and-Egg' Dilemma: How Does Calico Assign IPs to Itself?
https://midbai.com/en/post/cni-chicken-egg-problem
While research CNI recently, I recalled an interesting issue I encountered during the development of network plugins and investigation of Calico: Calico assigns IP addresses to its own components’ Pods (e.g., calico-kube-controllers). How does Calico achieve this? From the installation of the Calico network plugin to assigning IPs to its own Pods, what happens at the underlying level?
This essentially poses a “chicken-and-egg” problem: running a Pod requires the CNI plugin, while the CNI plugin’s operation depends on the proper functioning of other Pods.
This analysis is based on Cilium v1.16.5, Calico v3.29.1, and Kubernetes v1.23.
https://midbai.com/en/post/cni-chicken-egg-problem
How we tested scaling to 10,000 Kubernetes clusters without missing a beat
https://www.spectrocloud.com/blog/how-we-tested-scaling-to-10-000-kubernetes-clusters-without-missing-a-beat
https://www.spectrocloud.com/blog/how-we-tested-scaling-to-10-000-kubernetes-clusters-without-missing-a-beat
kro
https://github.com/kro-run/kro
This project aims to simplify the creation and management of complex custom resources for Kubernetes.
Kube Resource Orchestrator (kro) helps you to define complex multi-resource constructs as reusable components in your applications and systems. It does this by providing a Kubernetes-native, vendor agnostic way to define groupings of Kubernetes resources.
kro's fundamental custom resource is the ResourceGraphDefinition. A ResourceGraphDefinition defines collections of underlying Kubernetes resources. It can define any Kubernetes resources, either native or custom, and can specify the dependencies between them. This lets you define complex custom resources, and include default configurations for their use.
The kro controller will determine the dependencies between resources, establish the correct order of operations to create and configure them, and then dynamically create and manage all of the underlying resources for you.
kro is Kubernetes native and integrates seamlessly with existing tools to preserve familiar processes and interfaces.
https://github.com/kro-run/kro
The Hidden Risk of Running WordPress on Kubernetes: Debugging an Unexpected Downtime Issue
https://medium.com/1000farmacie/the-hidden-risk-of-running-wordpress-on-kubernetes-debugging-an-unexpected-downtime-issue-e810bf4fb577
https://medium.com/1000farmacie/the-hidden-risk-of-running-wordpress-on-kubernetes-debugging-an-unexpected-downtime-issue-e810bf4fb577
Understanding the 1MB Limit of Etcd in Kubernetes: Challenges with Helm Deployments
https://logeshbalu1998.medium.com/understanding-the-1mb-limit-of-etcd-in-kubernetes-challenges-with-helm-deployments-47ef41f37e9c
https://logeshbalu1998.medium.com/understanding-the-1mb-limit-of-etcd-in-kubernetes-challenges-with-helm-deployments-47ef41f37e9c
kubewall
https://github.com/kubewall/kubewall
A single binary to manage your multiple kubernetes clusters.
kubewall provides a simple and rich real time interface to manage and investigate your clusters.
https://github.com/kubewall/kubewall
Terratags: Enforce Tags on your AWS Terraform configuration
https://dev.to/quixoticmonk/terratags-enforce-tags-on-your-aws-terraform-configuration-1ck5
https://dev.to/quixoticmonk/terratags-enforce-tags-on-your-aws-terraform-configuration-1ck5
Azure Verified Module - Azure Landing Zones
P1: https://mikeguy.co.uk/posts/azure-verified-module-landing-zones-part-1
P2: https://mikeguy.co.uk/posts/azure-verified-module-landing-zones-part-2
In this article, we take a look at the Azure Verified Module for Azure Landing Zones, and how we can customise deployments.
P1: https://mikeguy.co.uk/posts/azure-verified-module-landing-zones-part-1
P2: https://mikeguy.co.uk/posts/azure-verified-module-landing-zones-part-2
What I Really Mean When I Say “Good Communication” in Incident Response
https://uptimelabs.io/good-communication-in-incident-response
https://uptimelabs.io/good-communication-in-incident-response