Steve Wade

The Pragmatic CNCF Guy

Simplifying Cloud-Native Journeys

About Steve

Steve Wade is the "Pragmatic CNCF Guy" who helps enterprises simplify their cloud-native journey. After 50+ Kubernetes migrations, he's developed a superpower: knowing which CNCF tools you actually need (hint: it's fewer than you think). His approach is simple: simplification over sophistication. While others push complex architectures, Steve helps teams ship features daily with the minimum viable CNCF stack, saving his clients an average of £1.5M by NOT implementing unnecessary tools.

His career path has been unconventional. Steve started breaking things in production (on purpose) to understand how they work. This led him to become the person companies call when their "Netflix-inspired" architecture is 6 months behind schedule and hemorrhaging cash. He's built his reputation on telling enterprises hard truths: you probably don't need Istio (95% of his clients don't), your Kubernetes cluster is too complex (he can prove it), and following Google's architecture will bankrupt you (they have different problems).

Current Focus: CNCF Fast Track

Steve is currently building an ecosystem to help enterprises navigate CNCF without the BS. His CNCF Fast Track program is a 30-day intensive that takes platform teams from "drowning in complexity" to "shipping daily with confidence." It's the opposite of the typical 6-month consulting engagement that leaves teams more confused than when they started.

Beyond this, Steve publishes a weekly newsletter where 300+ platform engineers get his unfiltered take on CNCF adoption, runs free monthly masterclasses with live architecture reviews, and is developing a "Reality Check" assessment tool to score enterprise readiness for Kubernetes adoption.

If we're still writing HCL in 2035, we will have failed.

Q1: Where has Terraform stood the test of time, and where have you hit limitations?

Terraform has been my go-to for one reason: it's boring, and boring is good for infrastructure. I've done 50+ migrations. The successful teams used Terraform for one thing – managing cloud resources declaratively.

But here's where it breaks down: state management at scale. I've seen a £2M project nearly fail because someone corrupted the state file. Another client had 47 different state files because they feared the blast radius. That's not sustainable.

This is precisely why I created my "CNCF Reality Check" assessment – to help teams avoid these £2M mistakes before they happen.

I've seriously evaluated the alternatives. Crossplane? Brilliant idea – Kubernetes managing infrastructure. But after watching three enterprises try it, I'm skeptical. The problem? Your team now needs to be experts in both Kubernetes AND infrastructure provisioning. One financial services client spent 4 months trying to get Crossplane working before calling me to rip it out.

Pulumi? I love using real programming languages for infrastructure. But developers treat infrastructure like application code. It's not. Plus someone always writes "clever" infrastructure code that no one else can understand.

Terraform still wins for most enterprises. It's not perfect, but it's predictable.

Q2: Is Kubernetes' continuous reconciliation model the right approach for infrastructure?

This is where I get controversial: Kubernetes operators for infrastructure are solving the wrong problem.

The promise sounds amazing. Infrastructure that heals itself. Continuous reconciliation. GitOps everywhere. But here's what actually happens: Your PostgreSQL operator decides to "reconcile" your production database at 3 AM. It sees a config drift as the desired state. Suddenly, you're explaining to the board why the infrastructure "fixed itself" into an outage.

This is precisely the kind of expensive mistake I help teams avoid in my monthly masterclasses, where I show which parts of your infrastructure should use operators and which shouldn't. This one decision can save you thousands.

I've successfully implemented operators for stateless infrastructure. Load balancers, DNS, certificates – these work well. But for stateful infrastructure? It's a nightmare. One telecom client had their Kafka operator "helpfully" rebalance partitions during Black Friday. That cost them £500K in lost sales.

The model makes sense for cattle infrastructure – things that can be destroyed and recreated without consequence. However, most enterprise infrastructure is comprised of pets pretending to be cattle. Until we accept that, operators will cause more problems than they solve.

Most enterprise infrastructure is pets pretending to be cattle.

Q3: Are visual tools replacing code for infrastructure management?

This question hits at something I see every week: we've made infrastructure so complex that only a priesthood of experts can manage it. This complexity is why 73% of Kubernetes migrations fail – a stat I share in my newsletter along with how to avoid being part of that statistic.

I'm watching System Initiative closely. The real innovation isn't the visual interface – it's the real-time feedback loop. When an architect can instantly see that their beautiful design will cost £50,000 per month, that's powerful. However, my concern is that every visual tool eventually encounters the "escape hatch" problem. What happens when you need something the tool doesn't support?

I've seen Brainboard demos that made executives understand their infrastructure! However, when the team attempted to implement their existing Terraform modules, they encountered wall after wall.

The future isn't visual OR code – it's both. Visual for reasoning and planning, code for precision and repeatability. The tools that will win are those that allow you to transition between both modes seamlessly. Think Figma for infrastructure – you can design visually but drop into code when needed.

The future isn't visual OR code – it's both. Think Figma for infrastructure.

Q4: What does daily infrastructure work look like in 2035?

By 2035, most infrastructure work will be conducted conversationally. Not chatbots writing YAML (please no), but infrastructure engineers having actual dialogues with AI systems that understand context, trade-offs, and business requirements.

This shift is why I now focus on teaching pragmatic simplification. The engineers who'll thrive in 2035 aren't the ones who can write the most complex Terraform – they're the ones who can simplify complexity into business value.

Picture this: "I need a Kubernetes cluster for our new payment service. It needs to handle 10K transactions per second, comply with PCI DSS, and cost less than our current setup."

The AI responds: "Based on your requirements, I recommend a 15-node cluster across three zones. This exceeds your TPS requirement by 40% for burst capacity. For PCI compliance, I'll enable encryption at rest, network policies, and Pod Security Standards. Total cost: £8,500/month, 23% less than the current rate. Should I show you the trade-offs if we reduce to 12 nodes?"

However, and this is crucial, the AI shows its work. Every decision is traceable, every trade-off explicit. Because when that cluster fails at 3 AM, you need to understand WHY those decisions were made.

The engineers of 2035 won't write HCL. They'll be infrastructure architects who understand business requirements, compliance, and cost optimization deeply. They'll guide AI systems that handle the implementation details while they focus on the complex problems: What should we build? Why? What are the trade-offs?

The biggest change? Infrastructure will be boring again. And that's exactly what enterprises need.

Josh's Note

Steve's pragmatic approach to infrastructure evolution cuts through the hype. His focus on making infrastructure boring again isn't about lack of innovation – it's about maturity. When infrastructure becomes conversational and transparent, with AI handling complexity while engineers focus on business value, we finally achieve what we've been promising for years: infrastructure that just works. At Terrateam, we share this vision of simplification through intelligent automation, not complexity for complexity's sake.

Follow Steve's work:

LinkedIn - Daily insights on pragmatic CNCF adoption
Newsletter - Join 300+ platform engineers getting reality checks on CNCF
CNCF Fast Track - 30-day intensive program for platform teams
Free Monthly Masterclasses - Register through the newsletter for live architecture reviews

Upcoming Speaking:

Bit Summit - Hamburg, September 4th (Get 15% off with code: STEVE_BITSUMMIT)
DevFest.cz 2025 - Prague, October 23-24, 2025
devopsdays Warsaw - November 4-5, 2025

More in the Series

Anton Babenko on Moving Beyond 'As-Code' →

Matt Gowie on Why We'll Stop Writing HCL →

Features

Learn More

Learn

Connect