Thoughts & Lessons
Writing about SRE, network engineering, observability, and the lessons learned building reliable infrastructure at scale.
Adopting AI in SRE, Service Assurance, and Observability
As the new Team Lead for SRE, Service Assurance and Observability, I'm exploring how AI can transform the way we detect, diagnose, and resolve network incidents — and where the hype falls short.
From Network Analyst to SRE Team Lead: My 11-Year Journey
How I went from configuring enterprise gateways in Manitoba to leading SRE observability and service assurance for a national core network — and what I learned along the way.
What SRE Means in a Telecom / ISP Context
SRE was born at Google, but the principles apply everywhere — including telecom. Here's how SRE practices translate when your 'services' are MPLS cores, DWDM transport, and fixed wireless access networks.
Designing SLIs and SLOs for Network Services
A practical guide to defining Service Level Indicators and Objectives for network infrastructure — from choosing the right metrics to setting targets that drive the right behavior.