Become a Site Reliability Engineer
Master reliability engineering, incident response, and observability — structured for working professionals.
Is this for you?
This track is designed for people at these stages:
You're a sysadmin or DevOps engineer who wants to specialise in reliability and resilience
You're a backend developer who wants to own production systems and on-call responsibilities
You're already in an SRE-adjacent role but need structured skills for UK SRE positions
Career Path Timeline
Where this track can take you in the UK job market.
Junior SRE
£35,000–£48,000/yrMid-level SRE
£55,000–£72,000/yrSenior SRE
£75,000–£95,000/yrStaff / Principal SRE
£95,000–£120,000+/yrJunior SRE
£35,000–£48,000/yrMid-level SRE
£55,000–£72,000/yrSenior SRE
£75,000–£95,000/yrStaff / Principal SRE
£95,000–£120,000+/yrThe Curriculum
10 structured modules. Progress at your pace — your training path adapts to you.
SRE Foundations & Principles
Understand the philosophy behind Site Reliability Engineering and how it differs from traditional ops
Service Level Objectives (SLOs)
Define, measure, and track SLIs, SLOs, and error budgets for production systems
Monitoring & Alerting
Build effective monitoring strategies with Prometheus, Grafana, and PagerDuty
Incident Management
Develop incident response workflows, on-call rotations, and blameless postmortems
Capacity Planning
Forecast resource needs, manage scaling, and optimise infrastructure costs
Chaos Engineering
Proactively test system resilience with fault injection and game days
Distributed Systems Reliability
Understand consensus, replication, and failure modes in distributed architectures
Toil Reduction & Automation
Identify and eliminate repetitive operational work with automation
Performance Engineering
Profile, benchmark, and optimise application and infrastructure performance
Capstone Project
Design and implement an SRE framework for a production-grade application
What You'll Build
Real projects that become your portfolio — the kind UK hiring managers actually ask about.
SLO Dashboard & Error Budget Tracker
Build a real-time SLO monitoring dashboard with error budget burn rate alerts and automated notifications
Incident Response Platform
Create an automated incident management system with escalation policies, runbooks, and postmortem generation
Chaos Engineering Framework
Design and execute chaos experiments to validate system resilience under failure conditions
Your Trainer
Sam
SRE Trainer at CareerForge
Sam has mentored 40+ engineers into SRE roles across UK fintech and e-commerce companies. Calm under pressure, methodical, and always ready with a debugging strategy.
Ready to start your SRE journey?
Cohort places are limited. Applications reviewed within 48 hours.
Apply for the SRE trackInvite-only · No self-service signup