The Role & Client Context
- Core platforms support revenue, users, and downstream systems in real time
- Reliability issues are already visible and commercially consequential
- Engineering teams are capable but stretched, with resilience debt in the system
- The Platform / SRE Lead operates with authority across infrastructure, engineering, and operations
This role exists to turn fragility into stability without slowing delivery.
THE CRITICAL MANDATE
Risk & Focus
- Production incidents are not hypothetical – they are happening.
- Stability has historically been secondary to feature delivery, and the cost is now visible.
- You will own uptime, incident response, and recovery decisions in environments where failure is public and traceable.
If you are uncomfortable being accountable for systems at 3am, this will not suit you.
If you are calm under operational pressure, this is where your judgement matters.
THE LONG-TERM FIT
Growth & Impact
- Long-term ownership of platform reliability and resilience strategy
- Authority to shape operational standards, tooling, and response models
- Trusted influence across engineering leadership and senior stakeholders
This mandate suits operators who value stability as a strategic advantage, not an afterthought.
Start a Confidential Conversation
Submit your CV if this mandate reflects how you work.
Every profile is reviewed personally.
We do not share CVs without consent.