Site Reliability Engineer
About the role
Site Reliability / Resilience Engineer
Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.
Hybrid: Sheffield, UK(2-3days)
Contract: 6 months rolling contract
Rate: £600/day(Inside IR35) via Umb
Role Overview
We are seeking a Site Reliability / Resilience Engineer to support a large-scale, enterprise technology environment. This role focuses on improving the reliability, availability, and resilience of critical services across complex, distributed systems.
You will work across cloud, infrastructure, and application ecosystems, helping ensure services are observable, recoverable, and aligned with both engineering best practices and regulatory resilience requirements.
Key Responsibilities
- Support reliability and resilience across cloud platforms (AWS, Azure, GCP)
- Work across infrastructure, networks, data centres, and application platforms
- Analyse and map service dependencies and critical service chains
- Contribute to the design and implementation of resilience and recovery strategies (RTO/RPO, failover patterns)
- Support vulnerability identification and risk reduction activities
- Enhance observability, monitoring, and resilience tooling across services
- Ensure alignment with UK Operational Resilience Policy Framework (PRA/FCA/Bank of England)
- Support ITIL-aligned processes, including incident, change, and release management
- Drive improvements in service stability, reliability, and performance
Skills & Experience
- Strong experience across enterprise technology environments:
- Cloud platforms (AWS, Azure, GCP)
- Infrastructure, networking, and data centres
- Application platforms and integration layers
- Strong understanding xwzovoh of:
- Service chain and dependency mapping
- Vulnerability and risk management
- Recovery models (RTO/RPO) and resilience patterns
- ITIL-based service management practices
- Experience with enterprise tooling such as ServiceNow
- Exposure to observability or monitoring platforms (beneficial but not essential)
- Familiarity with UK Operational Resilience frameworks (PRA/FCA/Bank of England)
This is a strong opportunity for someone who combines Site Reliability Engineering principles with a focus on operational resilience, observability, and large-scale enterprise systems.
About this listing
This role passed our automated spam and quality filters and was active in our feed when last checked. Joboru is an aggregator — here is how we screen listings. If anything looks off, tell us.
Similar jobs you may like
Strategic Customer Success Manager
1 day agoClaranet Limited
Seasonal Brand Home Guides - Caol Ila
1 day agoBrightwork Ltd
Strategic Customer Success Manager
1 day agoClaranet Limited
Visitor Services Assistant
1 day agoNational Trust Scotland
Resident Service Associate (Lettings & Tenancy Management)
1 day agoGreat Places Housing Association
Seasonal Brand Home Guides - Talisker
1 day agoBrightwork Ltd
Customer Experience Coordinator
1 day agoCreative Support Ltd
Welfare Benefits Advisor
1 day agoAmplius
Customer Service Representative
1 day agoRandstad Technologies