Site Reliability Engineer III (Tue - Sat)

CMETS CME Technology and Support Services Ltd.

ScreenedHybridJust posted

Belfast, Northern Ireland

Posted 1 day ago

Apply Now

About the role

Site Reliability Engineer III (Tue - Sat)

CME Group is seeking a Site Reliability Engineer III (Tue - Sat) to take a key role in building, operating, and scaling systems in our Markets portfolio. As an SRE III, you will apply your experience to the complex challenges of the CME Globex trading platform, where our systems deliver an exceptional combination of low-latency performance and rock-solid reliability.

Key Responsibilities

Own Observability: design, build, and refine monitoring, alerting, and observability solutions; drive continuous improvement of SLIs & SLOs to enable faster issue detection and resolution.
Drive Reliability Projects: take ownership of reliability-focused projects from design to implementation, collaborating with product teams to ensure new features are scalable, resilient, and safe.
Lead Technical Solutions: lead technical discussions for your work, presenting solution options and proposals with clear trade-offs.
Automate Intelligently: proactively identify and eliminate toil through robust automation, improving both system reliability and team velocity.
Manage Incidents: take a leading role in incident response, owning the resolution of significant incidents, ensuring rapid system recovery, and driving meaningful action from blameless post‑mortems.
Mentor & Coach: act as a technical mentor and point of escalation for L1 and L2 SREs, fostering their growth through code reviews and paired work.
Architect for the Future: contribute ideas to the product backlog and play an active role in the architectural design for the migration to Google Cloud Platform (GCP).

What We're Looking For

3‑5+ years of professional experience in a Site Reliability, DevOps, Software, or Systems Engineering role.
Strong, hands‑on experience administering and troubleshooting Linux‑based production systems.
Proficient programming skills in a language like Python or Go, with a track record of automating complex operational tasks.
Proven ability to lead technical initiatives and solve complex problems with a high degree of autonomy.
Excellent communication skills, with the ability to articulate complex technical concepts to diverse audiences.
A proactive and ownership‑oriented mindset.

Desirable Skills

Cloud Platforms: Deep experience with Google Cloud Platform (GCP), especially GCE, GKE, and cloud networking.
Monitoring Tools: Expertise in designing and managing monitoring stacks (e.g., Prometheus, Grafana, OpenTelemetry).
Distributed Systems: Strong practical knowledge of building and maintaining large‑scale distributed systems.
Containerisation: Advanced experience with Kubernetes and Docker in a production environment.
Networking: Solid understanding of networking protocols (HTTP, TCP/UDP, IP) and network architecture.
Domain Knowledge: Experience in financial markets, low‑latency systems, or with message‑oriented middleware.

Company Benefits

Bonus Programme
Generous shift allowance
Equity Programme
Employee Stock Purchase Plan (ESPP)
Private Medical and Dental coverage
Mental Health Benefit Programme
Group Pension Plan
Income Protection
Life Assurance
Cycle To Work
EV Car Benefit Scheme
Gym Membership
Family Leave
Education Assistance – MBA/Advanced Degree/Bachelor Degree
Ongoing Employee Development Training/Certification
Hybrid Working

As an equal‑opportunity employer, we consider all potential employees without regard to any protected characteristic.

#J-18808-Ljbffr

About this listing

Screened by Joboru

This role passed our automated spam and quality filters and was active in our feed when last checked. Joboru is an aggregator — here is how we screen listings. If anything looks off, tell us.