Site Reliability Engineer III (Tue - Sat)

ScreenedHybridJust posted
Belfast, Northern Ireland
Posted 1 day ago
Apply Now

About the role

Site Reliability Engineer III (Tue - Sat)

CME Group is seeking a Site Reliability Engineer III (Tue - Sat) to take a key role in building, operating, and scaling systems in our Markets portfolio. As an SRE III, you will apply your experience to the complex challenges of the CME Globex trading platform, where our systems deliver an exceptional combination of low-latency performance and rock-solid reliability.

Key Responsibilities

  • Own Observability: design, build, and refine monitoring, alerting, and observability solutions; drive continuous improvement of SLIs & SLOs to enable faster issue detection and resolution.
  • Drive Reliability Projects: take ownership of reliability-focused projects from design to implementation, collaborating with product teams to ensure new features are scalable, resilient, and safe.
  • Lead Technical Solutions: lead technical discussions for your work, presenting solution options and proposals with clear trade-offs.
  • Automate Intelligently: proactively identify and eliminate toil through robust automation, improving both system reliability and team velocity.
  • Manage Incidents: take a leading role in incident response, owning the resolution of significant incidents, ensuring rapid system recovery, and driving meaningful action from blameless post‑mortems.
  • Mentor & Coach: act as a technical mentor and point of escalation for L1 and L2 SREs, fostering their growth through code reviews and paired work.
  • Architect for the Future: contribute ideas to the product backlog and play an active role in the architectural design for the migration to Google Cloud Platform (GCP).

What We're Looking For

  • 3‑5+ years of professional experience in a Site Reliability, DevOps, Software, or Systems Engineering role.
  • Strong, hands‑on experience administering and troubleshooting Linux‑based production systems.
  • Proficient programming skills in a language like Python or Go, with a track record of automating complex operational tasks.
  • Proven ability to lead technical initiatives and solve complex problems with a high degree of autonomy.
  • Excellent communication skills, with the ability to articulate complex technical concepts to diverse audiences.
  • A proactive and ownership‑oriented mindset.

Desirable Skills

  • Cloud Platforms: Deep experience with Google Cloud Platform (GCP), especially GCE, GKE, and cloud networking.
  • Monitoring Tools: Expertise in designing and managing monitoring stacks (e.g., Prometheus, Grafana, OpenTelemetry).
  • Distributed Systems: Strong practical knowledge of building and maintaining large‑scale distributed systems.
  • Containerisation: Advanced experience with Kubernetes and Docker in a production environment.
  • Networking: Solid understanding of networking protocols (HTTP, TCP/UDP, IP) and network architecture.
  • Domain Knowledge: Experience in financial markets, low‑latency systems, or with message‑oriented middleware.

Company Benefits

  • Bonus Programme
  • Generous shift allowance
  • Equity Programme
  • Employee Stock Purchase Plan (ESPP)
  • Private Medical and Dental coverage
  • Mental Health Benefit Programme
  • Group Pension Plan
  • Income Protection
  • Life Assurance
  • Cycle To Work
  • EV Car Benefit Scheme
  • Gym Membership
  • Family Leave
  • Education Assistance – MBA/Advanced Degree/Bachelor Degree
  • Ongoing Employee Development Training/Certification
  • Hybrid Working

As an equal‑opportunity employer, we consider all potential employees without regard to any protected characteristic.

#J-18808-Ljbffr

About this listing

Screened by Joboru

This role passed our automated spam and quality filters and was active in our feed when last checked. Joboru is an aggregator — here is how we screen listings. If anything looks off, tell us.