Senior Site Reliability Engineer

Job summary
Permanent contract
Praha
No remote work
Salary: Not specified
Skills & expertise
Cloud & infrastructure
Containerization and orchestration
Programming languages
Incident response
Java
+13
Apply

Pure Storage
Pure Storage

Interested in this job?

Apply
Questions and answers about the job

The position

Job description

THE ROLE

In today's cloud-centric world, the reliability of cloud platforms underpins everything.  Ensuring the heartbeat of these systems is crucial.  At the frontier of cloud technology, Site Reliability Engineering (SRE) works diligently to bolster the availability of our cloud infrastructure and services.  As we pivot to a cloud-first strategy, Pure Storage seeks Site Reliability Engineers, with the ability to play a leading role in our cloud-focused transformation across the broader engineering organization. This team will be passionate about ensuring impeccable uptime, seamless scalability, observability, and unmatched availability.

You will be part of a globally distributed SRE team partnering closely with engineering teams across the US and Europe to architect, automate, and operate services that our customers rely on every second. This role is ideal for someone who wants to influence cloud architecture, drive operational excellence, and build scalable, observable systems from the ground up.

WHAT YOU'LL DO

  • Own the reliability and availability of core cloud services by building robust operational frameworks, proactive monitoring, and scalable automation that directly reduce downtime and improve customer experience.
  • Lead incident response and root cause analysis, ensuring rapid recovery, high-quality follow-ups, and long-term improvements that eliminate recurring issues.
  • Architect and implement automation and Infrastructure-as-Code to streamline deployments, operations, and service management at scale.
  • Partner with product and engineering teams to influence service architecture, embed SRE best practices, and guide the design of highly available cloud-native systems.
  • Develop and evolve observability systems, including metrics, logging, tracing, and actionable alerting to provide deep visibility into system health.
  • We are primarily an in-office environment and therefore, you will be expected to work from the Prague office in compliance with Pure’s policies, unless you are on PTO, or work travel, or other approved leave.

WHAT YOU BRING

  • Strong experience designing, operating, and improving highly available cloud services, including deep understanding of service uptime, SLOs, and production operational excellence.
  • Expertise with public cloud platforms (AWS, Azure, or GCP) and hands-on experience with cloud-native architectures.
  • Proficiency in Infrastructure-as-Code and automation using tools such as Terraform, Ansible, CloudFormation, Puppet, or similar.
  • Practical experience running containerized environments and orchestration systems such as Kubernetes.
  • Ability to build and operate observability stacks (e.g., ELK, Prometheus, OpenTelemetry) and manage on-call processes using tools like PagerDuty.
  • Strong programming skills in languages such as Python, Go, Java, Ruby, or similar.
  • Deep understanding of Linux systems, networking fundamentals, and modern software delivery practices.

#LI-ONSITE

Want to know more?

These job openings might interest you!

These companies are also recruiting for the position of “Cloud Computing and DevOps”.

Apply