Site Reliability Engineer Object Storage

Shrnutí práce
Plný úvazek
Paris
Plat: Neuvedeno
Několik dní doma
Dovednosti a odbornost
Písemná a ústní komunikace
React
Kibana
Gitlab
Grpc
+10

Scaleway
Scaleway

Máte zájem o tuto nabídku?

jobs.faq.title

Pozice

Popis pozice

About the Job

The Object Storage team is a cornerstone of Scaleway. Our mission is to provide S3-compatible Object Storage to our clients but also to all the other Scaleway Elements products that rely on it (Instances, Databases, Registry, and more). In a challenging environment, we (and hopefully you soon) manage hundreds of Object Storage servers across various regions, dealing with petabytes of data while ensuring high availability. As a Site Reliability Engineer in our team, you will be responsible for developing, automating, and enhancing Scaleway’s Object Storage solution. On top of your daily activities within the team, you will need to interact with all of Scaleway’s teams, especially Instance, Network, Hardware, and Platform.

Responsibilities

  • Design, implement, and maintain highly available and resilient Object Storage solutions to ensure scalability, availability and performance.
  • Develop automation tools and workflows to streamline provisioning, monitoring, and management of Object Storage infrastructure, ensuring that it scales effectively.
  • React to incident and troubleshooting activities in collaboration with other teams
  • Design technical solutions that address market defined challenges 
  • Present your work during tech meetings
  • Technical Stack

  • Linux (Ubuntu servers)
  • Go, C
  • gRPC, Protobuf
  • PostgreSQL, Redis
  • Vector, ElasticSearch, Kibana
  • VictoriaMetrics, Prometheus, Grafana
  • Ansible
  • GitLab CI/CD, Git
  • HAProxy, ExaBGP
  • Minimum qualifications

  • Strong Linux knowledge
  • Good system-level programming skills
  • Good understanding of C
  • Basic understanding of Go
  • Experience with Git and CI/CD.
  • Proactive mindset with a focus on identifying and addressing issues before they impact scalability and reliability.
  • Great oral and written communication skills
  • Preferred qualifications

  • Experience in designing, implementing, or maintaining storage infrastructure in production environments
  • Experience with (and love for) distributed systems
  • Experience with incident management and on-call support in a production environment.
  • Passion for automation and tooling
  • Infrastructure deployment with Ansible
  • Strong problem-solving skills
  • Experience with the S3 API
  • Logging and monitoring (Vector, VictoriaMetrics, Grafana, …)
  • Able to work efficiently in written English
  • Location

    This position is based in our offices in Paris or Lille (France).

    Recruitment Process  

    Screening call - 30 mins with the recruiter 

    Manager Interview - 45 mins

    Technical Interviews 

    HR Interview - 45 mins

    Head of Interview - 45 mins

    Offer sent

    Chcete se dozvědět více?

    Tato volná pracovní místa by vás mohla zajímat!

    Tyto společnosti rovněž nabírají pracovníky na pozici "{profese}".

    Podívat se na všechny nabídky