Senior Site Reliability Engineer

Job summary
Permanent contract
Paris, Rennes
Fully-remote
Salary: Not specified
Experience: > 5 years
Skills & expertise
Innovation
Grafana
Kubernetes
Ansible
Harbor
+14

Sekoia.io
Sekoia.io

Interested in this job?

Questions and answers about the job

The position

Job description

We’re creating a self-hosted SaaS platform designed for the most critical, regulated and air-gapped environments. As a Senior SRE, you will be the owner of the installer, updater, and debug toolkit — the solution that make it deployable, upgradable, and operable without outside help.

📍 The position is available in Rennes, Paris or fully remote in Europe.

Your missions :

  • Shape and maintain the overall deployment architecture, define the reference architectures, participate in design reviews to ensure reliability, operability, and security are built into the core deployment model.

  • Support software engineers in building an offline-capable installer and a one-click upgrade/rollback orchestrator using state-of-the-art modern cloud technologies, such as immutable & distroless Docker/OCI images, Helm charts, and Kubernetes operators.

  • Deliver the air-gap debug system, including diagnostics, log/metric capture, auto-redaction, and signed export artifacts so operators can resolve most issues independently.

  • Implement self-diagnostics, health checks, and recovery steps to enable operators to respond quickly without relying on external support.

  • Produce operator runbooks, install/upgrade guides, supportability matrices, and clear reference architectures.

  • Enforce runtime integrity checks, non-root/distroless images, anti-tamper protections, and per-customer watermarking of delivered artifacts.

Our technical stack ⚙️:

  • Kubernetes: k3s, Cilium, Ceph, ArgoCD, Helm

  • Observability: Thanos, Prometheus, Grafana, Loki

  • Tools: Python, Ansible, SaltStack, Terraform

  • Databases: Kafka, Clickhouse, Redis, KeyDB, PostgreSQL, ArangoDB, Quickwit

  • CI/CD: GitHub Actions, Harbor


Preferred experience

🤩 We are excited to meet you if :

  • A minimum of 5 years of experience in an SRE job working on a similar infrastructure,

  • Strong expertise in Kubernetes operators, Helm, and containerized deployments.

  • Proven experience building installation and upgrade frameworks for complex distributed systems.

  • A knack for making sophisticated systems operator-friendly, even in air-gapped settings.

  • Developed several useful tools/automation scripts in Python,

  • Participated in a 24x7 on-call rotation,

  • A solid understanding of cloud networking, load balancing, and firewall configurations,

  • A sense for innovation and implementing changes.


Recruitment process

📝 Here’s what’s in store for you if you apply :

  1. HR Interview with Clémentine, Talent Acquisition Manager (30’)

  2. Use case to do at home (60’)

  3. Skills fit interview with 2 SREs (60’)

  4. Final N+2 Interview with Georges, CTPO, and Léo, Head of Infrastructure (60’)

Our process usually takes about 3 weeks, depending on availability. The process includes reference calls. The program: discussions rather than trick questions! These discussions will help you understandhow Sekoia.io works and what it stands for. But they are also (and above all) an opportunity for you to tell us about your career path and your expectations for your next job!

Sekoia.io is an equal opportunity employer for any minority, disability, gender identity, or sexual orientation. We are committed to hiring and supporting diverse teams of people from all backgrounds, experiences, and perspectives.

Want to know more?

These job openings might interest you!

These companies are also recruiting for the position of “Cloud Computing and DevOps”.

See all job openings