- Shape and maintain the overall deployment architecture, define the reference architectures, participate in design reviews to ensure reliability, operability, and security are built into the core deployment model.
- Support software engineers to build an offline-capable installer and a one-click upgrade/rollback orchestrator using state of the art modern cloud technologies such as immutable & distroless Docker/OCI images, Helm charts, and Kubernetes operators.
- Deliver the air-gap debug system including diagnostics, log/metric capture, auto-redaction, and signed export artifacts so operators can resolve most issues independently.
- Implement self-diagnostics, health checks, and recovery steps so operators can operate quickly without relying on external support.
- Produce operator runbooks, install/upgrade guides, supportability matrices, and clear reference architectures.
- Enforce runtime integrity checks, non-root/distroless images, anti-tamper protections, and per-customer watermarking of delivered artifacts.
-A minimum of 5 years of experience in an SRE job working on a similar infrastructure,
- Strong expertise in Kubernetes operators, Helm, and containerized deployments.
- Proven experience building installation and upgrade frameworks for complex distributed systems. - A knack for making sophisticated systems operator-friendly, even in air-gapped settings.
- Developed several useful tools/automation scripts in Python,
- Participated in a 24x7 on-call rotation,
- A solid understanding of cloud networking, load balancing, and firewall configurations,
- A sense for innovation and implementing changes.
Rencontrez Maud, Talent recruter
Rencontrez Christophe, Dirigeant
Ces entreprises recrutent aussi au poste de “Cloud Computing and DevOps”.