Site Reliability Engineer

Resumen del puesto
Indefinido
Paris
Teletrabajo ocasional
Salario: No especificado
Experiencia: > 4 años
Formación: Licenciatura / Máster
Competencias y conocimientos
Sentido de la observación
Herramientas de automatización
Colaboración y trabajo en equipo
Grafana
Kubernetes
+8

MakiPeople
MakiPeople

¿Te interesa esta oferta?

Preguntas y respuestas sobre esta oferta

El puesto

Descripción del puesto

Our reliability vision

Our product is built on top of complex infrastructure: AI systems, enterprise integrations, and large-scale data pipelines. For our customers, downtime or latency is not an option. Reliability is not only about keeping the lights on; it’s about designing systems that scale predictably, self-heal, and deliver enterprise-grade SLAs.

As our first Site reliability engineer, you will own the foundations that make Maki stable and scalable. You’ll set the standards for observability, incident response, and automation, ensuring that reliability becomes a core product feature.

What you will do

  • Build resilient infrastructure: Design and maintain scalable, fault-tolerant systems that support our AI-powered HR platform.

  • Improve observability: Implement monitoring, logging, and alerting systems to ensure proactive detection and resolution of issues.

  • Automate operations: Develop tools and processes to eliminate manual toil—making deployments, scaling, and incident response smoother and faster.

  • Ensure performance & SLAs: Define and enforce service-level objectives (SLOs) and service-level agreements (SLAs) that meet enterprise expectations.

  • Run incident response: Lead incident management processes; run blameless post-mortems and continuously improve reliability practices.

  • Collaborate with engineering: Work closely with product and AI engineers to embed reliability, scalability, and security into every feature.

  • Stay ahead of growth: Anticipate scaling needs as Maki grows internationally and across enterprise customers.


Requisitos

Who we’re looking for

  • Experience: 4+ years in site reliability engineering, DevOps, or infrastructure roles in SaaS or enterprise environments.

  • Technical skills: Proficiency in cloud environments (AWS, GCP, or Azure), container orchestration (Kubernetes, Docker), and infrastructure-as-code (Terraform, Pulumi, etc.).

  • Observability mindset: Experience with monitoring and alerting tools (Prometheus, Grafana, Datadog, etc.).

  • Automation first: Strong scripting/programming skills (Python, Go, or similar) to automate workflows.

  • Mindset: Pragmatic, rigorous, and proactive—you care about reliability as much as innovation.

  • Collaboration: Clear communicator who can work with both engineers and leadership to prioritize reliability.

Bonus points if you:

  • Have scaled systems in a startup or hyper-growth environment.

  • Have worked with AI/ML infrastructure and understand the unique reliability challenges.

  • Have experience running security or compliance-related reliability audits.

  • Are passionate about incident culture—blameless post-mortems, chaos engineering, continuous improvement.


Proceso de selección

  • Mochi screen - 15 min
    A call with Mochi, our AI recruiter, to check your eligibility criteria and unveil your skills in a structured way.

  • Intro call – 30 min
    Intro with Ben (Cofounder & CPO). A first conversation to get to know you and share more context on Maki and the role.

  • Technical case – 90 min
    With our engineering team. You’ll solve a reliability-focused challenge and walk us through your approach.

  • Founder interview – 30 min
    Meet one of our founders to validate culture and values fit.

  • Final wrap-up
    Offer call (and ideally, a celebration!).

¿Quieres saber más?

¡Estas ofertas de trabajo te pueden interesar!

Estas empresas también contratan para el puesto de "{profesión}".

  • Signaturit Group

    SysOps Engineer*

    Signaturit Group
    Signaturit Group
    Indefinido
    Puteaux
    Unos días en casa
    Software, Inteligencia artificial/Aprendizaje automático
    400 empleados

  • Sekoia.io

    Site Reliability Engineer

    Sekoia.io
    Sekoia.io
    Indefinido
    Rennes, Paris
    Totalmente remoto
    Software, Inteligencia artificial/Aprendizaje automático
    110 empleados

  • Gireve

    Lead DevOps Engineer F/H

    Gireve
    Gireve
    Indefinido
    Sèvres
    Unos días en casa
    Salario: 65K a 72K €
    Software, Medio ambiente/Desarrollo sostenible
    66 empleados

  • Lenstra

    Senior Cloud Engineer

    Lenstra
    Lenstra
    Indefinido
    Paris
    Unos días en casa
    Software, Inteligencia artificial/Aprendizaje automático
    30 empleados

  • Prismic

    Service Reliability Engineer

    Prismic
    Prismic
    Indefinido
    Paris
    Totalmente remoto
    Software, SaaS/Servicios en la nube
    70 empleados

  • amo

    Lead Site Reliability Engineer (SRE)

    amo
    amo
    Indefinido
    Paris
    Sin trabajo a distancia
    Aplicaciones móviles, Software
    40 empleados

Ver todas las ofertas