Site Reliability Engineer

Job summary
Permanent contract
Paris
Occasional remote
Salary: Not specified
Experience: > 4 years
Education: Master's Degree
Skills & expertise
Observation skills
Automation tools
Collaboration and teamwork
Python
Kubernetes
+8

MakiPeople
MakiPeople

Interested in this job?

Questions and answers about the job

The position

Job description

Our reliability vision

Our product is built on top of complex infrastructure: AI systems, enterprise integrations, and large-scale data pipelines. For our customers, downtime or latency is not an option. Reliability is not only about keeping the lights on; it’s about designing systems that scale predictably, self-heal, and deliver enterprise-grade SLAs.

As our first Site reliability engineer, you will own the foundations that make Maki stable and scalable. You’ll set the standards for observability, incident response, and automation, ensuring that reliability becomes a core product feature.

What you will do

  • Build resilient infrastructure: Design and maintain scalable, fault-tolerant systems that support our AI-powered HR platform.

  • Improve observability: Implement monitoring, logging, and alerting systems to ensure proactive detection and resolution of issues.

  • Automate operations: Develop tools and processes to eliminate manual toil—making deployments, scaling, and incident response smoother and faster.

  • Ensure performance & SLAs: Define and enforce service-level objectives (SLOs) and service-level agreements (SLAs) that meet enterprise expectations.

  • Run incident response: Lead incident management processes; run blameless post-mortems and continuously improve reliability practices.

  • Collaborate with engineering: Work closely with product and AI engineers to embed reliability, scalability, and security into every feature.

  • Stay ahead of growth: Anticipate scaling needs as Maki grows internationally and across enterprise customers.


Preferred experience

Who we’re looking for

  • Experience: 4+ years in site reliability engineering, DevOps, or infrastructure roles in SaaS or enterprise environments.

  • Technical skills: Proficiency in cloud environments (AWS, GCP, or Azure), container orchestration (Kubernetes, Docker), and infrastructure-as-code (Terraform, Pulumi, etc.).

  • Observability mindset: Experience with monitoring and alerting tools (Prometheus, Grafana, Datadog, etc.).

  • Automation first: Strong scripting/programming skills (Python, Go, or similar) to automate workflows.

  • Mindset: Pragmatic, rigorous, and proactive—you care about reliability as much as innovation.

  • Collaboration: Clear communicator who can work with both engineers and leadership to prioritize reliability.

Bonus points if you:

  • Have scaled systems in a startup or hyper-growth environment.

  • Have worked with AI/ML infrastructure and understand the unique reliability challenges.

  • Have experience running security or compliance-related reliability audits.

  • Are passionate about incident culture—blameless post-mortems, chaos engineering, continuous improvement.


Recruitment process

  • Mochi screen - 15 min
    A call with Mochi, our AI recruiter, to check your eligibility criteria and unveil your skills in a structured way.

  • Intro call – 30 min
    Intro with Ben (Cofounder & CPO). A first conversation to get to know you and share more context on Maki and the role.

  • Technical case – 90 min
    With our engineering team. You’ll solve a reliability-focused challenge and walk us through your approach.

  • Founder interview – 30 min
    Meet one of our founders to validate culture and values fit.

  • Final wrap-up
    Offer call (and ideally, a celebration!).

Want to know more?

These job openings might interest you!

These companies are also recruiting for the position of “Cloud Computing and DevOps”.

  • Signaturit Group

    SysOps Engineer*

    Signaturit Group
    Signaturit Group
    Permanent contract
    Puteaux
    A few days at home
    Software, Artificial Intelligence / Machine Learning
    400 employees

  • Sekoia.io

    Site Reliability Engineer

    Sekoia.io
    Sekoia.io
    Permanent contract
    Rennes, Paris
    Fully-remote
    Software, Artificial Intelligence / Machine Learning
    110 employees

  • Gireve

    Lead DevOps Engineer F/H

    Gireve
    Gireve
    Permanent contract
    Sèvres
    A few days at home
    Salary: €65K to 72K
    Software, Environment / Sustainable Development
    66 employees

  • Lenstra

    Senior Cloud Engineer

    Lenstra
    Lenstra
    Permanent contract
    Paris
    A few days at home
    Software, Artificial Intelligence / Machine Learning
    30 employees

  • Prismic

    Service Reliability Engineer

    Prismic
    Prismic
    Permanent contract
    Paris
    Fully-remote
    Software, SaaS / Cloud Services
    70 employees

  • amo

    Lead Site Reliability Engineer (SRE)

    amo
    amo
    Permanent contract
    Paris
    No remote work
    Mobile Apps, Software
    40 employees

See all job openings