Head of SRE

Permanent contract
Paris
Fully-remote
Salary: Not specified
Experience: > 7 years
Apply

YOUSIGN (soon-to-be YOUTRUST)
YOUSIGN (soon-to-be YOUTRUST)

Interested in this job?

Apply
Questions and answers about the job

The position

Job description

Position Overview

Yousign is seeking an experienced Head of SRE to lead our Site Reliability Engineering team through a pivotal transformation phase. You will own the technical leadership and people management of a mature 6-person SRE team, driving our infrastructure migration to completion while shaping the future of our Platform organization.

This role is critical as we complete our multi-semester infrastructure migration (90% by H1 2026, 100% by EOY 2026) and transition into a phase focused on resilience, scalability, and Platform Engineering excellence.

About the job

As Head of SRE, you will provide strategic and operational leadership for our Site Reliability Engineering function. You'll be responsible for ensuring the availability, performance, and security of our critical B2B Trust platform serving thousands of customers and millions of users.

Key differentiator: Our infrastructure runs on European sovereign cloud (OVH) rather than hyperscalers (AWS, GCP, Azure). This requires deep expertise in non-managed infrastructure, IaaS, and bare-metal operations - offering unique technical challenges and ownership.

You will manage a team with complementary profiles (Infrastructure and Application SREs), drive our Platform Engineering vision (IDP/DevHub), and act as the technical voice during production incidents and crisis situations.

This is a leadership role requiring both strategic vision (semester roadmap, 1-2 years vision) and hands-on management capabilities (team development, incident leadership, cross-functional collaboration).

Your Team

As Head of SRE, you will lead a team of 6 Site Reliability Engineers with complementary expertise:

  • Infrastructure SREs: Focus on infrastructure lifecycle, networking, capacity planning, and our ongoing infrastructure migration, housing management.

  • Application SREs: Focus on application stack, deployments, observability, and app performance

The team is mature and experienced (10 years of SRE practice at Yousign) with established processes for incident management, on-call, and production operations. However, the team currently lacks dedicated leadership, with SREs reporting directly to the Director of Engineering Platform.

Your challenge: Reconstitute strong SRE leadership to guide the team through the final migration phase, build long-term strategic vision, and empower individuals.

Team culture: The Platform organization values pragmatism over perfection ("Keep it simple"), ownership and autonomy, and business-driven decisions. Technology is a means, not an end. You'll foster a blameless culture focused on continuous improvement and learning from incidents.

Collaboration: You'll work closely with Engineering Managers, Product teams, Security, and Engineering Council stakeholders. The Platform team operates with a Platform-as-a-Product mindset, where product development teams are your users.


Your Missions

Team Management and Leadership (40%)

  • Lead, inspire, and grow a team of 6 SREs with diverse profiles (Infrastructure and Application backgrounds)

  • Create a unified SRE vision while respecting each profile's specificities and career development needs

  • Provide coaching and mentorship, including for senior/staff engineers on technical leadership

  • Manage workload, prioritize and make structured recommendations that will be discussed with Engineering management.

  • Manage external resources: freelancers and specific expertise as needed

  • Embody and transmit Yousign's vision, values, and Operating Principles

Strategic Vision and Planning (25%)

  • Complete infrastructure migration successfully: Drive 90% decommissioning by H1 2026, 100% by EOY 2026

  • Define post-migration strategy: resilience (eliminate SPOFs), scalability (support growth), rationalization (observability overhaul)

  • Drive Platform Engineering vision: IDP/DevHub implementation to reduce cognitive load for product teams

  • Perform long-term capacity planning (1-2 years): project technical, budget, and headcount needs aligned with business growth

  • Participate in infrastructure strategy, architecture decisions, and technology trade-offs

  • Drive sovereign cloud strategy (OVH) and non-hyperscaler expertise

  • Present strategic recommendations to top management with clear business value articulation

Production Management and Crisis Leadership (20%)

  • Ensure SLA & SLO for critical B2B trust signature service (thousands of customers, millions of users)

  • Lead incident management with mature processes (on-call, runbooks, war rooms)

  • Act as credible spokesperson during crises: manage communication with internal teams, Engineering Council, and external customers

  • Establish and execute crisis communication plans maintaining trust and transparency

  • Drive blameless post-mortems and continuous improvement culture

  • Manage on-call organization and team mental load

  • Proactively identify and mitigate risks; strengthen resilience mechanisms

  • Implement disaster recovery and business continuity plans

Platform Engineering and DevOps Culture (15%)

  • Drive Platform-as-a-Product mindset: product teams as customers, focus on reducing their cognitive load

  • Minimize impact on product teams during infrastructure transformations

  • Own Build topics: Automation, CI/CD, technical framing, IDP/DevHub implementation, Developer Experience (self-service, golden paths)

  • Own Run topics: Supervision, monitoring, observability rationalization post-migration

  • Work closely with Engineering Managers to address cross-team needs

  • Diffuse DevSecOps/SRE culture within Engineering teams

  • Lead cross-functional initiatives to improve platform efficiency and reliability


Who are you ?

Infrastructure and Production Expertise

  • Non-hyperscaler infrastructure: Proven experience with sovereign cloud (OVH, Scaleway) or traditional hosting—not just AWS/GCP/Azure with managed services

  • Critical production management: Real experience managing major incidents with external customer and stakeholder communication during crises

Leadership and Management

  • Confirmed management experience: Minimum 3-5 years managing technical teams (5+ people)

  • SRE transformation experience: Successfully led SRE transformation in a scale-up environment

  • Diverse profile management: Experience managing teams with different technical backgrounds (Infrastructure + Application, or similar hybrid profiles)

Strategic Vision and Communication

  • Long-term vision: Ability to project over 6-12 months on technical evolution, budget, and people planning

  • Crisis communication: Credible spokesperson in high-pressure situations with internal and external stakeholders

  • Energy and dynamism: Engaging communication style that inspires and motivates teams

Mindset and Culture Fit

  • Pragmatism over dogma: "Keep it simple" mindset-technology as a means, not an end

  • Trade-off thinking: Makes pragmatic decisions based on business context, not only theoretical best practices

  • Systemic improvement culture: Focuses on long-term solutions rather than "hero mode"

Recruitment process

1 Interview TAM with Guillhem, Talent Acquisition Manager – 30 min
2 Interview with Kevin, Engineering Director – deep dive into your experience – 45 min
3 Case study presentation – showcase your strategic approach with Nicolas (CTO) et Kevin (Engineering Director) – 1 hour

Want to know more?

These job openings might interest you!

These companies are also recruiting for the position of “Cloud computing et DevOps”.

See all job openings
Apply