L'envoi d'un CV est-il obligatoire pour postuler à cette offre ?

Pour postuler à cette offre, l'envoi de votre CV est obligatoire.

Le télétravail est-il possible pour ce poste ?

Le télétravail occasionnel est autorisé pour ce poste.

Quel est le type de contrat pour ce poste ?

Le contrat pour ce poste est de type {contract_type}.

Une lettre de motivation est-elle obligatoire pour postuler à cette offre ?

Non, la lettre de motivation n'est pas nécessaire pour postuler à cette offre.

MLOps Infrastructure Engineer - Sigma Nova

Sigma Nova

MLOps Infrastructure Engineer

Join our team as an MLOps Infrastructure Engineer, where you'll design and deploy a high-performance platform for distributed machine learning. You'll work with cloud and Kubernetes architecture, develop internal tools for MLOps, and implement DevOps best practices. This role requires 3-4 years of experience in cloud infrastructure, DevOps, or MLOps, as well as proficiency in Kubernetes, cloud GPU management, Python, and CI/CD. Bonus skills include low-level optimization, backend/API experience, and designing partner-facing tools.

Résumé suggéré par Welcome to the Jungle

CDI

Paris

Télétravail occasionnel

Salaire : Non spécifié

Expérience : > 5 ans

Éducation : Bac +5 / Master

Missions clés

Concevoir et déployer une plateforme pour rendre les GPU, les clusters et l'entraînement distribué transparents.

Développer et améliorer l'orchestrateur interne pour simplifier l'entraînement distribué.

Mettre en œuvre l'Infrastructure-as-Code (Terraform/Pulumi) pour la reproductibilité et l'évolutivité.

il y a 7 jours

Sigma Nova

Cette offre vous tente ?

Questions et réponses sur l'offre

Le poste

Descriptif du poste

The Challenge: Build the Platform That Powers Research and Beyond

Your mission: Design and deploy the platform that makes GPUs, clusters, and distributed training transparent, not just for internal research, but also as a foundation for monetizable capabilities (e.g., managed training services, optimised inference pipelines for partners).

What You’ll Do

Cloud & Kubernetes Architecture:
- Build and maintain a high-performance, multi-tenant environment on Scaleway and GENCI, optimised for distributed ML.
- Deploy and supervise a Slurm cluster for research workload, ensuring seamless integration with Scaleway’s infrastructure.
- Automate scaling, resource allocation, and cost management to avoid technical debt.
MLOps & Internal Tools:
- Develop and enhance our internal orchestrator to simplify distributed training (FSDP, data pipelines) for both researchers and external users.
- Create reusable frameworks for monitoring, logging, efficiency, and cost tracking.
- Collaborate with research teams to industrialise workflows (e.g., model alignment, large-scale finetuning) and package them as deployable capabilities.
DevOps & Software Craftsmanship:
- Implement Infrastructure-as-Code (Terraform/Pulumi) for reproducibility and scalability.
- Write clean, typed, and documented Python code
- Troubleshoot at the intersection of hardware (GPUs, networking) and software (PyTorch, CUDA), ensuring robustness for both internal and external use cases.

Profil recherché

Key Skills

Experience: 3–4 years in cloud infrastructure, DevOps, or MLOps (research or industry).
Technologies:
- Kubernetes/Docker: Advanced orchestration and containerization.
- Cloud GPU Management: Scaleway, AWS/GCP (clusters, networking, storage).
- Python: Proficiency in PEP standards, typing, and testing.
- MLOps: Data pipelines, distributed training (PyTorch, FSDP), monitoring.
- CI/CD: Pipeline setup and maintenance.
- Fluent English (the team speaks English in the day-to-day)

Bonus Skills

Low-level optimisation (Triton, CUDA), HPC, or large-scale training experience.
Backend/APIs (FastAPI, gRPC) for exposing models or services.
Experience designing partner-facing tools or managed services.

Beyond Technical Skills:

While technical excellence is critical, we place equal importance on how we work together. We believe the best teams are built on:

Integrity & Respect
- We are striving for honesty, kindness, and fairness. We value people who treat others with dignity and foster an environment where everyone feels heard.
Open Communication & Humility
- Great ideas come from collaboration. We look for teammates who listen actively, communicate clearly, and approach challenges with self-awareness and humility.
Psychological Safety & Camaraderie
- We strive to create a space where people feel safe to take risks, ask questions, and grow.

Déroulement des entretiens

Prescreen with Paul (Head of People)
Technical Screen with one Research Scientist or Research Engineer
On-site (Take-home exercise and restitution OR On site live interviews + Behavioural interview)

Envie d’en savoir plus ?

Rencontrez Paul, Head of Talent Acquisition

Découvrez l'entreprise

Explorez la vitrine de l’entreprise ou suivez-la pour savoir si elle vous correspond vraiment !

Explorer l’entreprise

Ils sont sociables

L'entreprise

Sigma Nova

Intelligence artificielle / Machine Learning

16 collaborateurs

Qui sont-ils ?

Our Vision: Interdisciplinary AI for Complex Domains

We believe the future of AI isn’t just about scaling compute or scraping more internet data. The most impactful breakthroughs will come from domains where:

Data is complex and multimodal (think brain signals, molecular structures, industry machinery sensor data, astronomical observations)
Data is scarce or hard to collect (highly regulated industries, expensive instrumentation)
Impact is enormous (healthcare, climate, materials science, robotics)

These domains share common challenges from research, data, and technical standpoints. We’re building reusable blueprints and foundational building blocks that can transfer across scientific fields—what we learn from brain modelling can inform physics, chemistry, or beyond.

Starting with the Brain

Our first proving ground is AI for neuroscience. We’re encouraged by early partnerships with research labs and our team’s performance in the EEG NeurIPS Challenge December 2025), where we achieved beyond state-of-the-art results.

The challenge we’re tackling is urgent: Brain illness and neurodegenerative disorders represent one of the most threatening health crises of our era. Applying frontier AI to complex, multimodal brain datasets could unlock transformative clinical applications.

But we’re also investing in theoretical foundations. Our team is publishing in top-tier ML conferences, ex:

This dual focus—rigorous science and applied impact—defines our approach.

Les avantages salariés

Horaires de travail flexibles
Entre 1-2 jours de télétravail
Collations à volonté
RTT / Jour de repos
Participation
Retraite complémentaire

Voir tous les avantages

Le lieu de travail

Rue de Mogador, 75009 Paris, France

Besoin de plus d’infos ?

Vie d’entreprise, ambiance, réalisations... On a encore plein de choses à vous dire !

Découvrir

D’autres offres vous correspondent !

Ces entreprises recrutent aussi au poste de “Data / Business Intelligence”.

Expert Data Management (H/F/NB)
Keyrus
CDI
Levallois-Perret
Télétravail fréquent
Intelligence artificielle / Machine Learning, IT / Digital
3 000 collaborateurs
il y a 9 heures
Data Steward - Lille
skiils
CDI
Suresnes
Télétravail fréquent
Salaire : 50K à 60K €
Intelligence artificielle / Machine Learning, Transformation
150 collaborateurs
il y a 13 heures
Candidature spontanée - Theodo Data & AI
Theodo Data & AI
CDI
Paris
Télétravail non autorisé
Intelligence artificielle / Machine Learning, IT / Digital
70 collaborateurs
il y a 14 heures
Data Engineer
Sekoia.io
CDI
Paris
Télétravail fréquent
Logiciels, Intelligence artificielle / Machine Learning
140 collaborateurs
il y a 15 heures
Data Scientist x Software Engineer - Intermediate
Metroscope
CDI
Paris
Télétravail fréquent
Salaire : 55K à 65K €
Logiciels, Intelligence artificielle / Machine Learning
55 collaborateurs
il y a 15 heures
Lead MLOps
QuantCube Technology
CDI
Paris
Télétravail fréquent
Salaire : ≥ 65K €
Intelligence artificielle / Machine Learning, FinTech / InsurTech
78 collaborateurs
il y a 16 heures