L'envoi d'un CV est-il obligatoire pour postuler à cette offre ?

Pour postuler à cette offre, l'envoi de votre CV est obligatoire.

Le télétravail est-il possible pour ce poste ?

Le télétravail occasionnel est autorisé pour ce poste.

Quel est le type de contrat pour ce poste ?

Le contrat pour ce poste est de type {contract_type}.

Une lettre de motivation est-elle obligatoire pour postuler à cette offre ?

La lettre de motivation est optionnelle pour postuler à cette offre.

ML Performance Engineer - Sigma Nova

Sigma Nova

ML Performance Engineer

CDI

Paris

Télétravail occasionnel

Salaire : Non spécifié

Expérience : > 3 ans

Éducation : Bac +5 / Master

il y a 13 heures

Sigma Nova

Cette offre vous tente ?

Questions et réponses sur l'offre

Le poste

Descriptif du poste

We are looking for a Performance ML Engineer to optimise the efficiency and cost of our machine learning models, preparing them for large-scale deployment.

Currently, we operate with 8x H100 GPUs in a single-node configuration, which serves our immediate research and development needs, but we anticipate scaling up to clusters of 64+ H100 in the coming years, with compute needs reaching millions of GPU hours annually.

Your mission: Maximise model performance (latency, throughput, cost per inference) through advanced optimisation techniques (quantisation, distillation, CUDA/Triton kernels, GPU profiling), while preparing our architecture for future scalability.

What You’ll Do

Designing systems for distributed training and inference:
- Build scalable distributed training pipelines
- Debug foundational model trainings
- Oversee and scale up GPU clusters
Optimisation:
- Profile GPU usage and bottlenecks
- Optimise models, implementing techniques such as quantisation, distillation or kernel improvements
Collaboration:
- Work with the R&D team to integrate optimisations into production pipelines.
- Document benchmarks and performance gains (latency, cost, accuracy).
- Stay up to date on new architectures (e.g., H200, TPU).

Profil recherché

Required Skills

Model Optimisation Experience: Quantisation, distillation, or kernel optimisation (CUDA/Triton).
- PyTorch Mastery: Deep understanding of internals (memory, attention, custom ops).
GPU Profiling: Experience with Nsight Systems/Compute or similar tools (e.g., PyTorch Profiler).
Distributed training: DDP, FSDP.
Mixed Precision: FP16/BF16, loss scaling, and troubleshooting.
Problem-Solving: Ability to diagnose performance issues and propose innovative solutions.
Fluent English (the team speaks English in the day-to-day)

Bonus Skills

Experience with Triton or custom CUDA kernels.
Knowledge of ML compilers (e.g., Apache TVM, TensorRT).
Experience with large-scale GPU clusters (even modest ones).
Open-source contributions or publications on model optimisation.
Experience with one or more of these domains: neurology, multi-modality, (conditional) generation or interpretability.

Beyond Technical Skills:

While technical excellence is critical, we place equal importance on how we work together. We believe the best teams are built on:

Integrity & Respect
- We are striving for honesty, kindness, and fairness. We value people who treat others with dignity and foster an environment where everyone feels heard.
Open Communication & Humility
- Great ideas come from collaboration. We look for teammates who listen actively, communicate clearly, and approach challenges with self-awareness and humility.
Psychological Safety & Camaraderie
- We strive to create a space where people feel safe to take risks, ask questions, and grow.

Déroulement des entretiens

Prescreen with Paul (Head of People)
Technical Screen with one Research Engineer
On-site (Take-home exercise and restitution OR onsite interviews + Behavioural interview)

Envie d’en savoir plus ?

Rencontrez Paul, Head of Talent Acquisition

Découvrez l'entreprise

Explorez la vitrine de l’entreprise ou suivez-la pour savoir si elle vous correspond vraiment !

Explorer l’entreprise

Ils sont sociables

L'entreprise

Sigma Nova

Intelligence artificielle / Machine Learning

16 collaborateurs

Qui sont-ils ?

Our Vision: Interdisciplinary AI for Complex Domains

We believe the future of AI isn’t just about scaling compute or scraping more internet data. The most impactful breakthroughs will come from domains where:

Data is complex and multimodal (think brain signals, molecular structures, industry machinery sensor data, astronomical observations)
Data is scarce or hard to collect (highly regulated industries, expensive instrumentation)
Impact is enormous (healthcare, climate, materials science, robotics)

These domains share common challenges from research, data, and technical standpoints. We’re building reusable blueprints and foundational building blocks that can transfer across scientific fields—what we learn from brain modelling can inform physics, chemistry, or beyond.

Starting with the Brain

Our first proving ground is AI for neuroscience. We’re encouraged by early partnerships with research labs and our team’s performance in the EEG NeurIPS Challenge December 2025), where we achieved beyond state-of-the-art results.

The challenge we’re tackling is urgent: Brain illness and neurodegenerative disorders represent one of the most threatening health crises of our era. Applying frontier AI to complex, multimodal brain datasets could unlock transformative clinical applications.

But we’re also investing in theoretical foundations. Our team is publishing in top-tier ML conferences, ex:

This dual focus—rigorous science and applied impact—defines our approach.

Les avantages salariés

Horaires de travail flexibles
Entre 1-2 jours de télétravail
Collations à volonté
RTT / Jour de repos
Participation
Retraite complémentaire

Voir tous les avantages

Le lieu de travail

Rue de Mogador, 75009 Paris, France

Besoin de plus d’infos ?

Vie d’entreprise, ambiance, réalisations... On a encore plein de choses à vous dire !

Découvrir

D’autres offres vous correspondent !

Ces entreprises recrutent aussi au poste de “Data / Business Intelligence”.

Data Scientist - IA Générative & Agents IA
MP DATA
CDI
Boulogne-Billancourt
Télétravail fréquent
Intelligence artificielle / Machine Learning, IT / Digital
150 collaborateurs
il y a 11 heures
Senior Machine Learning Engineer
Aive
CDI
Paris
Télétravail fréquent
Salaire : 55K à 75K €
Intelligence artificielle / Machine Learning, SaaS / Cloud Services
33 collaborateurs
il y a 12 heures
MLOps Infrastructure Engineer
Sigma Nova
CDI
Paris
Télétravail occasionnel
Intelligence artificielle / Machine Learning
16 collaborateurs
il y a 13 heures
Data engineer Pyspark / Databricks
Visian
CDI
Courbevoie
Télétravail occasionnel
Salaire : < 63K €
Intelligence artificielle / Machine Learning, IT / Digital
200 collaborateurs
il y a 15 heures
Marketing & Data Strategy - Associate Manager
JAKALA
CDI
Paris
Télétravail occasionnel
Intelligence artificielle / Machine Learning, Digital Marketing / Data Marketing
250 collaborateurs
il y a 18 heures
Senior ML Engineer - F/H
ChapsVision
CDI
Paris
Télétravail fréquent
Intelligence artificielle / Machine Learning, Big Data
1 200 collaborateurs
hier