We’re building foundational models for scientific signals, starting with the brain (EEG, fMRI, ultrasound), and we aim to extend these approaches to other complex domains.
We work with temporal signals, collected under real-world constraints (e.g., clinical settings with limited sample sizes): our data is noisy and heterogeneous.
As the architect of our “Data & Training Factory,” you’ll:
Scale Pre-training to New Frontiers
Implement and supervise large-scale pre-training runs on cutting-edge architectures (Transformers, SSMs/Mamba, long-context windows).
Ensure convergence and performance across distributed resources (Scaleway/GENCI).
Practice “Deep Debugging”: Diagnose why models converge—or fail—at a mathematical and technical level.
Finetune pretrained models on both open-source and proprietary data
Engineer Multimodal Data Pipelines
Build robust pipelines to aggregate, align, and clean heterogeneous datasets (EEG/fMRI).
Tackle the complexity of signal normalisation, temporal/spatial alignment, and data versioning.
Champion Data-Centric AI: Ensure every byte feeding our models is pristine and traceable.
Uphold Scientific Rigor & Software Standards
Establish internal leaderboards and data tests to validate progress against the SOTA.
Write production-ready research code: Typed, tested, and documented by default.
Collaborate with the team on research papers.
3–7 years of experience in applied research or R&D.
Distributed Training: Proven track record with large-scale model training.
PyTorch Mastery: Deep knowledge of internals (memory, kernels, attention mechanisms).
Data Engineering: Robust pipelines, versioning, and data quality.
Scientific Rigor: Experience in research environments with high software standards.
Multimodal Data: EEG, fMRI, time-series, or sensor signals.
Low-Level Optimisation: FlashAttention, CUDA kernels, or custom hardware.
Cloud Infrastructure: Scaleway or similar GPU cloud environments.
Research Publications: NeurIPS, ICML, ICLR, or equivalent.
Data-Centric AI: Curation, alignment, and cleaning of heterogeneous datasets.
While technical excellence is critical, we place equal importance on how we work together. We believe the best teams are built on:
Integrity & Respect
Open Communication & Humility
Psychological Safety & Camaraderie
Prescreen with Paul (Head of People)
Technical Screen with one Research Scientist or Research Engineer
On-site (Take-home exercise and restitution + Behavioural interview)
Rencontrez Paul, Head of Talent Acquisition
Ces entreprises recrutent aussi au poste de “Engineering R&D”.