Cette offre n’est plus disponible.

Jeune docteur (PhD) - NLP | LLM

Rejoignez notre équipe en tant que Data Scientist au sein de l'équipe Legal Data. Vous travaillerez sur des modèles d'apprentissage automatique pour le traitement de documents juridiques multilingues. Vous aurez l'opportunité de participer à des défis internationaux, d'écrire des articles scientifiques et d'utiliser le superordinateur français Jean Zay pour entraîner nos modèles.

Résumé suggéré par Welcome to the Jungle

Résumé du poste
CDI
Paris
Télétravail fréquent
Salaire : Non spécifié
Expérience : > 3 ans
Éducation : > Bac +5 / Doctorat
Compétences & expertises
Multilingue
TensorFlow
Pytorch
Elasticsearch
PostgreSQL
+4
Missions clés

Créer des modèles d'apprentissage automatique pour le traitement de documents juridiques et les tâches de traitement du langage naturel (NLP).

Travailler avec des modèles de langage de grande taille (LLMs), y compris leur utilisation, leur ajustement, leur évaluation et leur benchmarking.

Collaborer avec l'équipe technique et l'équipe cloud pour mettre en production les modèles.

Jus Mundi
Jus Mundi

Cette offre vous tente ?

Questions et réponses sur l'offre

Le poste

Descriptif du poste

As a Data Scientist, you will work within the Legal Data Team, a multidisciplinary team where Data Scientists and Lawyers work together in creating tools that can analyze and process multilingual legal documents from around the world.

⚡ Your main tasks

  • Creating machine-learning models for processing legal documents and dealing with NLP tasks where:

    • Annotated data is scarce

    • Documents can be multilingual

    • Long and specialized documents

  • Working with LLMs, such as using, fine-tuning, evaluating, benchmarking

  • Creating packages and tools for the Data Science Team

  • Shipping quality tools

  • Working with the Tech and Cloud Team to put into production models

🤖 About our Data Science Team

  • 🥇 We love to participate in international challenges, such as Semeval and TREC

  • 🔬 We write scientific papers for NLP conferences

  • 🚀 We use Jean Zay, the French supercomputer, for training our models

⚒️ Technical stack:

  • Main: Python, FastAPI, Docker, Temporal

  • Databases: PostgreSQL, Elasticsearch, and Neo4J


Profil recherché

📚 Education and experience

  • You have completed a Ph.D. (BAC+8: Doctorat) in Computer Science, Mathematics, or any other related career. You may have obtained your Ph.D. from a French or foreign university.

  • You are looking for your very first job with a permanent contract (Contrat à Durée Indéterminée) since obtaining your Ph.D.

  • You have at least 3 years of experience in Natural Language Processing (NLP) tasks, such as Named Entity Recognition, Text Classification, Entity Linking, Coreference Resolution, or Text Summarization.

  • It is highly appreciated if you have one Post-doctorate or if you have worked on, either a national or international, project with multiple parties involved (e.g. Horizon 2020, Horizon Europe, ANR).

  • Ideally:

    • You have worked with multidisciplinary teams.

    • You have worked with closed or open source LLMs

    • You have worked with NLP models in languages different than English.

    • Experience in having deployed NLP projects using APIs, Docker, or Temporal.

✨ Skills

  • You have an excellent understanding of NLP and key aspects of machine learning.

  • You have good programming skills, including OOP, especially in Python.

  • You have a good understanding of how to create machine learning models and ML frameworks, such as PyTorch and TensorFlow.

  • You have a positive, “can-do” attitude and a desire to learn, help and grow.

  • You can communicate and present Data Science work to a multi-disciplinary team.

  • English is a requirement, French is optional.

Company Perks and Benefits:

  • 😍 Working for a fast-growing global legal tech offering a disruptive product that is revolutionizing the way lawyers around the world interconnect and conduct legal research,

  • 💻 Hybrid working organization, mix between remote and on-site,

  • 💰Competitive salary & equity

  • 🏖 5 weeks of vacation

  • 🍼  Paid parental leave (under specific conditions)

  • 🩺 A great complementary private health insurance (paid 100% for the employee and his children by the company)

  • 🚊50% of public transportation reimbursed.

  • 🍴Personal credit card to buy lunches during the week (Swile)

  • 😍 Every quarter we organize a company-wide summit to

  • 🌍 Travel (work abroad) policy: 8 weeks per year, you can live and work from where you want across the globe,

  • ✈️ Relocation Package (to France)

Envie d’en savoir plus ?