Data Scientist - Internship

Join Cure51, a startup focused on revolutionizing cancer care through advanced data analysis technology. As a Data Scientist intern, you will work under the supervision of a senior data scientist to enable decision-makers to better understand their environment and make informed decisions. Your tasks will include developing an OCR and NER pipeline, creating data guardrails to prevent PII leaks, and designing AI agents to analyze and extract insights from multiple data sources. The position is based in Paris with a hybrid work model.

Résumé suggéré par Welcome to the Jungle

Résumé du poste
Stage(6 à 12 mois)
Paris
Télétravail fréquent
Salaire : Non spécifié
Début : 08 février 2026
Compétences & expertises
Metabase
Scala
PostgreSQL
DBT
Slack
+5
Missions clés

Développer un pipeline OCR et NER pour extraire des données clés des rapports de pathologie et d'IRM.

Créer des garde-fous de données pour prévenir les fuites d'informations personnelles dans les données non structurées.

Concevoir, mettre en œuvre et optimiser des agents d'IA pour analyser et extraire des informations de plusieurs sources de données.

Cure51
Cure51

Cette offre vous tente ?

Questions et réponses sur l'offre

Le poste

Descriptif du poste

Summary

Join us in our ambitious journey at Cure51, a dynamic startup committed to revolutionizing cancer care through advanced data analysis technology.

Under the supervision of Data Scientist senior, your mission will be to enable all the company’s decision-makers to better master their environment and make the best possible decisions.

Example of missions:

  • Develop an OCR and NER pipeline to extract key data points from pathology and MRI reports, including treatment protocols, tumor events and biological parameters.

  • Create a data guardrails to prevent PII (personally identifiable information) leaks in unstructured data.

  • Design, implementation, and optimization of AI agents to analyze and extract insights from multiple data sources

Stack:

  • Hosting: AWS

  • Decisional: DBT + SQL

  • Analytics : Python

  • Visualization : Metabase

  • Database: PostgreSQL (RDS)

  • Scheduling : Airflow

  • Misc: Jira, Confluence, Google workspace, Slack

Remote work:

  • 3 days / week in the office 19 rue Richer, 75009 Paris

  • 2 days of remote work / week

Date to start :

  • March 2025 or ASAP

Profil recherché

Qualifications:

  • Master’s degree in Computer Science or a related field.

  • Work / academic projects experience in data science related fields

  • Proficient in data science, data modeling, data analytics and SQL.

  • Comfortable to work on NLP, Computer Vision and Agent subjects

  • Familiarity with programming languages such as Python, Scala etc.

  • Comfortable to work in English (written & spoken)

  • Rigorous, curious, with a good sense of service

  • Must be legally authorized to work in France (French nationality or valid work visa required)


Déroulement des entretiens

Recruitment process:

  • 1st interview: visio with Guillaume, Data Scientist (1h)

  • 2nd interview: visio with Louis-Baptiste, Head of Engineer (1h)

  • 3nd interview on site with the COO and 2 peoples from other teams (1h)

Envie d’en savoir plus ?

D’autres offres vous correspondent !

Ces entreprises recrutent aussi au poste de “Données/Business Intelligence”.