Data Scientist - Internship

Join Cure51, a startup focused on revolutionizing cancer care through advanced data analysis technology. As a Data Scientist intern, you will work under the supervision of a senior data scientist to enable decision-makers to better understand their environment and make informed decisions. Your tasks will include developing an OCR and NER pipeline, creating data guardrails to prevent PII leaks, and designing AI agents to analyze and extract insights from multiple data sources. The position is based in Paris with a hybrid work model.

jobs.show.blocks.metaData.summary.generated

Resumen del puesto
Prácticas(de 6 a 12 meses)
Paris
Unos días en casa
Salario: No especificado
Fecha de inicio: 08 de febrero de 2026
Competencias y conocimientos
Metabase
Scala
PostgreSQL
DBT
Slack
+5
jobs.show.blocks.metaData.subtitle.key_missions

Développer un pipeline OCR et NER pour extraire des données clés des rapports de pathologie et d'IRM.

Créer des garde-fous de données pour prévenir les fuites d'informations personnelles dans les données non structurées.

Concevoir, mettre en œuvre et optimiser des agents d'IA pour analyser et extraire des informations de plusieurs sources de données.

Cure51
Cure51

¿Te interesa esta oferta?

Preguntas y respuestas sobre esta oferta

El puesto

Descripción del puesto

Summary

Join us in our ambitious journey at Cure51, a dynamic startup committed to revolutionizing cancer care through advanced data analysis technology.

Under the supervision of Data Scientist senior, your mission will be to enable all the company’s decision-makers to better master their environment and make the best possible decisions.

Example of missions:

  • Develop an OCR and NER pipeline to extract key data points from pathology and MRI reports, including treatment protocols, tumor events and biological parameters.

  • Create a data guardrails to prevent PII (personally identifiable information) leaks in unstructured data.

  • Design, implementation, and optimization of AI agents to analyze and extract insights from multiple data sources

Stack:

  • Hosting: AWS

  • Decisional: DBT + SQL

  • Analytics : Python

  • Visualization : Metabase

  • Database: PostgreSQL (RDS)

  • Scheduling : Airflow

  • Misc: Jira, Confluence, Google workspace, Slack

Remote work:

  • 3 days / week in the office 19 rue Richer, 75009 Paris

  • 2 days of remote work / week

Date to start :

  • March 2025 or ASAP

Requisitos

Qualifications:

  • Master’s degree in Computer Science or a related field.

  • Work / academic projects experience in data science related fields

  • Proficient in data science, data modeling, data analytics and SQL.

  • Comfortable to work on NLP, Computer Vision and Agent subjects

  • Familiarity with programming languages such as Python, Scala etc.

  • Comfortable to work in English (written & spoken)

  • Rigorous, curious, with a good sense of service

  • Must be legally authorized to work in France (French nationality or valid work visa required)


Proceso de selección

Recruitment process:

  • 1st interview: visio with Guillaume, Data Scientist (1h)

  • 2nd interview: visio with Louis-Baptiste, Head of Engineer (1h)

  • 3nd interview on site with the COO and 2 peoples from other teams (1h)

¿Quieres saber más?

¡Estas ofertas de trabajo te pueden interesar!

Estas empresas también contratan para el puesto de "{profesión}".

  • Orakl Oncology

    ML Engineer Intern (Computer Vision)

    Orakl Oncology
    Orakl Oncology
    Prácticas
    Le Kremlin-Bicêtre
    Unos días en casa
    Inteligencia artificial/Aprendizaje automático, Farmacia/Biotecnología
    21 empleados