Join Cure51, a startup focused on revolutionizing cancer care through advanced data analysis technology. As a Data Scientist intern, you will work under the supervision of a senior data scientist to enable decision-makers to better understand their environment and make informed decisions. Your tasks will include developing an OCR and NER pipeline, creating data guardrails to prevent PII leaks, and designing AI agents to analyze and extract insights from multiple data sources. The position is based in Paris with a hybrid work model.
Résumé suggéré par Welcome to the Jungle
Développer un pipeline OCR et NER pour extraire des données clés des rapports de pathologie et d'IRM.
Créer des garde-fous de données pour prévenir les fuites d'informations personnelles dans les données non structurées.
Concevoir, mettre en œuvre et optimiser des agents d'IA pour analyser et extraire des informations de plusieurs sources de données.
Join us in our ambitious journey at Cure51, a dynamic startup committed to revolutionizing cancer care through advanced data analysis technology.
Under the supervision of Data Scientist senior, your mission will be to enable all the company’s decision-makers to better master their environment and make the best possible decisions.
Example of missions:
Develop an OCR and NER pipeline to extract key data points from pathology and MRI reports, including treatment protocols, tumor events and biological parameters.
Create a data guardrails to prevent PII (personally identifiable information) leaks in unstructured data.
Design, implementation, and optimization of AI agents to analyze and extract insights from multiple data sources
Stack:
Hosting: AWS
Decisional: DBT + SQL
Analytics : Python
Visualization : Metabase
Database: PostgreSQL (RDS)
Scheduling : Airflow
Misc: Jira, Confluence, Google workspace, Slack
Remote work:
3 days / week in the office 19 rue Richer, 75009 Paris
2 days of remote work / week
Date to start :
Qualifications:
Master’s degree in Computer Science or a related field.
Work / academic projects experience in data science related fields
Proficient in data science, data modeling, data analytics and SQL.
Comfortable to work on NLP, Computer Vision and Agent subjects
Familiarity with programming languages such as Python, Scala etc.
Comfortable to work in English (written & spoken)
Rigorous, curious, with a good sense of service
Must be legally authorized to work in France (French nationality or valid work visa required)
Recruitment process:
1st interview: visio with Guillaume, Data Scientist (1h)
2nd interview: visio with Louis-Baptiste, Head of Engineer (1h)
3nd interview on site with the COO and 2 peoples from other teams (1h)
Rencontrez Maria, Clinical network manager
Rencontrez Clarisse, Développeuse Fullstack
Ces entreprises recrutent aussi au poste de “Données/Business Intelligence”.