Is it mandatory to attach a resume?

Yes, you are required to attach your resume to apply for this job.

Does this position allow remote work?

Remote work is allowed for this position.

What type of contract is offered for this position?

Type of contract for this position: {contract_type}

Is a cover letter mandatory to apply for this position?

A cover letter is mandatory to apply for this position.

AI Data Engineer H/F – CAST – Permanent contract in Meudon

This position is no longer available.

CAST

AI Data Engineer H/F

Permanent contract

Meudon

A few days at home

Salary: Not specified

Experience: > 3 years

Education: Master's Degree

3 months ago

CAST

Interested in this job?

Questions and answers about the job

The position

Job description

Context

At CAST the world leader in Software Intelligence, we are building the foundation to ground AI with AAA data — Aggregated, Accurate, and Augmented — sourced from real-world software and technology projects.

We go beyond manual curation: this role is about using AI to empower AI.
You will design intelligent pipelines leveraging LLMs, embeddings, and NLP tools to clean, enrich, and validate data, ensuring that AI systems and autonomous agents can rely on it for training, reasoning and contextual understanding.

Your Mission

As a Data Engineer specialized in AI Enablement, you will be responsible for building robust, intelligent, and traceable data pipelines that power AI models and agents with high-quality, semantically rich information.

Your Responsibilities:

Aggregate and structure data from diverse software ecosystems (codebases, APIs, tickets, documentation, architecture specs).
Apply LLMs, embeddings, and NLP techniques to automate data cleaning, entity extraction, metadata tagging, and semantic annotation.
Build and maintain semantic data pipelines for LLM fine-tuning and Retrieval-Augmented Generation (RAG).
Organize datasets for Agent-to-Agent (A2A) interactions using APIs, vector databases, and knowledge graphs.
Collaborate with AI research and engineering teams to evolve schemas, prompts, labeling strategies, and evaluation datasets.
Ensure data lineage, reproducibility, and version control across all workflows.

Preferred experience

Your Profile

We’re looking for a hands-on Data Engineer who understands both the rigor of data pipelines and the creativity of AI enablement.
You’re analytical, curious, and passionate about leveraging AI to make data smarter.

Core Qualifications

Degree from a leading engineering school (Grande École) or equivalent university program.
3+ years of experience in data engineering, ML data operations, or structured data curation.
Proficiency in Python and data pipeline tools (Pandas, PyArrow, regex, Airflow).
Experience with LLM or NLP frameworks (Hugging Face, spaCy, LangChain).
Ability to use AI to clean, enrich, classify, and organize technical or unstructured content.
Strong understanding of tokenization, chunking, and model input preparation.
Experience working with software project data (Git repositories, APIs, documentation).

Bonus Skills

Knowledge of vector databases (FAISS, Qdrant, Weaviate) or knowledge graphs (Neo4j, RDF, SPARQL).
Exposure to agentic AI or autonomous AI frameworks (LangChain Agents, AutoGPT, OpenAgents).
Experience with RAG architectures, LLMOps, or prompt pipelines.
Background in software engineering or technical documentation.

Recruitment process

Recruitment Process

Our recruitment process consists of three steps:

Initial interview with our HR team.
Discussion with Guillaume, our Product Management Director, and Christophe, our R&D Director.
Final meeting to share our decision and next steps.
With us, the recruitment process moves quickly and efficiently!

Why Join Us?

Be part of a global AI innovation hub shaping the next generation of Software Intelligence.
Work at the intersection of data, AI, and software engineering, with real-world impact.
Collaborate with top AI experts and contribute to groundbreaking initiatives in AI enablement and automation.

Want to know more?

Rencontrez Émile, Senior Software Engineer

Discover the company

Explore the company’s profile or follow them to find out if they’re the right fit!

Explore the company

Follow them!

The company

CAST

Software, SaaS / Cloud Services

350 employees

Founded in 1990

Revenue: 54M €

Who are they?

CAST is the leader in Software Intelligence. Everything you need to know about how complex software systems work under the hood. For over 25 years, we’ve been building advanced tools that automatically analyze software architecture and code, helping tech teams master complexity, reduce technical debt, and accelerate modernization.

With deep R&D rooted in France and a solid global footprint. At CAST we serve hundreds of major organizations across all industries (50% USA, 40% Europe, 10% India & China).

If you’re passionate about semantic code analysis, software architecture, and building tools that make the invisible visible, CAST is definitely a good place to grow, innovate, and have a real impact.

The workplace

3 Rue Marcel Allégot, 92190 Meudon, France

The pros

Psst... We have a lot to tell you about the perks we offer to our employees.

Discover

These job openings might interest you!

These companies are also recruiting for the position of “Data / Business Intelligence”.

Senior Data Engineer - Analytics (x/f/m)
Doctolib
Permanent contract
Paris
A few days at home
Mobile Apps, Software
2,800 employees
19 hours ago
AI Platform Engineer
Payflows
Permanent contract
Paris
A few days at home
Salary: €70K to 100K
Software, Artificial Intelligence / Machine Learning
35 employees
5 days ago
Director of AI – Predictions
SkillCorner
Permanent contract
Paris
A few days at home
Software, Artificial Intelligence / Machine Learning
100 employees
5 days ago
Head of Machine Learning (M/F) - CDI - Paris
Wiremind
Permanent contract
Paris
A few days at home
Software, Mobility
150 employees
6 days ago
Tech Lead Data & Analytics (Snowflake, dbt, Tableau) F/M/X
Accor Tech & Digital
Permanent contract
Issy-les-Moulineaux
Software, Big Data
7 days ago
Data Engineer
Sekoia.io
Permanent contract
Paris
A few days at home
Software, Artificial Intelligence / Machine Learning
110 employees
13 days ago