Senior Data Engineer @ Entalpic > Breega > Joboolo FR :

Société : Breega
Lieu : Paris 92210

We are a dedicated team at the forefront of AI and chemistry, working to accelerate the energy transition.

Our focus is on discovering new chemicals and materials that enable more sustainable practices in sectors with urgent decarbonization needs.

Specifically, we are developing a modern generative AI platform to discover new catalysts that optimize chemical reactions, significantly reduce CO? emissions, and help transform carbon-intensive industries.

As an early-stage, AI-driven startup with over €5M in funding, our approach is grounded in state-of-the-art academic research, with a strong focus on simplicity, clarity, and constant optimization.

Join Entalpic to be part of a passionate, fast-growing team united by the belief that technology can drive meaningful impact toward a more sustainable future.

Entalpic is committed to equal opportunity employment and a diverse, inclusive workplace.

We encourage applications from all backgrounds—even if you don’t meet every requirement.

If you’re passionate about our mission and think you can contribute, we want to hear from you.

Reporting & Job Location

You will report to the CTO of Entalpic and be based in our Paris office.

Mission Highlights

As a key team member, you will contribute to two main areas :

Data Infrastructure Development

Design, build, and maintain scalable data infrastructure to integrate diverse data sources (text, simulations, experiments) in support of ML and LLM applications.

Lead the development of internal tools to enable efficient, AI-enhanced access to data and promote a data-centric culture across the organization.

Role & Responsibilities

Data Engineering :

Build and optimize scalable data pipelines for simulation (e.g.

DFT), textual (e.g.

patents, papers), and experimental data (e.g.

time series, imagery).

Data Storage Solutions :

Implement and manage secure, scalable data storage systems supporting analytics and ML workflows.

Automation and Scripting :

Create tools and scripts to automate data ingestion, transformation, and processing.

Data Governance and Lineage :

Establish policies for data quality, lineage tracking, and regulatory compliance.

Infrastructure Support :

Work closely with DevOps to integrate solutions with system architecture (AWS / GCP).

Collaboration and Support :

Partner with scientists and experts to meet data needs and enable data-driven decisions.

Open Source Engagement :

Contribute tools and learnings to open-source projects to support the broader community.

Profile

Master’s or PhD in Computer Science, Data Engineering, or a related field

7+ years of experience in data engineering, with proven experience managing diverse data types and building scalable architectures

Proficiency in at least two programming languages (e.g.,
Python, Rust, Scala, Go)

Strong experience with both SQL (MySQL, PostgreSQL) and NoSQL (MongoDB)

Deep understanding of data modeling, ETL, and data warehousing

Cloud experience (AWS or GCP) and infrastructure-as-code tools (e.g.,
Terraform)

Strong communication skills in English

Ability to thrive in a fast-paced startup environment

Bonus Skills

Experience with ML pipelines and AI infrastructure

Contributions to open-source projects

Familiarity with scientific data, especially in materials science

Expertise

Programming :

Strong in Python and at least one other language, with best practices in version control (Git)

Data Management :

Expertise in both SQL and NoSQL for large-scale data processing

Cloud Platforms :

Proficient with AWS or GCP and infrastructure-as-code (Terraform)

DevOps Collaboration :

Comfortable with CI / CD, containerization (Docker, Kubernetes)

Open Source :

Experience in contributing to and maintaining open-source libraries and communities

We are a no-nonsense startup focused on sustainable work culture and meaningful rewards.

We offer :

Equity package (BSPCE)

Paid time off aligned with French standards

A dynamic and supportive work environment with flexibility for remote work

#J-1880

- Ljbffr
Breega
Paris 92210
Autre(s)
Stage
0 mois

Nouvelle recherche d'emploi Senior Data Engineer @ Entalpic

Plus d'offres Breega

Data engineer LLM (H/F) - Lieu : Paris - Société : DGSE - Direction Générale de la Sécurité Extérieure

Data Engineer Systèmes distribués, Temps réel (H/F) - Lieu : Paris - Société : DGSE - Direction Générale de la Sécurité Extérieure

Senior Data Scientist - Lieu : Puteaux - Société : CEVA LOGISTICS

Offres d'emploi fournis par