We are a dedicated team at the forefront of AI and chemistry, working to accelerate the energy transition.
Our focus is on discovering new chemicals and materials that enable more sustainable practices in sectors with urgent decarbonization needs.
Specifically, we are developing a modern generative AI platform to discover new catalysts that optimize chemical reactions, significantly reduce CO? emissions, and help transform carbon-intensive industries.
As an early-stage, AI-driven startup with over €5M in funding, our approach is grounded in state-of-the-art academic research, with a strong focus on simplicity, clarity, and constant optimization.
Join Entalpic to be part of a passionate, fast-growing team united by the belief that technology can drive meaningful impact toward a more sustainable future.
Entalpic is committed to equal opportunity employment and a diverse, inclusive workplace.
We encourage applications from all backgrounds—even if you don’t meet every requirement.
If you’re passionate about our mission and think you can contribute, we want to hear from you.
Reporting & Job Location
You will report to the CTO of Entalpic and be based in our Paris office.
Mission Highlights
As a key team member, you will contribute to two main areas :
Data Infrastructure Development
Design, build, and maintain scalable data infrastructure to integrate diverse data sources (text, simulations, experiments) in support of ML and LLM applications.
Lead the development of internal tools to enable efficient, AI-enhanced access to data and promote a data-centric culture across the organization.
Role & Responsibilities
DataEngineering :
Build and optimize scalable data pipelines for simulation (e.g.
DFT), textual (e.g.
patents, papers), and experimental data (e.g.
time series, imagery).
Data Storage Solutions :
Implement and manage secure, scalable data storage systems supporting analytics and ML workflows.
Automation and Scripting :
Create tools and scripts to automate data ingestion, transformation, and processing.
Data Governance and Lineage :
Establish policies for data quality, lineage tracking, and regulatory compliance.
Infrastructure Support :
Work closely with DevOps to integrate solutions with system architecture (AWS / GCP).
Collaboration and Support :
Partner with scientists and experts to meet data needs and enable data-driven decisions.
Open Source Engagement :
Contribute tools and learnings to open-source projects to support the broader community.
Profile
Master’s or PhD in Computer Science, DataEngineering, or a related field
7+ years of experience in dataengineering, with proven experience managing diverse data types and building scalable architectures
Proficiency in at least two programming languages (e.g., Python, Rust, Scala, Go)
Strong experience with both SQL (MySQL, PostgreSQL) and NoSQL (MongoDB)
Deep understanding of data modeling, ETL, and data warehousing
Cloud experience (AWS or GCP) and infrastructure-as-code tools (e.g., Terraform)
Strong communication skills in English
Ability to thrive in a fast-paced startup environment
Bonus Skills
Experience with ML pipelines and AI infrastructure
Contributions to open-source projects
Familiarity with scientific data, especially in materials science
Expertise
Programming :
Strong in Python and at least one other language, with best practices in version control (Git)
Data Management :
Expertise in both SQL and NoSQL for large-scale data processing
Cloud Platforms :
Proficient with AWS or GCP and infrastructure-as-code (Terraform)
DevOps Collaboration :
Comfortable with CI / CD, containerization (Docker, Kubernetes)
Open Source :
Experience in contributing to and maintaining open-source libraries and communities
We are a no-nonsense startup focused on sustainable work culture and meaningful rewards.
We offer :
Equity package (BSPCE)
Paid time off aligned with French standards
A dynamic and supportive work environment with flexibility for remote work