Jobs
Required skills
Python
PyTorch
Machine Learning
Deep Learning
About Mistral
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.
We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
Role Summary:
About the Research Engineering team:
The team spans Platform (shared infra & clean code) and Embedded (inside research squads). Engineers can move along the research↔production spectrum as needs or interests evolve.
As a Research Engineer – ML track, you’ll build and optimise the large-scale learning systems that power our open-weight models. Working hand-in-hand with Research Scientists, you’ll either join:
- Platform RE Team: Enhance the shared training framework, data pipelines and cluster tooling used by every team; or
- Embedded RE Team: Sit inside a research squad (Alignment, Pre-training, Multimodal, …) and turn fresh ideas into repeatable, scalable code.
What will you do
- Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
- Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
- Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
- Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
- Deliver prototypes that become production-grade components for Le Chat and our enterprise API.
About you
- Master’s or PhD in Computer Science (or equivalent proven track record).
- 4 + years working on large-scale ML codebases.
- Hands-on with Py Torch, JAX or Tensor Flow; comfortable with distributed training (Deep Speed / FSDP / SLURM / K8s).
- Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops.
- Strong software-design instincts: testing, code review, CI/CD.
- Self-starter, low-ego, collaborative.
Total Views
0
Apply Clicks
0
Weekly mock applicants
0
Bookmarks
0
Similar jobs

Internship, Software Machine Learning Engineer, Reliability Energy Engineering (Summer 2026)
Tesla · Palo Alto, California

Applied AI Engineer
Augment Code · Palo Alto, California, United States

Software Engineer III - AI/ML Deep Learning & GPU ML Serving
JPMorgan Chase · Palo Alto, CA, United States, US

AI Engineer, 3D Computer Vision, Self-Driving
Tesla · Palo Alto, California

Applied Scientist, Rufus Experience Science
Amazon · Palo Alto, CA, USA
About Mistral AI

Mistral AI
Series BMistral AI is a French artificial intelligence company that develops and provides large language models and AI solutions. The company focuses on creating efficient and powerful AI models for various applications.
51-200
Employees
Paris
Headquarters
$6.0B
Valuation
Reviews
3.8
10 reviews
Work-life balance
2.5
Compensation
4.0
Culture
4.2
Career
3.5
Management
2.3
72%
Recommend to a friend
Pros
Supportive team environment
Good compensation and benefits
Innovative projects and cutting-edge technology
Cons
Poor management and lack of direction
Work-life balance issues and heavy workload
Fast-paced stressful environment
Salary Ranges
37 data points
Mid/L4
Senior/L5
Staff/L6
Mid/L4 · Applied AI Engineer
2 reports
$214,500
total per year
Base
$165,000
Stock
-
Bonus
-
$195,000
$234,000
Interview experience
1 interviews
Difficulty
3.0
/ 5
Duration
21-35 weeks
Interview process
1
Application Review
2
Recruiter Screen
3
Technical Interview
4
Research Presentation
5
Team Matching
6
Offer
Common questions
Machine Learning/AI Algorithms
Research Experience
Technical Knowledge
Coding/Implementation
Behavioral/STAR
News & Buzz
Generative AI Platforms - Trend Hunter
Trend Hunter
News
·
4d ago
How France’s Mistral Built A $14 Billion AI Empire By Not Being American - Forbes
Forbes
News
·
4d ago
Connect the dots: Build with built-in and custom MCPs in Studio - Mistral AI
Mistral AI
News
·
5d ago
The OpenAI / TBPN Audit: Why Anthropic’s Next Acquisition Should Be a Regulatory Network
https://preview.redd.it/q7ltkacfu2tg1.jpg?width=3000&format=pjpg&auto=webp&s=261ce6e7090baf84297a882ffa5b7e62f0d09955 # Forensic Audit: OpenAI’s TBPN Acquisition, the Enterprise Trust Gap, and the Dawn of Regulatory Media **Listen to audio at** [**https://enoumen.substack.com/p/the-openai-tbpn-audit-why-anthropics**](https://enoumen.substack.com/p/the-openai-tbpn-audit-why-anthropics) OpenAI just spent hundreds of millions to buy the Silicon Valley narrative. It’s a brilliant cons
·
2w ago
·
1
·
1