招聘

Software Engineer, TT-Distributed
Austin, Texas, United States; Santa Clara, California, United States; Toronto, Ontario, Canada
·
On-site
·
Full-time
·
1mo ago
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
As our TT-Distributed Software Engineer, you will develop and optimize distributed software systems that power the most efficient and highest-performing AI and HPC clusters. In this role, you'll work on distributed programming across multiple nodes, utilizing systems programming, inter-node communication, and Tenstorrent’s scalable architectures to advance the state-of-the-art distributed inference and training infrastructure.
This role ishybrid, based out of Santa Clara, CA; Austin, TX; or Toronto, ON.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Who You Are
-
Strong C or C++ engineer with solid foundations in systems programming, operating systems, and distributed systems principles.
-
Enthusiastic about distributed computing, including IPC, socket programming, and cluster resource coordination.
-
Comfortable reasoning about scalability, fault tolerance, and performance across multi-node environments.
-
Curious and first-principles thinker who challenges conventional approaches to distributed system design.
-
Motivated to grow into a deep technical expert in large-scale distributed AI infrastructure.
What We Need
-
Architect, implement, and optimize distributed software systems that coordinate computation and communication across clusters of AI accelerators and CPUs.
-
Design and build distributed APIs enabling data-parallel and tensor-parallel AI workloads.
-
Leverage MPI-based technologies and related frameworks to scale programming models across multiple hosts and compute nodes.
-
Develop robust systems using IPC, inter-node sockets, and distributed communication primitives to ensure reliability and high performance.
-
Build and maintain testing, debugging, profiling, and monitoring tools for large-scale distributed workloads and collaborate with model and systems teams on cluster bring-up.
What You Will Learn
-
How large-scale distributed inference and training systems are architected across thousands of accelerators.
-
Advanced techniques in collective communication, synchronization, and parallel workload distribution.
-
The performance characteristics of Tenstorrent hardware in multi-node cluster environments.
-
How distributed programming models integrate with compilers, runtimes, and AI frameworks.
-
The real-world challenges of deploying, debugging, and scaling next-generation AI clusters.
Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Software Development Engineer , Alexa Device Platform Team
Amazon · Austin, TX, USA

Compiler Software Engineer
Qualcomm · Austin, Texas, United States of America

Software Engineer - Cloud Technology Application Development
Emerson · Austin, TX, United States, US

Systems Infrastructure Software Engineer
Saronic · Austin, TX

Robot Platform Software Engineer
Apptronik · Austin, TX
关于Tenstorrent

Tenstorrent
Series CTenstorrent is a semiconductor company that develops AI accelerator chips and software for machine learning workloads. The company focuses on creating scalable processor architectures for data centers and edge computing applications.
201-500
员工数
Toronto
总部位置
$2.6B
企业估值
评价
3.8
10条评价
工作生活平衡
3.2
薪酬
2.8
企业文化
4.1
职业发展
3.4
管理层
4.0
72%
推荐给朋友
优点
Supportive management and strong leadership
Great team culture and fantastic colleagues
Cutting-edge technology and challenging projects
缺点
Heavy workload and frequent overtime
Fast-paced and stressful environment
Below industry standard salary
薪资范围
24个数据点
Staff/L6
Staff/L6 · Staff Field Application Engineer
1份报告
$261,520
年薪总额
基本工资
$201,323
股票
-
奖金
-
$261,520
$261,520
新闻动态
Former Tenstorrent Execs Launch AI& to Build Japan’s Full-stack AI Infrastructure - EE Times Asia
EE Times Asia
News
·
2w ago
Ex-Tenstorrent Execs Start Cloud Provider, AI Lab in Japan - EE Times
EE Times
News
·
4w ago
Interview with Toloka CEO (in Russian)
If you happen to understand Russian, here is a 2h interview with Toloka CEO Olga Megorskaya. Few things that I noted: * Industry is called Human Data, the biggest competitor is Scale AI. Basically they know how to produce human-generated data of a high quality that is used to train/post-train AI. These days it's a highly skilled people, sometimes with PhDs or many years of experience, but humans are unreliable, don't follow instructions, etc. - so it is a challenge to produce high quality data
·
4w ago
·
28
·
7
These AI Workstations Look Like PCs but Pack a Stronger Punch - IEEE Spectrum
IEEE Spectrum
News
·
4w ago