招聘
必备技能
Python
Go
At Hugging Face, we’re on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 5 million users & 100k organizations who collectively shared over 1M models, 300k datasets & 300k apps. Our open-source libraries have more than 400k+ stars on Github.
About the Role:
As our first Data/Infrastructure Advocate Engineer, you’ll bridge the gap between cutting-edge data infrastructure and the global community of data engineers, researchers, and developers. You’ll champion Xet storage on the Hugging Face Hub, empowering users to efficiently store, version, and collaborate on large-scale datasets. This role is for someone who thrives at the intersection of technical depth (storage, Parquet, deduplication) and community advocacy—helping define the future of open data workflows.
You’ll collaborate with teams like Datasets, Hub, and Infrastructure to shape how developers interact with data on our platform, and inspire a community to build better, faster, and more scalable data pipelines.
Your Main Missions:
- Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organize events or challenges. Engage with communities like Apache Parquet, Open Tables Formats, and data engineering forums to promote best practices and Hugging Face tools.
- Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet.
- Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub’s value for data workflows.
- Create demos, benchmarks, and tools(e.g., Colab notebooks) to illustrate best practices for data storage and versioning.b Experiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering.
- Produce high-quality tutorials, blog posts, and videos that make complex topics accessible.
- Share insights on storage optimization, dataset versioning, and deduplication to empower developers.
- Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration.
- Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.
About you
You’re a great fit if you:
- Have strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).
- Are a hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning.
- Can clearly explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks.
- Are active in developer communities (GitHub, Discord, forums) and passionate about open source and knowledge sharing.
- Thrive in fast-moving environments and enjoy building in public to inspire others.
If you're interested in joining us but don't tick every box above, we still encourage you to apply! We're building a diverse team whose skills, experiences, and backgrounds complement one another. We're happy to consider where you might be able to make the biggest impact.
More about Hugging Face
**We are actively working to build a culture that values diversity, equity, and inclusivity.**We are intentionally building a workplace where you feel respected and supported—regardless of who you are or where you come from. We believe this is foundational to building a great company and community, as well as the future of machine learning more broadly. Hugging Face is an equal opportunity employer, and we do not discriminate based on race, ethnicity, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or ability status.
We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education.
We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer parental leave and flexible paid time off.
**We support our employees wherever they are.**While we have office spaces in NYC and Paris, we're very distributed, and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.
**We want our teammates to be shareholders.**All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Mixed Methods Researcher - Music Mission
Spotify · New York, NY

AI Performance Optimization Engineer
Lightning AI · New York, New York, United States; San Francisco, California, United States

Audio Engineer- Irving Plaza
Live Nation · New York, NY, USA

Audio Inference Engineer, Model Efficiency
Cohere · New York

Lighting Engineer - Gramercy
Live Nation · New York, NY, USA
关于Hugging Face

Hugging Face
Series DHugging Face, Inc., is an American company based in New York City that develops computation tools for building applications using machine learning.
201-500
员工数
New York City that develops computation
总部位置
$4.5B
企业估值
评价
4.3
10条评价
工作生活平衡
4.0
薪酬
4.2
企业文化
4.5
职业发展
3.8
管理层
4.3
82%
推荐给朋友
优点
Supportive team and collaborative environment
Flexible work arrangements and remote options
Innovative and cutting-edge technology projects
缺点
Fast-paced and sometimes overwhelming environment
Heavy workload and long hours during peak times
Limited career advancement opportunities
薪资范围
19个数据点
Mid/L4
Mid/L4 · Developer Advocate
1份报告
$130,000
年薪总额
基本工资
$130,000
股票
-
奖金
-
$100,000
$160,000
面试经验
3次面试
难度
3.0
/ 5
时长
14-28周
录用率
33%
体验
正面 33%
中性 67%
负面 0%
面试流程
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Virtual Onsite
5
Team Matching
6
Offer
常见问题
Coding/Algorithm
Machine Learning/Data Science
System Design
Behavioral/STAR
Technical Knowledge
新闻动态
Malicious npm Package Hijacks Hugging Face for Malware Delivery - gbhackers.com
gbhackers.com
News
·
2d ago
NPM Menace Exposes Hugging Face As Backend For Data Theft and Malware Delivery - cyberpress.org
cyberpress.org
News
·
2d ago
Hugging Face Releases ml-intern: An Open-Source AI Agent that Automates the LLM Post-Training Workflow - MarkTechPost
MarkTechPost
News
·
3d ago
Attackers Weaponize CVE-2026-39987 to Spread Blockchain-Based Backdoor Via Hugging Face - CyberSecurityNews
CyberSecurityNews
News
·
1w ago