refresh

Trending Companies

Trending

Jobs

JobsTikTok

Machine Learning Engineer Intern (Training Pre-processing) - 2025 Summer (PhD)

TikTok

Machine Learning Engineer Intern (Training Pre-processing) - 2025 Summer (PhD)

TikTok

San Jose, CA

·

On-site

·

Full-time

·

1mo ago

Benefits & Perks

Top Tier compensation with equity

Annual team offsites

Health, dental, and vision coverage

Wellness benefits

Required Skills

Apache Spark

Airflow

Python

Machine Learning Engineer Intern (Training Pre-processing) - 2025 Summer (PhD)

3+ months ago• San Jose, CA

About Us

Tik Tok is the leading destination for short-form mobile video and our mission is to inspire creativity and bring joy.
Size: 5001-10000 employees

Industry: Entertainment & Gaming, Social Media, Technology

Responsibilities:

Team Introduction: The Tik Tok Flink Ecosystem Team plays a critical role in delivering real-time computing capabilities to power Tik Tok's massive-scale recommendation, search, and advertising systems. This team is focused on building the infrastructure for stream processing at exabyte scale - enabling ultra-low-latency, high-reliability, and cost-efficient real-time data transformations.

We are deeply involved in developing and optimizing Apache Flink and surrounding components like connectors, state backends, and runtime execution models to meet Tik Tok's rapidly evolving data needs at EB-level throughput and scale.

We also collaborate closely with ML infrastructure teams to bridge real-time stream processing and machine learning. This includes integrating Velox to accelerate model training, building multimodal data pipelines, and utilizing frameworks like Ray to orchestrate large-scale distributed ML workflows.

Responsibilities:

  • Design and develop core Flink operators, connectors, or runtime modules to support Tik Tok's exabyte-scale real-time processing needs.
  • Build and maintain low-latency, high-throughput streaming pipelines powering online learning, recommendation, and ranking systems.
  • Collaborate with ML engineers to design end-to-end real-time ML pipelines, enabling efficient feature generation, training data streaming, and online inference.
  • Leverage Velox for compute-optimized ML data transformation and training acceleration on multimodal datasets (e.g., video, audio, and text).
  • Use Ray to coordinate distributed machine learning workflows and integrate real-time feature pipelines with ML model training/inference.
  • Optimize Flink job performance, diagnose bottlenecks, and deliver scalable solutions across EB-scale streaming workloads.

Qualifications

Minimum Qualifications:

  • Currently pursuing a PhD's degree in Computer Science, Software Engineering, Data Engineering, or a related technical field.
  • Strong programming skills in Java, Scala, or Python.
  • Understanding of distributed systems, stream processing, and event-driven architecture.
  • Familiar with system design concepts such as fault tolerance, backpressure, and horizontal scalability.
  • Demonstrated ability to debug and analyze complex distributed jobs in production environments.

Preferred Qualifications:

  • Graduating in December 2025 or later, with the intent to return to your academic program.
  • Experience with Apache Flink, Spark Streaming, or Kafka Streams.
  • Hands-on experience with Ray for distributed ML or workflow orchestration.
  • Familiarity with Velox, Arrow, or similar columnar execution engines for training/feature pipelines.
  • Understanding of multimodal data processing (e.g., combining video, audio, and text in model training pipelines).
  • Experience working with data lake ecosystems (e.g., Iceberg, Hudi, Delta Lake) and cloud-native storage at PB-EB scale.
  • Contributions to open-source projects or participation in ML/engineering hackathons or competitions.

Want more jobs like this?

Get jobs in

San Jose, CA delivered to your inbox every week.

Email Address

Send me The Muse newsletters for the best in career advice and job search tips.

Get jobs!
By signing up, you agree to our & .

Client-provided location(s): San Jose, CA

Job ID: Tik Tok-7498908264136165650

Employment Type: INTERN

Posted: 2025-05-02T00:32:28
Search all jobs

Perks and Benefits

Health and Wellness

  • Health Insurance
  • Dental Insurance
  • Vision Insurance
  • HSA
  • Life Insurance
  • Fitness Subsidies
  • Short-Term Disability
  • Long-Term Disability
  • On-Site Gym
  • Mental Health Benefits
  • Virtual Fitness Classes

Parental Benefits

  • Fertility Benefits
  • Adoption Assistance Program
  • Family Support Resources

Work Flexibility

  • Flexible Work Hours
  • Hybrid Work Opportunities

Office Life and Perks

  • Casual Dress
  • Snacks
  • Pet-friendly Office
  • Happy Hours
  • Some Meals Provided
  • Company Outings
  • On-Site Cafeteria
  • Holiday Events

Vacation and Time Off

  • Paid Vacation
  • Paid Holidays
  • Personal/Sick Days
  • Leave of Absence

Financial and Retirement

  • 401(K) With Company Matching
  • Performance Bonus
  • Company Equity

Professional Development

  • Promote From Within
  • Access to Online Courses
  • Leadership Training Program
  • Associate or Rotational Training Program
  • Mentor Program

Diversity and Inclusion

  • Diversity, Equity, and Inclusion Program
  • Employee Resource Groups (ERG)

Company Videos

Hear directly from employees about what it is like to work at Tik Tok.

Search all jobs

Similar Jobs

Suggested Searches

internship jobsTikTok jobsAll jobs

Search Additional Jobs

Machine Learning Engineer Intern Jobs in San Jose, CAJobs in San Jose, CA

of Use](https://www.themuse.com/user/Popular Jobs

Get Involved

Join The Conversation:

Total Views

0

Apply Clicks

0

Mock Applicants

0

Scraps

0

About TikTok

TikTok

TikTok

Late Stage

A short-form video entertainment app and social network platform

10,001+

Employees

Los Angeles

Headquarters

$220B

Valuation

Reviews

3.1

3 reviews

Work Life Balance

1.5

Compensation

2.0

Culture

1.2

Career

1.8

Management

1.0

5%

Recommend to a Friend

Pros

Limited positive feedback available

Company size allows for potential opportunities

Technology platform experience

Cons

Mass layoffs and poor handling of terminations

Unprofessional management and HR behavior

Exposure to traumatic content without adequate support

Salary Ranges

52 data points

Mid/L4

Senior/L5

Mid/L4 · Applied AI Product Data Scientist

1 reports

$273,000

total / year

Base

$210,000

Stock

-

Bonus

-

$273,000

$273,000

Interview Experience

4 interviews

Difficulty

3.5

/ 5

Duration

21-35 weeks

Experience

Positive 0%

Neutral 25%

Negative 75%

Interview Process

1

Application Review

2

Recruiter Screen

3

Online Assessment

4

Technical Phone Screen

5

Onsite/Virtual Interviews

6

Team Matching

7

Offer

Common Questions

Coding/Algorithm

System Design

Behavioral/STAR

Technical Knowledge

Data Structures