
Inventing the technologies the world loves.
LLM Serving Engineer (Cloud AI Engineering), Senior / Staff Engineer
Required skills
Python
PyTorch
Machine Learning
Company:
Qualcomm Technologies, Inc.
Job Area:
Engineering Group, Engineering Group > Machine Learning Engineering
General Summary:LLM Serving Engineer (Cloud AI Engineering)
Qualcomm is utilizing its traditional strengths in digital wireless technologies to play a central role in the evolution of Cloud AI. We are investing in several supporting technologies including Deep Learning. The Qualcomm Cloud AI team is developing hardware and software solutions for Inference Acceleration.
We are hiring LLM Serving Engineers at multiple levels to join our dynamic, collaborative team. This role spans the full product lifecycle—from cutting-edge research and development to commercial deployment—and demands strategic thinking, strong execution, and excellent communication skills.
This role involves the following activities:
-
Building a scalable LLM inference platform using inference techniques (e.g. disaggregated serving and KV-Cache management, advanced parallelism, speculative algorithms, model optimization, specialized kernels).
-
Contribute to the development of LLM Serving packages (e.g. vLLM, SGLang, TGI, Triton-Inference server, Dynamo, LLM-d).
-
Work closely with customers to drive solutions by collaborating with internal compiler, firmware and platform teams.
-
Work at the forefront of GenAI by understanding advanced algorithms (e.g. attention mechanisms, Mo Es) and numerics to identify new optimization opportunities.
-
Drive efficient serving through smart autoscaling, load balancing and routing.
-
Engage with open-source serving communities to evolve the framework.
Candidates for this position will demonstrate the following:
-
Hands-on experience in one or more of the following LLM serving/Orchestration packages (Triton-Inference Server, vLLM, SGLang, Ollama, llm-d, KServe, LMCache, Moon Cake)
-
Deep understanding of foundational LLMs, VLMs, SLMs, transformer-based architectures.
-
Strong experience in developing language models using Py Torch.
-
Strong computer science fundamentals - algorithms, data structures, parallel and distributed programming.
-
Understanding of computer architecture, ML accelerators, in-memory processing and distributed systems.
-
Strong Python development skills for large-scale projects with passion for software engineering.
-
Experience in analyzing, profiling, and optimizing deep learning workloads.
-
Proactive learning about the latest inference optimization techniques.
-
Excellent communication and problem-solving skills, with the ability to thrive in a fast-paced and collaborative environment.
-
MS in Computer Science, Machine Learning, Computer Engineering or Electrical Engineering.
Bonus Skills:
-
Open-source contribution to any GenAI package.
-
Experience architecting and developing large-scale distributed systems.
-
High-level kernel design experience (Py Torch, CUDA, Triton).
-
Knowledge of torch.compile or torch Dynamo
-
PhD in Computer Science, Computer Engineering or Machine Learning
Minimum Qualifications:
- Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. OR
Master's degree in Computer Science, Engineering, Information Systems, or related field and 3+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
OR
PhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).
To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.
EEO Employer: Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification.
Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.
Pay range and Other Compensation & Benefits:
$158,400.00 - $237,600.00
The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer – and you can review more details about our US benefits at this link.
If you would like more information about this role, please contact Qualcomm Careers.
Total Views
1
Total Apply Clicks
0
Total Mock Apply
0
Total Bookmarks
0
Similar jobs

Principal Core Science Machine Learning Scientist
Paramount · New York, NY, US, 10036

Senior Applied Scientist, UI Control Models
Apple · Cupertino, CA

Senior Bioimage Scientist
Veracyte · San Diego, California, United States

Senior Machine Learning Engineer, Simulation
Waymo · Mountain View, CA, USA; New York, NY, USA

Staff AI / ML Engineer - Embodied AI
General Motors · Mountain View, California, United States of America
About Qualcomm

Qualcomm
PublicInventing the technologies the world loves.
10,001+
Employees
San Diego
Headquarters
$136B
Valuation
Reviews
3 reviews
3.0
3 reviews
Work-life balance
3.0
Compensation
2.0
Culture
2.5
Career
3.5
Management
2.0
45%
Recommend to a friend
Pros
Opportunity to work at reputable company
Interesting work and new skill development
Strong brand name recognition
Cons
Low compensation compared to market rates
Poor communication from employees
No benefits provided
Salary Ranges
21 data points
Junior/L3
Junior/L3 · Data Scientist
0 reports
$196,000
total per year
Base
$150,000
Stock
$33,000
Bonus
$13,000
$166,600
$225,400
Interview experience
8 interviews
Difficulty
2.8
/ 5
Duration
14-28 weeks
Interview process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
Common questions
Coding/Algorithm
Technical Knowledge
System Design
Behavioral/STAR
Past Experience
Latest updates
Narwhal Capital Management Sells 13,601 Shares of Qualcomm Incorporated $QCOM - MarketBeat
MarketBeat
News
·
1w ago
Intel, Qualcomm Alert: Analyst Says Some Chip Stocks Are 'Living In A Bad Neighborhood' - Benzinga
Benzinga
News
·
1w ago
Why Qualcomm Is Set To Disrupt The AI Market (NASDAQ:QCOM) - Seeking Alpha
Seeking Alpha
News
·
1w ago
Tesla, Qualcomm, Apple, Domino’s Pizza, Micron, Sandisk, Meta, Organon, Avis, and More Stock Movers - Barron's
Barron's
News
·
1w ago