refresh

Trending Companies

Trending

Jobs

JobsWipro

AI Researcher

Wipro

AI Researcher

Wipro

San Francisco, CA

·

On-site

·

Full-time

·

2w ago

Compensation

$200,000 - $280,000

Benefits & Perks

Healthcare

Disability Insurance

Paid Time Off

Healthcare

Required Skills

Python

Machine Learning

Reinforcement Learning

PyTorch

Model Evaluation

Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a leading technology services and consulting company focused on building innovative solutions that address clients' most complex digital transformation needs. Leveraging our holistic portfolio of capabilities in consulting, design, engineering, and operations, we help clients realize their boldest ambitions and build future-ready, sustainable businesses. With over 230,000 employees and business partners across 65 countries, we deliver on the promise of helping our customers, colleagues, and communities thrive in an ever-changing world. For additional information, visit us at www.wipro.com.

Job Description:

Job Description Job Title: AI Researcher (SFT, RLHF, RL Environments & Model Evaluation)

About the Role We are seeking an AI Researcher with strong hands-on experience in Supervised Fine-Tuning (SFT),Reinforcement Learning from Human Feedback (RLHF),RL environments (gyms), and model evaluation. The role focuses on training, aligning, and evaluating models-particularly for STEM, coding, robotics, reasoning, and real-world problem-solving capabilities.You will help build systems that not only perform well on benchmarks, but also reason effectively, generalize to real-world scenarios, and align with human intent.

Key Responsibilities Design and implement SFT pipelines for training models on STEM subjects, coding tasks, robotics concepts, logical reasoning, and real-world problem-solving

Develop and execute RLHF workflows**, including preference data collection, reward modeling, and policy optimization Create and maintain** RL environments / gyms for reasoning tasks, coding challenges, robotics simulations, and applied real-world scenarios Train models to improve step-by-step reasoning, tool use, and structured problem solving

Design and run model evaluation frameworks

covering: STEM and mathematical reasoning Code correctness, efficiency, and robustness Robotics task success and planning Real-world decision-making and generalization Perform error analysis to identify reasoning failures, hallucinations, or misalignment Collaborate with engineers, educators, and domain experts to curate high-quality training and evaluation datasets Translate research insights into scalable, production-ready training and evaluation systems Document experiments, results, and best practices with strong reproducibility standards Required Qualifications Strong background inmachine learning, reinforcement learning, or AI research

Hands-on experience with SFT and RLHF**, especially for reasoning-intensive tasks Experience building or using** RL gyms / environments**, including task-driven or simulation-based setups Solid understanding of** model evaluation**, including automated metrics and human-in-the-loop evaluation Proficiency in** Python and ML frameworks such as Py Torch

Ability to reason deeply about model behavior, generalization, and alignment

Experience training or evaluating models on STEM, coding, or real-world problem domains

Preferred / Nice-to-Have Experience with LLMs, multimodal models, or foundation models

Background in robotics, simulation environments, or embodied AI

Familiarity with program synthesis, code evaluation, or formal reasoning

Experience with large-scale or distributed training Interest or experience in AI safety, alignment, or robustness

Publications, open-source contributions, or applied research experience What We Offer Opportunity to work on cutting-edge AI reasoning and alignment challenges

Direct impact on real-world AI capabilities in STEM, coding, and robotics Collaborative, research-driven environment Competitive compensation and benefits

DO:

  • At least 15 years of experience in selling IT Services in Tier-1 or Tier-2 competitive organizations.
  • Strong knowledge of global delivery model (GDM) and methodologies. Should be familiar with cross selling various service lines for customers
  • Ability to present and interact at all levels, and have consultative sales capability.
  • Ability to work and collaborate across other teams in various service lines and anchor together for the account.
  • Exposure to delivery, sales or pre-sales roles will be required
  • Should have managed a multi-million USD account, across various geos.
  • Strong Account Management - building and managing client relationships at the all levels.
  • Carry targets on revenue, bookings and OM.
  • Get involved in resolving any people management issue within Wipro teams
  • Generating leads by interacting with the customers in various lines of business to expand our footprint.
  • Presenting and publishing the proposals (proactive ones as well as responses to RFP/RFIs)
  • Interacting with Procurement and Supplier relationship team from customer organization and maintain smoother flow of contracts, invoices and payments.
  • Work closely with senior customer team (CIO, VPs and Directors) to suggest, advice, evaluate, and prime business growth
    Ã,Â

Expected annual pay for this role ranges from $200,000.00 to $280,000.00. Based on the position, the role is also eligible for Wipro's standard benefits including a full range of medical and dental benefits options, disability insurance, paid time off (inclusive of sick leave), other paid and unpaid leave options

Reinvent your world. We are building a modern Wipro. We are an end-to-end digital transformation partner with the boldest ambitions. To realize them, we need people inspired by reinvention. Of yourself, your career, and your skills. We want to see the constant evolution of our business and our industry. It has always been in our DNA - as the world around us changes, so do we. Join a business powered by purpose and a place that empowers you to design your own reinvention.

Applications from people with disabilities are explicitly welcome.

Total Views

0

Apply Clicks

0

Mock Applicants

0

Scraps

0

About Wipro

Wipro

A technology services and consulting company focused on building solutions that address clients' digital transformation needs.

10,001+

Employees

Bengaluru

Headquarters

$8.5B

Valuation

Reviews

3.4

4 reviews

Work Life Balance

1.5

Compensation

2.0

Culture

1.5

Career

2.0

Management

1.5

15%

Recommend to a Friend

Pros

Good for resume/brand name

Broad technical experience

Exposure to multiple tech stacks

Cons

Poor management quality

Low compensation

Toxic work environment

Salary Ranges

41,395 data points

Mid/L4

Mid/L4 · Analyst - Business Process L2

1 reports

$128,283

total / year

Base

$111,550

Stock

-

Bonus

-

$128,283

$128,283

Interview Experience

5 interviews

Difficulty

2.0

/ 5

Duration

14-28 weeks

Offer Rate

40%

Experience

Positive 100%

Neutral 0%

Negative 0%

Interview Process

1

Application Review

2

Online Assessment/Aptitude Test

3

Technical Interview

4

HR Interview

5

Offer

Common Questions

Coding/Algorithm

Technical Knowledge

Behavioral/STAR

Past Experience

Culture Fit