Jobs
Benefits & Perks
•Healthcare
•Disability Insurance
•Paid Time Off
•Healthcare
Required Skills
Python
Machine Learning
Reinforcement Learning
PyTorch
Model Evaluation
Wipro Limited (NYSE: WIT, BSE: 507685, NSE: WIPRO) is a leading technology services and consulting company focused on building innovative solutions that address clients' most complex digital transformation needs. Leveraging our holistic portfolio of capabilities in consulting, design, engineering, and operations, we help clients realize their boldest ambitions and build future-ready, sustainable businesses. With over 230,000 employees and business partners across 65 countries, we deliver on the promise of helping our customers, colleagues, and communities thrive in an ever-changing world. For additional information, visit us at www.wipro.com.
Job Description:
Job Description Job Title: AI Researcher (SFT, RLHF, RL Environments & Model Evaluation)
About the Role We are seeking an AI Researcher with strong hands-on experience in Supervised Fine-Tuning (SFT),Reinforcement Learning from Human Feedback (RLHF),RL environments (gyms), and model evaluation. The role focuses on training, aligning, and evaluating models-particularly for STEM, coding, robotics, reasoning, and real-world problem-solving capabilities.You will help build systems that not only perform well on benchmarks, but also reason effectively, generalize to real-world scenarios, and align with human intent.
Key Responsibilities Design and implement SFT pipelines for training models on STEM subjects, coding tasks, robotics concepts, logical reasoning, and real-world problem-solving
Develop and execute RLHF workflows**, including preference data collection, reward modeling, and policy optimization Create and maintain** RL environments / gyms for reasoning tasks, coding challenges, robotics simulations, and applied real-world scenarios Train models to improve step-by-step reasoning, tool use, and structured problem solving
Design and run model evaluation frameworks
covering: STEM and mathematical reasoning Code correctness, efficiency, and robustness Robotics task success and planning Real-world decision-making and generalization Perform error analysis to identify reasoning failures, hallucinations, or misalignment Collaborate with engineers, educators, and domain experts to curate high-quality training and evaluation datasets Translate research insights into scalable, production-ready training and evaluation systems Document experiments, results, and best practices with strong reproducibility standards Required Qualifications Strong background inmachine learning, reinforcement learning, or AI research
Hands-on experience with SFT and RLHF**, especially for reasoning-intensive tasks Experience building or using** RL gyms / environments**, including task-driven or simulation-based setups Solid understanding of** model evaluation**, including automated metrics and human-in-the-loop evaluation Proficiency in** Python and ML frameworks such as Py Torch
Ability to reason deeply about model behavior, generalization, and alignment
Experience training or evaluating models on STEM, coding, or real-world problem domains
Preferred / Nice-to-Have Experience with LLMs, multimodal models, or foundation models
Background in robotics, simulation environments, or embodied AI
Familiarity with program synthesis, code evaluation, or formal reasoning
Experience with large-scale or distributed training Interest or experience in AI safety, alignment, or robustness
Publications, open-source contributions, or applied research experience What We Offer Opportunity to work on cutting-edge AI reasoning and alignment challenges
Direct impact on real-world AI capabilities in STEM, coding, and robotics Collaborative, research-driven environment Competitive compensation and benefits
DO:
- At least 15 years of experience in selling IT Services in Tier-1 or Tier-2 competitive organizations.
- Strong knowledge of global delivery model (GDM) and methodologies. Should be familiar with cross selling various service lines for customers
- Ability to present and interact at all levels, and have consultative sales capability.
- Ability to work and collaborate across other teams in various service lines and anchor together for the account.
- Exposure to delivery, sales or pre-sales roles will be required
- Should have managed a multi-million USD account, across various geos.
- Strong Account Management - building and managing client relationships at the all levels.
- Carry targets on revenue, bookings and OM.
- Get involved in resolving any people management issue within Wipro teams
- Generating leads by interacting with the customers in various lines of business to expand our footprint.
- Presenting and publishing the proposals (proactive ones as well as responses to RFP/RFIs)
- Interacting with Procurement and Supplier relationship team from customer organization and maintain smoother flow of contracts, invoices and payments.
- Work closely with senior customer team (CIO, VPs and Directors) to suggest, advice, evaluate, and prime business growth
Ã,Â
Expected annual pay for this role ranges from $200,000.00 to $280,000.00. Based on the position, the role is also eligible for Wipro's standard benefits including a full range of medical and dental benefits options, disability insurance, paid time off (inclusive of sick leave), other paid and unpaid leave options
Reinvent your world. We are building a modern Wipro. We are an end-to-end digital transformation partner with the boldest ambitions. To realize them, we need people inspired by reinvention. Of yourself, your career, and your skills. We want to see the constant evolution of our business and our industry. It has always been in our DNA - as the world around us changes, so do we. Join a business powered by purpose and a place that empowers you to design your own reinvention.
Applications from people with disabilities are explicitly welcome.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

LV HSG/TML Engineer
Aptiv · Ulsan, Republic of Korea

AML Project Delivery Senior Analyst
Deloitte · Arlington, VA; Atlanta, GA; Baltimore, MD; Boca Raton, FL; Boston, MA; Buffalo, NY; Charlotte, NC; Darien, CT; Hartford, CT; Jacksonville, FL; Jericho, NY; Jersey City, NJ; McLean, VA; Miami, FL; Morristown, NJ; New York, NY; Philadelphia, PA; Pittsburgh, PA; Princeton, NJ; Raleigh, NC; Richmond, VA; Rochester, MA; Tallahassee, FL; Tampa, FL; Washington, DC

AI Engineer for the Analytics Plateau
Airbus · Manching

AI Engineering Leader
Trane Technologies · Davidson, North Carolina, United States

Machine Learning Engineer
eBay · Bengaluru, India
About Wipro
Reviews
3.4
4 reviews
Work Life Balance
1.5
Compensation
2.0
Culture
1.5
Career
2.0
Management
1.5
15%
Recommend to a Friend
Pros
Good for resume/brand name
Broad technical experience
Exposure to multiple tech stacks
Cons
Poor management quality
Low compensation
Toxic work environment
Salary Ranges
41,395 data points
Mid/L4
Mid/L4 · Analyst - Business Process L2
1 reports
$128,283
total / year
Base
$111,550
Stock
-
Bonus
-
$128,283
$128,283
Interview Experience
5 interviews
Difficulty
2.0
/ 5
Duration
14-28 weeks
Offer Rate
40%
Experience
Positive 100%
Neutral 0%
Negative 0%
Interview Process
1
Application Review
2
Online Assessment/Aptitude Test
3
Technical Interview
4
HR Interview
5
Offer
Common Questions
Coding/Algorithm
Technical Knowledge
Behavioral/STAR
Past Experience
Culture Fit
News & Buzz
Wipro Launches a New Operating Model for Enterprise Functions, Combining Advisory, AI, and Enterprise Transformation Services - Wipro
Source: Wipro
News
·
4w ago
Wipro earthian Awards 2025 Felicitate Excellence in Sustainability Education - Wipro
Source: Wipro
News
·
5w ago
Wipro GE Healthcare unveils SIGNA™ Prime Elite — a Made-in-India, AI-powered breakthrough in MR imaging - TheWire.in
Source: TheWire.in
News
·
5w ago
Wipro Earthian Awards 2025 Spotlight Sustainability Innovation in Indian Education - TipRanks
Source: TipRanks
News
·
5w ago
