Jobs

Software Engineer III -Gen AI Inferencing
Addison, TX; Charlotte, NC; Kennesaw, GA; Newark, DE
·
On-site
·
Full-time
·
1mo ago
Benefits & Perks
•Top Tier compensation with equity
•Health, dental, and vision coverage
•Flexible PTO policy
•Learning and development stipend
Required Skills
PyTorch
Airflow
TensorFlow
About Us
Bank of America provides people, companies, and institutional investors with industry-leading financial products and services.
Size: 10000+ employees
Industry: Financial Services
Job Description:
At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates' physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
Position Summary
Join a groundbreaking team at Bank of America, at the forefront of innovation in AI. We are building the next generation of Gen AI platform, empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth organizations. This is a unique opportunity to contribute to a critical platform that will enable secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a passion for pushing the boundaries of what's possible with AI.
This position is focused on design, build, and operate of reusable toolkits for Gen AI RAG capabilities.
This job is responsible for developing and delivering complex requirements to accomplish business goals. Key responsibilities of the job include ensuring that software is developed to meet functional, non-functional and compliance requirements, and solutions are well designed with maintainability/ease of integration and testing built-in from the outset. Job expectations include a strong knowledge of development and testing practices common to the industry and design and architectural patterns.
Responsibilities:
- Codes solutions and unit test to deliver a requirement/story per the defined acceptance criteria and compliance requirements
- Designs, develops, and modifies architecture components, application interfaces, and solution enablers while ensuring principal architecture integrity is maintained
- Mentors other software engineers and coach team on Continuous Integration and Continuous Development (CI-CD) practices and automating tool stack
- Executes story refinement, definition of requirements, and estimating work necessary to realize a story through the delivery lifecycle
- Performs spike/proof of concept as necessary to mitigate risk or implement new ideas
- Automates manual release activities
- Designs, develops, and maintains automated test suites (integration, regression, performance)
- Utilizes multiple architectural components (across data, application, business) in design and development of client requirements
- Manage multiple priorities, and simultaneously engage with multiple teams.
- Participates in estimating work necessary to realize a story/requirement through the delivery lifecycle.
- Be vocal and actively participate in all session with business stakeholders and agile teams.
- Collaborate with product teams, data analysts and data scientists to design and build solutions.
Required qualifications:
- 5+ years OOP in Python/Scala/Java programming experience with expert level development skills
- Experience with AI/ML/GenAI Lifecycle Management and Development and its Ecosystem. Hands on experience building frameworks using MLOps, Fine
- Tuning techniques, Inference Frameworks
- Experience with deploying models using vLLM/Triton Inference Server in containers in production with automation. Performs Continuous Integration and Continuous Development (CI-CD) activities. Performance Tuning those models and deployment to provide higher throughput.
- Track record of maintaining large scale Python/Unix based systems.
- Hands on experience and knowledge generative AI RAG process for various use cases, including chunking, embedding, retrieval, reranking and summarization.
- Hands-on experience in application development in one or more areas MongoDB, Redis, Angular/React Frameworks, Containerization, Building API based application leveraging FAST API services, JWT Integration, API Gateway
- Develop efficient utilities, automation frameworks, data science platforms that can be utilized across multiple Data Science teams for AI/ML and GenAI work.
- Working in large sized teams that collaboratively develop on a shared multi-repo codebase using IDEs (e.g. VS Code rather than Jupyter Notebooks), Continuous Integration (CI), Continuous Deployment (CD) and Continuous Testing
- Strong automation, scripting, and Python development skills. Hands-on DevOps experience with one or more of the following enterprise development tools: Version Control (GIT/Bitbucket), Build Orchestration (Jenkins), Code Quality (Sonar Qube and pytest Unit Testing), Artifact Management (Artifactory) and Deployment (Ansible)
Email Address
Send me The Muse newsletters for the best in career advice and job search tips.
Get jobs!
Desired Qualifications:
- Experience building & deploying Gen AI inferencing platform with open-source toolsets, building inferencing & servicing capabilities (AI Gateway, Policy store, Observability) for RAG/ MCP use cases etc.
- Hands on experience on driving and maintaining a culture of quality, innovation, and experimentation.
- Research on new tools and capabilities for better UI and UX for advanced analytics platform, quick prototype and demonstrate the features and capabilities, and participate on various user forums.
Skills:
- Application Development
- Automation
- Influence
- Solution Design
- Technical Strategy Development
- Architecture
- Business Acumen
- DevOps Practices
- Result Orientation
- Solution Delivery Process
- Analytical Thinking
- Collaboration
- Data Management
- Risk Management
- Test Engineering
Shift:
1st shift (United States of America)
Hours Per Week:
40
Client-provided location(s): Charlotte, NC, Newark, DE, Kennesaw, GA, Addison, TX
Job ID: Bank OfAmerica-JR-25032986
Employment Type: FULL_TIME
Posted: 2025-12-22T18:52:57
Apply on company site
Perks and Benefits
Health and Wellness
- FSA
- HSA
- Health Insurance
- Dental Insurance
- Vision Insurance
- Life Insurance
- Short-Term Disability
- Long-Term Disability
- Pet Insurance
- Mental Health Benefits
Parental Benefits
- Non-Birth Parent or Paternity Leave
- Birth Parent or Maternity Leave
- Adoption Assistance Program
- Adoption Leave
- Family Support Resources
Work Flexibility
Office Life and Perks
Vacation and Time Off
- Leave of Absence
- Personal/Sick Days
- Paid Holidays
- Paid Vacation
- Sabbatical
- Volunteer Time Off
Financial and Retirement
- 401(K) With Company Matching
- 401(K)
- Financial Counseling
Professional Development
- Tuition Reimbursement
- Internship Program
- Associate or Rotational Training Program
- Mentor Program
- Access to Online Courses
Diversity and Inclusion
Apply on company site
Similar Jobs
Suggested Searches
senior jobsBank of America jobsAll jobs
Search Additional Jobs
Software Engineer Jobs in Charlotte, NCSoftware Engineer Jobs in Newark, DESoftware Engineer Jobs in Kennesaw, GAJobs in Charlotte, NCJobs in Newark, DEJobs in Kennesaw, GA
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Entry Level AI Software Engineer
IBM · Abilene, TX; Research Triangle Park, NC

Sr. Automation/AI Engineer - HR Technology
General Motors · Flexible / Remote

Senior Machine Learning Engineer
General Motors · Flexible / Remote; Mountain View, CA; Sunnyvale, CA

Staff/Sr Staff AI Engineer Scientist
Palo Alto Networks · Santa Clara, CA

Digital & AI Strategy Senior Associate
PwC · Atlanta, GA; Boston, MA; Chicago, IL; Dallas, TX; Florham Park, NJ; Fort Worth, TX; Gunnison, CO; Los Angeles, CA; New York, NY; Rosemont, IL; San Francisco, CA; Silicon Valley, CA
About Bank of America

Bank of America
PublicA financial institution that offers credit cards, home loans, and auto loan services.
10,001+
Employees
Charlotte
Headquarters
$316B
Valuation
Reviews
3.7
10 reviews
Work Life Balance
4.0
Compensation
4.2
Culture
4.1
Career
4.0
Management
2.8
75%
Recommend to a Friend
Pros
Great benefits
Good work-life balance and flexible schedule
Positive team environment and culture
Cons
Micromanagement issues
Communication problems with management
High pressure to meet goals and sell products
Salary Ranges
16,604 data points
L4
L5
L6
Mid/L4
Senior/L5
Staff/L6
VP
L4 ·
0 reports
$222,365
total / year
Base
-
Stock
-
Bonus
-
$189,010
$255,720
Interview Experience
8 interviews
Difficulty
2.9
/ 5
Duration
21-35 weeks
Experience
Positive 0%
Neutral 88%
Negative 12%
Interview Process
1
Application Review
2
HireVue Video Interview
3
Recruiter/HR Screen
4
Technical Phone Screen
5
Final Round/Super Day
6
Team/Location Matching
Common Questions
Technical Knowledge
Behavioral/STAR
Coding/Algorithm
Past Experience
Culture Fit
News & Buzz
Bank of America Securities reiterates a buy rating on Ferrari (RACE) - MSN
Source: MSN
News
·
5w ago
AlphaQuest LLC Purchases New Stake in Bank of America Corporation $BAC - MarketBeat
Source: MarketBeat
News
·
5w ago
BofA Awards $500,000 Grant to FIND Regional Food Bank - PR Newswire
Source: PR Newswire
News
·
5w ago
Judge greenlights Epstein victims' sex-trafficking lawsuit against Bank of America - Business Insider
Source: Business Insider
News
·
5w ago