
Solutions Architect, Model Builder - LATAM
About the role
Join NVIDIA as a Solutions Architect to help LATAM build culturally-nuanced LLMs and empower local developers to build and deploy next-generation agentic AI applications. Collaborate with premier startups, research labs and ISVs to develop the next generation components of the AI-native systems. By mastering NVIDIA’s core technologies—NIM, Ne Mo Framework, Dynamo, and Nemo Agent Toolkit—you will guide partners through the complexities of performance optimization and production-grade deployment. As a trusted advisor, you’ll transform raw LLM capabilities into high-performance, industry-focused enterprise agents. At NVIDIA, we work as a unified front. You will collaborate daily with our Account Managers, Dev Rel leads, and Marketing experts to turn bold AI visions into regional realities.
What you'll be doing:
-
Localize the future: Fine-tune LLMs to speak the authentic language of specific regions and industries.
-
Develop and optimize training and inference workflows with partners and collaborate with internal NVIDIA development teams to improve our software stack
-
Build sophisticated agentic systems featuring multi-agent coordination, long-horizon reasoning, and sophisticated planning frameworks.
-
Develop full-scale solutions, including domain-specific enterprise agents and high-performance retrieval pipelines (RAG) spanning various data sources.
-
Optimize inference performance by bringing to bear GPU-accelerated frameworks and the full NVIDIA AI infrastructure stack.
-
Build hands-on Po Cs and reference architectures that serve as the blueprint for production-grade generative AI pipelines.
-
Partner with high-growth startups and Enterprise ISVs to embed NVIDIA’s software stack into their core platforms, slashing the time to market for production-grade AI.
-
Fuel partner innovation through hands-on developer enablement and thorough architectural reviews, turning sophisticated AI visions into production realities.
-
Scale global expertise by crafting reusable assets and documentation that help field teams deploy agentic AI at scale.
What we need to see:
-
BS/MS/PhD in Computer Science, Electrical Engineering, AI/ML, or equivalent experience.
-
5+ years of experience in deep learning, machine learning, or distributed AI systems.
-
Strong programming and debugging experience in Python, C/C++, and Linux environments.
-
Background in using deep learning libraries like Py Torch or Tensor Flow.
-
Hands-on experience building LLM and generative AI applications.
-
Experience working with agentic or multi-agent AI systems employing frameworks such as: Lang Graph, Llama Index, CrewAI, Lang Chain, or OpenAI Agents SDK or similar orchestration frameworks
-
Experience building tool-using AI agents that interact with APIs, databases, and enterprise systems.
-
Ability to rapidly prototype AI applications and build scalable GPU-accelerated architectures.
-
Excellent interpersonal skills and the ability to collaborate with engineering teams, partners, and executive collaborators.
Ways to Stand Out from the Crowd:
-
Experience working with NVIDIA GPUs and AI software, such as NVIDIA NIM, Ne Mo Framework, Ne Mo Retriever, and Ne Mo Agent Toolkit.
-
Experience with LLM evaluation frameworks, benchmarking systems, and safety guardrails for agentic workflows.
-
Experience optimizing reasoning-focused LLMs through timely engineering, quantization, or benchmarking.
-
Familiarity with Kubernetes/Open Shift, CI/CD automation, and cloud-native deployment patterns for AI systems.
-
Experience with parallel or distributed computing environments and AI workloads optimized for GPUs.
About NVIDIA
2 Locations
Headquarters