AI Infrastructure Engineer, DGXC Lepton job opportunity at NVIDIA.



bot
NVIDIA AI Infrastructure Engineer, DGXC Lepton
Requires: 12-years - XP
Pattern: full-time
apply Apply Now
Salary:
Status:
Copy Link Report
Bachelor's (B.A.)
CA, Santa Clara, United States Of America
CA, Santa Clar..........United States Of America

Develop infrastructure #software and tools for large-scale #AI, LLM, and GenAI infrastructure. __ Develop and optimize tools to improve infrastructure efficiency and resiliency. __ Root cause and analyze and triage failures from the application level to the #hardware level __ Enhance infrastructure and #products underpinning #NVIDIA's AI platforms. __ Co-design and implement APIs for integration with #NVIDIA's resiliency stacks. __ Define meaningful and actionable reliability metrics to track and improve #system and service reliability. __ Skilled in problem-solving, root cause analysis, and optimization.

Other Ai Matches

Senior Software Technical Program Manager - GPU Communication Libraries Applicants are expected to have a solid experience in handling Manager related tasks
Network RDMA Algorithms Architect Applicants are expected to have a solid experience in handling Architect related tasks
AI and ML Infra Software Engineer, GPU Clusters - New College Grad 2025 Applicants are expected to have a solid experience in handling Engineer related tasks
Linux for Edge System Software Engineer (RDSS intern) Applicants are expected to have a solid experience in handling Gaming related tasks
Software Engineer, DOCA Applicants are expected to have a solid experience in handling Engineering related tasks
Director, APAC Tax Applicants are expected to have a solid experience in handling Finance related tasks
GPU Architecture Engineer - New College Grad 2025 Applicants are expected to have a solid experience in handling Engineering related tasks
OEM Sales Operations Analyst Applicants are expected to have a solid experience in handling Analyst related tasks
PhD Research Intern, Gaming Research - 2026 Applicants are expected to have a solid experience in handling Intern related tasks
SWAQ Tools Development Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Switch Firmware Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Analog and Mixed Signal Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Senior Developer Relations Manager, Edge AI and Functional Safety Applicants are expected to have a solid experience in handling Manager related tasks
Senior Technical Program Manager - Autonomous Vehicles Platform Applicants are expected to have a solid experience in handling Manager related tasks
HPC and AI Software Architect Applicants are expected to have a solid experience in handling Software related tasks
Software Engineer - Backend Applicants are expected to have a solid experience in handling Engineering related tasks
Manager, ML Engineering Applicants are expected to have a solid experience in handling Manager related tasks
Senior Design Verification Engineer - Hardware Applicants are expected to have a solid experience in handling Engineering related tasks
AI Computing Performance Architect, Perf Analysis and Kernel Dev Applicants are expected to have a solid experience in handling Data Professional related tasks
Senior Software Engineer, Cloud Functions Applicants are expected to have a solid experience in handling Engineer related tasks
GPU Driver Profiler Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
NPI System Product Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Custom Timing Engineer - Circuits Applicants are expected to have a solid experience in handling Engineering related tasks