Principal On-Device Model Inference Optimization Engineer job opportunity at NVIDIA.



bot
NVIDIA Principal On-Device Model Inference Optimization Engineer
Requires: 15-years - XP
Pattern: full-time
apply Apply Now
Salary:
Status:
Copy Link Report
Master's (M.Sc.)
China, Shanghai, China
China, Shangha..........China

Develop and implement strategies to optimize #AI model inference for on-device deployment. __ Employ techniques like pruning, quantization, and knowledge distillation to minimize model size and #computational demands. __ Optimize performance-critical components using CUDA and C++. __ Collaborate with multi-functional teams to align optimization efforts with hardware capabilities and deployment needs. __ Benchmark inference #performance, identify bottlenecks, and implement solutions. __ Research and apply innovative methods for inference optimization. __ Adapt models for diverse hardware platforms and operating systems with varying capabilities. __ Create tools to validate the accuracy and latency of deployed models at scale with minimal friction. __ Recommend and implement model architecture changes to improve the accuracy-latency balance.

Other Ai Matches

Systems Software Engineer - AI and Cloud Applicants are expected to have a solid experience in handling Engineer related tasks
Product Program Manager Applicants are expected to have a solid experience in handling Manager related tasks
ASIC Design Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Server Firmware Bringup Engineer Applicants are expected to have a solid experience in handling Firmware related tasks
Senior Performance and Development Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Account Leader, Automotive Applicants are expected to have a solid experience in handling Account Leader related tasks
Mechanical and Thermal Program Manager Applicants are expected to have a solid experience in handling Thermal related tasks
Software Design Engineer - SONiC Group Applicants are expected to have a solid experience in handling Engineering related tasks
PCB Library Engineer Applicants are expected to have a solid experience in handling Management related tasks
Senior SMTA Engineer Applicants are expected to have a solid experience in handling Manufacturing Engineering related tasks
Senior Architecture Energy Modeling Engineer Applicants are expected to have a solid experience in handling Administration related tasks
Senior System Level Product Engineer Applicants are expected to have a solid experience in handling Product Engineer related tasks
GPU Driver Profiler Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior Solutions Architect Applicants are expected to have a solid experience in handling Technology related tasks
Principal Firmware Engineer - Data Center Server Management Applicants are expected to have a solid experience in handling Engineering related tasks
Interconnect HW Design Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Global Trade Classification Analyst Applicants are expected to have a solid experience in handling Engineering related tasks
Developer Relations Manager, Quantum Computing Applicants are expected to have a solid experience in handling Development related tasks
Principal On-Device Model Inference Optimization Engineer Applicants are expected to have a solid experience in handling Engineer related tasks
Senior Site Reliability Engineer - Observability and Telemetry Platform Applicants are expected to have a solid experience in handling Engineer related tasks
Chip Design Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Senior AI Training Performance Engineer Applicants are expected to have a solid experience in handling Engineering related tasks
Consumer Channel Promotion Manager Applicants are expected to have a solid experience in handling PROMOTIONS LEAD related tasks