post ads here
DatePosted 30+ Days Ago
Senior Technical Marketing Engineer - AI Inference at Scale
Modern data centers are transforming into AI factories, and NVIDIA accelerated computing is the engine of artificial intelligence. Our data center platforms integrate CPUs, GPUs, DPUs, networking, and a full-stack software ecosystem to power AI at scale. We are looking for a Senior Technical Marketi......
Hiring In US, CA, Santa Clara
full-time Sourced OND 7-years NVIDIA United States Of America
DatePosted 30+ Days Ago
Senior Software Engineer, AI Inference Systems
We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale w......
Hiring In Canada, Toronto
full-time Sourced OND 7-years NVIDIA Canada
DatePosted 15 Days Ago
Principal Software Engineer - AI Inference
NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVIDIA GPUs and systems.......
Hiring In US, CA, Santa Clara
full-time Sourced High School (S.S.C.E) 15-years NVIDIA United States Of America
DatePosted 15 Days Ago
Senior Compiler Engineer, AI Inference Platforms
NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-d......
Hiring In US, CA, Santa Clara
full-time Sourced General 3-years NVIDIA United States Of America
DatePosted 14 Days Ago
Senior Compiler Engineer, AI Inference Performance
NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-d......
Hiring In US, CA, Santa Clara
full-time Sourced General 3-years NVIDIA United States Of America
DatePosted 14 Days Ago
Senior Software Engineer, AI Inference Systems
We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale w......
Hiring In US, CA, Santa Clara
full-time Sourced OND 7-years NVIDIA United States Of America
DatePosted 14 Days Ago
Senior AI Inference Compiler Engineer
NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-d......
Hiring In US, CA, Santa Clara
full-time Sourced Bachelor's (B.A.) 3-years NVIDIA United States Of America
DatePosted 5 Days Ago
AI Inference Performance Engineer - New College Grad 2026
We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry’s performance standards across language models, video generation, and speech workloads. We work directly within TensorRT-LLM, SGLang, and vLLM, building the tools that evaluate serving performance at sca......
Hiring In US, CA, Santa Clara
full-time Sourced PhD 2-years NVIDIA United States Of America
DatePosted Yesterday
AI Inference Performance Engineer
We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry’s performance standards across language models, video generation, and speech workloads. We work directly within TensorRT-LLM, SGLang, and vLLM, building the tools that evaluate serving performance at sca......
Hiring In US, CA, Santa Clara
full-time Sourced PhD 5-years NVIDIA United States Of America
DatePosted 30+ Days Ago
Senior Engineer-AI Inference
Job Description: At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day. Being a Great Place to Work is......
Hiring In Addison
full-time Sourced OND 8-years Bank Of A... United States Of America
DatePosted 20 Days Ago
Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)
The vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer . In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform ( LLM-D , and vLLM ) and our customers' ......
Hiring In Remote US WA
Remote Sourced Associate General Red Hat, ... United States Of America
DatePosted 6 Days Ago
Senior Principal MLOps Engineer, AI Inference
At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM ......
Hiring In Boston
full-time Sourced Associate 10-years Red Hat, ... United States Of America
post ads here