Technical Specialist - Infrastructure & Systems for Large Models job opportunity at Huawei.



bot
Huawei Technical Specialist - Infrastructure & Systems for Large Models
Requires: 2-years - XP
Pattern: full-time
apply Apply Now
Salary:
Status:
Copy Link Report
Bachelor's (B.Sc.)
Markham, Ontario, Canada
Markham, Ontar..........Canada

Join a team that maintain the core #infrastructure powering large-scale #AI training. Contribute to #data loading, training workflows, and checkpointing systems for distributed model training. Help improve tools that manage training jobs across compute clusters (e.g., GPUs, TPUs, multi-node setups). __ Work on monitoring and logging tools to make long-running jobs reliable and observable. Support optimization efforts (e.g., mixed precision, sharding) to make model training faster and more efficient. Collaborate closely with machine learning engineers and researchers on new training methods and experiments. Learn to scale systems, #debug complex workloads, and make training pipelines reproducible. Be part of a team that bridges research and infrastructure to accelerate AI development.

Other Ai Matches

Technical Specialist - Infrastructure & Systems for Large Models Applicants are expected to have a solid experience in handling Specialist related tasks
Numerical Computing Researcher / Expert – AI Infrastructure Applicants are expected to have a solid experience in handling Software/Computing Researcher related tasks
Storage Engineer / IT Network Engineer - Graduate Program Applicants are expected to have a solid experience in handling Engineering related tasks
Intern Assistant Engineer – LLM Applicants are expected to have a solid experience in handling Engineering related tasks
Engineer - Game Engine System Applicants are expected to have a solid experience in handling Engineer related tasks
Senior Engineer - Ray Applicants are expected to have a solid experience in handling IT related tasks
Software Engineer Applicants are expected to have a solid experience in handling Software Engineering related tasks
Algorithm Expert - Multimodal LLM Applicants are expected to have a solid experience in handling Specialist related tasks
Research Engineer - Computer Network and Protocol Applicants are expected to have a solid experience in handling Researcher related tasks
Senior Engineer - Rendering System (Android) Applicants are expected to have a solid experience in handling Accounting related tasks
Machine Learning Software Engineer - GPU/NPU Applicants are expected to have a solid experience in handling Legal related tasks
Senior Research Engineer Applicants are expected to have a solid experience in handling RESEARCH related tasks
Researcher - Power System and Grid Interconnection Applicants are expected to have a solid experience in handling Programmer related tasks
Intern Researcher - Robot/Embodied AI Applicants are expected to have a solid experience in handling Researcher related tasks
Research Engineer – Optical Networking Applicants are expected to have a solid experience in handling Manager related tasks
Algorithm Expert - NLP Large Model Applicants are expected to have a solid experience in handling Specialist related tasks
Senior Technical Expert - Distributed Data Systems Applicants are expected to have a solid experience in handling Manager related tasks
Researcher - Reinforcement Learning Research Synthesis Applicants are expected to have a solid experience in handling Analyst related tasks
Co-op Researcher - Web & AI Applicants are expected to have a solid experience in handling Gaming related tasks
DSP Research Engineer Applicants are expected to have a solid experience in handling Research related tasks
AI for optical networking researcher Applicants are expected to have a solid experience in handling RESEARCH related tasks
Co-op Researcher - LLMs Reasoning Applicants are expected to have a solid experience in handling IT related tasks
Full-time Junior position for graduates Applicants are expected to have a solid experience in handling Technician related tasks