[Remote] Machine Learning Systems Engineer
Note: The job is a remote job and is open to candidates in USA. Motional is a driverless technology company focused on making autonomous vehicles a safe and reliable reality. They are seeking a Machine Learning Systems Engineer to join their ML Acceleration team, responsible for optimizing systems that enable large-scale model training with an emphasis on speed, cost, reliability, and throughput.ResponsibilitiesUtilize profiling tools (e.g., Nsight, PyTorch Profiler) to identify bottlenecks in data loading, gradient computation, and communication. Implement optimizations like kernel fusion, sharding, and tiling to improve step timeOptimize distributed training pipelines using frameworks such as PyTorch DistributedDesign and maintain high-performance GPU kernels in Triton or CUDA for state-of-the-art ML workloadsOptimize robust data loading pipelines that maximize training throughputSkillsBachelor's, Master's degree, or PhD in Computer Science, Computer Engineering, or a related technical disciplineStrong proficiency in PythonExtensive hands-on experience with PyTorchExperience optimizing machine learning model execution during training and inference, alongside a strong understanding of fundamental machine learning concepts, architectures, and processesExceptional analytical and problem-solving skills, with a bias for action and a data-driven approach to technical challengesBenefitsCandidates for certain positions are eligible to participate in Motional’s benefits program.Motional’s benefits include but are not limited to medical, dental, vision, 401k with a company match, health saving accounts, life insurance, pet insurance, and more.Company OverviewMotional offers an autonomous driving platform for robotaxi providers, fleet operators, and automotive manufacturers. It is a sub-organization of Hyundai Motor Group. It was founded in 2020, and is headquartered in Boston, Massachusetts, USA, with a workforce of 501-1000 employees. Its website is https://motional.com.