[Remote] Senior AI Compiler Engineer - Applied Research
Note: The job is a remote job and is open to candidates in USA. NVIDIA is a leading company in AI infrastructure, and they are seeking a Senior AI Compiler Engineer to join their team. The role focuses on developing innovative AI compiler solutions and optimizing low-level GPU programming through AI-based technologies.ResponsibilitiesHelp trailblaze company efforts in applying AI within conventional compilation pipelinesDesign and implement AI-based technology addressing core problems of low-level GPU programmingBuild training pipelines for supervised fine-tuning and reinforcement learning (RL/RLHF-style or policy optimization variants)Define model inputs/outputs over compiler low level compiler representationsDevelop evaluation frameworks to measure code quality, runtime, compile-time overhead, and correctnessIntelligent (domain` task based) prompt engineeringCollaborate with compiler engineers to integrate learned policies into production toolchainsPrototype and iterate on model architectures, prompts, and fine-tuning strategies for scheduling and allocation tasksCreate datasets from compiler traces, optimization passes, and target-specific performance signalsApply RL techniques to optimize for downstream objectives (performance, spill reduction, instruction-level parallelism, etc.) and run rigorous experiments, ablations, and benchmarking across workloads and hardware targetsSkillsM.S./PhD degree in Computer Engineering, Computer Science related technical field (or equivalent experience)5+ years of experience building AI/ML systemsStrong software engineering skills in Python and at least one systems language (C++ preferred)Hands-on experience training/fine-tuning large models (Transformers, PEFT/LoRA, distributed training)Solid understanding of machine learning fundamentals and experimentation best practicesExperience with reinforcement learning (e.g., policy gradients, actor-critic, offline RL, bandit-style optimization)Knowledge of prompt-engineering techniquesAbility to work across research and engineering, from prototype to productionDistributed training/inference at scaleExperience working with the NVIDIA NeMo frameworkUnderstanding of GPU performance, experience with benchmarking suites and performance profiling toolsFormal methods or static analysis familiarity for correctness guaranteesCUDA programming experienceBenefitsEquityBenefitsCompany OverviewNVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. It was founded in 1993, and is headquartered in Santa Clara, California, USA, with a workforce of 10001+ employees. Its website is https://www.nvidia.com.Company H1B SponsorshipNVIDIA has a track record of offering H1B sponsorships, with 448 in 2026, 1872 in 2025, 1354 in 2024, 976 in 2023, 835 in 2022, 601 in 2021, 529 in 2020. Please note that this does not guarantee sponsorship for this specific role.