Infrastructure and Build Systems Engineer - New College Grad 2026
NVIDIA is seeking a New College Grad for the Infrastructure and Build Systems Engineer position within their AI TensorRT-LLM team. The role involves taking ownership of critical systems that enhance engineering innovation, managing CI/CD pipelines, and collaborating with cross-functional teams to improve deployment efficiency and reliability.ResponsibilitiesBuilding and maintaining infrastructure from first principles needed to deliver TensorRT LLMMaintain CI/CD pipelines to automate the build, test, and deployment process and build improvements on the bottlenecksManaging tools and enabling automations for redundant manual workflows via Github Actions, Gitlab, Terraform, etcEnable performing scans and handling of security CVEs for infrastructure componentsImprove the modularity of our build systems using CMakeUse AI to help build automated triaging workflowsExtensive collaboration with cross-functional teams to integrate pipelines from deep learning frameworks and components is essential to ensuring seamless deployment and inference of deep learning models on our platformSkillsMasters degree or equivalent experienceExperience in Computer Science, computer architecture, or related fieldAbility to work in a fast-paced, agile team environmentExcellent Bash, CI/CD, Python programming and software design skills, including debugging, performance analysis, and test designExperience with CMakeBackground with Security best practices for releasing librariesExperience in administering, monitoring, and deploying systems and services on GitHub and cloud platformsSupport other technical teams in monitoring operating efficiencies of the platform, and responding as needs ariseHighly skilled in Kubernetes and Docker/containerdAutomation expert with hands-on skills in frameworks like Ansible & TerraformExperience in AWS, Azure or GCPExperience contributing to a large open-source deep learning community - use of GitHub, bug tracking, branching and merging code, OSS licensing issues handling patches, etcExperience in defining and leading the DevOps strategy (design patterns, reliability and scaling) for a team or organizationExperience driving efficiencies in software architecture, creating metrics, implementing infrastructure as code and other automation improvementsDeep understanding of test automation infrastructure, framework and test analysisExcellent problem solving abilities spanning multiple software (storage systems, kernels and containers) as well as collaborating within an agile team environment to prioritize deep learning-specific features and capabilities within Triton Inference Server, employing advanced troubleshooting and debugging techniques to resolve complex technical issuesBenefitsEquityBenefitsCompany OverviewNVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. It was founded in 1993, and is headquartered in Santa Clara, California, USA, with a workforce of 10001+ employees. Its website is https://www.nvidia.com.Company H1B SponsorshipNVIDIA has a track record of offering H1B sponsorships, with 1877 in 2025, 1355 in 2024, 976 in 2023, 835 in 2022, 601 in 2021, 529 in 2020. Please note that this does not guarantee sponsorship for this specific role.
Apply To This Job
Apply To This Job