[Remote] Senior Software Engineer, AI Agent Runtime and Open Source Infrastructure
Note: The job is a remote job and is open to candidates in USA. NVIDIA AI is at the forefront of AI innovation, transforming computing with cutting-edge technology. They are seeking a Senior Software Engineer to develop AI-powered software development tools and infrastructure, focusing on agentic AI and runtime security.ResponsibilitiesBuild and implement production-grade features across NemoClaw, focusing on onboarding flows, policy controls, inference routing, and sandbox lifecycleDevelop and sustain secure agent runtime infrastructure, ensuring strong network policy administration, credential management, and failure recoveryEngage in daily open-source workflows: author pull requests, conduct technical reviews, address issues, write tests, and contribute to documentationUse AI-assisted development tools to improve the engineering loop, while applying rigorous verification and security measuresDevelop tools, test harnesses, automation scripts, and CI/CD workflows to boost team efficiencyDiagnose complex failures across various platforms and environments, including TypeScript/Node.js, containers, Linux, macOS, WSL, and GPU-backed systemsCollaborate with internal teams and external communities, including OpenShell and AI platform partnersSkillsBS, MS, or equivalent experience in Computer Science, Software Engineering, or a related technical fieldOver 12+ years of experience in developing and managing production software systems, developer infrastructure, or open-source platformsStrong systems engineering fundamentals with a proven track record of solving multifaceted problemsSkilled in at least one prominent programming language and capable of rapidly learning TypeScript, JavaScript, Node.js, and RustComfort working in large codebases, with experience in reading unfamiliar code, conducting detailed reviews, and improving maintainabilityDemonstrated experience with open-source practices, including managing tasks, pull requests, code reviews, and public technical discussionsExperience with AI-supported development tools and a solid understanding of validating generated codeSecurity-conscious engineering approaches, particularly concerning secrets management, sandboxing, and network policy enforcementSolid testing, continuous integration and delivery, and debugging abilities, with the capability to replicate failures, determine root causes, and clearly convey resultsExcellent written and verbal communication skills, capable of explaining technical concepts to diverse audiencesContributions to open-source developer infrastructure, AI tooling, or large public software projectsHands-on experience with AI coding agents, workflow automation, or multi-agent systemsExperience with containers and Linux isolation technologies including Docker, Kubernetes, and network policy managementDemonstrated experience in developing dependable CI, comprehensive validation, and test infrastructure for dynamic softwareFamiliarity with LLM inference, GPU-backed workloads, or performance-sensitive AI infrastructure as well as demonstrated ability to elevate the engineering standards through thoughtful reviews, clear documentation, and effective mentoringBenefitsYou will also be eligible for equity and benefits.Company OverviewExplore the latest breakthroughs made possible with AI. It was founded in undefined, and is headquartered in Santa Clara, CA, US, with a workforce of 10001+ employees. Its website is https://developer.nvidia.com/blog/.