[Remote] AI Infrastructure Engineer

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. vCluster is a venture-backed tech startup pioneering Kubernetes virtualization for the AI era. As an AI Infrastructure Engineer, you will work directly with customers to drive technical deployments and optimize GPU infrastructure, ensuring a smooth transition to production-ready environments.ResponsibilitiesLead Technical Deployments: Drive end-to-end technical deployments for GPU neocloud and AI Factory customers, from initial bare metal configuration to a validated vCluster environmentInfrastructure Optimization: Configure and troubleshoot bare metal GPU node infrastructure, including CNI configuration, GPU Operator setup, distributed storage backends, and RDMA/InfiniBandValidation: Deploy and validate Kubernetes and vCluster to provide GPU-powered managed K8sKnowledge Transfer: Work alongside customer teams to build self-sufficiency, ensuring they can operate and grow the platform independentlyScaling through Documentation: Document reusable playbooks and deployment architectures so your learnings become the next customer's head startFeedback Loop: Collaborate with Engineering and Product to surface recurring infrastructure challenges, acting as a direct feedback loop from the field into the roadmapStrategic Partnering: Join Sales in the pre-sales process where deep infrastructure work is required to achieve a meaningful proof of valueSkills5+ years of experience deploying and operating Kubernetes in production, ideally on bare metal or in high-complexity environmentsPractical knowledge of NVIDIA GPU Operators, CUDA tooling, and systems-level configuration for GPU nodesDeep understanding of CNI plugins, overlay networks, load balancing, and connectivity diagnosis in layered environmentsExperience with persistent volume configuration, CSI drivers, and distributed systems like Ceph, Rook, Weka, or LonghornComfort operating in ambiguous, fast-moving environments where you are often writing the playbook in real timeYou thrive in environments that reject legacy tech and prefer a modern stack where you can solve a variety of problems from pipelines to internal servicesExperience writing automation scripts with Bash, Python, or GoRelevant certifications such as CKA (Certified Kubernetes Administrator) or experience writing Kubernetes OperatorsExperience with inference serving, GPU scheduling, and the tooling around LLM deploymentExperience building AI Automation in documentation to contribute to a shared knowledge baseBenefitsOffers EquityOffers BonusHealth, dental, vision, and life Insurance, including plans for you and eligible dependents (benefits vary depending on country)Flexible Working Schedule: You have a doctor’s appointment or need to head to the supermarket to get groceries at 2pm? We won’t have an issue with that. To us, results matter more than clocking in and out at the same time every day.Workplace Flexibility: We’re very flexible about where you work. We know things can change in life and we’re happy to adjust the work environment for you along the way.Company OverviewvCluster helps companies build flexible infrastructure tenancy for GPU and AI infra as well as for K8s in private, public and hybrid clouds. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 51-200 employees. Its website is https://vcluster.com/.Company H1B SponsorshipvCluster has a track record of offering H1B sponsorships, with 1 in 2024. Please note that this does not guarantee sponsorship for this specific role.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Revenue Operations Manager

Remote

Director of Talent Acquisitions

Remote

Teen Online Jobs Near Me: Earn Money & Gain Experience

Remote

Analyst Customer Success, Measurement US

Remote

Remote Customer Service Representative – Flexible Schedule – Full‑Time Position at careerzynith

Remote

**Experienced Data Entry Specialist – Remote Opportunity with arenaflex**

Remote

Customer Service Call Center - Work From Home!

Remote

Remote Senior Medical Facility Planner

Remote

CUSTOMER SUPPORT REPRESENTATIVE (Work from anywhere in the world)

Remote

Senior Software Development Engineer, Campaign Creation Core

Remote
← Back