VP of Product, Research and Training Infrastructure

Remote Full-time
About the position

As CoreWeave continues to solidify its position as the Essential Cloud for AI, we are seeking a visionary VP of Research Training Infrastructure. This executive leader will own the product strategy and engineering execution for the services that power the most ambitious AI research labs in the world. You will bridge the gap between "the metal" and the researcher, delivering a seamless, high-performance environment where frontier models are born.
The Role: Architect of the AI Factory
You will lead the product strategy of our Research Training Stack, focusing on the specialized orchestration, evaluation, and iteration tools required for massive-scale pre-training and post-training. This is a mission-critical role at the intersection of high-performance computing (HPC) and cloud-native agility.
In 2026, CoreWeave is the foundation of the largest infrastructure buildout in human history. We are building AI Factories, not just data centers.

Responsibilities
β€’ Frontier Orchestration: Oversee the evolution of SUNK (Slurm on Kubernetes) to provide researchers with deterministic, bare-metal performance through a cloud-native interface.
β€’ Holistic Training Services: Beyond Slurm, drive the development of next-generation orchestrators and automated training-based evaluation frameworks that ensure model quality throughout the lifecycle.
β€’ Post-Training Excellence: Build the infrastructure required for sophisticated Reinforcement Learning (RL) and RLHF pipelines, enabling labs to refine foundation models with maximum efficiency.
β€’ Customer Advocacy: Act as the primary technical partner for lead researchers at global AI labs, translating their "future-state" requirements into actionable product roadmaps.

Requirements
β€’ Proven Leadership: 15+ years of experience in engineering leadership, with at least 5+ years managing large-scale infrastructure at a top-tier research lab or an AI-native cloud provider.
β€’ Domain Expertise: Deep, hands-on knowledge of Slurm, Kubernetes, and the specific networking requirements (InfiniBand/RDMA) for distributed training clusters.
β€’ Research Mindset: You likely come from a background supporting frontier model research (pre-training and post-training) and understand the "pain points" of a research scientist.
β€’ Scaling Experience: A track record of delivering mission-critical services on multi-thousand GPU clusters (H100/Blackwell/Rubin architectures).
β€’ Strategic Vision: Ability to define "what’s next" in the AI stack, from automated RL loops to specialized sandbox environments.

Benefits
β€’ Medical, dental, and vision insurance - 100% paid for by CoreWeave
β€’ Company-paid Life Insurance
β€’ Voluntary supplemental life insurance
β€’ Short and long-term disability insurance
β€’ Flexible Spending Account
β€’ Health Savings Account
β€’ Tuition Reimbursement
β€’ Ability to Participate in Employee Stock Purchase Program (ESPP)
β€’ Mental Wellness Benefits through Spring Health
β€’ Family-Forming support provided by Carrot
β€’ Paid Parental Leave
β€’ Flexible, full-service childcare support with Kinside
β€’ 401(k) with a generous employer match
β€’ Flexible PTO
β€’ Catered lunch each day in our office and data center locations
β€’ A casual work environment
β€’ A work culture focused on innovative disruption

Apply tot his job

Apply To this Job
Apply Now β†’

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

**Experienced Full Stack Data Scientist – Web & Cloud Application Development at blithequark**

Remote

**Experienced Remote Live Chat Support Specialist - Customer Service Excellence at Blithequark**

Remote

**Experienced Remote Customer Service Specialist – Delivering Exceptional Blithequark Experiences**

Remote

Experienced Virtual Live Chat Agent – Provide Real-Time Assistance and Exceptional Customer Service in a Dynamic Remote Work Environment at arenaflex

Remote

**Experienced Full Stack Data Entry Specialist – Remote Work Opportunity at arenaflex**

Remote

Experienced Remote Data Analyst – Real-Time Power Grid Monitoring and Anomaly Detection Specialist

Remote

Experienced Full-Time Remote Customer Service Representative - Inbound Calls, Chats, and Emails for Leading Fortune 500 Companies at Blithequark

Remote

Project Manager

Remote

**Experienced Data Entry Specialist – Remote Work Opportunity at blithequark**

Remote

Design Sales Consultant - Hybrid Remote

Remote
← Back