Software Engineer, Inference AI/ML

Remote Full-time
CoreWeave is The Essential Cloud for AI™, providing a platform for innovators to build and scale AI. The role involves joining the Inference team to implement features that enhance model serving on the GPU platform, focusing on improving latency, reliability, and cost.ResponsibilitiesImplement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve)Write tests, code comments, and short design docs; participate in code reviewsAdd basic metrics and dashboards; assist with alarms and runbooksFollow on-call runbooks and learn incident response in a guided rotationContribute to performance experiments (e.g., request batching, concurrency, caching) with guidanceSkillsBS/MS in CS, EE, or related field, or equivalent practical experienceFoundations in data structures, algorithms, and networked servicesExperience with Python or Go (C++ a plus) and Linux fundamentals; Git/CI basicsExposure to containers and Kubernetes (coursework or projects welcome)Curiosity about GPU inference concepts (micro-batching, KV cache, streaming)Internship or project that deployed a microservice or ML inference demoCoursework/research with PyTorch or TensorFlow; simple CUDA projects a plusFamiliarity with Grafana/Prometheus/OpenTelemetry or similar toolingBenefitsMedical, dental, and vision insurance - 100% paid for by CoreWeaveCompany-paid Life InsuranceVoluntary supplemental life insuranceShort and long-term disability insuranceFlexible Spending AccountHealth Savings AccountTuition ReimbursementAbility to Participate in Employee Stock Purchase Program (ESPP)Mental Wellness Benefits through Spring HealthFamily-Forming support provided by CarrotPaid Parental LeaveFlexible, full-service childcare support with Kinside401(k) with a generous employer matchFlexible PTOCatered lunch each day in our office and data center locationsA casual work environmentA work culture focused on innovative disruptionCompany OverviewCoreWeave is a cloud-based AI infrastructure company offering GPU cloud services to simplify AI and machine learning workloads. It was founded in 2017, and is headquartered in Livingston, New Jersey, USA, with a workforce of 1001-5000 employees. Its website is https://www.coreweave.com.



Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Virtual Registered Dietitian

Remote

Go-to-Market Engineer - St. Petersburg, FL, USA

Remote

Content Marketing Manager — Creator & Influencer

Remote

**Executive Team Leader GM & Grocery (Assistant Manager General Merchandise & Food Sales) - A Leadership Role that Drives Sales Growth and Guest Satisfaction**

Remote

**Experienced Full Stack Customer Success Technical Specialist – Z Transaction Processing Software Platform**

Remote

Remote Technical Support Specialist

Remote

Clinical Data Manager, Sr. (remote)

Remote

Travel and Insurance Advisor

Remote

[Remote] Work From Home Sales - No Experience Needed

Remote

Sales Rep Remote - good pay start today

Remote
← Back