Sr. Engineering Manager, Inference - Weights & Biases

Remote Full-time
About the position

CoreWeave, the AI Hyperscaler™, acquired Weights & Biases to create the most powerful end-to-end platform to develop, deploy, and iterate AI faster. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe, and was ranked as one of the TIME100 most influential companies of 2024. By bringing together CoreWeave’s industry-leading cloud infrastructure with the best-in-class tools AI practitioners know and love from Weights & Biases, we’re setting a new standard for how AI is built, trained, and scaled.

The integration of our teams and technologies is accelerating our shared mission: to empower developers with the tools and infrastructure they need to push the boundaries of what AI can do. From experiment tracking and model optimization to high-performance training clusters, agent building, and inference at scale, we’re combining forces to serve the full AI lifecycle — all in one seamless platform.

Weights & Biases has long been trusted by over 1,500 organizations — including AstraZeneca, Canva, Cohere, OpenAI, Meta, Snowflake, Square,Toyota, and Wayve — to build better models, AI agents and applications. Now, as part of CoreWeave, that impact is amplified across a broader ecosystem of AI innovators, researchers, and enterprises.

As we unite under one vision, we’re looking for bold thinkers and agile builders who are excited to shape the future of AI alongside us. If you're passionate about solving complex problems at the intersection of software, hardware, and AI, there's never been a more exciting time to join our team.
What You’ll Do:
As a Senior Engineering Manager for the W&B Inference team, you will lead the group responsible for productizing and operating our inference offering. In partnership with CoreWeave AI platform team, you will ensure the service becomes a polished, reliable, developer-friendly product within the W&B platform.
You will guide engineers working on areas such as service reliability, observability, operational excellence, packaging, developer-facing tooling, and application-layer enhancements. You will partner closely with Product, CoreWeave engineering teams, Design, Support, and GTM stakeholders to ensure the inference experience meets the needs of practitioners deploying and scaling real-world AI workloads. Your work will directly support end-users, as well as other products powered by this service, like W&B Training.
You will combine strong engineering leadership with clarity in execution, helping the team deliver improvements that raise reliability, accelerate development workflows, and strengthen the overall inference experience for W&B users.

Responsibilities
• Lead and grow the engineering team responsible for evolving and operating the W&B Inference product, focusing on service reliability, orchestration, operational maturity, and developer experience.
• Drive execution on roadmap initiatives in close partnership with Product, ensuring that platform capabilities are delivered predictably, robustly, and with measurable customer impact.
• Own the engineering processes for the inference productization layer, including incident response, operational readiness, release management, observability, and engineering quality.
• Partner with CoreWeave’s infrastructure teams to integrate capabilities from the underlying inference platform into a cohesive, user-facing product.
• Guide the design and delivery of application-layer enhancements such as tracing and tool call handling.
• Ensure the inference offering meets high standards of reliability, usability, compliance, and performance, balancing trade-offs across cost, operational load, and architectural constraints.
• Create clarity in complex, cross-functional environments, ensuring strong communication, aligned priorities, and smooth execution across multiple teams.
• Build a culture of ownership, technical excellence, and continuous improvement within the engineering team.

Requirements
• 7+ years of experience in software engineering, including 3+ years managing or leading engineering teams responsible for distributed systems, developer platforms, or large-scale services.
• Fluent in concepts related to distributed compute services, including observability, autoscaling patterns, reliability engineering, API design, and service operations.
• Comfortable with competing priorities and making principled trade-offs among latency, reliability, cost, and development velocity.
• Skilled at leading engineering teams through ambiguous, multi-stakeholder projects with strong communication and alignment.
• Deep empathy for ML practitioners and platform developers, with a drive to improve reliability, reduce friction, and elevate the developer experience.

Nice-to-haves
• Background in high-scale systems, real-time APIs, or cloud infrastructure, ideally with exposure to model-serving or inference-adjacent domains.
• Background in related platform domains such as IAM, billing/metering, observability systems, or deployment tooling.

Benefits
• Medical, dental, and vision insurance - 100% paid for by CoreWeave
• Company-paid Life Insurance
• Voluntary supplemental life insurance
• Short and long-term disability insurance
• Flexible Spending Account
• Health Savings Account
• Tuition Reimbursement
• Ability to Participate in Employee Stock Purchase Program (ESPP)
• Mental Wellness Benefits through Spring Health
• Family-Forming support provided by Carrot
• Paid Parental Leave
• Flexible, full-service childcare support with Kinside
• 401(k) with a generous employer match
• Flexible PTO
• Catered lunch each day in our office and data center locations
• A casual work environment
• A work culture focused on innovative disruption

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Law Firm Office Manager/Executive Assistant (Remote) No Calls Please

Remote

Graphic & Web Designer (Temporary, with potential for Full-Time Transition)

Remote

Experienced Customer Sales Representative for Dynamic Remote Opportunities – Driving Business Growth through Exceptional Customer Experiences

Remote

**Experienced Part-Time Data Entry Specialist – Remote Work Opportunity at arenaflex**

Remote

Mobile Application Tester - Easy Work from Anywhere (No Experience)

Remote

Security Operations Center Engineer

Remote

Remote Data Entry Clerk and Typist for National and Local Paid Focus Groups, Clinical Trials, and Phone Interviews at blithequark

Remote

**Experienced Online Data Entry Assistant (Teen Years Old) – Kickstart Your Career with blithequark**

Remote

Area Sales Manager - South West

Remote

**Experienced Remote Data Entry Specialist – Flexible Work Schedule and Competitive Pay**

Remote
← Back