Senior Technical Product Manager, Observability

Remote Full-time
Who We Are
Vultr is on a mission to make high-performance cloud infrastructure easy to use, affordable, and locally accessible for enterprises and AI innovators around the world. With 33 global cloud data center locations, Vultr is trusted by hundreds of thousands of active customers across 185 countries for its flexible, scalable, global Cloud Compute, Cloud GPU, Bare Metal, and Cloud Storage solutions. In December 2024 Vultr announced an equity financing at a $3.5 billion valuation. Founded by David Aninowsky and self-funded for over a decade, Vultr has grown to become the world’s largest privately-held cloud infrastructure company.

Vultr Cares
100% company-paid insurance premiums for employee medical, dental and vision plans.

401(k) plan that matches 100% up to 4%, with immediate vesting

Professional Development Reimbursement of $2,500 each year

11 Holidays + Paid Time Off Accrual + Rollover Plan

Commitment matters to Vultr! Increased PTO at 3 year and 10 year anniversary + 1 month paid sabbatical every 5 years + Anniversary Bonus each year

$500 stipend for remote office setup in first year + $400 each following year

Internet reimbursement up to $75 per month

Gym membership reimbursement up to $50 per month

Company paid Wellable subscription

Join Vultr
Vultr is seeking a highly skilled and experienced Senior Technical Product Manager to own the Observability Platform — the system that provides telemetry ingestion, querying, visualization, alerting, and retention for large-scale GPU clusters and multi-tenant cloud environments. The ideal candidate brings deep technical fluency in observability infrastructure, distributed systems monitoring, and cloud-native telemetry, combined with a strong product instinct for developer and operator experiences. This is a highly visible role in a high-growth technology company, which will require close partnership with Compute, Networking, and Platform teams to ensure every new infrastructure launch is observable by design. This is your opportunity to join our fast growing team and leave your mark on Vultr and the future of AI Infrastructure.

Key Responsibilities
Own the end-to-end Observability Platform roadmap across telemetry ingestion, querying, visualization, alerting, and retention for large-scale GPU clusters and multi-tenant cloud environments

Define Vultr's observability strategy across bare metal, VMs, Kubernetes, and managed services, aligned to infrastructure roadmap, reliability goals, and customer experience

Drive the customer-facing observability surface across dashboards, APIs, telemetry pipelines, and topology-aware insights

Translate low-level signals across GPU, CPU, memory, storage, and network into actionable health views, alerts, and debugging workflows for customers

Work closely with engineering on technical tradeoffs across metrics agents, collectors, data models, telemetry pipelines, APIs, and retention architecture

Build products for distributed AI environments by understanding how training and inference workloads behave across nodes, clusters, schedulers, and network fabrics

Define health models that help customers quickly identify degraded nodes, performance anomalies, and cluster bottlenecks at fleet scale

Ensure new infrastructure and platform launches are observable by design through strong partnership with compute, network, and platform teams

Stay current on modern observability stacks and AI infrastructure trends, including how GPU workloads change performance analysis, cost attribution, and operational workflows

Qualifications
7+ years of product management experience in cloud infrastructure, observability, monitoring, or developer platforms

Deep understanding of observability and monitoring systems, including metrics, logging, tracing, alerting, and telemetry pipeline architecture

Experience defining product strategy and roadmaps for platform or infrastructure products at scale

Strong technical background — ability to engage with engineering on telemetry agents, data models, query engines, retention, and distributed systems

Experience with GPU, AI/ML, or HPC infrastructure monitoring and the unique observability challenges of training and inference workloads

Track record of shipping developer- and operator-facing products with measurable impact on reliability, time-to-detect, or operational efficiency

Experience working across cross-functional teams (engineering, design, marketing, sales) in a fast-paced environment

Excellent written and verbal communication skills, with the ability to translate complex technical concepts for diverse audiences

Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)

Compensation
$130,000 - $165,000
Final compensation will vary depending on years of experience, background/skill set, location, and applicable laws.
Inclusion & Privacy
We are an equal opportunity employer and are committed to creating an inclusive environment for all employees. We welcome applications from individuals of all backgrounds and experiences, and we prohibit discrimination based on race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, veteran status, or any other protected status under applicable laws. Vultr will consider qualified applicants with arrest or conviction records in accordance with applicable laws and will not conduct a background check until after an offer of employment has been extended and accepted.
We also take your privacy seriously. We handle personal information responsibly and follow applicable laws, including U.S. privacy rules and India’s Digital Personal Data Protection Act, 2023. Your data is used only for legitimate business purposes and is protected with proper security measures.
Where allowed by law, applicants may request details about the data we collect, access or delete their information, withdraw consent for its use, and opt out of nonessential communications. For more details, please see our Privacy Policy.
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Remote Loan Servicing Customer Service Representative - $20-25 p/hr max 30 hrs

Remote

**Experienced Full Stack Delivery Specialist – Flexible Scheduling and Competitive Earnings**

Remote

[Remote] Cyber Governance, Risk, and Compliance (GRC) Analyst

Remote

Customer Service Representative I - NCC - 994043 ** work from home within the Tri-County area, upon completion of onsite training **

Remote

Experienced Customer Support Representative for Email and Chat Channels – Full Remote Opportunity with blithequark

Remote

Benefit & Well-Being Educator - Cigna Healthcare - Remote

Remote

Inside Sales Representative - REMOTE

Remote

Experienced Part-Time Online Remote Customer Service Representative – Delivering Exceptional Support from Home for arenaflex

Remote

Immediate Hiring: Live Chat Remote Support - Contract to Hire

Remote

Typing Jobs from Home

Remote
← Back