Staff Software Engineer - Observability

Remote Full-time
We're seeking an exceptional Staff Software Engineer to join our Observability team at Pinterest. This role combines deep technical expertise in distributed systems and data engineering with a product-oriented mindset to build world-class observability solutions that empower our engineering organization. As a Staff Engineer on the Observability team, you'll be responsible for designing and building the infrastructure and tools that provide visibility into Pinterest's large-scale distributed systems, helping thousands of engineers understand, debug, and optimize their services.

What you'll do:
• Define and execute the observability roadmap, treating it as a product. Understand engineering team needs and translate them into technical solutions with measurable impact.
• Architect, build, and scale distributed observability infrastructure (metrics, logs, traces) to handle massive volumes across Pinterest's distributed systems.
• Build high-performance data pipelines and storage for real-time and historical telemetry analysis at Pinterest scale.
• Champion Best Practices: Establish observability standards and patterns across the organization, making it easy for teams to instrument their services and gain actionable insights
• Technical Leadership: Mentor engineers, lead architectural reviews, and influence technical decisions across teams to improve overall system reliability and performance
• Cross-functional Collaboration: Partner with SRE, Infrastructure, Product Engineering, and other teams to understand pain points and deliver solutions that improve developer productivity and system reliability
• Innovation: Stay current with observability trends and technologies, evaluating and adopting cutting-edge tools and techniques to keep Pinterest at the forefront

What we’re looking for:
• Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
• Product Mindset: Demonstrated ability to work backwards from customer needs —understanding user needs, prioritizing features, measuring success, and iterating based on feedback. Experience building internal platforms or tools with strong adoption
• Distributed Systems Expertise: 7+ years of experience designing and operating large-scale distributed systems with deep understanding of consistency, availability, scalability, and failure modes
• Data Engineering Skills: Strong background in building data pipelines, working with time-series databases, columnar storage, stream processing (Kafka, Flink, etc.), and data modeling at scale
• Observability Domain Knowledge: Hands-on experience with modern observability tools and practices including metrics, logging, tracing, and profiling. Familiarity with OpenTelemetry, Prometheus, Grafana, or similar technologies
• Programming Proficiency: Expert-level coding skills in languages like Java, Python, Go, or Scala with ability to write production-quality code
• Systems Thinking: Ability to see the big picture while managing complex technical details, balancing trade-offs between cost, performance, and reliability
• Experience building observability platforms from the ground up or significantly scaling existing solutions
• Familiarity with cloud-native architectures and technologies (Kubernetes, service mesh, etc.)
• Track record of driving adoption of internal platforms through excellent documentation, UX, and developer advocacy
• Experience with machine learning or anomaly detection applied to observability use cases
• Strong communication skills with ability to influence stakeholders at all levels
• Contributions to open-source observability projects, a plus

In-Office Requirement Statement:
• We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.
• This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country.

Relocation Statement:
• This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.

#LI-REMOTE

#LI-JT1

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

DISABILITY INSURANCE SPECIALIST I

Remote

External Accountant / Financial Controller for Small Business (Monthly Owner Reports)

Remote

Experienced Customer Experience Agent – Remote, Phone, and Live Support Specialist for blithequark Operations

Remote

Experienced Data Entry Clerk for Energy Services Programs – Remote Work Opportunity with a Dynamic Non-Profit Organization

Remote

**Immediate Hiring: Customer Service Associate at arenaflex**

Remote

Sr. Consultant, AI Governance Risk & Compliance (Remote)

Remote

Organizational Design Consultant

Remote

Consultant, Innovation Lab

Remote

SMB Business Development Associate

Remote

Experienced Remote Customer Service Representative – Providing Exceptional Policy Support and Claim Assistance to Policyholders and Third Parties via Inbound Calls

Remote
← Back