[Remote] Manager, Software Engineering - Observability
Note: The job is a remote job and is open to candidates in USA. Figma is a company dedicated to making design accessible to all, and they are seeking an Engineering Manager for their Observability team. The role involves leading a team of engineers to enhance the visibility and efficiency of Figma's systems, focusing on core observability platforms and driving initiatives for cost transparency and optimization.ResponsibilitiesLead and grow a team of engineers responsible for the reliability, scalability, and evolution of Figma’s observability and cost engineering platformsOwn and operate Figma’s core observability stack, including vendor platforms such as Datadog, ensuring high availability, strong data quality, and effective signal-to-noise across metrics, logs, and tracesDefine and drive the technical strategy for instrumentation standards, observability libraries, agents, and operators used to monitor internal and external facing servicesExplore and implement innovative, AI-driven approaches to anomaly detection, root cause analysis, signal correlation, and operational automationEstablish clear frameworks for cost attribution, budgeting, forecasting, and alerting across infrastructure and observability spend, enabling teams to make informed tradeoffsPartner with infrastructure, product engineering, finance, and security teams to improve visibility into system health and cost efficiency at scaleLead initiatives to optimize observability footprint and spend, balancing depth of insight with performance and cost considerationsCoach and mentor engineers through career development, performance feedback, and technical leadership, fostering a culture of ownership, collaboration, and high quality executionSkills4+ years of experience leading infrastructure, observability, or platform engineering teams, with a track record of delivering highly reliable production systemsDeep hands-on experience with modern observability platforms (e.g., Datadog, OpenTelemetry) across metrics, logs, and distributed tracingStrong understanding of distributed systems, instrumentation best practices, SLO design, and incident response workflowsExperience driving cost transparency and accountability initiatives, including cost attribution, budgeting, forecasting, and alerting in cloud environmentsDemonstrated ability to set technical direction, drive cross-functional alignment (Engineering, Finance, Security), and make sound architectural decisions in complex environmentsExperience designing or evolving company-wide observability standards, shared libraries, and agent/operator-based integrationsBackground in cost optimization for infrastructure or observability tooling, including vendor negotiations and usage modelingExperience applying AI or machine learning techniques to anomaly detection, root cause analysis, or operational automationFamiliarity with OpenTelemetry and modern instrumentation frameworks across multiple programming languagesExperience scaling and mentoring high-performing engineering teams through platform expansion or significant architectural changeBenefitsFigma offers equity to employeesHealth, dental & visionRetirement with company contributionParental leave & reproductive or family planning supportMental health & wellness benefitsGenerous PTOCompany recharge daysA learning & development stipendA work from home stipendCell phone reimbursementSales incentive pay for most sales rolesAn annual bonus plan for eligible non-sales rolesCompany OverviewFigma is a collaborative design tool that enables teams to create, prototype, and test digital products on one platform. It was founded in 2012, and is headquartered in San Francisco, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.figma.com.Company H1B SponsorshipFigma has a track record of offering H1B sponsorships, with 12 in 2026, 47 in 2025, 27 in 2024, 32 in 2023, 35 in 2022, 16 in 2021, 6 in 2020. Please note that this does not guarantee sponsorship for this specific role.