Senior Site Reliability Engineer (SRE)

Remote Full-time
About Fable Global enterprises work with Fable to make products more accessible for over one billion people who live with disabilities. Our customers include global leaders like Walmart, Slack, and Shopify. Fable was featured on the Forbes Accessibility 100 list in 2025, awarded Fast Company’s Most Innovative Companies in Design, and has received accolades from global entities like the World Summit Awards and the UN-endorsed Zero Project. About the role As a Senior Site Reliability Engineer at Fable, you will play a critical role in ensuring the reliability, scalability, and efficiency of our platform as we continue to grow. Fable’s products support organizations in building more accessible digital experiences, and the reliability of our infrastructure is essential to delivering that impact. You will work across our platform and product systems to ensure they are stable, performant, and cost-efficient, while enabling teams to move quickly and safely. As AI-powered capabilities increasingly become part of modern product experiences, you will also help ensure Fable’s infrastructure is ready to support AI workloads—balancing reliability, performance, and cost while enabling teams to safely experiment and scale new capabilities. Reporting to the Director of Technical Operations, this role works closely with teams across Engineering and Product. It is ideal for someone who enjoys hands-on technical work while taking ownership of system health, tooling, and operational excellence, and who is excited to help shape Fable’s approach to infrastructure, reliability, and platform engineering over time. Responsibilities Reliability, Infrastructure & Platform Design, build, and maintain reliable, scalable, and secure infrastructure for Fable’s product services Improve system observability, monitoring, and alerting to ensure high availability and fast incident response Contribute to and evolve SRE practices, including SLIs/SLOs, incident management, and postmortems Support and improve CI/CD pipelines and deployment processes Identify and reduce operational complexity across systems and tooling Work across infrastructure and application layers to diagnose and resolve reliability and performance issues, including making targeted improvements to application code when needed Support infrastructure and platform capabilities required for AI/ML-powered features, including scaling, performance, and reliability considerations Cost Efficiency & Performance Monitor and optimize infrastructure costs across cloud environments Contribute to capacity planning and cost forecasting for infrastructure and services Identify opportunities to improve performance and efficiency at the system level Evaluate and optimize the cost and performance of compute-intensive workloads (e.g., AI/ML services), ensuring efficient resource usage and scalability Vendor & Tooling Ownership Work with third-party vendors and tools that support Fable’s infrastructure and operations Help evaluate, select, and manage tools and services to support platform reliability and scalability Support vendor-related troubleshooting and ongoing service improvements Cross-functional Collaboration Partner with Engineering teams to improve reliability, performance, and operational readiness of new features Partner with application engineering teams to improve service architecture, performance, and observability, and help define best practices for building reliable, scalable systems Act as a point of support and escalation for production issues Collaborate across teams to manage dependencies and ensure smooth system operations Team & Practice Development Contribute to building strong SRE and operational practices across the organization Share knowledge through documentation, pairing, and technical discussions Help onboard and support more junior team members as the team grows Contribute to improving ways of working within the team and across Engineering
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

**Experienced Virtual Customer Support Representative – Remote Customer Experience Expert (Multiple Locations)**

Remote

Warehouse Part Time Overnight

Remote

Account Manager

Remote

IT Security Manager

Remote

**Experienced Email Customer Service Representative – Remote Opportunity with arenaflex**

Remote

**Job Title:** Experienced Call Center Representative – Data Entry Work At Home Opportunity with blithequark's Remote Jobs Program

Remote

Data/Sales Entry Agent - Remote – No Experience

Remote

**Experienced Full Stack Data Entry Specialist – Remote Data Management and Analysis**

Remote

**Experienced Full Stack Data Entry Specialist – Remote Opportunity with arenaflex**

Remote

SQL DBA & User Support Remote Part Time

Remote
← Back