Site Reliability Engineer – SRE

Remote Full-time
Job Description: • Serve as first responder for production incidents during U.S. operating hours (±2h EST). • Lead triage during outages, analyzing logs, metrics, and traces to identify root causes. • Drive incident postmortems and follow-ups to prevent recurrence. • Communicate clearly and quickly during incidents to internal stakeholders. • Own reliability outcomes across all OpenFX systems, with a focus on uptime, latency, and error budgets. • Enhance observability through logging, metrics, alerting, and dashboards. • Optimize on-call processes and ensure smooth handoffs across IST, EST, and PST coverage. • Partner with DevOps and engineering pods to implement fixes or approve production changes. • Proactively identify systemic reliability risks and propose improvements. • Contribute automation and tooling to reduce manual incident handling. • Champion best practices in reliability engineering and operational excellence. Requirements: • 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering. • Proven experience leading incident response, running postmortems, and communicating during outages. • Strong background with cloud infrastructure (AWS preferred), container orchestration (Kubernetes, ECS), and Infrastructure-as-Code (Terraform, CloudFormation). • Familiarity with observability stacks (e.g., Prometheus, Grafana, Datadog, ELK, OpenTelemetry). • Ability to triage errors at both the infrastructure and application level, and escalate effectively when deeper intervention is required. • Ownership mindset with strong communication skills in high-pressure situations. Benefits: • Competitive salary and benefits package. • Equity in a rapidly growing company. • Opportunity to work on mission-critical infrastructure in fintech. • A collaborative team culture with a bias toward ownership and outcomes. • The chance to make a direct impact on the resilience of global financial infrastructure. Apply tot his job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Satellite Operations Lead (Orbitworks - UAE)

Remote

**Experienced Data Entry Specialist – Remote Opportunity with arenaflex**

Remote

Loan Processor - (Remote)

Remote

HEDIS - Quality Practice Advisor

Remote

Business Development Manager - Freight Brokerage

Remote

Associate SEO Specialist

Remote

Information Systems Security Officer (Remote Part-time)

Remote

[Remote] Compliance Officer (Advisory), Remote

Remote

[Remote] Learning Operations Coordinator - Contractor (Part-Time, 25–30 hours per week)

Remote

Inbound Toll Collections Processing Agent

Remote
← Back