Senior Site Reliability Engineer

Remote Full-time
About Appspace:
At Appspace, we’re passionate about creating better work experiences for people everywhere, and we’re looking for people that feel the same way. Our global office locations and flexible work culture help you work wherever and however you’re at your best. Plus, we take the time to help you enjoy your work, build lasting connections, and grow your role. Join the Appspace team and be a part of a culture that’s helping people everywhere love where they work.Your Role as a Senior Site Reliability Engineer:
Our Cloud Operations team seeks a Senior Site Reliability Engineer who will play a critical role in ensuring the reliability, performance, and scalability of our SaaS applications. The ideal candidate is a problem-solver and an automater. In this Senior SRE role, you will be responsible for designing, implementing, and maintaining robust systems and processes to prevent outages, minimize downtime, optimize our infrastructure, and build scalable automation that allows us to be more efficient as the business grows. You will work closely with our development, quality assurance, product, and IT teams to meet the goals of this role. On-Call coverage will be required weekly.
A Day in the Life of a Senior Site Reliability Engineer:
For this role, you will play a key role in maintaining our cloud platform, which includes an assortment of Kubernetes, Microservices, MongoDB, RabbitMQ, MySQL, Windows Server VM Infrastructure, Orchestration Engines, CI/CD and Monitoring platforms.

Executing projects that rollout new platform maintenance features, automate tasks, or other big picture changes to improve our customers’ experience on our Cloud Platform.
Deploying new features and releases of our software into Kubernetes via Helm, so strong experience in Kubernetes and Helm is a must.
Troubleshooting performance issues or errors thrown by the cloud platform or application, and either resolving the underlying cause, or forwarding your research to Engineering to address in the product.
Mentoring others towards technical and procedural success and providing daily operational support to our DevOps team members
Actioning Request Tickets from other teams in support of their needs to enable and prepare for upcoming releases.
Monitoring and maintaining our Platform’s, uptime, resiliency and performance, looking for improvement opportunities, and proactively taking action to solve any negative trends before they become issues.
Lead, Participate, or Execute within the incident management process when alerts fire, and quickly ascertain root cause, resolve the issue, and find new and creative solutions to prevent recurrence.
Configure, Monitor, Research, and Evaluate workload performances both on Google Cloud Platform and Microsoft Azure Clouds.
Security and Compliance: Work closely with security teams to ensure adherence to security best practices and compliance standards.
Collaborating with our Development and Quality Assurance teams to address issues in the product and platform, particularly around recurring problems.
Documenting new or updating existing processes and procedures to share knowledge and improve on standardized approaches to solution.

What You’ll Need:

Must have a passion for life-long learning.
Must communicate well and adapt to working well with others across different countries and cultures.
Strong background in Containers, Kubernetes, Helm, Linux, Python coding, and some experience with Windows Server OS and MacOS are a must.
Experience with Google Cloud Platform and Microsoft Azure required.
Expert-level troubleshooting experience and the ability to reason through a process workflow to identify a fault or odd behavior (i.e., spending time following log trails).
Experience with administering MySQL & MongoDB preferred.
Experience with administering message brokering systems like RabbitMQ preferred.
Must be flexible on occasionally attending “off-hour” meetings (we’re a global team supporting a global customer base!).
No travel required for this role.

Nice to Haves:

Experience with Build pipeline tools and the Atlassian suite (JIRA, Confluence, Bitbucket/Git, Azure DevOps, Bamboo, Octopus).
Experience with monitoring and alerting platforms, especially StackDriver.
Experience with HashiCorp Terraform.
Experience with IIS

The Perks of Working for Appspace:
For all our US based team members, we offer a variety of benefits from competitive salaries, medical, dental and vision coverage, disability coverage, employer paid life insurance, mental health resources, 401(k) plan and a fully paid parental leave program.
Additional perks include:

Generous PTO
Flexible work schedules
Remote work opportunities
Paid company holidays
Appspace Quiet Fridays (No non-essential internal meetings scheduled)
A casual dress work environment
Disclaimer:
Appspace is committed to equitable compensation practices and complies with all applicable local, state, and federal regulations. For jurisdictions that require pay scale disclosure, a general compensation range may be provided during the initial stages of the interview process. Final compensation will be based on multiple factors including experience, skills, certifications, and overall fit for the role.
If you are located in a jurisdiction with specific pay transparency requirements, we will be happy to discuss the relevant range during your application process.



Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Laotian-English Bilingual Healthcare Customer Service Representative - Remote in California

Remote

Web engineering manage (Drupal)

Remote

Werkstudent*in HR Operations (m/w/d) – People & Relations

Remote

Part-Time English Teacher

Remote

[Remote/WFM] Sr Operations Research Scientist - Last Mile

Remote

Global Compensation and Benefits Partner – Remote EMEA

Remote

**Experienced Full Stack Data Analyst – Business Intelligence and Predictive Modeling**

Remote

Senior Sales Executive

Remote

Food Service Helper - Continuous Posting 2024-2025 School Year RB#23-456

Remote

Nationwide Insurance and Financial Services, Fall 2025 Accounting Internship - Application via WayUp

Remote
← Back