Site Reliability Engineer

Remote Full-time
Job Summary




As a Site Reliability Engineer, you will play a critical role in ensuring the availability and performance of our customer-facing platform. You will work closely with DevOps, DBA, and Development teams to provision and maintain infrastructure, deploy and monitor our applications, and automate workflows. Your contributions will have a direct impact on customer satisfaction and overall user experience.









Responsibilities and Deliverables





Manage, monitor, and maintain highly available systems (Windows and Linux)






Analyze metrics and trends to ensure performance and rapid scalability.






Address routine service requests while identifying ways to automate and simplify.






Create infrastructure as code using Terraform, ARM Templates, Cloud Formation.






Maintain data backups and disaster recovery plans.






Adhere to security best practices through all stages of the software development lifecycle






Follow and champion ITIL best practices and standards.







Organizational Alignment





Reports to the Senior SRE Manager






This role involves close collaboration with DevOps, DBA, and security teams.







Technical Proficiencies





Hands-on experience with AWS is a must-have.






Proficiency analyzing application, IIS, system, security logs, and CloudTrail events.






Experience with CI/CD tools such as Jenkins and GitHub Actions






Experience maintaining and administering Windows, Linux, and Kubernetes.






Experience in automation using scripting languages such as PowerShell, Bash, or Python.






Good understanding of networking concepts (VPC, subnet, private link, peering).






Familiarity with configuration management using Ansible, Azure Automation or similar.






Familiarity with observability tools such as New Relic, AppDynamics, or DataDog.






Experience





3+ years of experience in SRE or System Administration role.






Demonstrated ability building and supporting high availability Windows/Linux servers.






2+ years of experience working with cloud technologies including AWS, Azure.






Comfortable using Scrum, Kanban, or Lean methodologies.










Education





Bachelor’s Degree or College Diploma in Computer Science, Information Systems, or equivalent experience.
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Manager, Corporate Social Responsibility (Project Hire)

Remote

Business Development Representative

Remote

[Remote/WFM] Urgently Require Elementary Teacher- Whittier in USA

Remote

Google Fiber Technician

Remote

Immediately Require Chiropractic Assistant / Rehab Assistant / Medical Assistant in Gaithersburg, MD

Remote

Seasonal Retail Sales Associate - The Rim

Remote

Senior Sales Manager - Remote Available

Remote

WMS Admin I

Remote

Retail Deductions Sales Engineer – Target Compliance and Supplier Experience at SupplyPike

Remote

Intermediate Software Engineer

Remote
← Back