[Remote] Sr. Site Reliability Engineer

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. PayNearMe is on a mission to simplify payment processes with innovative technology. As a Site Reliability Engineer, you will design and maintain systems and infrastructure to ensure application reliability and performance, while automating processes to enhance operational efficiency.ResponsibilitiesInfrastructure Management: Design, implement, and maintain scalable and resilient infrastructure using Terraform for infrastructure as code, ensuring high availability and performanceKubernetes and Containers: Deploy, manage, and optimize Kubernetes clusters and containerized applications using Docker. Implement best practices for container orchestration and managementSystems and Application Monitoring/Observability: Develop and maintain comprehensive monitoring and observability solutions using Datadog. Ensure detailed visibility into system performance and application healthSLOs and SLA Management: Define, monitor, and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure reliable and consistent service deliveryIncident Response and Troubleshooting: Respond to incidents, perform root cause analysis, and implement solutions to prevent recurrence. Participate in post-incident reviews and contribute to blameless postmortemsReliability and Production Environment Management: Ensure the reliability and stability of our production environments. Continuously assess and improve system reliability, identifying and addressing potential points of failureAutomation and Scripting: Develop automation scripts and tools to reduce manual intervention and improve system reliability using Python, Bash, or Go. Implement and improve CI/CD pipelinesCI/CD Pipeline Management: Enhance and maintain continuous integration and continuous deployment pipelines using GitLab CI. Ensure seamless and reliable deployment processesCapacity Planning and Scaling: Assist in capacity planning and ensure that systems are scalable to meet future demands. Implement auto-scaling strategies where applicableSecurity and Compliance: Implement security best practices and ensure compliance with industry standards. Regularly review and update security policies and proceduresCollaboration and Support: Work closely with development teams to ensure reliability and scalability of new features and services. Provide technical support and guidance on infrastructure-related issuesSoftware Engineering for Operations: Develop and maintain internal tools and services that enhance the efficiency and reliability of our operationsOn-Call Rotation: Participate in an on-call rotation to address production issues and collaborate in incident response effortsSkills+3 years of experience in SRE, DevOps, or a related roleCloud Platform Experience: Proficient with cloud platforms such as AWS, GCP, or Azure Experience with EC2, RDS, VPCs, and security groups is essentialKubernetes and Containers: Strong experience with Kubernetes and Docker, including deployment, scaling, and management of containerized applicationsInfrastructure as Code: Expert in using Terraform for infrastructure as code. Proficient with configuration management tools such as Ansible, Puppet, or ChefMonitoring and Observability: Extensive experience with monitoring and observability tools like Datadog, Prometheus, Grafana, ELK stack, or Splunk. Skilled in setting up detailed monitoring and logging systemsSLOs and SLA Management: Proven ability to define, monitor, and maintain SLOs and SLAs to ensure reliable service deliveryScripting and Automation: Strong skills in scripting languages like Python, Bash, or Go. Experience automating repetitive tasks and processesCI/CD Practices: Familiarity with GitLab CI or similar tool for continuous integration and deployment. Experience in setting up and managing pipelinesProduction Environments: Experience supporting production environments running Go or Ruby/Rails applicationsTool Development: Ability to write and update tools to support infrastructure and application management, demonstrating the principle that 'SRE is what happens when you ask a software engineer to design an operations team'DevOps Best Practices: Deep understanding of DevOps principles, practices, and tools to drive continuous improvement in the software development lifecycleSoft Skills: Strong organizational skills, attention to detail, and the ability to work collaboratively in a team environment. Excellent documentation skills to ensure accurate and detailed recordsProblem-Solving Ability: Excellent analytical and problem-solving skills to diagnose and resolve complex system issues quickly and effectivelyBenefitsCompetitive salary and benefits with growth-company options grantStock options with standard startup vesting - 1 year cliff; 4 years total$50 monthly communication expense stipend to go towards your phone/internet bill$250 stipend to enhance your WFH setupReimbursement for peripheral equipment: monitor (up to $400), keyboard and mouse (up to $200)Premium medical benefits including vision and dental (100% coverage for employees)Company-sponsored life and disability insurancePaid parental bonding leavePaid sick leave, jury duty, bereavement401k planFlexible Time Off (our team members typically take off ~3-4 weeks per year)Volunteer Time Off13 scheduled holidaysCompany OverviewPayNearMe provides a web and mobile-based cash payments platform designed to facilitate online purchases and bill payments. It was founded in 2009, and is headquartered in Santa Clara, California, USA, with a workforce of 201-500 employees. Its website is https://home.paynearme.com.Company H1B SponsorshipPayNearMe has a track record of offering H1B sponsorships, with 3 in 2026, 3 in 2025, 3 in 2024, 4 in 2023. Please note that this does not guarantee sponsorship for this specific role.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Immediate Hiring: Require Math Instructor / Tutor in Mount

Remote

Field Account Manager - North Central Regional Territory

Remote

[Remote-Position] Driver Non CDL

Remote

District Asset Protection Leader (Travel)

Remote

Join Today: Disney Remote Data Entry $22/Hour - DPS

Remote

Job Title:** Experienced Medical Assistant - Virtual Healthcare Services Representative (Remote Opportunity) at careerzynith

Remote

[Remote] Senior Director -Marketing Technology

Remote

Amazon Customer Support Specialist (Remote) Up to $30 An Hour ...

Remote

Part‑Time / Casual Customer Service & Sales Associate – Retail, Visual Merchandising, Technology & Loss Prevention (Multiple Locations) – careerzynith

Remote

Live Online SAT Instructor

Remote
← Back