[Remote] Sr. Site Reliability Engineer

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. PayNearMe is on a mission to simplify payments through innovative technology. As a Site Reliability Engineer, you will design, build, and maintain systems and infrastructure, ensuring their reliability, scalability, and performance while automating processes to support business needs.ResponsibilitiesInfrastructure Management: Design, implement, and maintain scalable and resilient infrastructure using Terraform for infrastructure as code, ensuring high availability and performanceKubernetes and Containers: Deploy, manage, and optimize Kubernetes clusters and containerized applications using Docker. Implement best practices for container orchestration and managementSystems and Application Monitoring/Observability: Develop and maintain comprehensive monitoring and observability solutions using Datadog. Ensure detailed visibility into system performance and application healthSLOs and SLA Management: Define, monitor, and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure reliable and consistent service deliveryIncident Response and Troubleshooting: Respond to incidents, perform root cause analysis, and implement solutions to prevent recurrence. Participate in post-incident reviews and contribute to blameless postmortemsReliability and Production Environment Management: Ensure the reliability and stability of our production environments. Continuously assess and improve system reliability, identifying and addressing potential points of failureAutomation and Scripting: Develop automation scripts and tools to reduce manual intervention and improve system reliability using Python, Bash, or Go. Implement and improve CI/CD pipelinesCI/CD Pipeline Management: Enhance and maintain continuous integration and continuous deployment pipelines using GitLab CI. Ensure seamless and reliable deployment processesCapacity Planning and Scaling: Assist in capacity planning and ensure that systems are scalable to meet future demands. Implement auto-scaling strategies where applicableSecurity and Compliance: Implement security best practices and ensure compliance with industry standards. Regularly review and update security policies and proceduresCollaboration and Support: Work closely with development teams to ensure reliability and scalability of new features and services. Provide technical support and guidance on infrastructure-related issuesSoftware Engineering for Operations: Develop and maintain internal tools and services that enhance the efficiency and reliability of our operationsOn-Call Rotation: Participate in an on-call rotation to address production issues and collaborate in incident response effortsSkills+3 years of experience in SRE, DevOps, or a related roleProficient with cloud platforms such as AWS, GCP, or Azure Experience with EC2, RDS, VPCs, and security groups is essentialStrong experience with Kubernetes and Docker, including deployment, scaling, and management of containerized applicationsExpert in using Terraform for infrastructure as code. Proficient with configuration management tools such as Ansible, Puppet, or ChefExtensive experience with monitoring and observability tools like Datadog, Prometheus, Grafana, ELK stack, or Splunk. Skilled in setting up detailed monitoring and logging systemsProven ability to define, monitor, and maintain SLOs and SLAs to ensure reliable service deliveryStrong skills in scripting languages like Python, Bash, or Go. Experience automating repetitive tasks and processesFamiliarity with GitLab CI or similar tool for continuous integration and deployment. Experience in setting up and managing pipelinesExperience supporting production environments running Go or Ruby/Rails applicationsAbility to write and update tools to support infrastructure and application management, demonstrating the principle that 'SRE is what happens when you ask a software engineer to design an operations team'Deep understanding of DevOps principles, practices, and tools to drive continuous improvement in the software development lifecycleStrong organizational skills, attention to detail, and the ability to work collaboratively in a team environment. Excellent documentation skills to ensure accurate and detailed recordsExcellent analytical and problem-solving skills to diagnose and resolve complex system issues quickly and effectivelyBenefitsCompetitive salary and benefits with growth-company options grantFast- paced and professional work cultureStock options with standard startup vesting - 1 year cliff; 4 years total$50 monthly communication expense stipend to go towards your phone/internet bill$250 stipend to enhance your WFH setupReimbursement for peripheral equipment: monitor (up to $400), keyboard and mouse (up to $200)Premium medical benefits including vision and dental (100% coverage for employees)Company-sponsored life and disability insurancePaid parental bonding leavePaid sick leave, jury duty, bereavement401k planFlexible Time Off (our team members typically take off ~3-4 weeks per year)Volunteer Time Off13 scheduled holidaysCompany OverviewPayNearMe provides a web and mobile-based cash payments platform designed to facilitate online purchases and bill payments. It was founded in 2009, and is headquartered in Santa Clara, California, USA, with a workforce of 201-500 employees. Its website is https://home.paynearme.com.

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced IT Service Engineer – Warehouse and Distribution Center Technical Support Specialist

Remote

**Experienced Remote Data Entry Clerk – Work from Home Admin Assistant**

Remote

Amazon – Director, Content Acquisition & Development (Entertainment Group) – Newark, NJ

Remote

Staff Data Scientist - Payments

Remote

Project Managers/Researchers

Remote

**Experienced Customer Service Representative – Work From Home Opportunity at arenaflex**

Remote

Accounts payable manager- remote us based

Remote

Sage Intacct Consultant

Remote

Administrative Assistant - (Full-Time, Remote, Entry Level)

Remote

Events Assistant, Army Benevolent Fund

Remote
← Back