Senior Site Reliability Engineer

Remote Full-time
About Juniper SquareOur mission is to unlock the full potential of private markets. Privately owned assets like commercial real estate, private equity, and venture capital make up half of our financial ecosystem yet remain inaccessible to most people. We are digitizing these markets, and as a result, bringing efficiency, transparency, and access to one of the most productive corners of our financial ecosystem. If you care about making the world a better place by making markets work better through technology – all while contributing as a member of a values-driven organization – we want to hear from you. Juniper Square offers employees a variety of ways to work, ranging from a fully remote experience to working full-time in one of our physical offices. We invest heavily in digital-first operations, allowing our teams to collaborate effectively across 27 U.S. states, 2 Canadian Provinces, India, Luxembourg, and England. We also have a physical offices in San Francisco, New York City, Mumbai and Bangalore for employees who prefer to work in an office some or all of the time.About your roleWe are looking for a Senior Site Reliability Engineer (SRE) to join our team and help scale, secure, and improve our cloud infrastructure. In this role, you will work with modern cloud-native technologies, automate infrastructure management, and enhance system reliability. You will collaborate closely with software engineers and the platform team to build and maintain self-service tools that empower development teams while ensuring the reliability and scalability of our services.This role requires a high degree of ownership, a bias for action, and a problem-solving mindset. If you are someone who naturally seeks out inefficiencies, takes the initiative to fix them, and enjoys building scalable systems, we want to hear from you.What you’ll doOwn reliability and scalability initiatives—identify, prioritize, and implement solutions before issues escalate.Participate in an on-call rotation, responding to incidents, performing root cause analysis, and driving long-term fixes.Design, deploy, and manage Kubernetes clusters using Helm charts, Cilium, and Karpenter to optimize performance and cost.Architect and maintain AWS infrastructure with a focus on RDS/Aurora PostgreSQL, networking, and scaling best practices.Implement GitHub Actions CI/CD pipelines, integrating security best practices and automation.Define and enforce policy-based security for Kubernetes using Kyverno.Automate infrastructure provisioning with Crossplane and Terraform to ensure consistency and scalability.Enhance observability and monitoring using Datadog to proactively detect and resolve issues.Improve security and reliability by identifying risks in CI/CD, cloud environments, and Kubernetes, then implementing necessary safeguards.Lead post-incident reviews, drive lessons learned into long-term improvements, and document best practices in Confluence.QualificationsTechnical Skills5+ years of experience in SRE, DevOps, or Infrastructure Engineering with a proven track record of ownership and initiative.Strong experience with Kubernetes, Helm, and CNIs, including networking and security.Proficiency in AWS services such as RDS, Aurora, IAM, VPC, EKS, and EC2.Experience in PostgreSQL administration, including performance tuning and high availability in RDS/Aurora.Hands-on experience with GitHub Actions and ArgoCD for secure and scalable CI/CD automation.Strong background in Infrastructure as Code (IaC) with Crossplane and Terraform.Deep understanding of observability and monitoring with Datadog.Experience with Kyverno for Kubernetes policy-based security enforcement.Proficiency in Python and Bash scripting for automation and system management.Strong understanding of CI/CD security best practices and ability to implement controls for securing deployments.Soft SkillsSelf-starter mentality—actively seeks out and fixes problems without waiting for assignments.High ownership and accountability—takes initiative in driving improvements and following through to resolution.Strong problem-solving mindset—identifies bottlenecks, inefficiencies, and risks, then delivers scalable solutions.Excellent communication skills—documents processes in Confluence, collaborates cross-functionally, and influences engineering teams toward operational excellence.Preferred QualificationsDeep experience with GitHub Actions for CI/CD automation, with a focus on security best practices.Extensive knowledge of Helm charts for managing Kubernetes applications.Strong experience in PostgreSQL, including optimization and high availability in RDS/Aurora.Experience with NoSQL databases and best practices for scaling and performance.Proven ability to influence engineering culture toward automation, self-service, and operational excellence.Experience with Karpenter for Kubernetes autoscaling.Previous experience with cost optimization strategies in AWS environments.Experience with Atlassian tools (Jira, Confluence) for tracking incidents and documentation.Strong experience with and a passion for expanding AI into the SRE and DevOps world.CompensationCompensation for this position includes a base salary, equity, and a variety of benefits. The U.S. base salary range for this role is $140,000 - $185,000 USD. Actual base salaries will be based on candidate-specific factors, including experience, skillset, and location, and local minimum pay requirements as applicable. Benefits include:Health, dental, and vision care for you and your familyLife insuranceMental wellness coverageFertility and growing family supportFlex Time Off in addition to company paid holidaysPaid family leave, medical leave, and bereavement leave policiesRetirement saving plansAllowance to customize your work and technology setup at homeAnnual professional development stipendYour recruiter can provide additional details about compensation and benefits.#LI-Remote #LI-AD1

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

MEDICAL TRANSCRIBER

Remote

Data Center Energy Infrastructure Team Leader

Remote

Product Management Professional (Medium Voltage Drives) (New Kensington, PA, US, 15068)

Remote

Care Manager- Telephonic Nurse Part Time

Remote

Director of Healthcare Revenue Cycle New Client Implementation - Remote/Nationwi

Remote

Senior Remote Leadership Opportunity - People-Focused Professionals

Remote

Experienced Full Stack Data Engineer – Web & Cloud Application Development

Remote

Strategic Account Executive (Fully Remote)

Remote

Court Support Specialist

Remote

Experienced Chat Support Agent (Remote) - Revolutionizing the Gig Staffing Industry with careerzynith

Remote
← Back