Site Reliability Engineer

Remote Full-time
About Nscale

Nscale is the GPU cloud engineered for AI—purpose-built to deliver high-performance, cost-efficient infrastructure for AI-native startups and global enterprises. We enable organizations to accelerate innovation, reduce the complexity of AI development, and achieve meaningful business outcomes through scalable, sustainable compute.

Our culture is defined by ownership, accountability, and rapid innovation. We operate with urgency and transparency, and every team member contributes to building the infrastructure powering the future of AI.

What You’ll Be Doing

Help build and improve automation, tooling, and infrastructure that supports AI workloads

Support the development of operational systems and platform services

Assist in defining and maintaining basic SLOs/SLIs and monitoring dashboards

Participate in incident response, troubleshooting, and post-incident reviews

Investigate and help resolve performance and reliability issues across systems

Collaborate with Engineering, Networking, and Infrastructure teams to improve system stability

Contribute to improving availability, scalability, and operational efficiency

Learn from senior engineers and grow your expertise in reliability engineering

What You Bring

2–5 years of experience in Site Reliability Engineering, Systems Engineering, or Software Engineering in Data Center Environment

2+ years programming skills (e.g., Python, Go, or similar) with interest in automation and tooling

Working knowledge of Linux systems, networking concepts, and distributed systems

Experience troubleshooting system or application issues in production environments

Familiarity with monitoring or observability tools (e.g., logs, metrics, dashboards)

Strong willingness to learn and improve reliability and operational practices

Ability to work in fast-paced environments and collaborate across teams



Preferred Experience

Exposure to cloud platforms, Kubernetes, or virtualized/bare-metal environments

Experience in AI, GPU workloads, or high-performance computing (HPC)

Basic understanding of high-performance networking concepts (e.g., InfiniBand, RDMA)

Exposure to production monitoring or alerting systems at small or medium scale



What We Can Offer You

At Nscale, you'll find a collaborative, supportive, and innovative environment where your contributions spark real impact. We're building something extraordinary, and we want you at the core.

Highly competitive package (base + equity) with reviews every 12 months.

Join the fastest-growing tech startup, your chance to push boundaries, collaborate with brilliant minds, and make your mark on cutting-edge AI. ✨

Expect a dynamic progression plan tailored to your ambitions. Grow by trying new things, leading, challenging the status quo, and owning your impact, always with our full support.

Human-First Flexibility: We treat you as humans first. Our flexible workplace trusts Nscalers to deliver, giving you the autonomy to shape your day around life's moments.

Equal Opportunities Statement

We strongly encourage applications from people of color, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, careers, and people from lower socio-economic backgrounds.

If there’s anything we can do to accommodate your specific situation, please let us know.

The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role.


The range below reflects the base salary for the position. Actual compensation may vary based on job-related factors such as skill set, experience, education, and location. In addition to base salary, this role may be eligible for bonus, equity, and/or commission programs. Nscale may offer a competitive benefits package including medical, dental, vision, flexible paid time off, parental leave, and retirement plan participation.

Salary Range
$100,000—$170,000 USD

For information on how Nscale handles candidate personal data, please see our Employee & Candidate Privacy Notice: Here.
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Part-Time Remote Data Entry Clerk – Unlock a World of Opportunities at careerzynith

Remote

Experienced Night Shift Data Entry Specialist – Accurate and Efficient Data Management Professional

Remote

[Work From Home] Senior Software Development Engineer

Remote

Registered Behavior Technician - $1,000 Sign-On Bonus - Home Based Carson City

Remote

Crew Team Member - Full-time / Part-time

Remote

Administrative Program Coordinator II

Remote

Experienced Global Vice President, Customer Success – Web & Cloud Application Development

Remote

System Development Engineer, Amazon Robotics Portfolio, Global Central RME Team

Remote

Payment Posting Reconciliation Specialist

Remote

Patient Support Transition Specialist - Phoenix, AZ

Remote
← Back