Site Reliability Engineer (SRE)

Remote Full-time
The cloud is broken: It's wasteful, slow, awfully expensive, and burdened with legacy tech that wasn't built for today's workloads. At Unikraft we're building a generational, truly millisecond-native, extremely scalable cloud platform that provides exponentially higher efficiency. Are you bored with your current job? Want to push the boundaries of what's possible in the cloud to the absolute limit?
Our team consists of some of the best systems, performance, and security geeks out there, and is backed by top investors with category leaders as our customers. We believe a focused team of exceptional people, moving fast with conviction, can rebuild the cloud from first principles and make extreme efficiency (e.g., millions of users on a few servers) available to everyone.
What you'll do and why it's career defining
This is a rare opportunity to work at the very foundation of a generational cloud platform -- one that's rewriting the rules of infrastructure performance. As an SRE at Unikraft, you won't just keep the lights on. You'll be building the reliability and deployment machinery that underpins a product developers love and trust with their most demanding workloads.
You'll work closely with world-class systems engineers and have direct ownership over production environments, deployment pipelines, and observability infrastructure. If you care deeply about reliability, love automation, and want your work to have a measurable impact on a fast-growing platform, this is your role.
What You'll Own
Deployments & Reliability
Maintain and operate customer on-prem and cloud deployments of our platform, ensuring reliability and rapid troubleshooting of technical issues.

Plan, package, and roll out software updates both internally and to customers, including testing and validation.

Collaborate with engineering to ensure quality deployments and maintain a high standard of product reliability.

Deploy, manage, and troubleshoot Kubernetes clusters for reliable, scalable infrastructure.

Observability & Automation
Set up and manage monitoring systems to proactively detect and resolve issues in production environments.

Write scripts and automation for deployment, infrastructure management, and CI/CD workflows.

Build tooling and automation to streamline deployment and platform integration.

Contribute to continuous integration pipelines that catch regressions across components and system integrations.

Documentation
Create and maintain clear documentation for systems, processes, and tools to support team effectiveness.

What We're Looking For
At least 2 years of experience working in high-pressure production environments.

Proven experience in Linux system administration, software packaging, and delivery.

Solid understanding of Linux networking fundamentals, including firewalls, DNS, proxies, and best practices.

Experience managing and troubleshooting Kubernetes clusters in production.

Good understanding of the CNCF/cloud-native landscape and associated tools.

Familiarity with observability tools such as Prometheus and Grafana.

Basic scripting skills (e.g., Bash, Python).

Familiarity with cloud platforms (e.g., AWS, GCP, Azure).

Interest in automation tools like Ansible, Terraform, or similar.

Exposure to CI/CD pipelines (e.g., GitHub Actions, Jenkins, GitLab CI).

Familiarity with microservice architectures, serverless, and DevOps best practices.

[BONUS] Familiarity with virtualization solutions like QEMU/KVM -- micro-VMMs like Cloud-Hypervisor or Firecracker are a big plus.

Why you will love this team
Elite founders, real access: Work directly with globally recognized deep tech founders who've spent careers at the frontier of systems and cloud research. You'll learn more here in a year than most people do in five.

World-class product: A category-defining technology that sparks genuine excitement with developers.

Zero bureaucracy: Founder-led, product-obsessed, and deeply technical.

Fully Remote, Fully Flexible: Work from your favorite place, at your most productive times.

Retreats, Game Nights and More: Fun-focused team retreats and other events to recharge and build great relationships.

The Standard Stuff: Competitive salary, 6 weeks of vacation, development opportunities.

Apply To This Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Small Engine Mechanic

Remote

(Part-time Work From Home Jobs) Apple Remote Jobs

Remote

Work From Home Amazon Customer Service Job – No Experience Needed

Remote

Insurance Producer/ Surety

Remote

Part-Time Remote Data Entry Specialist – Join arenaflex’s High‑Impact Team for Precise, Agile Data Management

Remote

**Experienced Data Entry Clerk – Alternate Investments (Remote Opportunity)**

Remote

Junior Data Annotation Engineer; Remote at Jobright.ai Santa Clara, CA

Remote

TikTok Ads Manager

Remote

**Experienced Customer Support Specialist – Delivering Exceptional Apple Product Experiences**

Remote

Experienced Southwest Regional Account Executive – Enterprise SaaS Sales and Business Development for Fire Service Industry

Remote
← Back