Deployment DevOps Engineer

Remote Full-time
About The Team

Adaptive ML is a frontier AI startup building a Reinforcement Learning Operations (RLOps) platform that enables enterprises to specialize and deploy LLMs into production with measurable impact.

We provide the core infrastructure to tune, evaluate, and serve specialized models at scale — pioneering task-specific LLM development and running production-ready workflows that serve millions of requests while optimizing for both cost and performance across distributed systems.

Our tightly-knit team was previously involved in the creation of state-of-the-art open-access large language models. We raised a $20M seed led by Index Ventures and ICONIQ in early 2024, and we're already live in production with customers including Manulife, AT&T, Deloitte, across travel and financial services — with much more to be announced soon.

Our Product Staff is responsible for packaging the technology we build and turning it into an exceptional product that directly answers pain points companies encounter in their generative AI deployments. We strive to build a high-quality, intuitive, and robust experience for our customers.

About The Role

As a DevOps Engineer in our Product Staff, you will help package our technology and turn it into an exceptional product. Our products help companies build more singular generative AI experience, by enabling deeper personalization with reinforcement learning. Importantly, the technology in our products has to be transparent, and to directly answer pain points companies face—not add another layer of inextricable complexity. We place tremendous importance on the quality of our product, with particular emphasis on ease-of-deployment/use, robustness, and scalability, which can only be achieved with tremendous care on the devops side.

You will work on all DevOps aspects of our product, from systematic deployment to scaling production databases, as well as support internal workloads. Challenges you may face are likely to arise from coordinating complex GPU infrastructure and scaling the storage of user interactions to trillions of records in a robust manner. We are looking for self-driven, intense individuals, interested in contributing to a highly-technical product with challenges regarding robustness, accessibility, and responsiveness. As this is an early role, you can expect to be able to directly shape our product team as we grow.

This role is ideally in-person at our New York or Toronto office, but we are also open to fully remote work.

Your Responsibilities
• Build systematic K8s workflows to deploy our product, either client-side on a variety of infrastructures, or internally for our cloud platform;
• you'll work closely with sales and customer success to assist product deployment, pre and post sales (POC, demos, production), both on-prem, in cloud and in our SaaS.
• you'll be part of our first line of support for customer escalation, including bugs, security escalations, special events support (scale-outs, large workshops, load tests)
• Contribute to deployment support, in particular on North America time zones. This involves both personally joining customer calls to assist onboarding and troubleshooting, but also leading and contributing to our support mechanisms such as ticket triaging and oncall process.
• Contribute to our product roadmap, by coordinating between the needs of the Commercial Staff and latest developments from our Technical Staff;
• Report clearly on your work to a distributed collaborative team, with a bias for asynchronous written communication.

Your (ideal) background

The background below is only suggestive of a few pointers we believe could be relevant; we welcome applications from candidates with diverse backgrounds, do not hesitate to get in touch if you think you could be a great fit even if the below doesn't fully describe you.
• Significant experience (>6-8 years) in DevOps, DevSecOps, Tech Support or SRE.
• You have experience working in high-pressure roles such as on-call support, security operations or SRE.
• You enjoy the responsibility of maintaining and troubleshooting high-uptime SLA services.
• You are comfortable with live interactions with Fortune-500 customers, whether to present architecture ideas or to troubleshoot incidents on a call.
• Strong experience in Kubernetes, containers and associated ecosystem (Helm, ArgoCD, EKS/AKS/GKE, etc)
• Expertise in networking (DNS, proxys, WAFs)
• Expertise in authentication and identity federation (OIDC, SSO)
• Expertise in Postgres
• Strong baseline in security (incident detection & containment, network security, Linux hardening)
• Ideally, some experience with software development in Rust;
• A deep concern for user experience;
• Passionate about the future of generative AI, and eager to build foundational technology to help machines deliver more singular experiences.

Benefits
• Comprehensive medical (health, dental, and vision) insurance;
• 401(k) plan with 4% matching (or equivalent);
• Unlimited PTO — we strongly encourage at least 5 weeks each year;
• Mental health, wellness, and personal development stipends;
• Visa sponsorship if you wish to relocate to New York or Paris.

Apply Now

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Lead Manufacturing Recruiter/HR Coordinator

Remote

[PART_TIME Remote] Senior Support Engineer

Remote

9th-12th Grade Paraprofessional Contract Jobs - Peterborough, New Hampshire

Remote

American Express Jobs At Home, American Express Part Time Remote Jobs

Remote

Program Director - Counseling; Remote- Michigan Resident

Remote

Account Manager

Remote

Experienced Customer Service Manager for Remote Full-Time Position – Career Growth and Development Opportunities with arenaflex

Remote

Experienced Part-Time Data Entry and Customer Service Representative – Remote Work Opportunity with Flexible Hours and Competitive Salary

Remote

Casualty Claims Examiner ($2,500 Sign-On Bonus)

Remote

Experienced Customer Service Representative for Dynamic Telecommunications Environment – Delivering Exceptional Support and Driving Customer Satisfaction

Remote
← Back