RL Environment Engineer (ML Engineer)

Remote Full-time
Requirements
• Master’s degree in Computer Science, AI, ML, or a related technical field,
• (Desirable) Deep knowledge of transformer internals or LLM training/inference,
• Strong Python skills with production-quality engineering standards,
• (Desirable) Experience with inference libraries such as vLLM or SGLang,
• Experience designing or working with RL environments or training pipelines,
• (Desirable) CUDA or custom kernel optimization experience (e.g. Pallas),
• Solid understanding of modern LLMs and their limitations,
• (Desirable) Research experience with publications or high-quality open-source work,
• Ability to work quickly, iterate reliably, and respond to feedback,
• (Desirable) Experience building complex or open-ended RL-based learning systems,
• Advanced English proficiency (C1/C2)

What the job involves
• Design and build reinforcement learning environments for training and evaluating LLMs,
• Translate modern ML and AI research into structured RL problems,
• Implement reliable, debuggable, and scalable training environments in Python,
• Collaborate with researchers and engineers to improve model learning quality,
• Complete an average of two well-scoped tasks per week,
• Iterate quickly based on feedback and evaluation results

Apply Now

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Remote Direct Support Specialist (Part Time) – Weekends Evening Shift

Remote

Revenue Integrity Charge Analyst job at HCA - Hospital Corporation of America in FL, GA, ID, KS, KY, MO, NV, NH, NC, SC, TN, TX, UT, VA

Remote

Business Development Manager, E&I

Remote

Transformation Architect - Emerging Markets

Remote

Freelance Remote Fiction Ghostwriter

Remote

Hospital at Home Coordinator (Remote – Evening & Weekend Hours)

Remote

Collection Analyst

Remote

Live Design Support Expert, Anterior

Remote

Fully Remote Fortune 1000 Company Senior Accountant

Remote

[Remote] Human Resources Generalist

Remote
← Back