LLM & RL Engineer / Agentic Engineering

Remote Full-time
About YC Bench YC Bench is a live benchmark designed to forecast the top-performing Y Combinator startups at Demo Day. We combine real-world startup data with advanced AI to predict which early-stage companies will outperform their batch peers using short-term execution signals. Our mission is to make startup success measurable in months rather than years. The Role We are looking for a talented LLM & RL Engineer to help build and optimize the AI systems that power our forecasting platform. You will work at the intersection of large language models and reinforcement learning to create agentic systems capable of long-horizon reasoning, decision-making, and accurate prediction in uncertain environments. Responsibilities - Fine-tune, optimize, and deploy large language models (LLMs) for complex reasoning and forecasting tasks - Design, implement, and scale reinforcement learning (RL) algorithms, including RLHF, RL from AI feedback, and agentic RL frameworks - Build and improve LLM-based agents for simulation, planning, and multi-step decision making - Develop robust machine learning pipelines for training, evaluation, and inference at scale - Experiment with hybrid LLM + RL approaches to enhance predictive accuracy and long-term performance - Collaborate with the team to integrate models into the YC Bench platform and forecasting engine - Stay up-to-date with the latest advancements in LLMs, RL, and agentic AI systems Requirements - Strong experience working with Large Language Models (fine-tuning, prompting, evaluation, and optimization) - Solid background in Reinforcement Learning (policy optimization, value-based methods, actor-critic, RLHF, etc.) - Proficiency in Python and modern ML frameworks (PyTorch, Hugging Face Transformers, vLLM, DeepSpeed, RL libraries such as TRL, Stable Baselines, or Ray RLlib) - Experience building production-grade ML pipelines and handling large-scale training/inference - Familiarity with agentic systems, long-horizon planning, or simulation environments is a big plus - Passion for AI forecasting, decision-making under uncertainty, and real-world impact Nice-to-Haves - Experience with predictive modeling or time-series forecasting - Background in startup analysis, venture capital, or early-stage company evaluation - Publications or open-source contributions in LLMs or RL - Comfort working in a fast-moving, early-stage environment If you love pushing the boundaries of what LLMs and RL can do together — and want to apply cutting-edge AI to one of the most exciting prediction problems in tech — we'd love to hear from you.
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Patient Engagement Partner, Access Center, Remote Position (Local to NJ/PA)

Remote

Media (Content) Marketing Manager, Norton Mental Health

Remote

Senior Network Researcher

Remote

Remote Dispatcher 12pm-6pm MST (dedicated dispatcher for 2 NON-CDL trucks) PART TIME

Remote

**Experienced Remote Healthcare Customer Service Representative – Thrive in a Culture of Excellence at arenaflex**

Remote

Overnight Online Assistant | $25–$35/hr

Remote

Service Desk Agent

Remote

**Experienced Junior Tech Support Specialist – 24/7 Live-Chat Team at arenaflex**

Remote

Cloud Infrastructure Engineer 2

Remote

Director of Retail Operations & Process Improvement

Remote
← Back