Senior/Staff Research Scientist (Frontier Benchmarks)

Remote Full-time
Requirements
• Strong research background in AI/ML evaluation, NLP, or related fields, with a track record of rigorous experimental design — especially around measuring the impact of training and evaluation data on model behavior,
• Exceptional communication skills — able to present complex technical findings clearly to both technical and non-technical audiences,
• Comfort operating in a fast-moving, cross-functional environment with ambiguous problem spaces,
• Genuine interest in GTM strategy, startup dynamics, and the commercial side of AI data services,
• Ph.D. in machine learning, NLP, or a related field preferred; equivalent industry or research lab experience considered

What the job involves
• We're looking for a Staff or Senior Research Scientist to collaborate with partners and lead the development of the next frontier benchmarks and datasets,
• This is a highly visible, customer-facing role at the intersection of research, company strategy, and go-to-market,
• You'll design datasets taking into account frontier model performance and work with our academic partners, and then partner with delivery, product and go-to-market to scale out production,
• You will also serve as a credible technical partner for our customers, prospects, and drive results that impact the broader research community,
• This role reports directly to the Head of Research and is ideal for someone who is energized by cross-functional work and wants to understand how startups operate across research, data operations, and commercial teams,
• Design state of the art datasets that drive frontier model training and evaluation based on current model performance and academic partnerships,
• Translate benchmark insights into clear, compelling narratives that articulate the ROI of expert-curated data for customer-facing presentations, technical reports, and go-to-market materials,
• Work cross-functionally with data operations, product, engineering, and strategy to surface research findings that inform the company roadmap,
• Stay at the frontier of LLM evaluation research and bring best practices into Snorkel's workflows,
• Represent Snorkel's research externally through publications, blog posts, conference talks, and customer engagements that advance the conversation around data-centric AI

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Repair Supplier Manager

Remote

Product Owner, Order Management

Remote

Experienced Technical Support Specialist – Chat Support Associate for Veterinary Professionals at blithequark

Remote

**Experienced Part-Time Remote Data Entry Clerk – Thriving in a Dynamic arenaflex Community**

Remote

Jr Data Entry Clerk - Remote Opportunity with Competitive Pay and Career Growth at blithequark

Remote

Lead Consultant - SAP Functional - SAP MM

Remote

[Remote-Position] Looking for Machesney Park Managerial

Remote

Amazon Key for Business Regional Account Executive

Remote

Insurance Sales Opportunity – Work From Home

Remote

Financial Staff Auditor (State Staff Auditor)

Remote
← Back