Senior/Staff Research Scientist, Frontier Benchmarks

Remote Full-time
ABOUT THE ROLE

We're looking for a Staff or Senior Research Scientist to collaborate with partners and lead the development of the next frontier benchmarks and datasets. This is a highly visible, customer-facing role at the intersection of research, company strategy, and go-to-market. You'll design datasets taking into account frontier model performance and work with our academic partners, and then partner with delivery, product and go-to-market to scale out production. You will also serve as a credible technical partner for our customers, prospects, and drive results that impact the broader research community.

This role reports directly to the Head of Research and is ideal for someone who is energized by cross-functional work and wants to understand how startups operate across research, data operations, and commercial teams.

MAIN RESPONSIBILITIES
• Design state of the art datasets that drive frontier model training and evaluation based on current model performance and academic partnerships
• Translate benchmark insights into clear, compelling narratives that articulate the ROI of expert-curated data for customer-facing presentations, technical reports, and go-to-market materials.
• Work cross-functionally with data operations, product, engineering, and strategy to surface research findings that inform the company roadmap.
• Stay at the frontier of LLM evaluation research and bring best practices into Snorkel's workflows
• Represent Snorkel's research externally through publications, blog posts, conference talks, and customer engagements that advance the conversation around data-centric AI

PREFERRED QUALIFICATIONS
• Strong research background in AI/ML evaluation, NLP, or related fields, with a track record of rigorous experimental design - especially around measuring the impact of training and evaluation data on model behavior.
• Exceptional communication skills - able to present complex technical findings clearly to both technical and non-technical audiences
• Comfort operating in a fast-moving, cross-functional environment with ambiguous problem spaces
• Genuine interest in GTM strategy, startup dynamics, and the commercial side of AI data services.
• Ph.D. in machine learning, NLP, or a related field preferred; equivalent industry or research lab experience considered.

Salary Range

$220,000-$320,000 USD

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

[Work From Home] Want Tutors in Schaumburg, IL

Remote

Experienced Remote Data Entry Specialist – Work from Home Opportunity with Competitive Salary and Comprehensive Benefits

Remote

Senior Print Producer, Associate Director

Remote

Full Stack Engineer 4 – SOps Workflows & Tools

Remote

Personal Assistant – Office Hero

Remote

Experienced Data Entry Clerk for Energy Services Programs – Remote Work Opportunity with a Dynamic Non-Profit Organization

Remote

Opening Shift Team Member

Remote

[Remote] Technical Business Analyst

Remote

No open jobs catch your eye?

Remote

[Hiring] Quality Compliance Auditor @Jobgether

Remote
← Back