AI product eval/ applied LLM eval/Human-in-the-Loop Evaluation / Annotation Advisor(1-3hr pre week/potential co founder)

Remote Full-time
Looking for a Part-Time Evaluation Advisor (human annotation/llm eval) The direction method is already validated by multiple ai leaders experts and ux researcher, there’s early buyer interest /pre pilot from big tech ai team , and the immediate goal is to turn the current build into a small, stable, pilot-ready product. We’re building an early evaluation and workflow product for AI teams. The current focus is helping teams structure and operationalize real-world failure cases, especially in scenarios where an assistant recommends too early, becomes overconfident, or fails to verify what matters before responding. The short-term wedge is a lightweight regression and review workflow. The longer-term opportunity is much bigger: infrastructure for how AI systems are tested, reviewed, and controlled in production decision flows. Looking for someone who can help with: • human annotation design • label guideline writing • evaluation schema • translating Best fit: someone with experience in human eval, annotation design, ranking/review quality, AI eval, or related areas. Bonus if you’ve worked on shopping, marketplace, trust, search, ranking, or agent systems. • Part-time / advisor to start. Best fit: someone with experience in human eval, annotation design, ranking/review quality, AI eval, or related areas. Bonus if you’ve worked on shopping, marketplace, trust, search, ranking, or agent systems. • Part-time / advisor to start. offer equity or cash
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Chief Technology Officer, Provider Market – Optum Insight

Remote

Experienced Data Entry Specialist for Apple - Remote Opportunity for Career Growth

Remote

**Experienced Overnight Live Chat Support Representative – Thriving in a Remote Environment with Late Night Shifts at blithequark**

Remote

**Experienced Part-Time Customer Service Agent - ICT (Part-Time) – blithequark Store**

Remote

Insurance Producer - Denver Metro, Colorado

Remote

Accounting & Auditing AI Trainer, $90-$110/hour

Remote

Experienced Remote Data Entry Administrative Assistant – Flexible Work from Home Opportunity with blithequark

Remote

Python Engineer - RPA Developer

Remote

FULL TIME Youtube Moderator $30/hour At Home | Careermilard

Remote

Lead Teller - Monkey Junction

Remote
← Back