Data Scientist (AI Quality & Evaluation)

Remote Full-time
About the Role

We're looking for a Data Scientist to own the quality, reliability, and trustworthiness of our clinical AI outputs. You'll build the systems that ensure our AI "knows what it doesn't know" — developing evaluation frameworks, calibrated confidence scoring, and automated quality assurance that physicians can actually trust.

What You'll Do
• Design and implement automated evaluation pipelines that assess AI output quality, accuracy, and safety at scale
• Develop uncertainty quantification systems where confidence scores meaningfully correlate with accuracy
• Build comprehensive evaluation frameworks combining automated assessment with clinician-validated test cases
• Implement feedback loops that continuously improve model outputs based on validation signals
• Establish scalable quality gates that catch errors before they reach end users
• Contribute to model alignment and fine-tuning efforts

Qualifications
Required
• Strong foundation in deep learning frameworks (PyTorch) and LLM architectures
• Experience with model evaluation, benchmarking, and quality metrics
• Proficiency in Python and modern ML development tools
• Strong statistical foundations
• Ability to read, implement, and extend research papers
• Excellent communication skills

Preferred
• Master's degree in Computer Science, Machine Learning, Statistics, or related quantitative field (PhD preferred)
• Publications in top ML/AI venues (NeurIPS, ICML, ICLR, ACL)
• Experience with RLHF, DPO, or preference optimization techniques
• Background in healthcare AI or regulated industries
• Experience building evaluation systems for production LLM applications

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Experienced Remote Customer Support Representative – Delivering Exceptional Service and Solutions from the Comfort of Your Home Office at arenaflex

Remote

AWS DevOps Engineer

Remote

Executive Administrative Assistant - Emory College of Arts and Sciences

Remote

Remote Delta Live Chat Agent Jobs (Work From Anywhere)

Remote

Appointment Setter & Lead Generation Specialist for an Insurance Agent in the US (Home Based Part Time)

Remote

Endoscopy Pre-Screening Registered Nurse, Remote MO

Remote

Experienced Bilingual Customer Service Representative – Remote Opportunity with careerzynith

Remote

Experienced External Support Engineer – Content Tools and Workflow Optimization for External Content Creation Teams

Remote

Certified Pharmacy Technician in Boston, MA

Remote

Experienced Remote Data Entry Specialist – Flexible Work from Home Opportunity with arenaflex

Remote
← Back