Data Scientist

Remote Full-time
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description We're seeking a data-driven analyst to conduct comprehensive failure analysis on AI agent performance across finance-sector tasks. You'll identify patterns, root causes, and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions (task types, file types, criteria, etc.). Statistical Failure Analysis : Identify patterns in AI agent failures across task components (prompts, rubrics, templates, file types, tags) Root Cause Analysis : Determine whether failures stem from task design, rubric clarity, file complexity, or agent limitations Dimension Analysis : Analyze performance variations across finance sub-domains, file types, and task categories Reporting & Visualization : Create dashboards and reports highlighting failure clusters, edge cases, and improvement opportunities Quality Framework : Recommend improvements to task design, rubric structure, and evaluation criteria based on statistical findings Stakeholder Communication : Present insights to data labeling experts and technical teams Qualifications Statistical Expertise : Strong foundation in statistical analysis, hypothesis testing, and pattern recognition Programming : Proficiency in Python (pandas, scipy, matplotlib/seaborn) or R for data analysis Data Analysis : Experience with exploratory data analysis and creating actionable insights from complex datasets AI/ML Familiarity : Understanding of LLM evaluation methods and quality metrics Tools : Comfortable working with Excel, data visualization tools (Tableau/Looker), and SQL Requirements Experience with AI/ML model evaluation or quality assurance Background in finance or willingness to learn finance domain concepts Experience with multi-dimensional failure analysis Familiarity with benchmark datasets and evaluation frameworks 2-4 years of relevant experience
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

**Experienced Customer Sales Representative – Remote Opportunity at arenaflex**

Remote

**Experienced Remote Customer Service Specialist – Delivering Exceptional Arenaflex Experiences**

Remote

Retail Sales- Valley Fair

Remote

**Experienced Bilingual Customer Service Representative – Temporary Assignment with blithequark**

Remote

Healthcare Compliance Analyst (Remote with travel)

Remote

**Experienced Remote Customer Service Representative – Amazon's Customer Experience Team**

Remote

Compliance Analyst needed for SOC2, PCI DSS and ISO27001

Remote

Senior Manager-Site Analytics Optimization

Remote

**Experienced Overnight Online Chat Consultant | Provide Expert Assistance During Night Hours | Earn $25-$35/HR**

Remote

[Remote] Auto Telephone Claims Adjuster Trainee

Remote
← Back