Lead QA Engineer (AI Agent Quality & Evaluation)

Remote Full-time
As a Lead QA Engineer, you’ll own quality strategy for AI-powered systems where correctness is probabilistic, outputs are structured (JSON), and evaluation requires real measurement (accuracy, cost, latency, edge-case handling, regression detection).

You’ll build automated evaluation harnesses, and partner closely with Engineering and Product to prevent silent quality regressions as the system evolves.

High autonomy, high leverage, and direct impact on the core product.

Ideal Profile
• Lead QA engineer who has moved beyond manual testing into automation, tooling, and quality systems.
• Comfortable testing systems where “expected output” is not always deterministic — and knows how to create evaluation strategies anyway.
• Strong Python + data mindset: can build repeatable harnesses, metrics pipelines, and regression suites.
• Product-minded and skeptical in the best way: notices failure modes, ambiguous cases, and risks early.
• Comfortable collaborating with engineers and shipping quality gates, not just filing bugs.
• Hands-on experience with AI developer / agent tooling (e.g., Claude Code, GitHub Copilot or similar) and building agents that amplify inputs and orchestrate multi-step workflows (prompt engineering, tool integration).

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Tax Manager - Top 100 CPA firm (Up to 180K+)

Remote

Experienced Overnight Remote Customer Service Representative – Provide Exceptional Support and Earn Competitive Hourly Rates from the Comfort of Your Own Home at blithequark

Remote

Senior Developer Experience Advocate

Remote

Sr Cyber Incident Response Analyst- Remote or Onsite in MN or DC

Remote

Experienced Customer Service Monitoring Representative – Alarm Response and Dispatch Specialist for Residential and Commercial Security Systems

Remote

Director, Global Strategic Market Insights (Remote)

Remote

Remote Email Marketing Specialist - Jamaica

Remote

**Experienced Full Stack Data Entry Specialist – Remote Work Opportunity with arenaflex**

Remote

Experienced Remote Chat Support Agent for blithequark - Elevating Client Experience in Public Relations

Remote

Coder I

Remote
← Back