Evaluation Scenario Writer - AI Agent Testing Specialist

Remote Full-time
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What we doThe Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.About the RoleWe’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.Although every project is unique, you might typically: Designing structured test scenarios based on real-world tasks Defining the golden path and acceptable agent behavior Annotating task steps, expected outputs, and edge cases Working with devs to test your scenarios and improve clarity Reviewing agent outputs and adapting tests accordinglyHow to get startedSimply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.RequirementsYou have a Bachelor's orMaster’s degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields. You have 3+ years of experience.Your level of English is advanced (C1) or above.You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.BenefitsWhy this freelance opportunity might be a great fit for you? Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.Originally posted on Himalayas

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Flexible Part-Time Remote Data Entry Specialist – $25-$35/Hour – No Experience Required – Work From Home

Remote

CNO Automation/Testing Engineer, Principal (TS/SCI w/ Poly)

Remote

CostCo Remote Jobs Work From Home (Full Time, Mon-Friday) $32/Hour

Remote

Remote Customer Support Specialist at PUMP

Remote

Veterinary Business Manager - Wake Forest

Remote

Disney Store: Sales Associate (PT)

Remote

Teleperformance Customer Service Remote Jobs ? Work From Home Jobs

Remote

Property Adjuster Specialist - Field - Work Remotely with Cutting-edge Technolog

Remote

Amazon eCommerce Manager, Personal Health

Remote

CONSULTOR TÉCNICO DYNAMICS/BUSINESS CENTRAL

Remote
← Back