Machine Learning Evaluation Specialist (Remote)

Remote Full-time
This a Full Remote job, the offer is available from: United States, Turkey, Greece, Estonia, Latvia, North Macedonia, Hungary, Bulgaria, Albania, Poland, Romania, Kosovo, Portugal, Spain, Malta, United Kingdom, Austria, Belgium, Germany, Ireland, France, Slovakia, Czechia, Italy, Montenegro, Bosnia and Herzegovina, Paraguay, Uruguay, Brazil, Dominican Republic, Venezuela, Ecuador, Colombia, Argentina, Chile, Bolivia, Peru, Mexico, Puerto Rico, Canada, Serbia, Arizona (USA), California (USA), Colorado (USA), Delaware (USA), District of Columbia (USA), Florida (USA), Georgia (USA), Idaho (USA), Illinois (USA), Indiana (USA), Louisiana (USA), Nevada (USA), North Carolina (USA), Ohio (USA), Oklahoma (USA), Pennsylvania (USA), Tennessee (USA), Texas (USA), Virginia (USA), Washington (USA) Machine Learning Evaluation Specialist (Remote) List of accepted countries and locations Important for US applicants: This is a 1099 independent contractor role and is not compatible with F-1 OPT, STEM OPT, or other visa statuses that require W-2 employment, guaranteed hours, or employer sponsorship. We are unable to provide offer letters or employment verification for this role. Help design the hardest ML problems state-of-the-art AI hasn't solved yet. We're hiring domain experts to build evaluation tasks that challenge the frontier of AI. This is not an ML engineering role β€” it's a research role. You'll use deep expertise in your field to create problems that general ML knowledge can't touch. What you'll do β€’ Propose and frame original, research-grade ML problems rooted in your domain β€’ Design evaluation tasks that require specialized knowledge well beyond standard pipelines β€’ Assess AI-generated solutions for correctness, creativity, and methodological rigor β€” and explain exactly where and why they fall short β€’ Document problem difficulty, required domain knowledge, and expected failure modes What you need β€’ Graduate-level expertise (MS or PhD preferred) in a scientific or technical domain that intersects with ML β€’ Strong working knowledge of ML methods β€” model selection, feature engineering, evaluation metrics β€’ Deep familiarity with active research problems in your field β€” you know where general ML knowledge runs out β€’ Excellent written communication β€” you can articulate complex problems clearly and precisely. This cannot be overstated. β€’ Self-motivated and comfortable working independently on intellectually demanding tasks What you don't need β€’ No prior AI training or RLHF experience required β€’ No software engineering background needed β€” domain expertise and research instincts are what matter Domains we're especially looking for β€’ Computational Biology / Bioinformatics β€’ Genomics / Molecular Biology β€’ Physics / Astrophysics / Signal Processing β€’ Climate / Environmental Modeling β€’ Healthcare / Medical Imaging β€’ Neuroscience / Brain-Computer Interfaces β€’ Materials Science / Chemistry β€’ Finance / Quantitative Modeling β€’ Robotics / Control Systems / Reinforcement Learning β€’ Advanced NLP (specialized domains) β€’ Mathematics / Statistics (applied) Logistics β€’ Fully remote β€” work from anywhere β€’ $200–$400/hr depending on domain and seniority β€’ 10–40 hrs/week, hourly contract β€’ Assessment required β€” paid if approved β€’ Independent contractor (1099) β€” not compatible with F-1 OPT, STEM OPT, or visa statuses requiring W-2 employment or employer sponsorship ⚠️ This is a project-based, freelance opportunity with no guaranteed hours. We recommend keeping other work options open while waiting for project assignment. This offer from "G2i Inc." has been enriched by Jobgether.com and got a 75% flex score.
Apply Now β†’

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Manager, Sales - DC, France (Virtual, Other, France)

Remote

**Experienced Full Stack Customer Support Specialist – Live Chat and Construction Industry Expertise**

Remote

Commission Analyst

Remote

Disney Careers , Disney Virtual , Disney Online…

Remote

Experienced Customer Support Specialist – Technical Expert and Advisor for Innovative Product Solutions at blithequark

Remote

**Experienced Customer Retention Specialist – Pet Insurance Industry Expert**

Remote

Internship - Robert W. Straub Fellowships - Public Service - Finance and Investm

Remote

Costco Jobs Northridge, Costco Jobs, What Are Data Entry Jobs

Remote

Experienced Remote Data Entry Specialist – Join arenaflex for a Dynamic Work-from-Home Opportunity in E-commerce and Technology

Remote

Physical Therapist

Remote
← Back