[Remote] Remote | Machine Learning Systems Evaluation Engineer — Up to $90/hour

Remote Full-time

Note: The job is a remote job and is open to candidates in USA. 24-MAG is offering a specialized remote consulting opportunity for experienced machine learning engineers. The role focuses on evaluating complex machine learning and AI engineering implementations, supporting workflows related to ML system evaluation, and providing structured feedback on MLOps and deployment processes.ResponsibilitiesUse modern coding agents to complete and evaluate complex machine learning and AI engineering tasksReview generated implementations involving model training, inference systems, MLOps workflows, LLM applications, and AI-powered product featuresAssess technical outputs for correctness, quality, maintainability, performance, reliability, and production-readinessApply professional machine learning engineering judgment to realistic technical scenariosEvaluate ML system workflows involving model deployment, inference infrastructure, monitoring, testing, and production integrationReview implementation choices related to scalability, latency, data flow, model serving, reliability, and system maintainabilityIdentify bugs, edge cases, performance issues, failure modes, and weak assumptions in ML engineering outputsProvide structured feedback on MLOps design, deployment patterns, and production ML system qualityCompare outputs from multiple coding agents and assess their strengths, weaknesses, accuracy, and practical usefulnessIdentify where generated solutions succeed, where they fail, and where additional ML engineering judgment is requiredEvaluate whether generated machine learning implementations reflect real-world engineering standardsDocument technical review findings clearly for project teams and quality evaluation workflowsProduce clear, structured evaluations of machine learning engineering tasks and generated outputsExplain reasoning around model training, inference systems, deployment infrastructure, LLM applications, performance, and architectural trade-offsSupport technical assessment workflows by documenting accepted work, improvement areas, and practical engineering conclusionsHelp ensure outputs reflect production-scale machine learning engineering expectationsSkills2+ years of professional machine learning engineering experienceHands-on experience building production ML systems, model deployment infrastructure, LLM applications, or AI-powered productsRegular use of AI coding agents such as Cursor, Claude Code, Codex, Windsurf, Gemini CLI, or comparable toolsAbility to evaluate generated machine learning implementations and identify technical trade-offs, bugs, edge cases, and performance issuesStrong understanding of model training, inference workflows, MLOps, data pipelines, evaluation methods, deployment patterns, and system reliabilityClear written communication skills and comfort documenting technical reasoning in a remote, project-based environmentA degree in Computer Science, Machine Learning, Artificial Intelligence, Data Science, Software Engineering, Computer Engineering, Statistics, Mathematics, or a related technical field is helpfulEquivalent professional experience in machine learning engineering, applied AI, MLOps, LLM applications, or production ML systems is also highly relevantExperience deploying ML systems to production is strongly preferredExperience with Python, PyTorch, TensorFlow, scikit-learn, Hugging Face, LangChain, LlamaIndex, MLflow, Ray, or comparable ML toolsFamiliarity with model serving, feature pipelines, vector databases, embeddings, retrieval systems, LLM application architecture, or evaluation frameworksExperience with cloud platforms, Docker, Kubernetes, CI/CD pipelines, observability tooling, or production deployment workflowsBackground in technical code review, ML architecture review, model performance evaluation, or large-scale AI product engineeringStrong comfort working in sprint-based project environments with focused technical assessment windowsBenefitsRemote consulting work aligned with machine learning engineering, coding agent, and technical evaluation expertiseFully remote and flexible schedulingSprint-based, project-based availabilityPayments are made weekly via Stripe or Wise based on services renderedSome projects may use accepted-task compensation depending on the specific workflowCompany OverviewAt 24-MAG, we support emerging AI and consulting platforms by sourcing and connecting qualified professionals with remote, contract-based opportunities. It was founded in undefined, and is headquartered in Sheridan, Wyoming, US, with a workforce of 2-10 employees. Its website is https://24-mag.com/.

Apply Now →

[Remote] Remote | Machine Learning Systems Evaluation Engineer — Up to $90/hour

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Director of CRM & Lifecycle Marketing

[Remote] Senior Program Manager, Product & Marketing Operations

Escalations Associate

Experienced Remote Customer Service Representative – Deliver Exceptional Support Experiences for arenaflex Customers

Virtual English LanguageTeacher

(US) Account Executive - Physician Groups

Experienced Remote Data Entry Specialist - Work from Home Opportunity with Flexible Schedule and Competitive Pay

Hiring Now: Looking for Chemistry Tutor in North Canton, OH

Sourcer, Operations, Remote Job

Part-time Customer Support Representative – Chat

[Remote] Remote | Machine Learning Systems Evaluation Engineer — Up to $90/hour

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Director of CRM & Lifecycle Marketing

[Remote] Senior Program Manager, Product & Marketing Operations

Escalations Associate

**Experienced Remote Customer Service Representative – Deliver Exceptional Support Experiences for arenaflex Customers**

Virtual English LanguageTeacher

(US) Account Executive - Physician Groups

**Experienced Remote Data Entry Specialist - Work from Home Opportunity with Flexible Schedule and Competitive Pay**

Hiring Now: Looking for Chemistry Tutor in North Canton, OH

Sourcer, Operations, Remote Job

Part-time Customer Support Representative – Chat

Experienced Remote Customer Service Representative – Deliver Exceptional Support Experiences for arenaflex Customers

Experienced Remote Data Entry Specialist - Work from Home Opportunity with Flexible Schedule and Competitive Pay