Swift Engineer (5+ YOE) – AI / LLM Code Evaluation (Remote, Contract)

Remote Full-time
Company: Mercor. Type: Contract (Full-time or Part-time). Location: Remote (Worldwide). Language: Professional English required. Compensation: - USD $30 – $90/hour (depending on experience & evaluation performance). - Weekly payments via Stripe or Wise. - Flexible workload (project-based, scalable hours). Mission: - Work directly with leading AI teams to improve how large language models reason about code, systems design, and technical problem-solving. - You will evaluate and refine AI-generated responses, making them more accurate, reliable, and aligned with real-world engineering standards. Responsibilities: - Evaluate AI-generated answers to coding and system design problems. - Execute and validate code outputs. - Identify bugs, inefficiencies, and incorrect reasoning. - Assess code quality & readability. - Assess algorithmic correctness. - Assess system design logic. - Annotate responses with structured, actionable feedback. - Follow defined evaluation frameworks and quality benchmarks. Required Skills: Core: - Swift (expert level). - Software Engineering (5+ years). - Data Structures & Algorithms. - Systems Design. - Debugging & Code Review. - Problem Solving (Medium–Hard level). Technical: - Code Execution & Testing. - API Design & Backend Logic. - Performance Optimization. - Version Control (Git). AI / Evaluation Context: - Experience using LLMs in development workflows. - Ability to evaluate reasoning, not just outputs. Nice-to-Have Skills: - RLHF / AI Model Evaluation. - Competitive Programming. - Open-source contributions (merged PRs). - Multi-language experience (Python, JS, etc.). - Technical writing / explaining complex concepts. Ideal Candidate: - Degree in Computer Science or related field (BS/MS/PhD). - Strong real-world engineering background. - Detail-oriented and highly analytical. - Comfortable identifying subtle logic flaws and edge cases. - Able to work independently in async environments. What You Will Achieve: - Improve the quality and reasoning of AI-generated code. - Influence how AI systems assist developers globally. - Deliver high-quality evaluation outputs that directly impact model performance. Location: Remote - Anywhere Skills required for this job: • AI model evaluation • API design • Algorithm development • Code review • Data science • Data structures • Debugging • Git • JavaScript • Large language model (LLM) • Performance optimization • Python • Reinforcement learning from human feedback (RLHF) • Software engineering • Swift • Systems design • Technical writing • Testing
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Amazon Flex Delivery Jobs – Part-Time, Flexible Hours

Remote

Medical Science Liaison, US remote

Remote

Experienced Virtual Customer Service Representative – Remote Work Opportunity with blithequark for Delivering Exceptional Customer Experiences

Remote

Experienced Remote Customer Service Representative – Delivering Exceptional Support from the Comfort of Your Own Home

Remote

Cybersecurity Engineer III

Remote

Sr National Account Manager – Walmart, Sam’s Club

Remote

**Experienced Remote Wayfair Chat Service Agent – Customer Experience Specialist with Excellent Communication Skills**

Remote

Experienced Remote Data Entry Clerk – Healthcare Administration with blithequark

Remote

High Risk OB Nurse Care Manager – RN / BSN – Hybrid Role NYC in New York City, NY

Remote

**Experienced Part-Time Remote Chat Support Specialist for Moms - Flexible Work Arrangement with Competitive Hourly Rate**

Remote
← Back