Swift Engineer (5+ YOE) – AI / LLM Code Evaluation (Remote, Contract)

Remote Full-time
Company: Mercor. Type: Contract (Full-time or Part-time). Location: Remote (Worldwide). Language: Professional English required. Compensation: - USD $30 – $90/hour (depending on experience & evaluation performance). - Weekly payments via Stripe or Wise. - Flexible workload (project-based, scalable hours). Mission: - Work directly with leading AI teams to improve how large language models reason about code, systems design, and technical problem-solving. - You will evaluate and refine AI-generated responses, making them more accurate, reliable, and aligned with real-world engineering standards. Responsibilities: - Evaluate AI-generated answers to coding and system design problems. - Execute and validate code outputs. - Identify bugs, inefficiencies, and incorrect reasoning. - Assess code quality & readability. - Assess algorithmic correctness. - Assess system design logic. - Annotate responses with structured, actionable feedback. - Follow defined evaluation frameworks and quality benchmarks. Required Skills: Core: - Swift (expert level). - Software Engineering (5+ years). - Data Structures & Algorithms. - Systems Design. - Debugging & Code Review. - Problem Solving (Medium–Hard level). Technical: - Code Execution & Testing. - API Design & Backend Logic. - Performance Optimization. - Version Control (Git). AI / Evaluation Context: - Experience using LLMs in development workflows. - Ability to evaluate reasoning, not just outputs. Nice-to-Have Skills: - RLHF / AI Model Evaluation. - Competitive Programming. - Open-source contributions (merged PRs). - Multi-language experience (Python, JS, etc.). - Technical writing / explaining complex concepts. Ideal Candidate: - Degree in Computer Science or related field (BS/MS/PhD). - Strong real-world engineering background. - Detail-oriented and highly analytical. - Comfortable identifying subtle logic flaws and edge cases. - Able to work independently in async environments. What You Will Achieve: - Improve the quality and reasoning of AI-generated code. - Influence how AI systems assist developers globally. - Deliver high-quality evaluation outputs that directly impact model performance. Location: Remote - Anywhere Skills required for this job: • AI model evaluation • API design • Algorithm development • Code review • Data science • Data structures • Debugging • Git • JavaScript • Large language model (LLM) • Performance optimization • Python • Reinforcement learning from human feedback (RLHF) • Software engineering • Swift • Systems design • Technical writing • Testing
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Home-Based Pediatric LPN; CLARION

Remote

DISTINGUISHED ENGINEER - NETWORK SECURITY (, TX, United States)

Remote

Pharmacy Rebate Specialist Remote

Remote

Business Solution Analyst – Vendor Management Systems

Remote

**Part-Time Remote Customer Service Representative – Work From Home Opportunity at arenaflex**

Remote

External Hiring|SME|International|31072026

Remote

Apply Now: ULTA Beauty No Experience Jobs $25/Hr

Remote

Remote Community Safety Chat Moderator – Protect Online Spaces, Flexible Hours, $25‑$35/hr, Work‑From‑Home Opportunity

Remote

Hepatitis Epidemiologist

Remote

Leadership Coach | Work Remotely | Flexible

Remote
← Back