REMOTE::Senior Software Engineer LLM Evaluation :: AI-generated @ US ,Western Europe

Remote Full-time
Title: Senior Software Engineer LLM Evaluation Duration: Long term ( depends on candidates performance) Work Type: Remote ( hybrid or onsite depending on candidate s location) Multiple openings Key skills: Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go Project Overview: As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go; evaluating and refining AI-generated code for efficiency, scalability, and reliability; and working with cross-functional teams to enhance enterprise-level AI-driven coding solutions. What Does a Typical Day Look Like? • Working on AI model training initiatives by curating code examples, building solutions, and correcting code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go. • Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable. • Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks. • Build agents that can verify the quality of the code and identify error patterns. • Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them • Design verification mechanisms that can automatically verify a solution to a software engineering task. Required Skills: • Several years of software engineering experience (+5 years), including 2+ years of continuous full-time experience at a top-tier product company (e.g., Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research). • Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools. • Deep understanding of software architecture, design, development, debugging, and code quality/review assessment. • Excellent oral and written communication skills for clear, structured evaluation rationales.
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

IGT Production Supervisor (Weekend shift)

Remote

**Experienced Online Data Entry Specialist – Flexible Remote Work Opportunity with arenaflex**

Remote

**Experienced Part-Time Remote Administrative Assistant – Dynamic Support for a Growing Company**

Remote

Logistics (Supply Chain) Co-op | July - December 2026

Remote

Staff M365 Engineer

Remote

**Experienced Remote Live Chat Agent – Delivering Exceptional Customer Experience from the Comfort of Your Home Office**

Remote

Senior Fund Accountant, Private Equity

Remote

**Experienced Customer Service Representative – Remote Healthcare Support Role**

Remote

**Experienced Remote Online Chat Specialist – Beginner-Friendly Part-Time Opportunity at blithequark**

Remote

Experienced Radiology Scheduler – Part-Time Remote Opportunity for Exceptional Customer Service Professionals

Remote
← Back