REMOTE::Senior Software Engineer LLM Evaluation :: AI-generated @ US ,Western Europe

Remote Full-time
Title: Senior Software Engineer LLM Evaluation
Duration: Long term ( depends on candidates performance)

Work Type: Remote ( hybrid or onsite depending on candidate s location)

Multiple openings

Key skills: Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go

Project Overview:

As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go; evaluating and refining AI-generated code for efficiency, scalability, and reliability; and working with cross-functional teams to enhance enterprise-level AI-driven coding solutions.

What Does a Typical Day Look Like?
• Working on AI model training initiatives by curating code examples, building solutions, and correcting code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go.
• Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable.
• Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks.
• Build agents that can verify the quality of the code and identify error patterns.
• Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
• Design verification mechanisms that can automatically verify a solution to a software engineering task.

Required Skills:
• Several years of software engineering experience (+5 years), including 2+ years of continuous full-time experience at a top-tier product company (e.g., Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research).
• Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools.
• Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
• Excellent oral and written communication skills for clear, structured evaluation rationales.

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Senior DevOps Engineer (Infrastructure & MLOps)

Remote

Web Chat Officer - Work from home

Remote

**Experienced Remote Customer Service Representative – Deliver Exceptional Customer Experiences from the Comfort of Your Own Home at arenaflex (Up to $35/hr)**

Remote

Hope R. Edison and Julian I. Edison Curator of Prints, Drawings, and Photographs

Remote

.NET Full Stack Developer

Remote

Vendor Manager Lead (Remote in US)

Remote

Experienced Customer-Focused Remote Web Chat Support Representative – Delivering Exceptional Technical Support and Customer Service Experience

Remote

Digital Business Partner – Remote Career Transition for Teachers

Remote

Manager, Digital Production

Remote

Remote Data Entry/Order Management - Must Reside In Indianapolis

Remote
← Back