REMOTE::Senior Software Engineer LLM Evaluation :: AI-generated @ US ,Western Europe

Remote Full-time
Title: Senior Software Engineer LLM Evaluation
Duration: Long term ( depends on candidates performance)

Work Type: Remote ( hybrid or onsite depending on candidate s location)

Multiple openings

Key skills: Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go

Project Overview:

As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go; evaluating and refining AI-generated code for efficiency, scalability, and reliability; and working with cross-functional teams to enhance enterprise-level AI-driven coding solutions.

What Does a Typical Day Look Like?
• Working on AI model training initiatives by curating code examples, building solutions, and correcting code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go.
• Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable.
• Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks.
• Build agents that can verify the quality of the code and identify error patterns.
• Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
• Design verification mechanisms that can automatically verify a solution to a software engineering task.

Required Skills:
• Several years of software engineering experience (+5 years), including 2+ years of continuous full-time experience at a top-tier product company (e.g., Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research).
• Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools.
• Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.
• Excellent oral and written communication skills for clear, structured evaluation rationales.

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Pre-Licensed Customer Service Representativ

Remote

**Experienced Data Entry Specialist – Night Shift Remote Opportunity at blithequark**

Remote

Data Entry Specialist - Medical Records (Remote)

Remote

**Experienced Entry-Level Data Entry Specialist – Remote Opportunity with arenaflex**

Remote

Part-Time Virtual Administrative Assistant - Flexible Hours, Client Management, and Professional Growth with MOD Assistants

Remote

Financial Analyst, Senior (Hybrid Remote)

Remote

Document Controller

Remote

IT Helpdesk Technician (Lake Mary, FL)

Remote

**Experienced Data Entry Specialist – Remote Work Opportunity with arenaflex**

Remote

Experienced Remote Data Entry Specialist – Home-Based Opportunity for Detail-Oriented Professionals

Remote
← Back