Ara Gamma – Data Engineer (LLM Data & Prompt Engineering) - English language

Remote Full-time

Welo Data works with technology companies to provide datasets that are high-quality, ethically sourced, relevant, diverse, and scalable to supercharge their AI models. As a Welocalize brand, WeloData leverages over 25 years of experience in partnering with the world’s most innovative companies and brings together a curated global community of over 500,000 AI training and domain experts to offer services that span:

ANNOTATION & LABELLING: Transcription, summarization, image and video classification and labeling.

ENHANCING LLMs: Prompt engineering, SFT, RLHF, red teaming and adversarial model training, model output ranking.

DATA COLLECTION & GENERATION: From institutional languages to remote field audio collection.

RELEVANCE & INTENT: Culturally nuanced and aware, ranking, relevance, and evaluation to train models for search, ads, and LLM output.

Want to join our Welo Data team? We bring practical, applied AI expertise to projects. We have both strong academic experience and a deep working knowledge of state-of-the-art AI tools, frameworks, and best practices. Help us elevate our clients' Data at Welo Data.

About the Role

We are looking for Data Engineers to support the development and refinement of high-quality datasets used to train Large Language Models (LLMs).

In this role, you will design and evaluate complex prompts modeled after real customer support ticket journeys, ensuring model outputs align with customer expectations.

You will work closely with engineering teams to generate, annotate, validate, and QA task data used in AI training workflows.

Project Details & Commitment

-Location: US / North America

-Language Requirement: English (Native or C1/C2)

-Contract Type: Freelance, Project-based

-Contract Duration: January 30th – February 13th (possibility of extension)

-Work Schedule (choose one): Minimum 4 hours per day, Monday to Friday

-Commitment: Reliable and consistent availability is mandatory

- Hourly rate: 100 USD

- Start date: Friday, January 30th. Only candidates who are able to start on this date will be considered.

Please note: This opportunity is only available for candidates located in the United States.

Key Responsibilities
• Query & Prompt Generation: Design complex LLM prompts that accurately represent real customer journeys and service interactions.
• Data Shaping & Collaboration: Partner with Field Engineers to transform raw data into structured, high-quality tasks for model training.
• Annotation & Evaluation: Annotate and review tasks to ensure strict quality standards and alignment with expected customer outcomes.
• Quality Assurance: Validate and assess model responses to ensure accuracy, relevance, and confidence in outputs.

Required Skills & Qualifications
• Language: Native or professional fluency (C1/C2) in English
• LLM & Prompting Knowledge: Understanding of LLM behavior and prompt engineering principles
• Analytical Skills: Strong attention to detail, critical thinking, and comfort working with ambiguous scenarios
• Technical Skills:
• SQL for data extraction
• Python (Pandas, NumPy) for data manipulation
• Experience with annotation tools (e.g., Labelbox, Prodigy, or similar platforms)
• Advanced proficiency in Google Sheets/Drive
• Familiarity with version control tools (GitHub)
• AI/ML Tools: Experience working with playground environments and prompt debugging
• Communication: Excellent technical writing skills and ability to clearly explain data requirements
• Nice to Have:
• Prior experience in data labeling, technical support analysis, or AI model evaluation
• Background or exposure to AI-related projects

Ready to Join?

If you’re excited about working hands-on with cutting-edge AI models and shaping how LLMs understand real customer journeys, we’d love to hear from you.

This is a great opportunity to collaborate with experienced engineering teams, apply your technical and analytical skills to real-world AI challenges, and make a direct impact on model quality and performance.

Apply now and be part of building the next generation of AI-powered solutions.

Apply Now

Apply Now

Apply Now →

Ara Gamma – Data Engineer (LLM Data & Prompt Engineering) - English language

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Enterprise Security Engineer

[Remote] Conveyance & Racking Field Project Manager

IC5 – Staff Engineer

Senior SIEM Engineer

Remote Mental Health Therapist (LPC, LMHC, LCSW, LMFT)

Experienced Customer Service Representative – Call Center Operations

Science Lab Teacher Trainer - Volunteer Position - Onsite in Liberia

[Hiring] Customer Service Manager @The Tucker Real Estate

RN and LPN Indeed Virtual Hiring Event for Johnson State Prison (76037)

Experienced Full Stack Data Entry Clerk – Remote Opportunity with arenaflex

Ara Gamma – Data Engineer (LLM Data & Prompt Engineering) - English language

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

USPS Office Helper

Enterprise Security Engineer

[Remote] Conveyance & Racking Field Project Manager

IC5 – Staff Engineer

Senior SIEM Engineer

Remote Mental Health Therapist (LPC, LMHC, LCSW, LMFT)

**Experienced Customer Service Representative – Call Center Operations**

Science Lab Teacher Trainer - Volunteer Position - Onsite in Liberia

[Hiring] Customer Service Manager @The Tucker Real Estate

RN and LPN Indeed Virtual Hiring Event for Johnson State Prison (76037)

**Experienced Full Stack Data Entry Clerk – Remote Opportunity with arenaflex**

Experienced Customer Service Representative – Call Center Operations

Experienced Full Stack Data Entry Clerk – Remote Opportunity with arenaflex