QA Engineer – AI Systems

Remote Full-time
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Everest Technologies, is seeking the following. Apply via Dice today!

We are seeking a QA Engineer with a strong background in
API testing
and
LLM fine-tuning/evaluation
. You will be responsible for the quality assurance of our Agent Mesh infrastructure, ensuring that the correctly translate enterprise business logic into machine-readable actions. Your goal is to ensure that AI agents interact with our reliably, securely, and without hallucinating tool calls.

Key Responsibilities
• AI Tool Validation: Test the accuracy of by verifying that LLMs correctly interpret OpenAPI specifications and trigger the right C#/.NET backend logic.
• Fine-Tuning Data Preparation: Curate and clean high-quality datasets (JSON/JSONL) in Python to fine-tune models for specific domain tasks and tool-calling accuracy.
• Prompt Regression Testing: Develop automated test suites to ensure that updates to underlying APIs or MCP servers do not break the reasoning or planning capabilities of the AI agents.
• Security & Auth QA: Validate that in Gravitee correctly enforce OAuth 2.1 and OpenFGA, preventing unauthorized data leakage through agent conversations.
• Performance Testing: Use to measure latency in the agent-to-API loop and identify bottlenecks in MCP server responses.

Technical Qualifications
• API Testing Mastery: Expert knowledge of REST, OpenAPI, and tools like Postman or Insomnia.
• Scripting: Proficiency in Python (for data processing and eval frameworks) and familiarity with C# (to understand backend MCP implementation).
• LLM Evaluation: Experience with frameworks like DeepEval, Ragas, or LangSmith to measure model performance (faithfulness, relevancy, and tool-call precision).
• API Management: Hands-on experience with or similar gateways to monitor and intercept traffic.
• Model Context Protocol: Understanding of and how it standardizes the way LLMs access external data.

Preferred Skills
• Experience with Red Teaming AI agents to identify prompt injection vulnerabilities.
• Knowledge of Vector Databases and how RAG (Retrieval-Augmented Generation) interacts with live API tools.
• Familiarity with GitHub Actions for CI/CD integration of AI evaluation pipelines.

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

[Entry Level Jobs] Delta Airlines Remote Jobs - $25/H

Remote

[Remote] Payroll Support 3

Remote

**Experienced Virtual Customer Care Representative – Delivering Exceptional Service from the Comfort of Your Home**

Remote

Senior Motion Designer (Remote, PA, US)

Remote

**Experienced Data Entry Clerk – Work From Home Opportunity with blithequark**

Remote

Word Processor/Transcriptionist- NATION-WIDE(remote)

Remote

Bilingual Customer Service Representative - Healthcare Support and Member Services in English and Spanish

Remote

Account Executive — Quantum Neuron Inc. (B2B SaaS | $130k OTE + Equity)

Remote

Marketing Assistant & Events Coordinator

Remote

Dental Director

Remote
← Back