Senior LLM Engineer

Remote Full-time
Tasks:
• As a Senior Engineer, you will design and ship agentic AI systems that plan, call tools, and execute reliably inside production workflows. You’ll own the end-to-end delivery of GenAI capabilities—from model adaptation and retrieval to orchestration, evaluation, and operational excellence.
• Build agentic systems: design supervisor/planner/executor patterns, routing, memory/context strategies, tool/function calling, and robust failure handling.
• LLM adaptation & deployment: fine-tune or parameter-efficiently adapt open-source LLMs; optimize inference (latency/cost) and ship safely to production.
• Retrieval-augmented generation (RAG): implement embedding, retrieval, re-ranking, and grounding patterns; optimize for quality, speed, and cost.
• Structured and reliable generation: enforce schemas/structured outputs, guardrails, and post-processing; reduce hallucinations and brittleness.
• Evaluation & quality: build automated evaluation harnesses for agents/LLMs (offline benchmarks + online monitoring), regression tests, and prompt/model versioning.
• Production engineering: ship containerized services and APIs; implement CI/CD, observability, and reliability practices (SLOs, alerting, incident readiness).
• Cross-functional delivery: collaborate with product, platform, and data teams to integrate GenAI features into user-facing and internal workflows; mentor others.

Requirements:
• 5+ years building production ML/AI systems; 2+ years at senior/lead level.
• Strong Python engineering (testing, packaging, code quality, performance profiling).
• Hands-on experience with LLMs and agentic AI in real systems (tool calling, orchestration, workflow integration).
• Experience adapting LLMs (LoRA/QLoRA/PEFT or equivalent) and evaluating quality/safety.
• Experience implementing RAG and operating retrieval components in production.
• Strong MLOps fundamentals: containers, CI/CD, model/service versioning, monitoring.
• API/service development: REST/gRPC, auth, rate limits, error handling, resilience patterns.
• Comfortable operating in cloud environments (AWS/GCP/Azure) with production constraints.

Benefits:
• Inference optimization: quantization, batching/caching, GPU serving (e.g., vLLM/TGI or similar).
• Agent safety engineering: prompt injection defenses, tool security, sandboxing, red teaming.
• Advanced evaluation: LLM-as-judge, preference testing, rubric-based grading, A/B testing.
• Vector database operations/tuning and retrieval performance engineering.
• Event-driven or workflow orchestration experience (e.g., Temporal/Airflow/n8n equivalents).
• Multi-lingual GenAI experience and robust internationalization practices.

Apply Now

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Staff Verification and Validation Lead

Remote

[Remote] Staff Data Analyst, Block Compliance

Remote

Automation Tester

Remote

Identity and Access Management (IAM) Engineer Authentication/Okta Consultant

Remote

Facilities Engineer

Remote

Experienced Full Stack Data Entry Specialist – E-commerce Operations and Amazon Platform Management

Remote

**Experienced Customer Service Representative – Work From Home Opportunity with arenaflex**

Remote

Content Creator Partner / Video Contributor

Remote

[Remote] Remote Regional Healthcare Senior Finance Manager - Surgery - Dallas, Tx

Remote

Guest Service Agent - Hilton Orlando

Remote
← Back