AI Engineer, Prompt Engineering, Python

Remote Full-time
Description: • Design & iterate prompts (system, tool/function-calling, task prompts) to boost voice AI agent success, reliability, and tone. • Build co-pilots for customers to author their own prompts: meta-prompted assistants that suggest structures, lint for risks, autocomplete tool schemas, critique drafts, and generate eval cases. • Work directly with customer feedback and conversation logs to identify failure modes; translate them into prompt changes, guardrails, and data improvements. • Build eval datasets (success labels, rubrics, edge cases, regressions) and run offline/online evaluations (A/B tests, canaries) to quantify impact. • Create Python utilities/services for prompt versioning, config-as-code, rollout/rollback, and guardrails (policies, refusals, redaction). • Partner with PM/Success to define success metrics (task completion, first-pass accuracy, cost, latency) and instrument dashboards/alerts. • Own LLM integration details: function/tool schemas, output parsing/validation (pydantic), retrieval-aware prompting, and fallback strategies. • Ensure privacy & compliance (PII handling, anonymization, regional data boundaries) in datasets and logs. • Share learnings via concise docs, playbooks, and internal demos. • Run a tight feedback loop with customers, turn real conversations into better prompts and eval datasets, and ship changes that measurably improve agent outcomes. Requirements: • Python: 3+ years writing clean, tested, production code (typing, pytest, profiling); experience building small services/APIs (FastAPI preferred). • Prompt Engineering: Hands-on experience designing system/tool prompts, meta-prompting, rubric graders, and iterative prompt tuning based on real user data. • LLM Integration: Comfortable with major APIs (OpenAI/Anthropic/Google/Mistral), function/tool calling, streaming, and robust output handling. • Evaluation Mindset: Ability to define measurable success, create labeled datasets, and run methodical experiments/A/B tests. • Product Sense: Comfortable talking with customers, turning qualitative feedback into shipped improvements. • Data Hygiene: Practical experience cleaning, labeling, and balancing datasets; awareness of privacy/PII constraints. • Nice-to-haves: Experience building prompt-authoring UIs/SDKs or internal tooling for prompt versioning and governance. • Nice-to-haves: Agentic frameworks & tooling: DSpy, MCP, LangGraph, LlamaIndex, Rasa; experience with agent/tool schemas and orchestration. • Nice-to-haves: Observability & eval tooling: Langfuse, LangSmith, Braintrust; building eval harnesses and experiment dashboards. • Nice-to-haves: RAG & vector stores: Qdrant/Weaviate/Pinecone and retrieval-aware prompting. • Nice-to-haves: Experimentation workflows: A/B testing, prompt diffing/versioning. • Nice-to-haves: Infra & analytics: light SQL/log analysis, metrics & tracing, simple Grafana/OTel dashboards. • Nice-to-haves: Writing public blog posts or talks about applied LLM techniques. Benefits: Apply tot his job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Cloud Applications Development Intern

Remote

IAM Analyst Intern – Identity & Access Management

Remote

Experienced Sales Leader for Medical Device Industry - Leading Teams to Exceptional Results in a Dynamic and Innovative Environment at blithequark

Remote

Advanced Slide Design in PowerPoint – Corporate Training Instructor

Remote

Staples Promotional Products, Sr Account Solutions Mgr (remote)

Remote

Remote Tax Consultant Jobs for Pakistani Residents

Remote

Photoshop Artist job at WPP plc in CA

Remote

Audit Manager I /Remote/

Remote

**Experienced Data Entry Clerk – Remote Opportunity with arenaflex**

Remote

Experienced Online arenaflex Chat Support Specialist – Delivering Exceptional Customer Experiences through Live Chat Interactions – Remote Part-Time Opportunity

Remote
← Back