AI/ML Consultant – LLM Deployment

Remote Full-time
to set up and configure a self-hosted large language model (Llama 3.1) on our Linux server infrastructure for automated report generation. Primary Objective: Deploy and configure Llama 3.1 (or equivalent) 8B on our hosted Linux server (CPU-only) and create an API service that our SafetyNet Platform can call for AI-powered report generation. Specific Deliverables: Server Environment Setup Configure Linux (Ubuntu 22.04) server environment Install Python 3.11, dependencies, and required libraries Set up virtual environment and security configurations AI Model Installation & Configuration Download and install Llama 3.1 8B Instruct model Optimize model configuration for CPU-only inference Implement quantization if needed for performance Test model functionality and response quality API Service Development (Nice to have) Create REST API service (Flask/FastAPI) for report generation Implement secure endpoints for our SafetyNet Platform to call Add error handling, logging, and health check endpoints Configure service to auto-start on server reboot (systemd) Security & Performance Configure firewall rules (allow only our application server) Implement authentication/API key system Optimize for 30-60 second response times Set up monitoring and logging Documentation & Training Comprehensive setup documentation API usage guide with examples Troubleshooting guide 2-hour knowledge transfer session with our development team Testing & Validation Generate 10+ test reports with sample data Validate output quality and format Performance testing under load Integration testing with our platform (we'll provide API endpoints) Technical Requirements Must Have: 3+ years experience with Python and machine learning frameworks (PyTorch, Transformers) Experience deploying and running large language models (Llama, GPT, Mistral, etc.) Strong Linux system administration skills (Ubuntu/Debian) Experience with API development (Flask, FastAPI, or similar) Understanding of CPU-based ML inference and optimization Experience with Hugging Face model hub Knowledge of systemd service configuration Security best practices for production systems Nice to Have: Experience with model quantization and optimization (bitsandbytes, ONNX) DevOps experience (Docker, monitoring tools) Previous work with government or healthcare systems (HIPAA/FERPA compliance) Experience with justice system or social services applications Apply tot his job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

**Experienced Customer Support Representative – Remote Team at blithequark**

Remote

Mental Health Therapist- Remote Opportunity with Salary, Benefits and more!

Remote

Experienced Service Desk Specialist and Live Chat Agent - Remote Opportunity in Colorado with blithequark

Remote

Python Developer - Remote Contract Job at Fusion Solutions, LLC in Plano

Remote

Remote Customer Support Specialist for Innovative Technology Leader - blithequark - Work from Home Opportunity

Remote

Experienced Remote Call Center Customer Service Representative – Delivering Exceptional Support and Exceeding Customer Expectations at blithequark

Remote

Experienced Arts Learning Coordinator – Arts Education and Community Development Specialist

Remote

System Support Analyst I - Medical Imaging Systems / Linux

Remote

Corporate Treasury Senior Manager – Remote in Washington

Remote

**Experienced Part-Time Virtual Customer Care Representative – Remote Opportunity with arenaflex**

Remote
← Back