[Remote] AI Software Engineer, Senior
Note: The job is a remote job and is open to candidates in USA. Cayuse Holdings is looking for a Senior AI Software Engineer to advance enterprise AI initiatives by transforming proof-of-concept solutions into scalable web applications. The role involves developing production-grade AI/ML services, collaborating with cross-functional teams, and ensuring compliance with enterprise standards.ResponsibilitiesDesign, develop, and maintain productionâgrade AI/ML services and web applications that extend existing POC solutions into scalable, secure, and reliable enterprise platformsImplement and optimize AI/ML workflows for: Model ingestion and lifecycle management, Automated quantity extraction from plans and documents, Plan conformance and rulesâbased checks, Computer visionâbased asset detection and inspection, NLP/LLMâbased plan review automation and document analysisBuild secure, userâfriendly web interfaces and APIs that enable engineering and business users to leverage AI capabilities within their dayâtoâday workflowsArchitect, implement, and manage CI/CD pipelines to support rapid, reliable deployment of AI/ML models and related servicesDeploy and manage AI/ML workloads across one or more major cloud platforms (AWS, Azure, GCP, OCI), leveraging native AI/ML services as appropriateImplement MLOps best practices, including experiment tracking, model registry, feature stores, monitoring, and automated retraining where appropriateOptimize model performance and cost through techniques such as quantization, pruning, distillation, and efficient distributed trainingIntegrate and operationalize LLM and NLP solutions (e.g., transformers, RAG systems) to support text understanding, summarization, Q&A, and other intelligent automation use casesCollaborate with data engineers, cloud engineers, and domain experts to design robust data pipelines and architectures for AI/ML workloads, including timeâseries, image/video, and text dataEnsure that all solutions adhere to security, compliance, and governance standards, especially when working with sensitive or regulated dataProvide technical leadership, mentorship, and guidance to junior engineers and peers, promoting best practices in AI/ML engineering, DevOps, and software craftsmanshipProduce highâquality technical documentation, including architecture diagrams, API specifications, deployment runbooks, and user guidesParticipate in technical planning, backlog grooming, and estimation; contribute to roadmap development for AI/ML capabilitiesSkills8+ years of professional software engineering experience, with substantial work in AI/ML and cloudânative developmentExperience with at least one major cloud platform (AWS, Azure, GCP, or OCI) for deploying and managing ML workloadsHandsâon experience with cloud AI/ML services such as Azure AI, AWS SageMaker/Bedrock, GCP Vertex AI, or OCI AI ServicesStrong DevOps background, including: Ansible for configuration management and automation, Docker for containerization, Kubernetes for container orchestration, CI/CD best practices for automated build, test, and deploymentProficiency with relational and nonârelational databases, including: SQL (PostgreSQL, MySQL), NoSQL and vector databases for similarity search and embeddingâbased retrievalStrong scripting skills in both: Bash, PowerShellProven experience designing and maintaining CI/CD pipelines using: Azure DevOps, GitHub Actions, Jenkins, or similar automation tools3â5+ years of productionâlevel Python development (primary implementation language)3+ years of experience with NLP and LLMs, including: Transformer models (BERT, GPT, T5, etc.), RAG (RetrievalâAugmented Generation) systems, Fineâtuning and prompt engineering, Building LLMâbased applications3+ years of experience with timeâseries data, including: Forecasting models, Anomaly detection, Sequential data modeling, Realâtime monitoring systems3+ years of experience building recommender systems, such as: Collaborative filtering, Ranking models, Personalization engines, Content recommendation pipelinesProduction experience with MLOps tools and platforms, such as: MLflow, Weights & Biases, Kubeflow, Airflow, or similar systems for orchestration, tracking, and model lifecycle managementExperience with distributed training, including: Largeâscale model training, MultiâGPU and/or multiânode setups, Data/model parallelism and performance optimizationProduction computer vision experience using: PyTorch and/or TensorFlow, OpenCV, YOLO or similar frameworks for object detection and segmentation, Realâtime inference and deployment workflowsExperience with feature stores (e.g., Feast, Tecton) and/or advanced feature engineering techniquesHandsâon experience with model optimization techniques: Quantization, Pruning, Knowledge distillationExperience working with LLM ecosystems such as: Ollama, Hugging Face, Other nonâfrontier / openâweight modelsDemonstrated AI/ML production track record: Built and deployed at least 2â3+ ML models serving real users (beyond experimental or researchâonly projects)Must be able to pass a background check. May require additional background checks as required by projects and/or clients at any time during employmentExceptional interpersonal skills with the ability to communicate in a clear, professional, and articulate mannerExceptional verbal and written communication skillsExcellent organizational, analytical, and problem-solving skills with high-level attention to detailProven ability to multitask and prioritize in a fast past environment with changing priorities; adaptable to change and a quick learnerMust be self-motivated and able to work well independently as well as on a multi-functional teamAbility to handle sensitive and confidential information appropriatelyProficient in MS Office, Word, Outlook, PowerPoint, and Excel1+ year of experience with Geospatial Information Systems (GIS) and analyzing or modeling spatial dataPrior experience in one or more of the following domains: Transportation, Logistics, Smart city or urban infrastructureBackground applying computer vision to infrastructure or vehicular data, including: Object detection, Image segmentation, Video or sensor data analysisFamiliarity with public sector data compliance, security, and governance, such as: Data classification and handling, Access control and audit requirements, Regulatory and policy constraints for government dataExperience with Unreal Engine in the context of: Realâworld digital twinning, Simulation or immersive visualization of physical environmentsExperience integrating or building solutions with: Google Maps APIs, Cesium or similar 3D mapping/geospatial visualization platformsExperience with Polygonflow Dash and its capabilities for: 3D workflows, Visualization pipelines, Automation of complex modeling or simulation tasksBenefitsMedical, Dental and Vision Insurance; Wellness ProgramFlexible Spending Accounts (Healthcare, Dependent Care, Commuter)Short-Term and Long-Term Disability optionsBasic Life and AD&D Insurance (Company Provided)Voluntary Life and AD&D options401(k) Retirement Savings Plan with matching after one yearPaid Time OffCompany OverviewCayuse Holdings is an economic enterprise that specializes in providing sourcing and diversity solutions. It was founded in 2018, and is headquartered in Pendleton, Oregon, USA, with a workforce of 501-1000 employees. Its website is https://www.cayuseholdings.com/.