[Remote] Sr. Engineer, Development Operations - Archimedes
Note: The job is a remote job and is open to candidates in USA. Navitus Health Solutions is an industry leader in specialty drug management solutions. The Sr. Engineer, Development Operations is responsible for architecting and advancing the organization's cloud platform, providing strategic technical leadership for Azure-based applications and infrastructure while ensuring secure and scalable technology solutions across the enterprise.ResponsibilitiesServe as the senior technical authority for DevOps, Platform Engineering, Infrastructure-as-Code, CI/CD, DataOps, MLOps, and cloud automation practicesDefine and maintain enterprise standards, reference architectures, reusable patterns, and governance frameworks for Azure cloud platforms and software delivery pipelinesLead architecture reviews for cloud infrastructure, deployment automation, platform services, integrations, and enterprise modernization initiativesProvide technical leadership and mentorship to software engineers, cloud engineers, DevOps engineers, and data engineers on cloud-native engineering practicesEstablish and govern Infrastructure-as-Code standards, Terraform module strategy, pipeline standards, branching strategies, and release management processes across the organizationDrive adoption of platform engineering principles, self-service infrastructure, internal developer platforms, and automation-first operating modelsLead cloud platform modernization initiatives, application modernization efforts, migration programs, and operational transformation projectsPartner with Enterprise Architecture and Security teams to ensure cloud platforms align with cybersecurity, compliance, governance, and resiliency requirementsDefine observability standards, SLOs, SLIs, operational metrics, and platform reliability objectivesLead root cause analysis efforts for critical incidents and establish corrective actions to improve platform resilience and operational maturityEvaluate emerging technologies, cloud-native services, AI-enabled automation platforms, and engineering tools to improve delivery capabilities and reduce operational complexityParticipate in technology roadmap development and provide recommendations regarding platform investments, automation opportunities, cloud strategy, and engineering best practicesAct as the highest level technical escalation point for complex platform, integration, automation, infrastructure, deployment, and cloud engineering issuesSupport vendor evaluations, architecture reviews, proof-of-concepts, and technical due diligence activitiesCollaborate closely with application development teams to optimize build, test, release, and deployment workflows, reducing cycle time and increasing release frequencyArchitect, implement, and optimize multi-stage CI/CD pipelines using Azure DevOps, YAML, environments, templates, deployment groups, release gates, and GitHub ActionsBuild and maintain CI/CD pipelines supporting applications, APIs, microservices, infrastructure, data pipelines, Databricks workloads, analytics platforms, and machine learning deploymentsServe as a technical owner for Infrastructure-as-Code practices, ensuring cloud infrastructure, networking, security controls, and platform services are provisioned, managed, and governed through Terraform and version-controlled automationDevelop reusable Terraform modules, Bicep templates, ARM templates, shared pipeline templates, and platform blueprints to standardize deployment patterns across environmentsAutomate infrastructure provisioning using Terraform, Bicep, ARM templates, Azure CLI, and PowerShell, enforcing modular reusable patterns, policy-as-code, and enterprise guardrailsImplement policy-as-code, compliance-as-code, and governance automation using Azure Policy, Management Groups, RBAC, Defender for Cloud, and related Azure governance capabilitiesManage and optimize package repositories including Azure Artifacts, GitHub Packages, NuGet, and npm, ensuring artifact integrity, traceability, and lifecycle controlSupport IT governance of the code release lifecycle, including semantic versioning, automated tagging, changelog generation, approval workflows, and deployment auditabilityEvaluate, integrate, and manage the DevOps toolchain, including build servers, artifact repositories, dependency scanners, secrets managers, testing tools, and security scanning toolsDesign and deliver secure, scalable cloud-native architectures using Azure App Services, AKS, Azure SQL, Azure Functions, API Management, Key Vault, Storage Accounts, and related Azure servicesDesign, implement, and support enterprise integration patterns utilizing REST APIs, GraphQL APIs, SOAP services, webhooks, message queues, event-driven architecture, and managed integration servicesDevelop and maintain API automation, service connections, authentication integrations, and system-to-system connectivity using Azure API Management, Service Bus, Event Grid, Event Hubs, Logic Apps, Azure Functions, and related servicesSupport integrations between SaaS platforms, internal applications, third-party vendors, cloud services, data platforms, automation tools, and enterprise systemsPartner with software engineering teams to establish API lifecycle management, API security controls, automated testing, monitoring, and deployment patternsPartner with software and data analytics teams to support deployment of data pipelines, Azure Data Factory, Databricks, Synapse, Data Lake, Event Grid, Event Hub, and related data platform integrationsDesign, automate, and support DataOps processes across Azure Databricks, Delta Lake, Azure Data Lake Storage Gen2, Azure Data Factory, Synapse Analytics, and related analytics platformsBuild and maintain automated deployment pipelines for Databricks workspaces, notebooks, jobs, clusters, Unity Catalog, libraries, data products, and supporting cloud infrastructureEnable infrastructure and DevOps support for machine learning, ETL, analytics, data lake, and lakehouse environments, ensuring secure service access, RBAC, observability, and cost controlsSupport MLOps platform capabilities, including model training, validation, deployment, monitoring, model registry integration, and governance automationPartner with data scientists and data engineers to operationalize machine learning, AI, and advanced analytics workloads using Azure Machine Learning, Databricks ML, MLflow, and CI/CD automationSupport enterprise automation initiatives utilizing robotic process automation, workflow orchestration, AI-powered automation, and low-code/no-code automation platformsSupport integrations with automation platforms such as Microsoft Power Automate, Azure Logic Apps, UiPath, Automation Anywhere, and similar technologiesUse pipeline analytics and deployment telemetry to identify bottlenecks, reduce build/test times, improve developer feedback loops, and increase delivery reliabilityEnable full-stack observability using Azure Monitor, Log Analytics, Application Insights, KQL dashboards, SLA/SLO instrumentation, synthetic monitoring, and real-time alertingImplement shift-left testing strategies, including unit, integration, regression, infrastructure, and performance testing automation directly within CI/CD pipelinesApply DevSecOps practices within the CI/CD lifecycle, including SAST, DAST, secrets scanning, dependency scanning, code signing, artifact provenance validation, and container securityIntegrate security tools such as Snyk, CredScan, OWASP ZAP, Aqua Security, Defender for Cloud, or similar platforms into the software delivery lifecycleIntegrate infrastructure validation into the delivery process using tools such as Pester, InSpec, Terratest, or similar testing frameworksLead containerization and deployment strategies using Docker, Helm, Azure Container Registry, and AKS, supporting secure cluster design, ingress, network isolation, pod security, and autoscalingImplement blue-green, canary, and ring deployments for applications and APIs using traffic routing, feature flags, deployment slots, and progressive release practicesBuild self-service platforms and reusable DevOps tooling to streamline developer onboarding, environment provisioning, service connection governance, and deployment configurationMaintain golden pipeline templates, DevOps starter kits, Terraform modules, automation scripts, and reusable deployment patterns to standardize delivery across teamsConduct failure mode analysis, root cause analysis, and resilience engineering using Azure Chaos Studio, Availability Zones, health probes, auto-healing logic, and operational telemetryMaintain architectural runbooks, deployment documentation, operational SOPs, environment diagrams, and audit evidence supporting change management and compliance requirementsSupport self-service enablement for developers and data teams through pipeline templates, service connection governance, shared agent pools, reusable modules, and automation frameworksServe as a Tier 2/3 escalation point for platform, deployment, automation, pipeline, integration, and cloud infrastructure issuesOperate within an ITSM framework, contributing to incident, change, release, and problem management practicesUse Jira Service Management to triage, track, and respond to infrastructure-related work items, automation requests, deployment support, integration requests, and platform incidentsParticipate in, adhere to, and support compliance, people and culture, and learning programsPerform other duties as assignedSkillsBachelor's degree or equivalent work experience required8+ years of experience in DevOps, Platform Engineering, Cloud Engineering, Site Reliability Engineering (SRE), or related disciplines, including at least 5 years of hands-on experience designing, implementing, and supporting Azure cloud solutions in enterprise environments requiredDemonstrated experience leading cloud modernization, platform engineering, Infrastructure-as-Code (IaC), DevSecOps, CI/CD automation, operational excellence, and cloud transformation initiatives requiredAdvanced expertise with Azure DevOps, GitHub Actions, Terraform, Bicep, ARM templates, Azure CLI, and enterprise CI/CD architectures supporting application, infrastructure, API, data, and machine learning deployments requiredExperience designing, implementing, and governing enterprise-scale Infrastructure-as-Code frameworks, reusable Terraform modules, platform blueprints, automation standards, and policy-as-code practices requiredProficient in scripting and automation development using PowerShell, Python, Bash, and related tooling requiredStrong knowledge of Azure-native services including App Services, AKS, Azure SQL, Azure Functions, Key Vault, Storage Accounts, Application Gateway, Load Balancers, VNETs, NSGs, Private Link, Azure DNS, API Management, and related cloud services requiredHands-on experience with containerization and orchestration using Docker, Kubernetes (AKS), Helm, Azure Container Registry (ACR), ingress controllers, networking, cluster security, and horizontal pod autoscaling requiredStrong understanding of Microsoft Entra ID (Azure Active Directory), RBAC, Managed Identities, Privileged Identity Management (PIM), service principals, and secure identity and access management practices requiredExpertise in monitoring, observability, diagnostics, and reliability engineering utilizing Azure Monitor, Application Insights, Log Analytics, Kusto Query Language (KQL), dashboards, alerting, and operational telemetry requiredCertifications such as Microsoft Certified DevOps Engineer Expert, Azure Solutions Architect Expert, or Azure Security Engineer Associate preferredExperience supporting automation, CI/CD, infrastructure, and deployment patterns for AI, machine learning, generative AI, Azure OpenAI, Azure Machine Learning, Databricks ML, MLflow, and related AI platform operations preferredExperience establishing observability standards, reliability engineering practices, service level objectives (SLOs), service level indicators (SLIs), cloud governance models, and operational excellence programs preferredExperience with Azure DevOps Extensions, Release Gates, deployment approvals, environment controls, pipeline security scanning, and software supply chain security preferredFamiliarity with serverless architectures, event-driven design patterns, and Azure services including Functions, Logic Apps, Event Grid, Event Hub, and Service Bus preferredExperience supporting enterprise integrations utilizing REST APIs, GraphQL, SOAP services, webhooks, messaging platforms, API Management, and system-to-system integration architectures preferredExperience supporting Azure Databricks, Azure Data Factory (ADF), Synapse Analytics, Delta Lake, Data Lake Storage Gen2, DataOps, analytics engineering, and modern data platform architectures preferredExperience supporting machine learning platforms, MLOps practices, AI workloads, Azure Machine Learning, Databricks ML, MLflow, and model deployment automation preferredExperience mentoring engineers, providing technical leadership, conducting architecture reviews, and guiding engineering best practices across cross-functional teams preferredExperience working within regulated environments supporting HIPAA, HITRUST, SOC 2, ISO 27001, NIST, or similar compliance frameworks preferredKnowledge of Microsoft's Cloud Adoption Framework, Well-Architected Framework, Zero Trust principles, and cloud governance best practices preferredBenefitsTop of the industry benefits for Health, Dental, and Vision insurance4 weeks paid parental leave9 paid holidays401K company match of up to 5% - No vesting requirementAdoption Assistance ProgramFlexible Spending AccountEducational Assistance Plan and Professional Membership assistanceReferral Bonus Program – up to $750!Company OverviewNavitus Health Solutions LLC is a full service, URAC-accredited pharmacy benefit management company. It was founded in 2003, and is headquartered in Appleton, Wisconsin, USA, with a workforce of 1001-5000 employees. Its website is https://www.navitus.com/.