Quick Reminder - DataOps & Build Engineer - Remote - USA
Role: DataOps & Build Engineer Data Analytics Location: Remote - USA Project Duration: 6 to 9 Months of contract We are seeking an experienced and visionary DataOps & Build Engineer to lead the architecture and optimization of a next-generation data platform. This critical role requires 8+ years of expertise to drive technical direction, mentor teams, and automate complex CI/CD pipelines in a fast-paced environment. You will be instrumental in bridging development and operations to ensure a scalable, high-performance data lifecycle that powers enterprise-level decision-making. Key Responsibilities: β’ Establish DataOps Framework: Define, document, and champion the organizational framework and guidelines for DataOps-including release management processes, environment promotion strategy, and data quality standards. β’ Best Practice Dissemination: Create and enforce standard operating procedures (SOPs) for data pipeline development, CI/CD, and testing across the engineering teams, ensuring consistency and adherence to architectural standards β’ Data Pipeline Automation: Design and implement robust continuous integration and continuous delivery (CI/CD) pipelines for data code and infrastructure β’ Workflow Orchestration Implementation: Configure, optimize, and manage the deployment of data workflows using orchestrators such as Dagster or Talend, focusing on automated testing and deployment steps. β’ Version Control & Repository Management: Enforce best practices for source code management (e.g., Gitflow), branching strategies, and repository organization across all data projects. β’ Infrastructure as Code (IaC): Work with Infrastructure teams to automate provisioning and management of data platform resources efficiently within AWS. β’ Resilience and Failure Recovery: Design and implement automated rollback and self-healing mechanisms within pipelines to quickly recover from transient failures. β’ Monitoring and Logging: Set up comprehensive monitoring, logging, and alerting using Cloud native tools, or other tools to ensure visibility into pipeline performance and quickly identify and resolve issues β’ Security and Compliance: Ensure data security and compliance by implementing IAM policies, encryption, and other security measures in AWS, adhering to best practices for handling sensitive data β’ Testing Frameworks: Implement automated testing strategies across the data lifecycle, including unit tests, integration tests, and data quality validation checks (e.g., column integrity, schema drift) to ensure data reliability before deployment β’ Resource and Cost Optimization: Implement automated policies and monitoring to track and control cloud resource consumption, ensuring that pipelines run efficiently and cost-effectively Candidate Profile: β’ 8+ years of hands-on experience in Data Engineering, DevOps, or a dedicated DataOps role, focused heavily on automation and operational excellence β’ Proven experience implementing CI/CD practices specifically for data pipelines and data infrastructure β’ Strong conceptual understanding of data warehousing, ETL/ELT methodologies, and cloud-native architecture. β’ Automation First Mindset: A strong drive to automate repetitive tasks and eliminate manual intervention in the data lifecycle β’ Collaboration: Excellent communication skills, capable of working effectively with Data Engineers, Data Scientists, and Infrastructure teams β’ Insurance industry experience preferred but not mandatory β’ Tools: β’ Cloud Environment: AWS (S3, IAM, VPC, etc.) β’ Pipeline Build: Dagster or Talend β’ Ingest & Transform: dbt Core, AWS Glue, or Flexter β’ Streaming/Integration: Confluent or AWS Streaming Services