[Remote] GCP Data Engineer
Note: The job is a remote job and is open to candidates in USA. CoSourcing Partners is an Enterprise-AI and IT Services Company seeking a highly skilled GCP Data Engineer. The role involves designing, developing, and optimizing scalable data solutions on Google Cloud Platform, focusing on building robust data pipelines and ensuring data quality for analytics and machine learning use cases.ResponsibilitiesDesign, build, and optimize scalable batch and real-time (streaming) data pipelines using GCPnative servicesDevelop and maintain data ingestion frameworks leveraging tools such as Pub/Sub, Dataflow, and Cloud StorageImplement data transformation pipelines using BigQuery, dbt, and Python-based workflowsEnsure efficient handling of large-scale structured and unstructured datasetsDesign and implement high-performance data models for cloud-based data lakes, data warehouses, and analytics platformsOptimize data schemas and partitioning strategies in BigQuery for performance and cost efficiencySupport modern architectures such as medallion (bronze/silver/gold) layers and lakehouse patternsWrite advanced SQL queries for transformation, validation, and analyticsDevelop scalable data processing logic using Python and/or Apache BeamBuild reusable, modular, and maintainable code for data workflowsImplement and maintain data quality checks, validation rules, and anomaly detection frameworksEnable data observability through monitoring, logging, and alerting mechanismsEnsure highly reliable data pipelines with fault tolerance and error handling strategiesSupport migration and modernization efforts from legacy ETL tools (e.g., Talend) to GCP-native ELT frameworks (dbt)Optimize existing pipelines for performance, scalability, and maintainability in cloud environmentsDrive adoption of ELT best practices using BigQuery as the compute engineCollaborate with data architects, business analysts, and machine learning teams to deliver trusted datasetsTranslate business requirements into scalable data solutionsProvide technical guidance and support for downstream analytics and reporting use casesDrive adoption of best practices in cloud data engineering, CI/CD, and DevOpsImplement secure data access controls using IAM roles, policies, and governance frameworksFollow standards for code quality, version control (Git), and automated deploymentsSkillsBachelor's or Master's degree in Computer Science, Engineering, or related field4+ years of experience in data engineering or data platform developmentHands-on experience with Google Cloud Platform (GCP) services: BigQuery, Dataflow, Pub/Sub, Cloud StorageStrong proficiency in SQL and PythonExperience with dbt (Data Build Tool) or similar ELT frameworksExperience building batch and streaming data pipelinesExperience with Apache Beam or SparkFamiliarity with Talend or other ETL tools and migration to cloud-native solutionsKnowledge of data lakehouse architectures and modern data stackExperience with CI/CD tools (e.g., GitHub Actions, Cloud Build, Jenkins)Understanding of data security, governance, and compliance standardsExposure to machine learning data pipelines and feature engineeringCompany OverviewWe are a Chicago-based Enterprise-AI and IT Services Company. It was founded in 2011, and is headquartered in Westmont, Illinois, USA, with a workforce of 51-200 employees. Its website is https://cosourcingpartners.com/.Company H1B SponsorshipCoSourcing Partners - Enterprise-AI and IT Services Company has a track record of offering H1B sponsorships, with 2 in 2023, 1 in 2021. Please note that this does not guarantee sponsorship for this specific role.