[Remote] GCP Data Engineer
Note: The job is a remote job and is open to candidates in USA. Rivago Infotech Inc is seeking a highly skilled Senior GCP Data Engineer to design, build, and optimize scalable, cloud-native data pipelines on Google Cloud Platform (GCP). This role will act as the primary technical owner for implementing data pipelines and driving engineering best practices across the data ecosystem.ResponsibilitiesDesign, develop, and maintain scalable data pipelines on GCP for batch and real-time processingImplement data ingestion frameworks (push/pull) using services like Dataflow, DataStream, Pub/Sub, and Cloud RunBuild and optimize data transformation pipelines using BigQuery, Dataform, and Spark-based frameworksAct as the technical lead/owner ensuring alignment with EA-approved data architectureDrive best practices in data engineering, including CI/CD, testing, monitoring, and cost optimizationCollaborate with cross-functional teams to deliver end-to-end data solutionsEnsure data quality, reliability, governance, and performance across pipelinesSkills6+ years of experience in data engineering, with strong focus on Google Cloud Platform (GCP)GCP Professional Data Engineer certification - must haveDesign, develop, and maintain scalable data pipelines on GCP for batch and real-time processingImplement data ingestion frameworks (push/pull) using services like Dataflow, DataStream, Pub/Sub, and Cloud RunBuild and optimize data transformation pipelines using BigQuery, Dataform, and Spark-based frameworksAct as the technical lead/owner ensuring alignment with EA-approved data architectureDrive best practices in data engineering, including CI/CD, testing, monitoring, and cost optimizationCollaborate with cross-functional teams to deliver end-to-end data solutionsEnsure data quality, reliability, governance, and performance across pipelinesStrong hands-on experience with Medallion architecture (Centralized Hub/Spoke model)Strong hands-on experience with BigQuery (data warehousing & optimization)Strong hands-on experience with Dataform (SQL-based transformations)Strong hands-on experience with Dataflow (batch & streaming pipelines)Strong hands-on experience with Datastream (CDC ingestion)Strong hands-on experience with Cloud Composer (Airflow) (orchestration)Strong hands-on experience with Dataproc (Spark/PySpark)Strong hands-on experience with Cloud RunStrong hands-on experience with Cloud StorageStrong hands-on experience with Pub/SubAdvanced proficiency in PythonAdvanced proficiency in PySpark / Apache SparkAdvanced proficiency in SQL (complex transformations, performance tuning)Deep understanding of data modeling techniquesDeep understanding of Medallion architecture (Bronze/Silver/Gold layers)Deep understanding of Centralized Hub/Spoke data platform designStrong ownership and accountabilityAbility to work across distributed teams and vendorsExcellent communication with technical and non-technical stakeholdersExperience with real-time streaming architecturesFamiliarity with CI/CD pipelines (Terraform, Cloud Build, GitOps)Strong understanding of data governance, security, and complianceExperience working in large enterprise or utility/regulated environmentsCompany OverviewRivago Infotech is a global professional services and staffing organization, delivering exceptional talent and technical solutions across IT, Non-IT, Healthcare, and Engineering sectors. It was founded in 2018, and is headquartered in Wilmington, Delaware, USA, with a workforce of 51-200 employees. Its website is https://www.rivagoinfotech.com.