Senior Data Engineer

Remote Full-time
Who Are We?Welcome to Welltech—where health meets innovation! As a global leader in Health & Fitness industry, we’ve crossed over 200 million installs with three life-changing apps, all designed to boost well-being for millions. Our mission? To transform lives through intuitive nutrition trackers, powerful fitness solutions, and personalized wellness journeys—all powered by a diverse team of over 700 passionate professionals with presence across 5 hubs.Why Welltech? Imagine joining a team where your impact on global health and wellness is felt daily. At Welltech, we strive to be proactive wellness partners for our users, while continually evolving ourselves.What We're Looking ForAs a Senior Data Engineer, you will play a crucial role in building and maintaining the foundation of our data ecosystem. You’ll work alongside data engineers, analysts, and product teams to create robust, scalable, and high-performance data pipelines and models. Your work will directly impact how we deliver insights, power product features, and enable data-driven decision-making across the company.This role is perfect for someone who combines deep technical skills with a proactive mindset and thrives on solving complex data challenges in a collaborative environment.Challenges You’ll Meet:Pipeline Development and Optimization: Build and maintain reliable, scalable ETL/ELT pipelines using modern tools and best practices, ensuring efficient data flow for analytics and insights.Data Modeling and Transformation: Design and implement effective data models that support business needs, enabling high-quality reporting and downstream analytics.Collaboration Across Teams: Work closely with data analysts, product managers, and other engineers to understand data requirements and deliver solutions that meet the needs of the business.Ensuring Data Quality: Develop and apply data quality checks, validation frameworks, and monitoring to ensure the consistency, accuracy, and reliability of data.Performance and Efficiency: Identify and address performance issues in pipelines, queries, and data storage. Suggest and implement optimizations that enhance speed and reliability.Security and Compliance: Follow data security best practices and ensure pipelines are built to meet data privacy and compliance standards.Innovation and Continuous Improvement: Test new tools and approaches by building Proof of Concepts (PoCs) and conducting performance benchmarks to find the best solutions.Automation and CI/CD Practices: Contribute to the development of robust CI/CD pipelines (GitLab CI or similar) for data workflows, supporting automated testing and deployment.You Should Have:4+ years of experience in data engineering or backend development, with a strong focus on building production-grade data pipelines.Solid experience working with AWS services (Redshift, Spectrum, S3, RDS, Glue, Lambda, Kinesis, SQS).Proficient in Python and SQL for data transformation and automation.Experience with dbt for data modeling and transformation.Good understanding of streaming architectures and micro-batching for real-time data needs.Experience with CI/CD pipelines for data workflows (preferably GitLab CI).Familiarity with event schema validation tools/ solutions (Snowplow, Schema Registry).Excellent communication and collaboration skills.Strong problem-solving skills—able to dig into data issues, propose solutions, and deliver clean, reliable outcomes.A growth mindset—enthusiastic about learning new tools, sharing knowledge, and improving team practices.Tech Stack You’ll Work With:Cloud: AWS (Redshift, Spectrum, S3, RDS, Lambda, Kinesis, SQS, Glue, MWAA)Languages: Python, SQLOrchestration: Airflow (MWAA)Modeling: dbtCI/CD: GitLab CI (including GitLab administration)Monitoring: Datadog, Grafana, GraylogEvent validation process: Iglu schema registryAPIs & Integrations: REST, OAuth, webhook ingestionInfra-as-code (optional): TerraformBonus Points / Nice to Have:Experience with additional AWS services: EMR, EKS, Athena, EC2.Hands-on knowledge of alternative data warehouses like Snowflake or others.Experience with PySpark for big data processing.Familiarity with event data collection tools (Snowplow, Rudderstack, etc.).Interest in or exposure to customer data platforms (CDPs) and real-time data workflows.Originally posted on Himalayas

Apply Now
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Entry-Level blithequark Data Entry Specialist - Launch Your Career with No Prior Experience Required

Remote

Operations Manager

Remote

Android Developer – Design System | Penn Interactive | Remote (United States)

Remote

Solutions Consultant I-Dental Analytics

Remote

Sr. Strategic Learning Partner

Remote

Supplier Manager, Third Party Manufacturing (Remote)

Remote

Principal Azure Solutions Architect

Remote

VP, Integrated Supply Chain

Remote

Patient Care Customer Service Representative - Remote Evening and Weekend Opportunities for Delivering Exceptional Patient Experiences

Remote

Attorney II - California (Remote)

Remote
← Back