Data Engineer

Remote Full-time
About Partao

Partao is building the global marketplace for heavy-machinery parts; a single platform that connects buyers and sellers across the industry. With 2,000+ brands and more than 5 million SKUs, we make sourcing faster, smarter, and more reliable by helping people find the exact part that fits their machine.

Weโ€™re focused on doing the right things well... first. That means building something useful, something real, with impact that lasts. If you want to be part of a fast-moving team shaping a marketplace at global scale, Partao is the place to do it.

About the Role

We are looking for a talented and analytically minded Data Engineer to join our growing team. In this role, you will be at the heart of our data ecosystem; designing and maintaining robust data pipelines, managing our cloud-based data lake on AWS, and transforming raw data into clean, reliable datasets that power business-critical reporting and analytics.

Key Responsibilities

Design, build, and maintain scalable ETL/ELT pipelines using orchestration tools such as Apache Airflow, dbt, or equivalent frameworks.
Extract data from diverse sources (APIs, databases, streaming systems) into our AWS-based data lake.
Transform raw, unstructured data into clean, well-modelled datasets ready for analytics and reporting.
Own and evolve our data lake architecture, including multi-zone S3 storage and AWS Glue cataloguing.
Manage relational and non-relational databases, ensuring optimal schema design, indexing, and query performance.
Leverage AWS services (S3, Redshift, Glue, Lambda, EMR) to build scalable, cost-efficient data solutions.
Process and analyse large-scale datasets using big data technologies such as Apache Spark.
Collaborate with analysts and business stakeholders to translate reporting requirements into reliable data models.
Contribute to the design and evolution of our overall data architecture, data governance, and quality standards.
Document systems, data flows, and architectural decisions to a high standard.

Requirements

Proven experience as a Data Engineer or in a similar data-focused engineering role.
Strong proficiency in Python for data engineering tasks; experience with Rust is a significant advantage.
Hands-on experience building and maintaining ETL/ELT pipelines, ideally using Apache Airflow
Deep knowledge of database management - both relational and non-relational.
Solid experience with AWS cloud services relevant to data engineering (S3, Glue, Redshift, EMR, Lambda, IAM).
Experience working with big data platforms and distributed computing frameworks (e.g. Apache Spark).
Strong understanding of data lake architecture, including storage layers, partitioning, and data cataloguing.
Excellent analytical thinking and problem-solving ability - you enjoy digging into complex data challenges.
Ability to communicate technical concepts clearly to non-technical stakeholders.

Bonus Points

Experience with Rust for performance-critical data processing.

Why Join Us?

Shape the data backbone of a fast-scaling platform.
Work with real-world data that matters to customers every day.
Join a small, sharp, and globally distributed team.
Own critical decisions and make an impact from Day 1.

What We Offer

Opportunity to shape the data architecture of a growing organisation from an early stage.
Opportunity to be a thought leader with a wide span of control in a fast-growing startup with experienced mentors
A challenging and rewarding environment where you can directly impact the future of the company and the industry

Apply To This Job
Apply Now โ†’

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

[Remote-Position] Remote Data Entry Clerk / Part-time at Easy

Remote

**Remote Customer Service Specialist - Part-Time at arenaflex**

Remote

Corporate Counsel, Technology & AI

Remote

Coord Geoscience Apps - 008398

Remote

Experienced Data Entry Operator โ€“ Entry Level Position with Remote Work Opportunities at careerzynith

Remote

Learning Systems Specialist

Remote

Logistics Forklift Operator

Remote

UPS Store Center Associate ย– The UPS Store Jersey City #0368 ย– Jersey City, NJ

Remote

**Experienced Student Success Coach โ€“ Remote Customer Service Representative**

Remote

**Experienced Remote Data Entry Specialist โ€“ Flexible Schedule and Competitive Compensation**

Remote
โ† Back