Data Engineer

Remote Full-time
About RipjarRipjar specialises in the development of software and data products that help governments and organisations combat serious financial crime. Our technology is used to identify criminal activity such as money laundering and terrorist financing, enabling organisations to enforce sanctions at scale to help combat rogue entities and state actors.Data infuses everything Ripjar does. We work with a wide variety of datasets of all scales, including an ever-growing archive of billions of news articles covering most languages going back over 30 years, sanctions and watchlist data provided by governments, and vast organisation and ownership datasets.About the RoleWe see a Data Engineer as a software engineer who specialises in distributed data systems. You’ll join the Data Engineering team, whose prime responsibility is the development and operation of the Data Collection Hub, a platform that ingests data from many sources, processes/enriches it, and distributes it to multiple downstream systems.We’re looking for someone with 2+ years of industry experience building and operating production software who enjoys working across data pipelines, distributed systems, and operational reliability.What you’ll doEngineer distributed ingestion services that reliably pull data from diverse sources, handle messy real-world edge cases, and deliver clean, well-structured outputs to multiple downstream products.Build high-throughput processing components (batch and/or near-real-time) with a focus on performance, scalability, and predictable cost, using strong profiling and measurement practices.Design and evolve data contracts (schemas, validation rules, versioning, backward compatibility) so downstream teams can build with confidence.Own production quality: write maintainable code, strong unit/integration tests, and add the observability you need (metrics/logs/tracing) to diagnose issues quickly.Improve platform reliability by hardening pipelines against partial failures, retries, rate limits, data drift, and infrastructure issues—then codify those learnings into better tooling and guardrails.Contribute to CI/CD and developer experience: faster builds, better test signal, safer releases, and automated operational checks.Participate in design reviews, code reviews, incident retrospectives, and iterative delivery—making pragmatic trade-offs and documenting them clearly.Technology Stack Languages: Predominantly Python and Node.jsDistributed/data platforms: HDFS, HBase, Spark, plus increasing use of Kubernetes and cloud servicesStorage/search: MongoDB, OpenSearchOrchestration: Airflow, Dagster, NiFiTooling: GitHub, GitHub Actions, Rundeck, Jira, ConfluenceDeployment/config: Ansible (physical), Terraform / Argo CD / Helm (Kubernetes)Development environment: MacBook (typical)RequirementsEssential:2+ years building and operating production software systemsFluency in at least one programming language (Python/Node.js a plus)Experience debugging moderately complex systems and improving reliability/performanceStrong fundamentals: data structures, testing, version control, Linux basicsNice to have:Spark/PySpark experienceHadoop ecosystem exposure (HDFS/HBase)Workflow orchestration (Airflow/Dagster/NiFi)Search/indexing (OpenSearch, MongoDB)Kubernetes and infrastructure-as-codeDegree in Computer Science or numerical degreeBenefitsCompetitive salary DOE25 days annual leave + your birthday off, in addition to bank holidays, rising to 30 days after 5 years of service.Remote workingPrivate Family Healthcare.35 hour working week.Employee Assistance Programme.Company contributions to your pension.Pension salary sacrifice.Enhanced maternity/paternity pay.The latest tech including a top of the range MacBook Pro.

Apply Now

Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

Reservation Agent Part-Time Evenings/Weekends

Remote

Case Manager, Registered Nurse - Remote

Remote

Experienced Customer Service Representative – Part-Time Remote Opportunity with arenaflex for Delivering Exceptional Travel Experiences

Remote

QA Lead – Test Manager, Eligibility Systems

Remote

Remote Customer Service Amazon Jobs Hiring For ...

Remote

Technical Engineer - Shelton, CT

Remote

Data Entry / Admin Assistant - Remote (UK-wide)

Remote

[Remote] Data Engineer, Web Scraping

Remote

Conference Event Planner II

Remote

Amazon Data Entry Part-Time Remote Jobs Hiring (URGENT) - No Experience/ Entry Level

Remote
← Back