[Remote] Data Engineer (Remote, US)
Note: The job is a remote job and is open to candidates in USA. Sayari is a company focused on trustworthy AI for economic security and commercial risk. The Data Engineer role involves building and scaling data pipelines to transform vast amounts of records into actionable intelligence, collaborating with cross-functional teams to implement new features, and contributing to a robust engineering culture.ResponsibilitiesDesign, build, and maintain scalable data pipelines using Python, Spark, and Airflow to support our core data acquisition and entity resolution enginesCollaborate cross-functionally with AI/ML and Product teams to implement new features and AI-native productsProactively identify and resolve bottlenecks in our complex ETL processes, bringing a fresh perspective to refine and optimize our existing codebaseContribute to a robust engineering culture through rigorous code reviews, unit testing, and clear communication of design decisionsOwn the end-to-end delivery of roadmap tasks within two-week sprints, ensuring work meets high standards for quality, documentation, and performanceParticipate in roadmap planning and story refinement, eventually taking ownership of major epics that drive our long-term product defensibilitySkillsProfessional proficiency in Python and experience contributing to shared codebases using Git (branching, PRs, code reviews)3+ years of experience working in Data EngineeringDemonstrated experience working with relational databases (PostgreSQL/BigQuery) and an interest in or familiarity with graph databasesFamiliarity with distributed computing (Spark) or a strong desire to master itStrong collaborative skills and the ability to work effectively in an Agile, sprint-based environmentA 'self-directed' orientation: ability to move tasks from 'assigned' to 'complete' with high autonomy and clear communicationExperience with Django, Scala, or ScrapyHands-on experience with workflow orchestration tools like AirflowExperience or strong interest in LLM tuning, deployment, and AI engineering best practicesExperience working with international or non-English datasetsPrior experience working with high-scale, complex data pipelinesBenefits100% fully paid medical, vision, and dental for employees and their dependentsGenerous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick daysOutstanding compensation package; competitive commissions for revenue roles and bonuses for non-revenue positionsA strong commitment to diversity, equity, and inclusionEligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leaveA collaborative and positive culture - your team will be as smart and driven as youLimitless growth and learning opportunitiesCompany OverviewSayari is the judgment infrastructure for trustworthy AI in economic security and commercial risk. It was founded in 2015, and is headquartered in Washington, District of Columbia, USA, with a workforce of 201-500 employees. Its website is https://sayari.com.Company H1B SponsorshipSayari has a track record of offering H1B sponsorships, with 1 in 2024, 2 in 2023, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.