[Remote] Data Engineer
Note: The job is a remote job and is open to candidates in USA. Arva is a company focused on ecosystem modeling and measurement, and they are seeking a Data Engineer to build and scale their data infrastructure. The role involves designing and maintaining data pipelines, ensuring data quality, and collaborating with Data Science teams to support modeling and analytics needs.ResponsibilitiesDesign, implement, and maintain scalable data pipelines supporting ecosystem and biogeochemical modelingBuild reproducible workflows that generate standardized model inputs and manage outputs across space, time, and scenario analysisIntegrate heterogeneous datasets, including field data, management data, soil data, and weather data, into modeling pipelinesDevelop and maintain cloud-based infrastructure to support modeling pipelines and optimization workflowsImplement data storage solutions using relational, spatial, and object-based databasesSupport efficient data access and processing using platforms such as PostgreSQL, PostGIS, and cloud object storageEnsure data quality, versioning, traceability, and auditability to support measurement, reporting, and verification requirementsImplement validation and monitoring processes to ensure reliability of model inputs and outputsSupport transparent, repeatable workflows suitable for regulatory and credit market reviewWrite clean, modular, and well-documented production code that supports maintainable and scalable data systemsApply software engineering best practices including testing, version control, and documentationCollaborate closely with Data Science and Technology teams to align data infrastructure with modeling, analytics, and production needsSkills3+ years demonstrated experience building and maintaining data pipelines for large, complex, and heterogeneous datasetsStrong proficiency in Python and modern data engineering tools, with experience writing production-grade, testable codeExperience working with cloud platforms, with AWS strongly preferredFamiliarity with containerization tools such as Docker and version control systems such as GitHubExperience with relational and spatial databases, including PostgreSQL and PostGISExperience working with geospatial data formats and spatial data processingBachelor's or Master's degree or equivalent experience in Data Engineering, Computer Science, Environmental Informatics, or a related fieldExperience supporting scientific or ecosystem modeling workflowsFamiliarity with workflow orchestration tools such as Airflow or PrefectCompany OverviewArva Intelligence brings farmers the power of machine learning inside the farm gate to significantly improve economics and soil health. It was founded in 2018, and is headquartered in Salt Lake City, Utah, USA, with a workforce of 11-50 employees. Its website is https://arva.com.