[Remote] Data Engineer
Note: The job is a remote job and is open to candidates in USA. BigCircle Ventures is an expanding Digital Health Start-Up seeking a Data Engineer. The successful candidate will develop scalable data management architectures and work closely with Data Science and Application Development teams to leverage IoT, sensor, and AI technologies for improving patient care.ResponsibilitiesDevelop scalable data management and data processing architecturesManage data acquisition from API, batch, event or streaming sourcesDevelop processes for data aggregationDesign and develop data pre- and post-processing stagesPlan and design for data governance, security, provenance and the over-all data lifecycleLeverage best-in-class cloud technologies to cater for OLTP and OLAP business needsIntegrate ML models and Analytic components into the workflows (including MLOps)Work closely with Data Science and Application Development teams in an agile development processSkillsB.Sc., B.Eng. or higher in Computer Science, Computer / Electronic / Systems Engineering, or similar disciplinesProven experience as a Data EngineerExperienced with structured, semi-structured and unstructured data (e.g., Relational, JSON, Schema-less)Experience with creating, cleaning and curating datasets and databases such as: MySQL, PostgreSQL, MongoDB, Redis, Bigtable, time-series databases or similarServerless/distributed processing experience, e.g., Multiprocessing, containers, lambda or similarKnow-how for scheduling workflows, e.g., DAGs with Apache AirflowAccomplished and versed with various ETL approachesExposure to classical and deep learning-based ML methods (e.g., CNNs, DL Auto-encoders, etc.)Knowledge and experience of relevant data, analytics, visualization and ML languages and libraries is important (e.g., Julia/Python, Boto3/Apache Airflow, Parquet, SciPy/NumPy, Pandas/Matplotlib, Keras/TensorFlow, PyTorch, etc.)Experience with Model Deployment / ML Ops is desirable. Edge-based inference is also of interestExperience with AWS (Fargate, RDS, EC2, SageMaker, Timestream, EMR, Kinesis, MWAA, etc.), Docker, IaC (Terraform), CI/CD, monitoring and related toolingExperience with Time-Series Data is a bonusCommunicating effectively in an interdisciplinary environment (AI/ML, product management, regulatory, clinical)Have practical experience with ETL, Data Pipelines and Cloud DeploymentsExperience in design and building data solutions while ensuring confidentiality, integrity, and availabilityA strong engineering interest in ML and data scienceBusiness proficient in English (spoken and written)BenefitsThe role offers a competitive salary and most importantly the chance to be a central player in the future of healthcare.Company OverviewBigCircle Ventures is a cleantech venture builder engaged in transforming early-stage deeptech research into scalable startups. It was founded in 2024, and is headquartered in Amsterdam, Noord-Holland, NLD, with a workforce of 2-10 employees. Its website is https://bigcircle.ventures/.