[Remote] Big Data Engineer
Note: The job is a remote job and is open to candidates in USA. AdvanSix plays a critical role in global supply chains, innovating and delivering essential products for various end markets. They are seeking a Big Data Engineer to build and operate their enterprise Unified Data Layer, delivering data products that support multiple corporate functions and ensure trusted data governance.ResponsibilitiesBuild ingestion pipelines (batch, CDC, streaming) from S/4HANA/DataSphere, PHD/historian, LIMS, TMS, HSE, and other sources into landing → curated → semantic layersImplement data contracts, schema/versioning, SCD handling, partitioning, and performance tuning (file formats, clustering, caching)Develop dimensional/semantic models that back certified Power BI datasets and APIs for apps/agentsIntegrate OT data via OPC UA/MQTT, broker/DMZ patterns, read-only historian feeds, and event/batch frames—no control-net readsCollaborate with plant controls on change control, signal quality, and downtime windowsEmbed data quality rules, unit/integration tests, and validation checks (freshness, completeness, drift/PSI)Instrument lineage and end-to-end monitoring; build alerting and on-call runbooks to minimize MTTREnforce RBAC, secrets management, PII/HSE classifications, and retention aligned to Governance/MDM policiesAutomate build/test/deploy with Git-based CI/CD (environments, approvals, blue/green)Track and optimize cost/performance (cluster sizing, autoscaling, cache strategy); contribute to FinOps reviewsPartner with Reporting & BI on semantic model contracts, RLS, and performance SLAs; avoid direct system scrapingProduce “readme” docs, data dictionaries, runbooks, and post-incident reviews; support knowledge transfer with vendorsSkillsMinimum 5 years' in data engineering building production pipelines at scale (batch/CDC/streaming)Hands-on with Azure data stack: Databricks or Fabric/Synapse, ADF/Pipelines, ADLS/OneLake, Azure SQL/SQL MI, Key VaultStrong SQL and Python/PySpark; comfort with Spark Structured Streaming and performance tuningExperience implementing tests/observability (freshness, schema, expectations), and Git-based CI/CDFamiliarity with SAP S/4HANA structures and SAP DataSphere semantic modelingOT concepts: historians (PHD/PI), OPC UA/MQTT, event/batch frames, ISA-95/99 basicsUnderstanding of Power BI consumption (semantic models, RLS) and APIs for downstream AI/ML apps/agentsTime-series/data-quality tooling (e.g., Great Expectations or equivalent patterns), feature/metric storesMDM concepts (keys, survivorship), lineage/catalog toolingTMS/WMS, LIMS, Historian, HSE domain exposure; Lean/Six Sigma mindset; FinOps awarenessBenefitsPaid holidaysPaid time off including vacationEligibility to purchase company stockTuition reimbursement401K with a competitive company matchDiscretionary financial benefits such as incentive pay, equity awards, and participation in a deferred compensation planMedical, dental and vision insuranceFlexible spending and health savings account eligibilityEmployer-provided short term disability benefitsEligibility to purchase long term disability benefitsEmployer-provided basic life insuranceEligibility to purchase voluntary life coveragesCompany OverviewAdvanSix is a chemical industry It was founded in 2016, and is headquartered in New Jersey, Saint Patrick, TTO, with a workforce of 1001-5000 employees. Its website is https://www.advansix.com/.Company H1B SponsorshipAdvanSix has a track record of offering H1B sponsorships, with 2 in 2023, 1 in 2022, 1 in 2021, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.