[Remote] Senior Data AI Engineer
Note: The job is a remote job and is open to candidates in USA. IntelliTech is a dynamic and forward-thinking small business specializing in Full Stack Engineering, Data Analytics, Cloud Solutions, and DevSecOps services. They are seeking a Senior Data / AI Engineer to support a Department of War program focused on operationalizing a Government-owned digital twin application for ammunition industrial base readiness. The role involves owning the data lifecycle end-to-end and designing AI-enabled decision-support layers for supply chain simulation solutions.ResponsibilitiesDesign and implement governed ingestion pipelines for complex defense supply chain datasets, including Bills of Materials (BOM), demand and order backlogs, facility and production line capacity, supplier risk, and acquisition planning dataBuild validation services that enforce schema conformance, referential integrity across linked datasets, circular reference detection, and business-rule validation with actionable row- and column-level feedbackImplement raw data preservation in object storage such as Amazon S3, including metadata capture for source type, upload timestamp, uploader identity, file checksum, and dataset versionDevelop canonical data transformation workflows that convert validated source inputs into normalized, run-ready artifacts aligned to the simulation engineβs entity modelImplement dataset versioning and lineage tracking so each scenario run is tied to explicit input versions and assumptionsWork with Government stakeholders and source-system owners to identify, prioritize, and implement automated or semi-automated data refresh pathsParticipate in Technical Exchange Meetings (TEMs) to help define data contracts, including source format, semantics, refresh cadence, and validation requirementsImplement approved connection patterns such as scheduled file landing, secure file exchange (SFTP), API-based retrieval, and cloud-to-cloud transfer mechanismsMaintain hardened controlled upload workflows in parallel so mission operations are not dependent solely on external integrations or approvalsBuild the AI integration layer within the FastAPI backend to broker access to Government-approved hosted LLM endpointsImplement scoped retrieval logic that constrains AI context to approved run artifacts, simulation outputs, and post-processed analyticsDevelop natural-language Q&A capabilities that allow analysts to query scenario results such as bottlenecks, supplier risks, and differences between runsBuild guided scenario generation workflows that translate analyst intent into structured JSON scenario configurations for user review and approval before executionImplement AI-assisted comparison summaries and brief-ready output generationEnable function calling and tool-use patterns so the model can dynamically query backend APIs for scenario comparison, bottleneck analysis, production planning, and supply chain riskEnsure all AI interactions are audit-logged, role-scoped, and grounded in explicit scenario artifactsExtend existing comparison capabilities to generate structured side-by-side scenario outputs with standardized metrics and deltasBuild reusable templates for brief-ready outputs that reduce analyst time-to-briefGenerate reproducible comparison artifacts and store them as part of the scenario run recordImplement data quality monitoring and dashboards for ingestion success rates, validation outcomes, and overall pipeline healthOptimize data preparation and post-processing workflows to reduce end-to-end scenario runtimeDesign and implement version-bounded caching strategies for validated inputs, normalized data products, and reusable post-processing summariesSkillsBachelor's degree in Computer Science, Data Science, Engineering, Information Systems, or a related technical discipline and 8+ years of relevant experience; or Master's degree in a related field and 6+ years of relevant experienceActive DoD Secret clearance7+ years of professional experience in data engineering or data / AI engineering rolesStrong hands-on Python development experience, including Pandas, NumPy, ETL/ELT design, data pipeline development, and asynchronous programming patternsExperience building data validation and quality frameworks, including schema enforcement, referential integrity, data contracts, and validation feedback mechanismsExperience integrating LLM APIs such as OpenAI, Anthropic, or equivalent platforms, including function calling, tool use, scoped retrieval, and prompt engineering for structured outputsExperience with MongoDB or other document-oriented databases, including data modeling and aggregation pipelines for analytics workloadsExperience with Amazon S3 or other cloud object storage services, including raw, normalized, and curated data layering approachesExperience supporting DoD or federal Government programsStrong communication skills and the ability to work directly with technical and non-technical stakeholders in mission environmentsExperience with defense supply chain, logistics, manufacturing, or industrial base dataFamiliarity with Databricks, data mesh, or medallion architecture patterns such as bronze/silver/goldFamiliarity with SimPy or discrete-event simulation data inputs and outputsExperience with Advana, WDP (War Data Platform), or other DoD enterprise data platformsExperience establishing data-sharing agreements and supporting Technical Exchange Meetings with Government source-system ownersKnowledge of munitions-related data structures such as NIIN, CAGE, Bill of Material hierarchies, and production line capacity modelsExperience with Redis or other caching layers supporting analytics applicationsExperience with FastAPI or Flask backend developmentPrior experience supporting Army Cloud EnvironmentsBenefitsHealth insuranceDental insuranceVision insuranceA 401(k)Paid time offProfessional development opportunitiesFlexible work arrangements to support work-life balanceCompany OverviewIntelliTech is a dynamic and forward-thinking small disadvantaged minority-owned business specializing in Full Stack Engineering, Data Analytics, Cloud Solutions, DevSecOps services, and AI & ML. It was founded in 2023, and is headquartered in Washington, District of Columbia, US, with a workforce of 2-10 employees. Its website is https://intellitech.co.