Databricks SRE and Support Engineer

Remote Full-time
Job Title: Databricks SRE and Support Engineer Work Location: Hopkins, MN / Remote Contract duration: 6 months Request ID: 116475-1 Job Details: • Must Have Skills: Expert-level proficiency in Databricks, Python and SQL • Nice to have skills: Expert-level proficiency in Databricks, Python and SQL Detailed Job Description: As Databricks SRE and Support Engineer, you will work on operations related to AI Dojo (AI/ML upskilling program developed by Client on Databricks. This individual contributor (IC) role requires experience on working on large-scale AI/ML platforms guaranteeing stability, reliability, scalability, and performance. Experience with modern Infrastructure and DevOps tools and paradigms, as well as proven hands-on knowledge with Databricks is a must. Primary Responsibilities: • Continuous support: Provide continuous SRE support to thousands of geographically distributed users on the AI Dojo Databricks platform: respond to tickets, triage support, liaise with customers. • Automation & DevOps: Improve existing Infrastructure as Code (IaC) according to best DevOps practices. • Systems Monitoring: Develop and maintain monitoring frameworks to timely respond to outages and other service interruptions. • Security & Compliance: Collaborate with internal cybersecurity teams to ensure all systems and operations comply with industry standards and are secure against evolving threats. • Capacity Planning & Cost Optimization: Forecast and manage capacity requirements for the AI/ML training environment, while identifying opportunities to reduce costs without compromising performance. Required Qualifications: • Bachelor's degree in computer science, information technology, or a related field. • 6+ years of infrastructure experience: Proven experience working on large-scale, cloud-based, enterprise-level software platforms and deep understanding of Databricks environment. In particular: • Experience building Github Actions pipelines including composite actions, OIDC federation for cloud provider identity acquisition, and use of environments and deployment controls • Experience building Databricks Asset Bundle and Terraform pipelines to manage and deploy Databricks Platform and Workspace resources • Fluency in Python, experience with the Databricks Python SDK to perform Workspace operations, and familiarity with PySpark and Delta Lake. • Deep familiarity with Databricks APIs, and use of the Databricks CLI for use provisioning Workspace identities, filesystem resources, and the querying of account and workspace level Users, Groups, and Service Principals • Strong understanding of security best practices and experience ensuring compliance with relevant regulatory frameworks. • 3+ years of practical experience in Infrastructure-as-Code and CI/CD tools like Terraform, Git Actions and alike. • 3+ years of experience working in support teams that are geographically distributed. Apply tot his job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

**Experienced Part-Time Customer Service Specialist – Insurance Industry Expertise (California-Based)**

Remote

Dynamic Part-Time Remote Career Opportunities with Netflix - Join Our Team of Innovative Professionals and Shape the Future of Entertainment

Remote

Manufacturing Production Planner, Corporate Manufacturing - Lakeland

Remote

Experienced Full Stack Data Engineer – Cloud Computing, Data Warehousing, and ETL Pipeline Development for Global Sales and Finance Operations at Blithequark

Remote

Experienced Remote Customer Service Representative – Delivering Exceptional Health Insurance Support and Guidance

Remote

Experienced Home-Based Customer Service Chat Support Representative for Blithequark - Flexible Scheduling and Competitive Pay for Entry-Level and Experienced Professionals

Remote

Logistics Analyst - Inventory and Property Management Remote / Telecommute Jobs

Remote

Sr Solutions Architect, AWSI MEGS

Remote

Customer Success Associate

Remote

Lead, Risk Management Consultant

Remote
← Back