[Remote] Sr. Site Reliability Engineer(Storage Platform)_Remote
Note: The job is a remote job and is open to candidates in USA. Dice is looking for a Senior Site Reliability Engineer specializing in Storage Platforms to join their team. The role involves managing enterprise storage and Kubernetes platforms, ensuring the reliability and efficiency of mission-critical production environments.Skills6+ years of experience managing enterprise storage and Kubernetes platforms on LinuxStrong hands-on experience with SDS solutions (Ceph, Longhorn) and storage migrations from legacy systemsExperience with block, file, and object storage, including Fibre Channel and IP-based protocolsExperience with NVMe-oF or iSCSI fabricsExpert knowledge of Kubernetes and Linux systems (Ubuntu, RHEL/CentOS)Proficiency with Infrastructure-as-Code (IaC) (Ansible, Terraform)Strong scripting skills in Python and Bash (Golang (GO) a plus)Strong working knowledge of Enterprise DNS and integrations with KubernetesExperience operating 24x7 mission-critical production environmentsHands-on experience with KVM hypervisors (Suse Harvester, OpenStack)Strong written and verbal communication skillsProficiency with Git, CI/CD pipelines, and automated testing frameworksOpenStack Cinder multi-backend administrationBackup platforms (Rubrik)Understanding of CIS/NIST security and infrastructure lifecycle managementITIL Foundation/advanced certifications in support of ITSM standard methodologyCNCF Certified Kubernetes Administrator (CKA), Certified Kubernetes Security Specialist (CKS) or Red Hat specialist in Ceph Storage Administrator (EX125) certificationsCompany OverviewDice is a job-searching platform for technology professionals. It is a sub-organization of DHI Group. It was founded in 1990, and is headquartered in Santa Clara, California, USA, with a workforce of 201-500 employees. Its website is http://www.dice.com.