[Remote] Site Reliability Engineer (Dynatrace, AWS and Kubernetes)
Note: The job is a remote job and is open to candidates in USA. Solugenix is assisting a client in their search for a Site Reliability Engineer specializing in Dynatrace, AWS, and Kubernetes. This role involves managing application performance monitoring, automation development, and high-availability design to enhance system reliability and performance.ResponsibilitiesMinimum 10 years of IT experienceDynatrace platform administration and advanced configuration aligned to best practicesApplication and infrastructure onboarding, including APM, RUM, tracing, and dependency mappingAlerting, events, and problem management design to reduce noise and improve signal qualityDevelopment of automation using Dynatrace APIs, Terraform, Ansible, and scripting (Python, PowerShell, shell)Standardized monitoring intake and lifecycle processes for new systems and applicationsDashboarding and reporting across applications, infrastructure, cloud, and key platforms (including database, storage, network, and SAP workloads)High‑availability design and monitoring for Dynatrace agents, extensions, and synthetic testsProactive application and performance analysis including stack traces, RUM insights, and network flowsDefinition of observability standards, access models, and operating proceduresEnablement of client teams through documentation, working sessions, and periodic value reviewsShould be willing to work in EST timingsSkillsMinimum 10 years of IT experienceDynatrace platform administration and advanced configuration aligned to best practicesApplication and infrastructure onboarding, including APM, RUM, tracing, and dependency mappingAlerting, events, and problem management design to reduce noise and improve signal qualityDevelopment of automation using Dynatrace APIs, Terraform, Ansible, and scripting (Python, PowerShell, shell)Standardized monitoring intake and lifecycle processes for new systems and applicationsDashboarding and reporting across applications, infrastructure, cloud, and key platforms (including database, storage, network, and SAP workloads)High‑availability design and monitoring for Dynatrace agents, extensions, and synthetic testsProactive application and performance analysis including stack traces, RUM insights, and network flowsDefinition of observability standards, access models, and operating proceduresEnablement of client teams through documentation, working sessions, and periodic value reviewsShould be willing to work in EST timingsCompany OverviewSolugenix is a leading IT services and staffing firm providing IT service management, support center services and more. It was founded in 1969, and is headquartered in Irvine, California, USA, with a workforce of 1001-5000 employees. Its website is https://www.solugenix.com/.