Level 2 (L2) Cloud Operations Engineer

Remote Full-time
Department: Operations
Reports To: NOC Manager
Location: Hybrid, Woburn, MA

Available Shifts: Overnight: 10PM-6AM & First: 6AM-2PM
Classification: Full-Time, Exempt
Estimated Annual Salary: $105,000 – $135,000

About Knox

Knox runs the largest Federal managed cloud, building and operating secure cloud and AI environments that support the U.S. government’s most critical missions — from national security and public safety to essential public services. Our customers rely on Knox to deploy production systems that meet the highest standards for security, reliability, and compliance.

Work at Knox is high-impact and purpose-driven. The problems we solve are high-stakes, the expectations are high, and the results are visible. Speed, rigor, and trust matter here - because the environments we secure cannot fail. Your contributions are visible, your expertise is relied upon, and the impact of your work is immediate and measurable. We operate at federal scale, securing some of the most sensitive government environments in the country - because the systems we build must perform without fail.

Role Overview
The Cloud Operations Engineer (L2) is responsible for advanced troubleshooting, system administration, and application environment support across Knox’s cloud infrastructure. This role bridges operations, automation, and development support — maintaining system stability, executing changes, and ensuring compliance within FedRAMP Moderate, High, and IL4 environments.The ideal candidate has hands-on experience operating compliance controlled cloud environments in a NOC/SOC setting, with deep familiarity with cloud infrastructure services, and experience responding to real-time alerts, incidents, and escalations in production.

This is a shift-based operations role within a 24x7 Network / Cloud Operations environment.
Team members are required to work assigned shifts, clock in for scheduled hours, and maintain continuous operational coverage. The role includes participation in a rotating on-call schedule for after-hours incidents and holiday coverage. This position is customer-facing and requires professional interaction with customers during incident response, including answering support phone calls and attending customer meetings via Zoom or other collaboration tools.

Key Responsibilities
• Perform advanced troubleshooting for infrastructure, OS, and application issues.
• Analyze system logs, metrics, and telemetry from monitoring platforms (Grafana, Datadog, Wiz, Crowdstrike).
• Coordinate with Platform/DevOps Engineers on root cause analysis and long-term remediation.
• Ensure timely resolution of escalated incidents in accordance with SLAs.
• Manage and maintain AWS, Azure, and hybrid environments in accordance with NIST 800-53 controls
• Execute system patching, upgrades, and configuration changes via automation or scripts.
• Perform health checks, deployment validations, and post-change verifications.
• Maintain infrastructure documentation and system configuration inventories.
• Perform advanced application troubleshooting for web-based applications, common application architectures.
• Troubleshoot app-layer issues such as API failures, integration errors, or misconfigurations.
• Work with DevOps/Platform teams to optimize CI/CD deployment workflows and rollback plans.
• Ensure adherence to change management and deployment authorization processes.
• Create or modify automation scripts (Bash, Python, PowerShell) for maintenance and reporting tasks.
• Leverage Terraform, Ansible, or cloud-native tools for provisioning and environment consistency.
• Proactively identify opportunities to automate recurring operational processes.
• Document system changes and incident response details for FedRAMP audits.
• Support Continuous Monitoring (ConMon) activities through vulnerability reporting and patch compliance tracking.
• Assist in maintaining logs, baselines, and access control evidence.

Qualifications
• 3–5 years of experience in cloud operations, system administration, or infrastructure support.
• Hands-on experience with CrowdStrike Falcon endpoint protection, including analyzing detections, reviewing IOM/IOA telemetry, assessing endpoint vulnerability exposure, and executing or supporting SOAR-based automated response actions.
• Hands-on experience using Grafana or Datadog for operational monitoring and incident response, including building and maintaining dashboards, analyzing time-series metrics, and correlating alerts to identify performance degradation, availability issues, and system failures in production environments.
• Proficiency in command-line troubleshooting
• Strong working knowledge of AWS and/or Azure infrastructure services
• Familiarity with CI/CD pipelines and deployment automation tools.
• Understand advanced application troubleshooting techniques for web-based applications and common application architectures.
• Experience writing and maintaining scripts (Bash, Python, PowerShell).
• Familiarity with FedRAMP, NIST 800-53, or similar compliance environments.

Required Certifications:
• AWS SysOps Administrator, Microsoft Azure Administrator, CompTIA Security+

Hiring Requirement: Due to the nature of our work with federal government clients and compliance with applicable regulations, this position requires U.S. citizenship. Candidates must be able to provide documentation verifying U.S. citizenship status as part of the background check process.

Any offer of employment is contingent upon the successful completion of all required pre-employment screenings, including a background check, in accordance with applicable laws and government contract requirements.

Benefits & Perks

Knox offers a competitive employee benefits package including Medical, Dental, Vision, Life & Disability, unlimited PEO, and an employee funded 401k plan. Please note, benefits are subject to change.

We are an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Employment decisions are made without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, or any other legally protected status.

Apply tot his job

Apply To this Job
Apply Now →

Similar Jobs

Experienced Registered Behavior Technician for In-Home ABA Therapy - Atlanta, GA

Remote

Immediate Hiring: Experienced Registered Behavioral Technician (RBT) for Clinic-Based ABA Therapy Services

Remote

Experienced Registered Behavioral Technician (RBT) - ABA Therapy for Children with Autism Spectrum Disorder

Remote

Experienced Registered Nurse - Telehealth: Providing Remote Care Coordination and Patient Support

Remote

Experienced Substitute Teacher for Riverside County Schools - Join Scoot Education's Innovative Team

Remote

Experienced Substitute Teacher for San Bernardino County - Flexible Schedules & Competitive Pay

Remote

Experienced School Year Instructional Coach for High-Dosage Tutoring Programs in Edgewater Park, NJ

Remote

Experienced School Year Tutor for K-8 Students in Math and Literacy - Mickleton, NJ

Remote

Experienced Secondary Social Studies Teacher for Kansas - Flexible Hybrid Remote Arrangement

Remote

USPS Office Helper

Remote

[Remote] Business Development Manager – Life Sciences Consulting (US – Remote) Full-time

Remote

Digital Archivist; Transfer

Remote

Apply Now: Doordash Customer Support – Freshers Jobs $25/Hour

Remote

National Content & Technology Cooperative - Member Success Manager-Tennessee

Remote

Inpatient Pharmacist, PharmD – Build Analyst

Remote

Experienced Remote Customer Service Representative – Delivering Exceptional Support and Solutions to Diverse Clients at blithequark

Remote

Oracle Integration Cloud Consultant

Remote

Remote General Education Teacher for Virtual Schools - Immediate Hiring for 2024-2025 School Year

Remote

Druck North America Regional Sales Manager

Remote

Experienced Remote Data Entry Specialist – Medical Information Acquisition and Management

Remote
← Back