[Remote] Senior/Staff/Principal Full-stack, Infra, Platform, Backend, Storage SWEs
Note: The job is a remote job and is open to candidates in USA. BRK Tech, a Berkshire Hathaway Group Company, is seeking Senior, Staff & Principal Software Engineers across various domains. The roles focus on designing, building, and operating technology platforms to support large-scale systems across multiple industries.ResponsibilitiesDesign, build, and operate modern technology platforms that run at massive scale and support long‑term, real‑world impactBuild and operate enterprise messaging platformsOperate and scale large distributed production systemsBuild and operate enterprise observability platformsSkillsBachelor's degree or equivalent practical experience (4+ additional years)Typically 6–8+ years of relevant experience with strong hands‑on ownership for Senior EngineersTypically 10+ years with deep technical leadership and architecture ownership for Principal Engineers6+ years of professional software development experienceStrong proficiency in at least one major language (Java, Go, Python, C#, or similar)Hands‑on experience building distributed systems and platform servicesBackend and platform development: microservices, APIs, service‑to‑service integrationsMessaging and event‑driven systems: Kafka, RabbitMQ, or similarData platforms: SQL, NoSQL, Graph databases, caching systemsCI/CD pipelines, Git workflows, and modern DevOps practicesContainers, Kubernetes, and/or serverless runtimesFamiliarity with observability stacks (metrics, logging, tracing, alerting)Hands‑on experience with Ceph‑based SDS (object, block, file)Storage protocols: NVMe‑oF (TCP), S3, NFS, SMB/CIFSDesigning and scaling high‑availability, multi‑tenant storage platformsData resiliency and recovery models (N+2 / N+3, erasure coding, DR)Storage integration with Kubernetes, virtualization, and bare metalStrong Linux systems expertiseAutomation using Python, Go, Bash, C/C++Experience with OpenStack, Nutanix, VMware, KubeVirtArchitecture of high‑performance storage fabricsExpertise in NVMe‑oF, RDMA, QoSDeep routing knowledge: BGP, OSPFExperience with Cisco, Palo Alto, NVIDIA/MellanoxDesigning and validating N+2 / N+3 resilient architecturesLinux networking and open‑source ecosystemsStrong grounding in network securityPerformance tuning for distributed storage (Ceph/Swift) at scaleKubernetes and virtualization optimization (KubeVirt a plus)Advanced Linux tuning: kernel parameters, eBPF, I/O profilingHigh‑performance hardware & networking (PCIe Gen5, NVMe‑oF/TCP)NUMA, CPU pinning, cgroups, and resource isolationQuantitative modeling of latency and throughputOperating and scaling large distributed production systemsStrong foundation in SRE principlesDeep experience with Kubernetes, Linux internals, and automationInfrastructure tooling using Go, Python, or JavaObservability across metrics, logging, tracing, and alertingHigh‑availability and data resiliency architecturesLeadership during high‑severity production incidentsOwnership of platforms with 24/7 operational responsibilityBuilding and operating enterprise messaging platformsDeep expertise with Kafka and/or RabbitMQ, including multi‑DCMessaging architecture: topic/queue design, durability, HA, scalingEvent‑driven and integration‑heavy environmentsKubernetes‑based platforms (on‑prem and/or cloud)API‑driven integrations and distributed systems fundamentalsStrong production ownership mindsetBuilding and operating enterprise observability platformsStrong open‑source backgroundExpertise across logging, metrics, distributed tracing, and alertingKubernetes telemetry pipelines and centralized ingestionCode‑level visibility into latency, errors, and service dependenciesAlerting driven by SLOs, SLAs, and latency thresholdsPublic cloud exposure (AWS, Azure, GCP)Company Overviewbrk tech is a new kind of technology organization within Berkshire Hathaway—built to empower world-class enterprises with adoption of tech and artificial intelligence enabling each business to accelerate their digital transformation.. It was founded in 2026, and is headquartered in San Francisco, California, US, with a workforce of 11-50 employees. Its website is .