Distinguished Site Reliability Engineer – Cloud

Remote · USA Full-time New today

Job Description:

Lead, design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus on performance at scale, real time monitoring, logging and alerting
Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation and refinement
Support services before they go live through activities such as system design consulting, developing software tools, platforms and frameworks, capacity management and launch reviews
Maintain services once they are live by measuring and monitoring availability, latency and overall system health
Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
Practice sustainable incident response and blameless postmortems
Be part of an on call rotation to support production systems

Requirements:

BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics), or equivalent experience
16+ years of experience with Infrastructure automation, distributed systems design, experience with design, develop tools for running large scale private or public cloud system in Production
Experience in one or more of the following: Python, Go, Perl or Ruby
In depth knowledge on Linux, Networking and Containers

Benefits:

equity
benefits

Apply tot his job Apply To this Job

Related roles

Site Reliability Engineer, IDaaS Data Platform

Remote · USA Full-time

Site Reliability/Platform Engineer (Linux/ Kubernetes / Python) - 180-190K

Remote · USA Full-time

Site Reliability Engineering Manager

Remote · USA Full-time

Site Reliability Engineer – SkillBridge Intern

Remote · USA Full-time

DevOps Engineer - Kubernetes, AWS & Docker Skills Required (Fully Remote )

Remote · USA Full-time

FSO Audit LABS - Kubernetes DevOps Engineer - Senior - Bay Area

Remote · USA Full-time

Team Lead, Site Reliability Engineering - Storage Layer Service

Remote · USA Full-time

Site Reliability Engineer-SkillBridge Intern

Remote · USA Full-time

SRE Architect + Strong Dynatrace exp

Remote · USA Full-time

Software Engineer – Java, Spring Boot, Kubernetes, AWS

Remote · USA Full-time

Associate, Marketing Solutions (*)

Remote · USA Full-time

Senior QA Automation Engineer (JavaScript/TypeScript) - HospitalityTech

Remote · USA Full-time

Experienced Data Entry Assistant – Remote Opportunity at arenaflex

Remote · USA Full-time

Head of Finance job at AnswersNow in US National

Remote · USA Full-time

Associate Staff Engineer

Remote · USA Full-time

Healthcare Benefits Representative

Remote · USA Full-time

Product Manager – Aseptic Packaging

Remote · USA Full-time

Remote Operations Manager - Beauty & Wellness

Remote · USA Full-time

Insurance Producer - Commercial Lines

Remote · USA Full-time

BCBA ($25,000 Bonus)

Remote · USA Full-time