Staff Site Reliability Engineer - Ursus, Inc.
San Francisco, CA
About the Job
JOB TITLE: Staff SRE
**TOP 3 SKILLS:**
- GoLang
- Kubernetes
- Ruby
LOCATION: Remote
DURATION: Direct Hire
RATE RANGE: $160-180K
SUMMARY:
We're looking for a driven software engineer who cares deeply about their craft, and who wants to use their skills to bring about positive change in the world while working in a high performing organization using modern software development approaches. Someone who is comfortable in the rapidly changing nature of a startup environment but also adept at moving relentlessly forward: doing what needs to be done to unblock projects that truly deliver value to our users.
RESPONSIBILITIES
- Site Reliability Engineer - Collaborate with engineers and cross-functional teams to proactively identify and mitigate risks, ensuring timely and effective solutions. Advocate for reducing complexity and focus on empowering others across the tech stack to drive excellence and innovation.
- Lean and Agile Owner - Collaborate with cross-functional teams to distill and synthesize non-functional requirements into discreet and meaningful iterations that can be quickly implemented. Leverage Lean Startup and Agile methodologies along with Continuous Integration and Continuous Deployment infrastructure to rapidly prototype and validate ideas.
- Operational Maintainer - As an SRE, you will be responsible for managing the on-call rotation for the engineering squad. It is expected that when not actively triaging or responding to an incident, that you will spend the balance of your time building process, procedures, and technology that result in service level indicators (SLI) that align with our service level objectives.
- Problem Solver - Be ready, willing, and able to dive into logs, statsd, and other various platform telemetries to identify potential performance, scale, and stability issues before they become bottlenecks.
QUALIFICATIONS:
- 5+ years engineering experience, at least part of which is in a startup environment
- Recent and relevant experience with compliance and security regulations and processes
- Alignment with BetterUp mission of enabling self-driven behavior change
- Succeeded in a remote work environment
- Advanced level experience with Infrastructure as Code (e.g, Terraform, CloudFormation)
- Willingness to participate in a 24x7x365 on-call rotation
- Experience identifying and establishing meaningful service level indicators and objectives
- Experience with Kubernetes
- Experience developing in a high-level programming language (e.g, Python, Ruby, JavaScript)
- Experienced in Agile product development processes (SCRUM, Kanban, Lean Startup, etc)
- Strong verbal and written communication
- Impressive track record of maintaining a high bar of quality, stability, and availability
Source : Ursus, Inc.