Site Reliability Engineering - Forhyre
United States, NY 00000
About the Job
Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape.
To be successful in this role
- You'll have the opportunity to design and implement major infrastructure components, systems, and developer-friendly capabilities to improve the availability, scalability, latency, and efficiency of our services
- You will provide technical leadership to cross-functional engineering, infrastructure, and product teams, and evangelize cloud best practices while building a culture of reliability and observability
- Engage in and improve the end to end lifecycle of software development--from inception and design, through deployment, operation and refinement of a highly distributed system running in public cloud
- Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles
- Scale systems sustainably through automation to improve reliability and velocity
- Assist with all aspects of operational security and compliance
- Run software performance analysis and system tuning
- Design and implement tools to collect data from various sources and provide actionable insights
- Participate in critical incident management and timely post-mortems of production incidents to drive practices around blameless analysis, resolution, and continuous improvement work with cross-functional teams Develop the rest of the team by conducting code reviews, providing mentorship, pairing, and training opportunities
Qualification & Skills
- We are looking for Principal SRE with proven experience in running distributed systems at scale, in production
- You have 15+ years of experience in relevant skills gained and developed in the same or similar role
- Strong knowledge of container orchestration, preferably Kubernetes and networking technology
- Hands-on experience in one or more languages, such as Node JS, Python, Go, Perl, Ruby, and Bash
- Experience with SOA, Microservices architecture, API Management & Enterprise system Integrations
- Strong production experience with cloud infrastructure, AWS, Azure & Google Cloud
- Strong sense of ownership, and an ability to drive tasks to completion
- Experience developing and monitoring distributed systems
- Experience working in an Agile Environment with great collaboration skills
Source : Forhyre