Site Reliability Engineering - Forhyre

United States, NY 00000

About the Job

Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape.

To be successful in this role

You'll have the opportunity to design and implement major infrastructure components, systems, and developer-friendly capabilities to improve the availability, scalability, latency, and efficiency of our services
You will provide technical leadership to cross-functional engineering, infrastructure, and product teams, and evangelize cloud best practices while building a culture of reliability and observability
Engage in and improve the end to end lifecycle of software development--from inception and design, through deployment, operation and refinement of a highly distributed system running in public cloud
Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles
Scale systems sustainably through automation to improve reliability and velocity
Assist with all aspects of operational security and compliance
Run software performance analysis and system tuning
Design and implement tools to collect data from various sources and provide actionable insights
Participate in critical incident management and timely post-mortems of production incidents to drive practices around blameless analysis, resolution, and continuous improvement work with cross-functional teams Develop the rest of the team by conducting code reviews, providing mentorship, pairing, and training opportunities

Qualification & Skills

We are looking for Principal SRE with proven experience in running distributed systems at scale, in production
You have 15+ years of experience in relevant skills gained and developed in the same or similar role
Strong knowledge of container orchestration, preferably Kubernetes and networking technology
Hands-on experience in one or more languages, such as Node JS, Python, Go, Perl, Ruby, and Bash
Experience with SOA, Microservices architecture, API Management & Enterprise system Integrations
Strong production experience with cloud infrastructure, AWS, Azure & Google Cloud
Strong sense of ownership, and an ability to drive tasks to completion
Experience developing and monitoring distributed systems
Experience working in an Agile Environment with great collaboration skills

Source : Forhyre

Site Reliability Engineering - Forhyre

United States, NY 00000

About the Job

Popular Job Categories

Popular Job Titles

Popular Job Locations

Popular Companies