Site Reliability Engineer - Farm Credit Bank Texas
Austin, TX
About the Job
Who we are:
Farm Credit Bank of Texas is a $38.2 billion wholesale bank that has been financing agriculture and rural America for over 100 years. Headquartered in Austin, Texas, we provide funding and services to rural lending associations in five states, and we are active in the nation's capital markets.
While you may not be familiar with our name, Farm Credit Bank of Texas plays a critical role in supporting the businesses that make it possible for America to maintain access to an affordable and safe food supply, an industry which is one of the most innovative and evolving of our time. And while you help us deliver on our mission, we deliver on our commitment to you as a valued employee by providing competitive compensation, generous health and wellness benefits packages and an attractive hybrid workplace located along the bluffs of the Colorado River just minutes west of downtown Austin.
We seek out top talent in their fields, whether it be technology, finance, accounting, credit, human resources, or other administrative functions, and welcome you to join us in our mission to feed the world.
Position Description:
We are searching for a Site Reliability Engineer Lead. The Site Reliability Engineer is part of a team that is responsible for the overall coordination and control of all infrastructure systems required to keep the business operational. The Site Reliability Engineer Lead is responsible for developing software systems and automated solutions for operational aspects. They will assist in driving the technology roadmap of the team and ensuring all systems run as efficiently as possible.
Site Reliability Engineering revolves around monitoring, alerting, and automating. A Site Reliability Engineer IV (Lead) monitor's and helps stabilize services in production, sets and maintains acceptable performance and availability thresholds, and writes code that automates repetitive tasks. Additionally, SRE's will keep an eye on our systems capacity and performance. Development efforts will focus on optimizing existing systems, building infrastructure-as-code, and eliminating work through automation.
Day-to Day-Duties and Responsibilities:
- Mentors and trains other team members.
- Leads architectural and roadmap initiatives from conception to execution
- Leads initiatives across any specifically assigned discipline noted above from conception to execution
- Evaluations of all existing services to improve design, security, stability, performance, or operational efficiency.
- Architecture development, planning, and implementation.
- Research technology trends and vendor products. Testing of new solutions and assisting with roadmap development.
- Monitor health of all servers and infrastructure components; perform break\fix troubleshooting and periodic preventive maintenance.
- Develop and maintain technical documentation.
- Ability to independently find problems, diagnose and design solutions.
- Track all demand in the bank's ITSM tool
- Communicate changes to management and end-users as appropriate
What You Bring to the Team:
Our ideal candidate lives within a commutable distance from our office in Austin, Texas and willing to work a hybrid schedule.
It is an important role that covers many skills. This position requires:
Bachelor's degree or equivalent and six (6) plus years' experience as a Site Reliability Engineer, DevOps Engineer or Systems Engineer or a minimum of eight (8) years' experience as a Site Reliability Engineer, DevOps Engineer or Systems Engineer. Must hold two professional level certifications in at least one discipline (operations or development) or hold one certification in one discipline plus three (3) additional years' experience.
Possesses expert knowledge of systems reliability engineer programs. Ability to communicate/coordinate effectively with all levels of management as well as with outside vendors; Ability to exercise sound judgment, use discretion and maintain confidentiality. A moderate degree of creativity and latitude is expected. Ability to work with minimal supervision and to travel on a moderate basis. Skill in oral and written communication. Intermediate to advanced skill level in Microsoft Office applications. Possesses analytical thinking and mathematical skills. Ability to work as part of a team. May coach or instruct lower-level colleagues. Handles the most complex issues and problems and refers only the most complex issues to higher-level staff. Provides leadership, coaching, and/or mentoring to a subordinate group. May act as a "lead" or first-level supervisor.
Expert level knowledge in at least four of the following disciplines and working level knowledge in all of them:
- Design, implementation and support of monitoring and al