Platform Reliability Engineer - Sustainable Talent
Hillsboro, OR
About the Job
Join Sustainable Talent as an Engineering Technician (Platform Reliability Engineer) supporting Nvidia and their IPP Platform Group (Infrastructure, Planning and Process)! This is a W-2 full-time contract with openings in Hillsboro, OR and Austin, TX. We offer competitive pay $35-45/hourly based on factors like experience, education, location, etc. and provide full benefits, PTO, and amazing company culture!
IPP works with various groups within NVIDIA Software Engineering and includes Graphics Processing, Deep/Machine Learning, Artificial Intelligence, and Autonomous Vehicles. IPP Platform supplies infrastructure and provides a Platform for executing nearly half a million automated jobs per day on thousands of servers improving the efficiency of thousands of NVIDIA's Software Engineers worldwide!
In this role, you will be faced with the challenge of maintaining the availability/reliability of the Platform developers use to develop and test software for NVIDIA hardware. Additionally, you will collaborate with Infrastructure Engineers and Architects, install/configure and maintain Windows, Linux, and MAC platforms, and be responsible for inventory and asset management. We expect things to break in the lab, as the Platform is extremely complex, the software developed is mostly low-level device drivers, and BUGS do break boards/GPUs. We seek to catch problems early in our labs rather than having them discovered in end-user devices. Our labs run more than 100,000 automated tests per day and are part of a sophisticated DevOps pipeline requiring constant supervision, BUG-fixes, and improvements all while maintaining a very high-level of availability and reliability.
What you'll be doing:
- Deploy, configure, and onboard Servers, Desktops, and GPUs in our datacenters.
- End-to-end servicing of systems inclusive of networking, storage, desktop servers, and OS installation and configuration.
- Active lifecycle management including inventory and asset management.
- Maintain reliability and availability of all hardware.
What we need to see:
- Associates or bachelor's degree in a Tech related Major or 4+ years of equivalent experience in a Lab or Datacenter environment.
- Ability to perform well at work without requiring constant manager supervision.
- Ability to do deploy and cable servers and test equipment.
- Basic understanding of UNIX/Windows operating systems, networking and storage.
- Ability to work in a fast-paced environment.
- Comfortable multitasking and iterating over repetitive tasks.
- To be successful in this position, you should have a love of working with close-knit, multi-disciplinary teams, and enjoy hands-on work with pioneering platforms.
Ways to stand out of the crowd:
- PC/MAC hobbyist/enthusiast; Gamer; Knows Nvidia.
- Hardware interest/focus; NVIDIA GPU knowledge a PLUS!
- Experience with scripting (shell, python) and can read code (TCL, Java).
- Mechanically inclined and capable of physical/manual labor; Can handle a screw driver and other such tools.
- Personable and team focused.
- Capable/quick learner.
Sustainable Talent is a M/F+, disabled, and veteran equal employment opportunity and affirmative action employer.