Network Engineer, HPC Systems Network Strategy - Meta
Menlo Park, CA
About the Job
Network Engineers at Meta are hybrid software/network engineers who ensure that Meta's network and related services run smoothly and have the capacity for future growth. Vendor and Community Management, Data Analytics, network (re)design, and cost modeling are keys to meeting our demands; you will be responsible for conceiving, developing, and deploying software, hardware and network systems and tools that improve reliability and efficiency in our global network. Do you want to work on one of the most dynamic, fast-paced networks in the world? Do you want to develop innovative solutions to our challenges and ship them into production? Then a role on one of our network engineering teams is for you!
RESPONSIBILITIES
Network Engineer, HPC Systems Network Strategy Responsibilities:
MINIMUM QUALIFICATIONS
Minimum Qualifications:
PREFERRED QUALIFICATIONS
Preferred Qualifications:
RESPONSIBILITIES
Network Engineer, HPC Systems Network Strategy Responsibilities:
- Design, deploy, manage and maintain multi-vendor, multi-protocol data center and high performance compute networks.
- Draft and review system architecture and migration plans working closely with internal and external development teams.
- Define and develop optimized network hardware (and optics) systems.
- Together with your engineering team, you will share an on-call rotation and be an escalation contact for service incidents.
- Analyze data to diagnose and identify root causes to network issues.
- Partner alongside the best engineers in the industry on the coolest stuff around - the code and systems you work on will be in production and used by billions of users all around the world.
MINIMUM QUALIFICATIONS
Minimum Qualifications:
- 10+ years of work experience in operating and improving production networks.
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
- Experience understanding and mitigating network hardware and topology failures.
- Experience coding in higher-level languages (e.g., Python, C++, Go, C, etc.).
- Experience in configuration and maintenance of network fabrics and NMS systems, or high performance compute clusters.
- Experience developing and understanding network device configuration for at least one vendor (NVidia/Mellanox, Arista/Broadcom, etc.).
- Experience learning software, frameworks and APIs.
PREFERRED QUALIFICATIONS
Preferred Qualifications:
- Knowledge of RoCE and Infiniband protocols.
- Understanding and development of Ethernet and Infiniband HPC Architectures.
- Hardware and software development experience integrating optics and ethernet switching ASICs to build network systems.
Source : Meta