NOC Network Administrator (Shift Work) - Peraton
Home, TX
About the Job
Responsibilities
We are hiring a NOC Network Administrator, providing support in a 100% remote position with opportunity for work travel 10% of year. Providing ongoing network support, addresses availability issues, overcomes performance problems and provides level 2/3 support. Installs, maintains and coordinates the use of network systems. Manages network performance, diagnoses performance problems and undertakes tuning activities to ensure network performance meets contract requirements. Troubleshoots and resolves problems to ensure quality transmission service on the network. Reports on overall network performance including frequent problems, underlying issues and plans to improve network availability, performance and problem resolution. Trains users, maintains, creates and deletes user accounts. Maintains and updates system security plans on all platforms. Implements and coordinates network policies, procedures and standards. Enforces network security procedures.
Monitors critical network elements and applications using various network monitoring systems in a 24x7x365 Network Operations Center (NOC). Troubleshoots, repairs and maintains telecommunications networks in a NOC environment. Tests new and already installed hardware and software. Uses computerized network management systems to check for system alarms when parts of the network are not working properly and jeopardizing the level of communication services available to customers. Follows procedures to validate, troubleshoot and escalate issues. May work second and third shift work and overtime in the event of network outages.
What you'll do:
- Incident Management for NOC team - first to triage incident ticket(s), communicate status to customer, resolve and publish after action reviews.
- Provides Tier 2-3 network support, addresses availability issues, overcomes performance problems to ensure quality of service on the network
- Monitors critical network elements and applications using various network monitoring systems in an 8am-8pm ET, 5 days a week, Network Operations Center (NOC) work week with rotating On Call support on weekends and Holidays.
- Manages network performance, diagnoses performance problems and undertakes tuning activities to ensure network performance meets contract requirements.
- Reports on overall network performance including frequent problems, underlying issues and plans to improve network availability, performance and problem resolution.
- Implements and coordinates network policies, procedures and standards. Enforces network security procedures.
- Troubleshoots, repairs and maintains telecommunications networks in a NOC environment. Tests new and already installed hardware and software. Works with Engineering on site changes and/or new site installations.
- Works with external partners and service providers to troubleshoot source to destination network communication issues involving various applications and/or systems.
- Uses computerized network management systems to check for system alarms when parts of the network are not working properly and jeopardizing the level of communication services available to customers. Follows procedures to validate, troubleshoot and escalate issues. May work second and third shift work and overtime in the event of network outages.
Specific job tasks include:
- Perform triage and troubleshooting for network/systems issues, document steps taken and to escalate to appropriate service support groups.
- Responsible for the monitoring, updating, incident management, troubleshooting, and event log reviews-interpretation-prioritization for networks and network elements; LANs, WANs, intranet, routers, switches, servers, secure cloud gateways and wireless systems.
- Acts as first tier in alert response, incident management, and resolution by identifying networks and network elements related incidents and issues; system, application, performance degradation or unavailability and suggest ways and means of improvement.
- Employs production monitoring tools to ensure that all systems & applications are running and available and operating on the latest operating systems and versions of software and firmware.
- Perform installation, configuration, test, maintenance and upgrade of network management tools and corresponding operating systems.
- Conduct routine tasks such as script execution, network connectivity tests, patching OS’s and applications, trouble-shooting systems.
- Perform remedial maintenance inspections and services. Add, modify, move, patch, and delete as needed on applications, systems, and infrastructure services. Ensure proper records of such activities are maintained.
- Make recommendations on where improvements can be made within SolarWinds, Azure Monitor, Aruba AirWave, and other monitoring tools to add to the vNOC's capability.
- Provide best practices recommendations to help filter out false positive alerts.
- Use all vendor and internal resources to isolate and fix issues as per performance metrics, service level agreements (SLAs) and expectations of clients.
- Collect, assess, and report on performance of network elements and applications and Work with the technical and other teams to generate and update knowledge base articles.
- Assist in developing metric based reporting for the NOC.
- Develop, update, maintain and follow NOC monitoring operating process and procedures.
- Update and maintain Standard Operating Procedures (SOP) documentation.
- Coordinate communication between the various IT service operations groups and organizations that have been affected by the breakdown.
- Take note of performance issues that occur regularly and communicate with affected organizations staff.
- Monitors and makes regular use of Team collaboration tools such as Microsoft Teams and email in multiple domains on a day-to-day basis, and ensures responses are provided in a timely manner.
Qualifications
What you'll need:
- Current/Active DoD Secret clearance
- 5 years of B.S. degree, 9 years of applicable experience will be acceptable.
- DoD 8570.1-M IAT level II certification (CompTIA CySA+, CompTIA Security+, CCNA-Security, GICSP, GSEC, or SSCP).
- Strong understanding of universal network services (DNS, HTTP, LDAP, DHCP).
- Strong understanding of typical infrastructure services (e.g. MySQL or other DBs, OpenSSH).
- Must be proficient with all Microsoft O365 applications/services/tools.
- Must have knowledge of routing & routing protocols e.g. OSPF, BGP, policy-based routing.
- Must have protocol level understanding of Switching technologies – VLAN, STP (all types), LACP, Virtual switching – Aruba VSX or Cisco VSS / VSP or equivalent, spine-leaf architecture.
- Expert level knowledge of IP networking, switching & routing.
- Must have packet level understanding of ARP, DHCP, UDP, TCP, ICMP, SNMP
- Must have good understanding of IP Multicast & Multicast routing protocols
- Minimum Four (4) + years of experience in customer facing remote or onsite Technical Support preferably with 3+ years in Tier3 escalation role
- 3+ years of experience with or knowledge of Network Management/Monitoring Suites (Zabbix, Nagios, Aruba AirWave) and Network Access Control technologies (Microsoft NPS, Cisco ISE, Aruba ClearPass),
- 4+ years of experience in configuration & troubleshooting of multi-vendor data networks (Aruba, Cisco, Juniper, Arista); Datacenter experience is a plus
- Experience in working with Tier III/IV support and handling the incident management workflow.
- Experience performing network monitoring in a virtual environment.
- Ability to work independently and as a member of a team to deliver high quality work.
- Excellent oral and written communication skills are a must.
- Ability to communicate clearly, both verbally and in writing, to IT and non-IT staff.
- Strong analytical and problem-solving skills.
Preferred:
- Microsoft Certified Azure Fundamentals
- Experience with Azure Monitor and Sentinel
- Previous experience with documentation and content management and development of process documentation
- Experience with SAFe methodology
- Intermediate level programming and automation experience (Python, Java, Javascript, C++, C#, Ruby, PHP)
- Intermediate level knowledge of Linux, Bash, REST APIs
- Working knowledge of JSON, YAML data formats used for log management
- Working knowledge of network automation & analysis tools - Ansible, StackStorm, Grafana, Humio
- Experience working in an enterprise cloud operations environment.
- Knowledge of DoD networking and IA policies and regulations.
- ITILv4 Foundation Certification
- Experience with IT Service Management tools and continuous service improvement
Peraton Overview
Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world’s leading mission capability integrator and transformative enterprise IT provider, we deliver trusted, highly differentiated solutions and technologies to protect our nation and allies. Peraton operates at the critical nexus between traditional and nontraditional threats across all domains: land, sea, space, air, and cyberspace. The company serves as a valued partner to essential government agencies and supports every branch of the U.S. armed forces. Each day, our employees do the can’t be done by solving the most daunting challenges facing our customers. Visit peraton.com to learn how we’re keeping people around the world safe and secure.