SYSTEMS RELIBILITY ENGINEER (IPN, DOCSIS and PONs) - CEDENT
Atlanta, GA
About the Job
SYSTEMS RELIABILITY ENGINEER REQUIREMENTS:
SYSTEMS RELIABILITY ENGINEER RESPONSIBILITIES:
The primary responsibility of this Engineer will be to Interface with stakeholders and present on the systems reliability capabilities/opportunities.
- Senior level (10 Years) Experience working with telecom infrastructure. IP networking, back-bone routers, DOCSIS and PONs
- Tenured career in Systems Engineering, Reliability Engineering, or a related role within the Telecommunications industry
- Experience defining and coordinating enterprise-wide reliability improvement initiatives, with a focus on system and network infrastructure
- Experience with system technologies, including network architecture, server platforms, and software applications
- Strong analytical and problem-solving skills, with the ability to conduct root cause analysis and drive effective solutions
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams and senior executives
- Experience with reliability engineering tools and methodologies, such as FMEA, RCM, and fault tree analysis
- Experience within Telecommunications environment versus cable
- Certifications:CISSP, ITIL, or Six Sigma Black Belt
- Experience coming from large fortune 500 Telecommunicaitons enterprise environment
SYSTEMS RELIABILITY ENGINEER RESPONSIBILITIES:
The primary responsibility of this Engineer will be to Interface with stakeholders and present on the systems reliability capabilities/opportunities.
- Reliability Strategy: Develop and execute a comprehensive reliability strategy for our enterprise systems, including network infrastructure, server platforms, and software applications
- Enterprise Coordination: Collaborate with cross-functional teams to prioritize and implement reliability improvement efforts across the organization, ensuring alignment with business objectives and industry standards
- Root Cause Analysis: Define scalable processes for root cause analysis investigations for system failures and performance degradation, identifying underlying issues and implementing corrective actions to prevent recurrence
- Risk Management: Assess potential risks to system reliability, such as hardware failures, software bugs, and configuration errors. Develop and implement risk mitigation strategies to enhance system resilience
- Performance Monitoring: Establish robust monitoring systems to track system performance metrics and reliability indicators. Analyze data to identify trends, anticipate potential issues, and proactively address reliability challenges.
- Continuous Improvement: Drive a culture of continuous improvement by identifying opportunities for process optimization, automation, and efficiency gains in reliability enhancement initiatives
- Vendor Management: Collaborate with vendors and suppliers to ensure the reliability of system components and software. Evaluate vendor performance and provide feedback to drive product improvements
- Documentation and Reporting: Maintain comprehensive documentation of reliability improvement efforts, including procedures, policies, and incident reports. Prepare regular reports and presentations for senior leadership, highlighting progress and key performance metrics.
Source : CEDENT