Enterprise Monitoring Manager - Apex Systems
Cincinnati, OH 45249
About the Job
Job#: 2050017
Job Description:
What youll own
Leading a team of system administrators and engineers responsible for designing, implementing and maintaining System and Cloud Observability & Log Management solutions which ensure that our infrastructure and applications are fully observable, enabling proactive monitoring, real-time analytics, and timely incident response.
The Manager will play a critical role in developing strategies and implementing best practices in observability and log management for on-premises and cloud environments. As a leader in this capacity, your responsibilities may include:
- Lead, mentor, and manage a team of observability and log management engineers, providing technical guidance, training, coaching, and professional development opportunities, fostering a culture of continuous learning and growth within the organization.
- Define and implement an observability strategy across cloud and on-premises systems, focusing on reliability, scalability, and proactive alerting, aligning technology initiatives with business goals and objectives.
- Develop, implement, and maintain an end-to-end observability and log management framework that supports business needs and ensures compliance with internal and regulatory requirements.
- Oversee the planning, execution, and completion of projects related to Infrastructure observability and log management initiatives, ensuring projects are delivered on time, within budget, and to the required quality standards.
- Lead the design and implementation of centralized log management and monitoring solutions to enhance the visibility of system and application performance.
- Collaborate with IT infrastructure, DevOps, and application teams to build and manage dashboards, alerts, and logging solutions that provide a holistic view of the systems health.
- Drive the implementation and adoption of best practices for system metrics, distributed tracing, log aggregation, and visualization.
- Stay current with new tools and technologies in the observability and log management space, evaluating their applicability to the environment.
Where youll own it
While we’re based in North America, we have collaborative spaces and regular opportunities to celebrate with each other in person.
What you bring
Experience
- Several years of experience in IT Operations, with a focus on monitoring, observability, and log management.
- Hands-on experience with observability and monitoring tools (focusing on Splunk and Zabbix, others include: Datadog, Dynatrace, Prometheus, Grafana, New Relic).
- Proven experience with cloud platforms (e.g., AWS, Azure, GCP), including managing cloud-based monitoring and logging solutions.
- Strong understanding of log management best practices, including centralized logging, data retention, and privacy requirements.
- Experience in designing and implementing system health dashboards, alerting mechanisms, and automated incident response processes.
Leadership and management skills:
- Strong leadership qualities, including the ability to inspire, motivate, and guide a team of IT professionals, are crucial for this leadership role. Experience in team management, mentoring, performance evaluation, and conflict resolution is important.
Technical expertise:
- In-depth knowledge of ESXi, Windows Server, Linux operating systems, virtualization technologies, networking, storage, and security concepts is essential for this leadership role. Hands-on experience with system architecture, design, and implementation is valuable.
Project management:
- Experience in project management, including planning, budgeting, resource allocation, and stakeholder communication, is important for overseeing technology initiatives and ensuring successful project delivery.
Added bonus if you bring
- Education: A bachelors degree or greater in computer science, information technology, or a related field. Practical experience in the role can be used in place of formal education.
- Certifications: Technical certifications in cloud and virtualization technologies are highly valued. Any certifications for AWS, Azure, MSCE, RH or VMware Certified Professional (VCP), VMware Certified Advanced Professional (VCAP), and Citrix Certified Associate - Virtualization (CCA-V), Datadog, Dynatrace, or Splunk are looked at favorably.
EEO Employer
Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at [email protected] or 844-463-6178.
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing® in Talent Satisfaction in the United States and Great Place to Work® in the United Kingdom and Mexico.