Expert monitoring
Context
The consultant works within the Monitoring & Observability service, a strategic transversal service covering all information systems.
They operate in a multi-technology, multi-domain, and high-criticality environment, with challenges regarding standardization, tool rationalization, and long-term transformation (hypervision, open source, cloud).
Main Mission
Ensure technical leadership, strategic vision, and the evolution of monitoring and observability at the enterprise level, while guaranteeing operational continuity and the upskilling of internal teams.
Key Responsibilities
- Technical lead and advanced expertise
Act as the main technical reference for all monitoring and observability solutions: SCOM, SolarWinds, Azure Monitor, and related solutions
Participate in the design and validation of complex hybrid monitoring architectures (on-prem / cloud / SaaS)
Define advanced best practices regarding:
- metrics collection;
- intelligent alerting;
- reduction of false alerts;
-
event correlation
-
Strategic consulting and governance
Support new projects in the selection and integration of monitoring/observability solutions
Act as an expert advisor to technical teams and management
Participate in defining the strategy for rationalizing monitoring tools
Contribute to the governance of enterprise standards in observability
- Analysis and technological transformation
Conduct complex opportunity studies on the evolution of the tools landscape:
- comparative analysis of commercial vs open source solutions;
- assessment of technical, organizational, and financial impacts;
- in-depth analysis of open source solutions (e.g.: Zabbix) as strategic alternatives;
-
support decision-making on long-term structural choices
-
Advanced automation
-
Design and implement advanced automation mechanisms around monitoring and observability:
- automation of deployment and configuration of tools;
- automation of alert management (enrichment, deduplication, remediation)
-
Reduce operational risk and improve system reliability through automation
-
Expert support and critical situation management
-
Occasional and targeted intervention in the management of complex or critical incidents related to monitoring and observability tools
- Expert-level support (L3) on high-impact or highly complex issues
-
Contribute to the analysis of major incidents (post-incident, continuous improvement)
-
Participation/leadership of the Hypervision program
-
Technical lead of the enterprise hypervision project:
- centralization of alerts from multiple heterogeneous tools;
- design of event correlation mechanisms;
- normalization of alert flows;
- Direct contribution to improving transversal supervision of information systems
-
Interaction with multiple teams (infrastructure, application, security, operations)
-
Knowledge management & upskilling
-
Define and implement a knowledge transfer strategy
- Advanced training of internal team members
- Structure critical documentation (architectures, standards, best practices)
- Contribute to reducing individual dependency through organized knowledge sharing
Expected level of expertise
- Confirmed expertise (senior / expert) in large-scale monitoring & observability
- Ability to operate on complex and critical systems
- Global vision, beyond a specific tool
- Strong autonomy, decision-making ability, and technical leadership
Additional information:
The mission may be extended for a maximum duration (initial period included) of: 880 business days.
Apply for this Job
This position was originally posted on Pro Unity.
It is publicly accessible, and we recommend applying directly through the Pro Unity website instead of going through third party recruiters.
Search jobs by category
- AI Engineer
- Application Support Analyst
- Business Analyst
- Business Intelligence Analyst
- CRM Developer
- Cybersecurity Analyst
- Data Analyst
- Database Administrator
- Data Engineer
- Data Scientist
- Developer
- DevOps Engineer
- Embedded Systems Engineer
- ERP Consultant
gofreelance
© 2026 gofreelance.be