Electronic International Standard Serial Number (EISSN)
1573-7705
abstract
Failures are unavoidable in communication networks, so their detection and identification are vital for the reliable operation of the networks. The existing fault diagnosis techniques are based on many paradigms derived from different areas (e.g., mathematical theories, machine learning, statistical analysis) and with different purposes, such as, obtaining a representation model of the network for fault localization, selecting optimal probe sets for monitoring network devices, reducing fault detection time, and detection of faulty components in the network. Nevertheless, there are still challenges to be faced because those techniques are invasive on account of they increase network traffic and the control overhead. Also, they intensify the internal processes of the network through expanding management processes or monitoring agents on almost all networking devices. This paper introduces a non-invasive fault detection approach based on the observation of symptoms of internal network failures in gateway routers (called peripheral elements). We developed a link failure induction experiment in an emulated network that evidenced the existence of the fault propagation phenomenon to a peripheral level, which demonstrates the feasibility of our approach. Our results foster the use of learning techniques which do not require a complete dependency model of the network and could continuously diagnose the failure symptoms while being resilient to the dynamic changes of the network.