What is fault management? | Definition from TechTarget (2024)

What is fault management?

Fault management is the component of network management that detects, isolates and fixes problems. When properly implemented, network fault management can keep connectivity, applications and services running at an optimal level, provide fault tolerance and minimize downtime. Fault management systems are platforms or tools designed specifically for this purpose.

Faults result from malfunctions or events that interfere with, degrade or obstruct service delivery. Examples of faults include hardware failure, connectivity loss or port status change. Once the fault management platform detects a fault, it notifies the administrator -- and any additional authorized or designated parties -- via an alarm or alert.

Network administrators can view these notifications in the fault management system's GUI, and many platforms can forward these alerts via email, text or a mobile app. Network administrators can also configure fault management systems to automatically fix or prevent certain events using programs and scripts.

Fault management is one component of FCAPS (fault management, configuration, accounting, performance and security), which is a network management framework established by the International Organization for Standardization (ISO).

What is fault management? | Definition from TechTarget (2)

Important functions of fault management

Network fault management comprises a variety of functions to keep the network operational. Fault management systems perform the following actions:

  • Defines thresholds for potential failure conditions.
  • Constantly monitors system status and usage levels.
  • Continuously scans for threats, such as viruses and Trojans.
  • Provides general diagnostics.
  • Remotely controls system elements, including workstations and servers, from a single location.
  • Notifies administrators and users of impending and actual malfunctions.
  • Traces the locations of potential and actual malfunctions.
  • Automatically corrects potential problem-causing conditions.
  • Automatically fixes malfunctions.
  • Comprehensively logs system status and actions taken.

Types of fault management

There are two types of network fault management: active and passive.

Active fault management

Active fault management uses various tools, such as ping or Transmission Control Protocol/User Datagram Protocol port checks, to continually query devices and determine their status. This is akin to a person asking, "How are you?" to everyone in a room at repeated intervals. This enables the fault management system to identify and rectify potential issues in real time, sometimes before they even become problems. The tradeoff, however, is more network chatter.

Passive fault management

Passive fault management systems monitor network environments for events that indicate a fault or failure has occurred. This information might come from error logs or Simple Network Management Protocol traps, among other sources. This is akin to a person who quietly listens until someone calls out for help. While passive fault management is more conservative in its resource use, its drawback is that it might not discover faults until too late.

Fault management process

The fault management process used in commercial platforms might vary slightly among different vendors, but fault management systems typically follow the same lifecycle:

  1. Fault detection. The system discovers that service delivery has been interrupted or its performance has degraded.
  2. Fault diagnosis and isolation. The system identifies the source of the fault, such as a component failure or power outage, and its location in the network topology.
  3. Event correlation and aggregation. Because a single fault can cause multiple alarms, fault management systems often group related events for administrators and provide a root cause analysis.
  4. Restoration of service. The network management system automatically executes any preconfigured scripts or programs to get services up and running as soon as possible.
  5. Problem resolution. The system corrects, repairs or replaces the source of the fault. In some cases, manual intervention might be necessary based on the cause.

Editor's note: This article was reformatted to improve the reader experience.

This was last updated in May 2023

See Also
Earthquakes

Continue Reading About fault management

Related Terms

baseboard management controller (BMC)
A baseboard management controller (BMC) is a specialized service processor that remotely monitors the physical state of a host ... Seecompletedefinition
network orchestration
Network orchestration is the use of a software-defined network controller that facilitates the creation of network and network ... Seecompletedefinition
Wireshark
Wireshark is a widely used network protocol analyzer that lets users capture and view the details of network traffic in real time... Seecompletedefinition

Dig Deeper on Network management and monitoring

What is fault management? | Definition from TechTarget (2024)
Top Articles
Bread 65 Recipe - Bread Snack Street Style in 10 Mins
Bhel puri of Jammu | Jammu style bhel puri recipe | Jammu bhel puri
2022 Basketball 247
[Re-Usable] - SSNSonicHD - Expanded & Enhanced
799: The Lives of Others - This American Life
Sproutieeee
Munsif Epaper Urdu Daily Online Today
Chevrolet Colorado - Infos, Preise, Alternativen
Log in or sign up to view
Jsmainnn
Nashville Tranny
Swap Shop Elberton Ga
KMS ver. 1.2.355 – Haste & Tactical Relay
Estate Sales Net Grand Rapids
Who is Harriet Hageman, the Trump-backed candidate who beat Liz Cheney?
Trinket Of Advanced Weaponry
Red Dead Redemption 2 Legendary Fish Locations Guide (“A Fisher of Fish”)
C.J. Stroud und Bryce Young: Zwei völlig unterschiedliche Geschichten
Oriellys Bad Axe
Haverhill, MA Obituaries | Driscoll Funeral Home and Cremation Service
Claims Adjuster: Definition, Job Duties, How To Become One
Chlamydia - Chlamydia - MSD Manual Profi-Ausgabe
Katonah Train Times
Lexington Park Craigslist
11 Shows Your Mom Loved That You Should Probably Revisit
Bearpaws Tropical Weather
Devotion Showtimes Near Amc Classic Shiloh 14
Ratchet & Clank Rift Apart: Trofea - lista | GRYOnline.pl
Sas Majors
Promiseb Discontinued
co*cker Spaniel For Sale Craigslist
The Front Porch Self Service
Hcpss Staff Hub Workday
Soul of the Brine King PoE Pantheon 3.14 Upgrade
craigslist: northern MI jobs, apartments, for sale, services, community, and events
Elemental Showtimes Near Regal White Oak
eUprava - About eUprava portal
Palindromic Sony Console For Short Crossword Clue 6 Letters: Composer Of
Lvc Final Exam Schedule
9044906381
Sentara Reference Lab Solutions Bill Pay
Hubspot Community
Texas State Final Grades
Tapana Telugu Movie Download Kuttymovies
charleston rooms & shares - craigslist
Huskersillustrated Husker Board
Delta Incoming Flights Msp
Traftarım 24
My Vidant Chart
Sparkle Nails Phillipsburg
Function Calculator - eMathHelp
The t33n leak 5-17: Understanding the Impact and Implications - Mole Removal Service
Latest Posts
Article information

Author: Manual Maggio

Last Updated:

Views: 6586

Rating: 4.9 / 5 (69 voted)

Reviews: 84% of readers found this page helpful

Author information

Name: Manual Maggio

Birthday: 1998-01-20

Address: 359 Kelvin Stream, Lake Eldonview, MT 33517-1242

Phone: +577037762465

Job: Product Hospitality Supervisor

Hobby: Gardening, Web surfing, Video gaming, Amateur radio, Flag Football, Reading, Table tennis

Introduction: My name is Manual Maggio, I am a thankful, tender, adventurous, delightful, fantastic, proud, graceful person who loves writing and wants to share my knowledge and understanding with you.