Common mode failure

Common mode failure

A common mode failure occurs when events are not statistically independent. That is, one event causes multiple systems to fail.

An example is when all of the pumps for a fire sprinkler system are located in one room. If the room becomes too hot for the pumps to operate, they will all fail at essentially the same time, from one cause (the heat in the room).

The "principle of redundancy" states that, when events of failure of a component are statistically independent, the probabilities of their joint occurrence multiply. Thus, for instance, if the probability of failure of a component of a system is one in one thousand per year, the probability of the joint failure of two of them is one in one million per year, provided that the two events are statistically independent. This principle favors the strategy of the redundancy of components. One place this strategy is implemented is in RAID 1, where two hard disks store a computer's data redundantly.

But even so there can be many common modes: consider a RAID1 where two disks are purchased online and are installed in a computer, there can be many common modes:

* The disks are likely to be from the same manufacturer and of the same model, therefore they share the same design flaws.
* The disks are likely to have similar serial numbers, thus they may share any manufacturing flaws affecting production of the same batch.
* The disks are likely to have been shipped at the same time, thus they may are likely to have suffered from the same transportation damage.
* As installed both disks are attached to the same power supply, making them vulnerable to the same power supply issues.
* As installed both disks are in the same case, making them vulnerable to the same overheating events.
* They will be both attached to the same card or motherboard, and driven by the same software, which may have the same bugs.
* Because of the very nature of RAID1, both disks will be subjected to the same workload and very closely similar access patterns, stressing them in the same way.

Also, if the events of failure of two components are maximally statistically dependent, the probability of the joint failure of both is identical to the probability of failure of them individually. In such a case, the advantages of redundancy are negated. Strategies for the avoidance of common mode failures include keeping redundant components physically isolated.

A prime example of redundancy with isolation is a nuclear power plant. The new ABWR has three divisions of Emergency Core Cooling Systems, each with its own generators and pumps and each isolated from the others. The new European Pressurized Reactor has two containment buildings, one inside the other. However, even here it is not impossible for a common mode failure to occur (for example, caused by a highly-unlikely Richter 10 earthquake).

ee also

*Nuclear safety
*Probabilistic risk assessment


Wikimedia Foundation. 2010.

Игры ⚽ Поможем сделать НИР

Look at other dictionaries:

  • Common Mode Failure — (CMF; deutsch Gleichartige Ausfälle[1]) ist ein Begriff aus der EN ISO 12100 1:2003 und bezeichnet in der Risikoanalyse das Versagen von mehreren Komponenten oder Systemen, deren Versagen einem einheitlichen Ablauf folgt. Gelegentlich findet …   Deutsch Wikipedia

  • common-mode failure — noun (nuclear eng) The failure of two or more supposedly independent parts of a system (eg a reactor) from a common external cause or from interaction between the parts • • • Main Entry: ↑common …   Useful english dictionary

  • Common mode — is a term in engineering with at least two independent meanings. Of electrical signals, Common mode rejection ratio, the ratio of rejection of common mode signals to differential signals Common mode interference, interference that appears on both …   Wikipedia

  • Common-cause and special-cause — Type of variation Synonyms Common cause Chance cause Non assignable cause Noise Natural pattern Special cause Assignable cause Signal Unnatural pattern Common and special causes are the two distinct origins of variation in a process, as defined… …   Wikipedia

  • Common Cause (disambiguation) — To make common cause is an idiom meaning to form an alliance, to cooperate towards a common goal . As a proper noun, Common Cause may refer to: Common Cause (U.S. lobbying group), the non partisan lobbying group, or its magazine Common Cause… …   Wikipedia

  • Failure rate — is the frequency with which an engineered system or component fails, expressed for example in failures per hour. It is often denoted by the Greek letter λ (lambda) and is important in reliability engineering. The failure rate of a system usually… …   Wikipedia

  • Mode (statistics) — In statistics, the mode is the value that occurs most frequently in a data set or a probability distribution.[1] In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score.[2] Like the… …   Wikipedia

  • Common Access Card — An example DoD Common Access Card The Common Access Card (CAC) is a United States Department of Defense (DoD) smart card issued as standard identification for active duty military personnel, reserve personnel, civilian employees, other non DoD… …   Wikipedia

  • Diastolic heart failure — Diastolic dysfunction Classification and external resources ICD 9 428.3 Diastolic heart failure or diastolic dysfunction refers to decline in performance of one or both ventricles of the heart during the time phase of diastole. Diastole is that… …   Wikipedia

  • Out of Box Failure — The term Out of Box Failure usually refers to computer hardware. It describes a negative experience a user has when installing and/or performing initial configuration on a piece of hardware that exhibits an immediate failure mode.DefinitionsOut… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”