Intermittent failures and no fault found (NFF) phenomena are a concern in electronic systems because of their unpredictable nature and irregular occurrence. They can impose significant costs for companies, damage the reputation of a company, or be catastrophic in systems such as nuclear plants or avionics. Intermittent failures in systems can be attributed to hardware failures or software failures. In order to diagnose and mitigate the intermittent failures in systems, the nature and the root cause of these failures have to be understood. In this paper we have reviewed the current literature concerning intermittent failures and have a comprehensive study on how these failures happen, how to detect them and how to mitigate them.

References

1.
Thomas
,
D. A.
,
Ayers
,
K.
, and
Pecht
,
M.
,
2002
, “
The ‘Trouble Not Identified’ Phenomenon in Automotive Electronics
,”
Microelectron. Reliab.
,
42
(
4–5
), pp.
641
651
.10.1016/S0026-2714(02)00040-9
2.
James
,
I.
,
Lumbard
,
D.
,
Willis
,
I.
, and
Goble
,
J.
,
2003
, “
Investigating No Fault Found in the Aerospace Industry
,”
Annual Reliability and Maintainability Symposium
, Tampa, FL, January 27–30, pp.
441
446
.10.1109/RAMS.2003.1182029
3.
Söderholm
,
P.
,
2007
, “
A System View of the No Fault Found (NFF) Phenomenon
,”
Reliab. Eng. Syst. Saf.
,
92
(
1
), pp.
1
14
.10.1016/j.ress.2005.11.004
4.
Steadman
,
B.
,
Pombo
,
T.
,
Madison
,
I.
,
Shively
,
J.
, and
Kirkland
,
L.
,
2002
, “
Reducing No Fault Found Using Statistical Processing and an Expert System
,”
IEEE AUTOTESTCON 2002
, Huntsville, AL, October 15–17, pp.
872
878
.10.1109/AUTEST.2002.1047966
5.
WDS White Paper
,
2006
, “
No Fault Found Returns Cost the Mobile Industry $4.5 Billion per Year
,” http://www.wds.co/news/whitepapers/20060717/MediaBulletinNFF.pdf
6.
Maul
,
C.
,
McBride
,
J. W.
, and
Swingler
,
J.
,
2001
, “
Intermittency Phenomena in Electrical Connectors
,”
IEEE Trans. Compon. Packag. Technol.
,
24
(
3
), pp.
370
377
.10.1109/6144.946482
7.
Schafft
,
H. A.
,
1973
, “
Failure Analysis of Wire Bonds
,”
11th Annual Reliability Physics Symposium
, Las Vegas, NV, April 3–5, pp.
98
104
.10.1109/IRPS.1973.362578
8.
McCullough
,
R. E.
,
1972
, “
Screening Techniques for Intermittent Shorts
,”
10th Annual Reliability Physics Symposium
, Las Vegas, NV, April 5–7, pp.
19
22
.10.1109/IRPS.1972.362523
9.
Koch
,
T.
,
Richliug
,
W.
,
Whitlock
,
J.
, and
Hall
,
D.
,
1986
, “
A Bond Failure Mechanism
,”
24th Annual Reliability Physics Symposium
, Anaheim, CA, April 1–3, pp.
55
60
.10.1109/IRPS.1986.362112
10.
Minzari
,
D.
,
Jellesen
,
M. S.
,
Møller
,
P.
, and
Ambat
,
R.
,
2011
, “
On the Electrochemical Migration Mechanism of Tin in Electronics
,”
Corros. Sci.
,
53
(
10
), pp.
3366
3379
.10.1016/j.corsci.2011.06.015
11.
Minzari
,
D.
,
Grumsen
,
F. B.
,
Jellesen
,
M. S.
,
Møller
,
P.
, and
Ambat
,
R.
,
2011
, “
Electrochemical Migration of Tin in Electronics and Microstructure of the Dendrites
,”
Corros. Sci.
,
53
(
5
), pp.
1659
1669
.10.1016/j.corsci.2011.01.009
12.
Reid
,
M.
,
Punch
,
J.
,
Grace
,
G.
,
Garfias
,
L. F.
, and
Belochapkine
,
S.
,
2006
, “
Corrosion Resistance of Copper-Coated Contacts
,”
J. Electrochem. Soc.
,
153
(
12
), pp.
B513
B517
.10.1149/1.2352042
13.
Pan
,
S.
,
Hu
,
Y.
, and
Li
,
X.
,
2010
, “
IVF: Characterizing the Vulnerability of Microprocessor Structures to Intermittent Faults
,”
Design, Automation & Test in Europe Conference Exhibition
(
DATE
), Dresden, Germany, March 8–12, pp.
238
243
.10.1109/DATE.2010.5457206
14.
Constantinescu
,
C.
,
2008
, “
Intermittent Faults and Effects on Reliability of Integrated Circuits
,”
Annual Reliability and Maintainability Symposium
(
RAMS 2008
), Las Vegas, NV, January 28–31, pp.
370
374
.10.1109/RAMS.2008.4925824
15.
Blaauw
,
D. T.
,
Oh
,
C.
,
Zolotov
,
V.
, and
Dasgupta
,
A.
,
2003
, “
Static Electromigration Analysis for On-Chip Signal Interconnects
,”
IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst.
,
22
(
1
), pp.
39
48
.10.1109/TCAD.2002.805728
16.
Kothawade
,
S.
,
Chakraborty
,
K.
,
Roy
,
S.
, and
Han
,
Y.
,
2012
, “
Analysis of Intermittent Timing Fault Vulnerability
,”
Microelectron. Reliab.
,
52
(
7
), pp.
1515
1522
.10.1016/j.microrel.2012.03.003
17.
Mathew
,
S.
,
Das
,
D.
,
Rossenberger
,
R.
, and
Pecht
,
M.
,
2008
, “
Failure Mechanisms Based Prognostics
,”
International Conference on Prognostics and Health Management
(
PHM 2008
), Denver, CO, October 6–9, pp.
1
6
.10.1109/PHM.2008.4711438
18.
Kirkland
,
L. V.
,
2011
, “
When Should Intermittent Failure Detection Routines be Part of the Legacy Re-Host TPS?
,”
IEEE AUTOTESTCON 2011
, Baltimore, MD, September 12–15, pp.
54
59
.10.1109/AUTEST.2011.6058722
19.
Qi
,
H.
,
Ganesan
,
S.
, and
Pecht
,
M.
,
2008
, “
No-Fault-Found and Intermittent Failures in Electronic Products
,”
Microelectron. Reliab.
,
48
(
5
), pp.
663
674
.10.1016/j.microrel.2008.02.003
20.
Steadman
,
B.
,
Berghout
,
F.
,
Olsen
,
N.
, and
Sorensen
,
B.
,
2008
, “
Intermittent Fault Detection and Isolation System
,”
IEEE AUTOTESTCON 2008
, Salt Lake City, UT, September 8–11, pp.
37
40
.10.1109/AUTEST.2008.4662580
21.
Gracia
,
J.
,
Saiz
,
L.
,
Baraza
,
J. C.
,
Gil
,
D.
, and
Gil
,
P.
,
2008
, “
Analysis of the Influence of Intermittent Faults in a Microcontroller
,” 11th IEEE Workshop on Design and Diagnostics of Electronic Circuits and Systems (
DDECS 2008
), Bratislava, Slovakia, April 16–18, pp.
80
85
.10.1109/DDECS.2008.4538761
22.
Syed
,
R. A.
,
Robinson
,
B.
, and
Williams
,
L.
,
2010
, “
Does Hardware Configuration and Processor Load Impact Software Fault Observability?
,”
Third International Conference on Software Testing, Verification and Validation
(
ICST
), Paris, April 6–10, pp.
285
294
.10.1109/ICST.2010.55
23.
Wei
,
J.
,
Rashid
,
L.
,
Pattabiraman
,
K.
, and
Gopalakrishnan
,
S.
,
2011
, “
Comparing the Effects of Intermittent and Transient Hardware Faults on Programs
,”
2011 IEEE/IFIP 41st International Conference on Dependable Systems and Networks Workshops
(
DSN-W
), Hong Kong, June 27–30, pp.
53
58
.10.1109/DSNW.2011.5958835
24.
Anderson
,
T.
, and
Knight
,
J. C.
,
1983
, “
A Framework for Software Fault Tolerance in Real-Time Systems
,”
IEEE Trans. Software Eng.
,
SE-9
(
3
), pp.
355
364
.10.1109/TSE.1983.237017
25.
Lyu
,
M. R.
,
1995
,
Software Fault Tolerance
,
John Wiley & Sons, Inc.
,
New York
.
26.
Randell
,
B.
,
1975
, “
System Structure for Software Fault Tolerance
,”
International Conference on Reliable Software
,
Los Angeles, CA
, April 21–23, pp.
437
449
.10.1109/TSE.1975.6312842
27.
Avizienis
,
A.
,
1985
, “
The N-Version Approach to Fault-Tolerant Software
,”
IEEE Trans. Software Eng.
,
SE-11
(
12
), pp.
1491
1501
.10.1109/TSE.1985.231893
28.
Blum
,
M.
,
Luby
,
M.
, and
Rubinfeld
,
R.
,
1993
, “
Self-Testing/Correcting With Applications to Numerical Problems
,”
J. Comput. Syst. Sci.
,
47
(
3
), pp.
549
595
.10.1016/0022-0000(93)90044-W
You do not currently have access to this content.