Hostname: page-component-76fb5796d-vvkck Total loading time: 0 Render date: 2024-04-26T09:17:51.515Z Has data issue: false hasContentIssue false

Reliable Fault Diagnosis with Few Tests

Published online by Cambridge University Press:  01 September 1998

ANDRZEJ PELC
Affiliation:
Département d'Informatique, Université du Québec à Hull, Hull, Québec J8X 3X7, Canada (e-mail: pelc@uqah.uquebec.ca)
ELI UPFAL
Affiliation:
IBM Almaden Research Center, San Jose, CA 95120, USA, and Department of Applied Mathematics, The Weizmann Institute of Science, Rehovot, Israel (e-mail: eli@wisdom.weizmann.ac.il)

Abstract

We consider the problem of fault diagnosis in multiprocessor systems. Processors perform tests on one another: fault-free testers correctly identify the fault status of tested processors, while faulty testers can give arbitrary test results. Processors fail independently with constant probability p<1/2 and the goal is to identify correctly the status of all processors, based on the set of test results. For 0<q<1, q-diagnosis is a fault diagnosis algorithm whose probability of error does not exceed q. We show that the minimum number of tests to perform q-diagnosis for n processors is Θ(n log 1/q) in the nonadaptive case and n+Θ( log 1/q) in the adaptive case. We also investigate q-diagnosis algorithms that minimize the maximum number of tests performed by, and performed on, processors in the system, constructing testing schemes in which each processor is involved in very few tests. Our results demonstrate that the flexibility yielded by adaptive testing permits a significant saving in the number of tests for the same reliability of diagnosis.

Type
Research Article
Copyright
1998 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)