Fano inequality
From Scholarpedia
| Robert Mario Fano (2008), Scholarpedia, 3(10):6648. | revision #50378 [link to/cite this article] | |||||||||||||||||||
Curator: Dr. Robert Mario Fano, Ford Professor of Engineering, Emeritus, EECS Department, MIT, Cambridge, MA
The inequality that became known as the Fano inequality pertains to a model of communications system in which a message selected from a set of
possible messages is encoded into an input signal for transmission through a noisy channel and the resulting output signal is decoded into one of the same set of possible messages. Successive input messages are assumed to be statistically independent.
Contents |
Origin
Let
be the set of input messages,
the corresponding set of output messages and
be the joint probability of message
being transmitted and message
being received. An error occurs when the output message differs from the input message, that is when
. Let
be the probability of this event and
be the associated binary entropy.
The conditional entropy
represents the average amount of information lost because of the channel noise and is referred to as the “equivocation”. It is the average value of
- (1)
which is the average amount of information still needed to specify the input message when a particular output message
is specified.
The Fano inequality is
- (2)
It first appeared as Eq. 4.35 in the 1953 edition of the lecture notes distributed to M.I.T. students in the graduate subject Statistical Theory of Information, and, later on, as Eq, 6.16 in the final textbook published in 1961 (Fano, 1961).
The inequality resulted from an early attempt to relate the equivocation, the information theory measure of the effect of channel noise, to the probability of error, the traditional practical measure. It was originally based on the following heuristic argument.
The equivocation, because of its very definition, cannot exceed the information provided about the input messages by any procedure capable of correcting output errors. The first step in correcting output errors is to specify for each input message whether the corresponding output message is correct or incorrect. The amount of information provided by this specification is given by the binary entropy
, the first term on the right hand side of (2). Then, for each incorrect output message the correct input message must be identified. The number of possible input messages is
because the output message is known to be incorrect. Thus the amount of information necessary to identify the input message cannot exceed
, and the corresponding average amount of information to be provided cannot exceed
, the second term on the right hand side of (2).
Derivation
The inequality may be formally derived as follows. Let
be the probability of error when
is the output message, that is, when the input message is any
with
. Then,
is the probability of the output message being correct, that is, of the input message being
, the one corresponding to the given output message
. The corresponding binary entropy is
The probability of message
being the correct input message when it is known that the output message
is incorrect is
Now, (1) can be rewritten in the form
where
is the entropy of an ensemble of
elements whose value cannot exceed
. Thus,
which, when averaged over the
ensemble, yields
which, in turn, yields (2), since
.
References
- Fano, RM. “Transmission of Information”, the M.I.T. Press and John Wiley and Sons, New York & London, 1961.
Internal references
- Tomasz Downarowicz (2007) Entropy. Scholarpedia, 2(11):3901.
Recommended Reading
- Cover, T.M. and Thomas, J.A. (1991). Elements of information theory. John Wiley & Sons, New York, NY.
External Links
See Also
| Robert Mario Fano (2008) Fano inequality. Scholarpedia, 3(10):6648, (go to the first approved version) Created: 20 February 2008, reviewed: 18 October 2008, accepted: 23 October 2008 |

