List-decoding - AbsoluteAstronomy.com

Computer science

Computer science or computing science is the study of the theoretical foundations of information and computation and of practical techniques for their implementation and application in computer systems...

, particularly in coding theory

Coding theory

Coding theory is the study of the properties of codes and their fitness for a specific application. Codes are used for data compression, cryptography, error-correction and more recently also for network coding...

, list decoding is an alternative to unique decoding of error-correcting codes for large error rates. The notion was proposed by Elias

Peter Elias

Peter Elias was a pioneer in the field of information theory. Born in New Brunswick, New Jersey, he was a member of the Massachusetts Institute of Technology faculty from 1953 to 1991....

in the 1950s. The main idea behind list decoding is that the decoding algorithm instead of outputting a single possible message outputs a list of possibilities one of which is correct. This allows for handling a greater number of errors than that allowed by unique decoding.

The unique decoding model in coding theory

Coding theory

, which is constrained to output a single valid codeword from the received word could not tolerate greater fraction of errors. This resulted in a gap between the error-correction performance for stochastic

Stochastic

Stochastic refers to systems whose behaviour is intrinsically non-deterministic. A stochastic process is one whose behavior is non-deterministic, in that a system's subsequent state is determined both by the process's predictable actions and by a random element. However, according to M. Kac and E...

noise models (proposed by Shannon) and the adversarial noise model (considered by Richard Hamming

Richard Hamming

Richard Wesley Hamming was an American mathematician whose work had many implications for computer science and telecommunications...

). Since the mid 90s, significant algorithmic progress by the coding theory community has bridged this gap. Much of this progress is based on a relaxed error-correction model called list decoding, wherein the decoder outputs a list of codewords for worst-case pathological error patterns where the actual transmitted codeword is included in the output list. In case of typical error patterns though, the decoder outputs a unique single codeword, given a received word, which is almost always the case (However, this is not known to be true for all codes). The improvement here is significant in that the error-correction performance doubles. This is because now the decoder is not confined by the half-the-minimum distance barrier. This model is very appealing because having a list of codewords is certainly better than just giving up. The notion of list-decoding has many interesting applications in complexity theory

Computational complexity theory

Computational complexity theory is a branch of the theory of computation in theoretical computer science and mathematics that focuses on classifying computational problems according to their inherent difficulty, and relating those classes to each other...

.

The way the channel noise is modeled plays a crucial role in that it governs the rate at which reliable communication is possible. There are two main schools of thought in modeling the channel behavior 1) Probabilistic noise model studied by Shannon in which the channel noise is modeled precisely in the sense that the probabilistic behavior of the channel is well known and the probability of occurrence of too many or too few errors is low 2) Worst-case or adversarial noise model considered by Hamming in which the channel acts as an adversary that arbitrarily corrupts the codeword subject to a bound on the total number of errors.

The highlight of list-decoding is that even under adversarial noise conditions, it is possible to achieve the information-theoretic optimal trade-off between rate and fraction of errors that can be corrected. Hence, in a sense this is like improving the error-correction performance to that possible in case of a weaker, stochastic noise model.

Mathematical formulation

Let

be a

error-correcting code; in other words,

is a code of length

, dimension

and minimum distance

over an alphabet

of size

. The list-decoding problem can now be formulated as follows:

Input: Received word

, error bound

Output: A list of all codewords

whose hamming distance

Hamming distance

In information theory, the Hamming distance between two strings of equal length is the number of positions at which the corresponding symbols are different...

from

is at most

Motivation for list decoding

Given a received word

, which is a noisy version of some transmitted codeword

, the decoder tries to output the transmitted codeword by placing its bet on a codeword that is “nearest” to the received word. The Hamming distance between two codewords is used as a metric in finding the nearest codeword, given the received word by the decoder. If

is the minimum Hamming distance of a code

, then there exists two codewords

and

that differ in exactly

positions. Now, in the case where the received word

is equidistant from the codewords

and

, unambiguous decoding becomes impossible as the decoder cannot decide which one of

and

to output as the original transmitted codeword. As a result, the half-the minimum distance acts as a combinatorial barrier beyond which unambiguous error-correction is impossible, if we only insist on unique decoding. However, received words such as

considered above occur only in the worst-case and if one looks at the way Hamming balls are packed in high-dimensional space, even for error patterns

beyond half-the minimum distance, there is only a single codeword

within Hamming distance

from the received word. This claim has been shown to hold with high probability for a random code picked from a natural ensemble and more so for the case of Reed-Solomon codes which is well studied and quite ubiquitous in the real world applications. In fact, Shannon’s proof of the capacity theorem for q-ary symmetric channels can be viewed in light of the above claim for random codes.

Under the mandate of list-decoding, for worst-case errors, the decoder is allowed to output a small list of codewords. With some context specific or side information, it may be possible to prune the list and recover the original transmitted codeword. Hence, in general, this seems to be a stronger error-recovery model than unique decoding.

List-decoding potential

For a polynomial-time list-decoding algorithm to exist, we need the combinatorial guarantee that any Hamming ball of radius

around a received word

(where

is the fraction of errors in terms of the block length

) has a small number of codewords. This is because the list size itself is clearly a lower bound on the running time of the algorithm. Hence, we require the list size to be a polynomial in the block length

of the code. A combinatorial consequence of this requirement is that it imposes an upper bound on the rate of a code. List decoding promises to meet this upper bound. It has been shown non-constructively that codes of rate

exist that can be list decoded up to a fraction of errors approaching

. The quantity

is referred to as the list-decoding capacity in the literature. This is a substantial gain compared to the unique decoding model as we now have the potential to correct twice as many errors. Naturally, we need to have at least a fraction

of the transmitted symbols to be correct in order to recover the message. This is an information-theoretic lower bound on the number of correct symbols required to perform decoding and with list decoding, we can potentially achieve reach this information-theoretic limit. However, to realize this potential, we need explicit codes (codes that can be constructed in polynomial time) and efficient algorithms to perform encoding and decoding.

(p, L)-list-decodability

For any error fraction

and an integer

, a code

is said to be list decodable up to a fraction

of errors with list size at most

. In other words, if for every

, the number of codewords

within Hamming distance

from

is at most

, then the code

is said to be (

)-list-decodable.

Combinatorics of list decoding

The relation between list decodability of a code and other fundamental parameters such as minimum distance and rate have been fairly well studied. It has been shown that every code can be list decoded using small lists beyond half the minimum distance up to a bound called the Johnson radius. This is quite significant because it proves the existence of (

)-list-decodable codes of good rate with a list-decoding radius much larger than

. In other words, the Johnson bound rules out the possibility of having a large number of codewords in a Hamming ball of radius slightly greater than

which means that it is possible to correct far more errors with list decoding.

List-decoding capacity

Below,

is the q-ary entropy function (defined for

and extended by continuity to

Theorem (list-decoding capacity)

Let

, and

, then the following two statements hold for large enough block length

.

i) If

, then there exists a (

)-list decodable code.

ii) If

, then every (

)-list-decodable code has

.

What this means is that for rates approaching the channel capacity, there exists list decodable codes with polynomial sized lists enabling efficient decoding algorithms whereas for rates exceeding the channel capacity, the list size becomes exponential which rules out the existence of efficient decoding algorithms.

The proof for list-decoding capacity is a significant one in that it exactly matches the capacity of a

-ary symmetric channel

. In fact, the term "list-decoding capacity" should actually be read as the capacity of an adversarial channel under list decoding. Also, the proof for list-decoding capacity is an important result that pin points the optimal trade-off between rate of a code and the fraction of errors that can be corrected under list decoding.

Sketch of proof

The idea behind the proof is similar to that of Shannon's proof for capacity of the binary symmetric channel

Binary symmetric channel

A binary symmetric channel is a common communications channel model used in coding theory and information theory. In this model, a transmitter wishes to send a bit , and the receiver receives a bit. It is assumed that the bit is usually transmitted correctly, but that it will be "flipped" with a...

where a random code is picked and showing that it is (

)-list-decodable with high probability as long as the rate

. For rates exceeding the above quantity, it can be shown that the list size

becomes super-polynomially large.

A "bad" event is defined as one in which, given a received word

and

messages

, it so happens that

, for every

where

is the fraction of errors that we wish to correct and

is the Hamming ball of radius

with the received word

as the center.

Now, the probability that a codeword

associated with a fixed message

lies in a Hamming ball

is given by

where the quantity

is the volume of a Hamming ball of radius

with the received word

as the center. The inequality in the above relation follows from the upper bound on the volume of a Hamming ball. The quantity

gives a very good estimate on the volume of a Hamming ball of radius

centered around any word in

. Put another way, the volume of a Hamming ball is translation invariant. To continue with the proof sketch, we conjure the union bound in probability theory which tells us that the probability of a bad event happening for a given

is upper bounded by the quantity

.

With the above in mind, the probability of "any" bad event happening can be shown to be less than

. To show this, we work our way over all possible received words

and every possible subset of

messages in

.

Now turning to the proof of part (ii), we need to show that there are super-polynomially many codewords around every

when the rate exceeds the list-decoding capacity. We need to show that

is super-polynomially large if the rate

. Fix a codeword

. Now, for every

picked at random, we have

since Hamming balls are translation invariant. From the definition of the volume of a Hamming ball and the fact that

is chosen uniformly at random from

we also have

Let us now define an indicator variable

such that

0 otherwise.

Taking the expectation of the volume of a Hamming ball we have

Therefore, by the probabilistic method, we have shown that if the rate exceeds the list-decoding capacity, then the list size becomes super-polynomially large. This completes the proof sketch for the list-decoding capacity.

List-decoding algorithms

In the recent past, there has been a slew of efficient algorithms developed by the coding theory community towards the goal of achieving list-decoding capacity. List-decoding algorithms for Reed-Solomon codes that can decode up to the Johnson radius which is

have been developed where

is the normalised distance or relative distance. However, for Reed-Solomon codes,

which means a fraction

of errors can be corrected. Some of the most prominent list-decoding algorithms are the following:

Sudan '95 - The first known non-trivial list-decoding algorithm for Reed-Solomon codes that achieved efficient list decoding up to errors developed by Madhu Sudan
Madhu Sudan
Madhu Sudan is an Indian computer scientist, professor of computer science at the Massachusetts Institute of Technology and a member of MIT Computer Science and Artificial Intelligence Laboratory.-Career:...

.

Guruswami-Sudan '98 - An improvement on the above algorithm for list decoding Reed-Solomon codes up to errors by Madhu Sudan and his then doctoral student Venkatesan Guruswami
Venkatesan Guruswami
Venkatesan Guruswami is a Computer Scientist at Carnegie Mellon University in Pittsburgh, USA. He ranked second All India in IIT-JEE in 1993. He completed his undergraduate in Computer Science from IIT Madras and his doctorate from Massachusetts Institute of Technology under the supervision of...

.

Parvaresh-Vardy '05 - In a breakthrough paper, Farzad Parvaresh and Alexander Vardy presented codes that can be list decoded beyond the radius for low rates . Their codes are variants of Reed-Solomon codes which are obtained by evaluating correlated polynomials instead of just as in the case of usual Reed-Solomon codes.

Guruswami-Rudra '06 - In yet another breakthrough, Venkatesan Guruswami and Atri Rudra give explicit codes that achieve list-decoding capacity, that is, they can be list decoded up to the radius for any . In other words, this is error-correction with optimal redundancy. This answered a question that had been open for about 50 years. This work has been invited to the Research Highlights section of the Communications of the ACM (which is “devoted to the most important research results published in Computer Science in recent years”) and was mentioned in an article titled “Coding and Computing Join Forces” in the Sep 21, 2007 issue of the Science magazine. The codes that they are give are called folded Reed-Solomon codes which are nothing but plain Reed-Solomon codes but viewed as a code over a larger alphabet by careful bundling of codeword symbols.

Because of their ubiquity and the nice algebraic properties they possess, list-decoding algorithms for Reed-Solomon codes have been the main focus of researchers. The list-decoding problem for Reed-Solomon codes can be formulated as follows:

Input: For an

Reed-Solomon code, we are given the pair

for

, where

is the

th bit of the received word and the

's are distinct points in the finite field

and an error parameter

.

Output: The goal is to find all the polynomials

of degree at most

which is the message length such that

for at least

values of

. Here, we would like to have

as small as possible so that greater number of errors can be tolerated.

With the above formulation, the general structure of list-decoding algorithms for Reed-Solomon codes is as follows:

Step 1: (Interpolation) Find a non-zero bivariate polynomial

such that

for

.

Step 2: (Root finding/Factorization) Output all degree

polynomials

such that

is a factor of

i.e.

. For each of these polynomials, check if

for at least

values of

. If so, include such a polynomial

in the output list.

Given the fact that bivariate polynomials can be factored efficiently, the above algorithm runs in polynomial time.

Applications in complexity theory and cryptography

Algorithms developed for list decoding of several interesting code families have found interesting applications in computational complexity

Computational Complexity

Computational Complexity may refer to:*Computational complexity theory*Computational Complexity...

and the field of cryptography

Cryptography

Cryptography is the practice and study of techniques for secure communication in the presence of third parties...

. Following is a sample list of applications outside of coding theory:

Construction of hard-core predicate
Hard-core predicate
In cryptography, a hard-core predicate of a one-way function f is a predicate b which is easy to compute given x but is hard to compute given f...

s from one-way permutations
One-way function
In computer science, a one-way function is a function that is easy to compute on every input, but hard to invert given the image of a random input. Here "easy" and "hard" are to be understood in the sense of computational complexity theory, specifically the theory of polynomial time problems...

.
Predicting witnesses for NP-search problems.
Amplifying hardness of Boolean functions.
Average case hardness of permanent
Permanent
The permanent of a square matrix in linear algebra, is a function of the matrix similar to the determinant. The permanent, as well as the determinant, is a polynomial in the entries of the matrix...

of random matrices.
Extractors and Pseudorandom generator
Pseudorandom generator
In theoretical computer science, a pseudorandom generator is a deterministic procedure that produces a pseudorandom distribution from a short uniform input, known as a random seed.-Definition:...

s.
Efficient traitor tracing.

External links

A Survey on list decoding by Madhu Sudan
Madhu Sudan
Madhu Sudan is an Indian computer scientist, professor of computer science at the Massachusetts Institute of Technology and a member of MIT Computer Science and Artificial Intelligence Laboratory.-Career:...
Notes from a course taught by Madhu Sudan
Notes from a course taught by Luca Trevisan
Luca Trevisan
Luca Trevisan is a professor of computer science at the University of California, Berkeley. As of winter 2010, he is on leave and an acting professor of computer science at Stanford University....
Notes from a course taught by Venkatesan Guruswami
Venkatesan Guruswami
Venkatesan Guruswami is a Computer Scientist at Carnegie Mellon University in Pittsburgh, USA. He ranked second All India in IIT-JEE in 1993. He completed his undergraduate in Computer Science from IIT Madras and his doctorate from Massachusetts Institute of Technology under the supervision of...
Notes from a course taught by Atri Rudra
P. Elias, "List decoding for noisy channels," Technical Report 335, Research Laboratory of Electronics, MIT, 1957.
P. Elias, "Error-correcting codes for list decoding," IEEE Transactions on Information Theory, vol. 37, pp. 5–12, 1991.
J. M. Wozencraft, "List decoding," Quarterly Progress Report, Research Laboratory of Electronics, MIT, vol. 48, pp. 90–95, 1958.
Venkatesan Guruswami
Venkatesan Guruswami
Venkatesan Guruswami is a Computer Scientist at Carnegie Mellon University in Pittsburgh, USA. He ranked second All India in IIT-JEE in 1993. He completed his undergraduate in Computer Science from IIT Madras and his doctorate from Massachusetts Institute of Technology under the supervision of...

's PhD thesis
Algorithmic Results in List Decoding

The source of this article is wikipedia, the free encyclopedia. The text of this article is licensed under the GFDL.