Misplaced Pages

Gilbert–Varshamov bound for linear codes

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
This article may be too technical for most readers to understand. Please help improve it to make it understandable to non-experts, without removing the technical details. (January 2019) (Learn how and when to remove this message)

The Gilbert–Varshamov bound for linear codes is related to the general Gilbert–Varshamov bound, which gives a lower bound on the maximal number of elements in an error-correcting code of a given block length and minimum Hamming weight over a field F q {\displaystyle \mathbb {F} _{q}} . This may be translated into a statement about the maximum rate of a code with given length and minimum distance. The Gilbert–Varshamov bound for linear codes asserts the existence of q-ary linear codes for any relative minimum distance less than the given bound that simultaneously have high rate. The existence proof uses the probabilistic method, and thus is not constructive. The Gilbert–Varshamov bound is the best known in terms of relative distance for codes over alphabets of size less than 49. For larger alphabets, algebraic geometry codes sometimes achieve an asymptotically better rate vs. distance tradeoff than is given by the Gilbert–Varshamov bound.

Gilbert–Varshamov bound theorem

Theorem: Let q 2 {\displaystyle q\geqslant 2} . For every 0 δ < 1 1 q {\displaystyle 0\leqslant \delta <1-{\tfrac {1}{q}}} and 0 < ε 1 H q ( δ ) , {\displaystyle 0<\varepsilon \leqslant 1-H_{q}(\delta ),} there exists a q {\displaystyle q} -ary linear code with rate R 1 H q ( δ ) ε {\displaystyle R\geqslant 1-H_{q}(\delta )-\varepsilon } and relative distance δ . {\displaystyle \delta .}

Here H q {\displaystyle H_{q}} is the q-ary entropy function defined as follows:

H q ( x ) = x log q ( q 1 ) x log q x ( 1 x ) log q ( 1 x ) . {\displaystyle H_{q}(x)=x\log _{q}(q-1)-x\log _{q}x-(1-x)\log _{q}(1-x).}

The above result was proved by Edgar Gilbert for general codes using the greedy method. Rom Varshamov refined the result to show the existence of a linear code. The proof uses the probabilistic method.

High-level proof:

To show the existence of the linear code that satisfies those constraints, the probabilistic method is used to construct the random linear code. Specifically, the linear code is chosen by picking a generator matrix G F q k × n {\displaystyle G\in \mathbb {F} _{q}^{k\times n}} whose entries are randomly chosen elements of F q {\displaystyle \mathbb {F} _{q}} . The minimum Hamming distance of a linear code is equal to the minimum weight of a nonzero codeword, so in order to prove that the code generated by G {\displaystyle G} has minimum distance d {\displaystyle d} , it suffices to show that for any m F q k { 0 } , wt ( m G ) d {\displaystyle m\in \mathbb {F} _{q}^{k}\smallsetminus \left\{0\right\},\operatorname {wt} (mG)\geq d} . We will prove that the probability that there exists a nonzero codeword of weight less than d {\displaystyle d} is exponentially small in n {\displaystyle n} . Then by the probabilistic method, there exists a linear code satisfying the theorem.

Formal proof:

By using the probabilistic method, to show that there exists a linear code that has a Hamming distance greater than d {\displaystyle d} , we will show that the probability that the random linear code having the distance less than d {\displaystyle d} is exponentially small in n {\displaystyle n} .

The linear code is defined by its generator matrix, which we choose to be a random k × n {\displaystyle k\times n} generator matrix; that is, a matrix of k n {\displaystyle kn} elements which are chosen independently and uniformly over the field F q {\displaystyle \mathbb {F} _{q}} .

Recall that in a linear code, the distance equals the minimum weight of a nonzero codeword. Let wt ( y ) {\displaystyle \operatorname {wt} (y)} be the weight of the codeword y {\displaystyle y} . So

P = Pr random  G ( linear code generated by  G  has distance < d ) = Pr random  G ( there exists a non-zero codeword  y  in a linear code generated by  G  such that  wt ( y ) < d ) = Pr random  G ( there exists  0 m F q k  such that  wt ( m G ) < d ) {\displaystyle {\begin{aligned}P&=\Pr _{{\text{random }}G}({\text{linear code generated by }}G{\text{ has distance}}<d)\\&=\Pr _{{\text{random }}G}({\text{there exists a non-zero codeword }}y{\text{ in a linear code generated by }}G{\text{ such that }}\operatorname {wt} (y)<d)\\&=\Pr _{{\text{random }}G}\left({\text{there exists }}0\neq m\in \mathbb {F} _{q}^{k}{\text{ such that }}\operatorname {wt} (mG)<d\right)\end{aligned}}}

The last equality follows from the definition: if a codeword y {\displaystyle y} belongs to a linear code generated by G {\displaystyle G} , then y = m G {\displaystyle y=mG} for some vector m F q k {\displaystyle m\in \mathbb {F} _{q}^{k}} .

By Boole's inequality, we have:

P 0 m F q k Pr random  G ( wt ( m G ) < d ) {\displaystyle P\leqslant \sum _{0\neq m\in \mathbb {F} _{q}^{k}}\Pr _{{\text{random }}G}(\operatorname {wt} (mG)<d)}

Now for a given message 0 m F q k , {\displaystyle 0\neq m\in \mathbb {F} _{q}^{k},} we want to compute

W = Pr random  G ( wt ( m G ) < d ) . {\displaystyle W=\Pr _{{\text{random }}G}(\operatorname {wt} (mG)<d).}

Let Δ ( m 1 , m 2 ) {\displaystyle \Delta (m_{1},m_{2})} be a Hamming distance of two messages m 1 {\displaystyle m_{1}} and m 2 {\displaystyle m_{2}} . Then for any message m {\displaystyle m} , we have: wt ( m ) = Δ ( 0 , m ) {\displaystyle \operatorname {wt} (m)=\Delta (0,m)} . Therefore:

W = { y F q n | Δ ( 0 , y ) d 1 } Pr random  G ( m G = y ) {\displaystyle W=\sum _{\{y\in \mathbb {F} _{q}^{n}|\Delta (0,y)\leqslant d-1\}}\Pr _{{\text{random }}G}(mG=y)}

Due to the randomness of G {\displaystyle G} , m G {\displaystyle mG} is a uniformly random vector from F q n {\displaystyle \mathbb {F} _{q}^{n}} . So

Pr random  G ( m G = y ) = q n {\displaystyle \Pr _{{\text{random }}G}(mG=y)=q^{-n}}

Let Vol q ( r , n ) {\displaystyle \operatorname {Vol} _{q}(r,n)} be the volume of a Hamming ball with the radius r {\displaystyle r} . Then:

P q k W = q k ( Vol q ( d 1 , n ) q n ) q k ( q n H q ( δ ) q n ) = q k q n ( 1 H q ( δ ) ) {\displaystyle P\leqslant q^{k}W=q^{k}\left({\frac {\operatorname {Vol} _{q}(d-1,n)}{q^{n}}}\right)\leqslant q^{k}\left({\frac {q^{nH_{q}(\delta )}}{q^{n}}}\right)=q^{k}q^{-n(1-H_{q}(\delta ))}}

By choosing k = ( 1 H q ( δ ) ε ) n {\displaystyle k=(1-H_{q}(\delta )-\varepsilon )n} , the above inequality becomes

P q ε n {\displaystyle P\leqslant q^{-\varepsilon n}}

Finally q ε n 1 {\displaystyle q^{-\varepsilon n}\ll 1} , which is exponentially small in n, that is what we want before. Then by the probabilistic method, there exists a linear code C {\displaystyle C} with relative distance δ {\displaystyle \delta } and rate R {\displaystyle R} at least ( 1 H q ( δ ) ε ) {\displaystyle (1-H_{q}(\delta )-\varepsilon )} , which completes the proof.

Comments

  1. The Varshamov construction above is not explicit; that is, it does not specify the deterministic method to construct the linear code that satisfies the Gilbert–Varshamov bound. A naive approach is to search over all generator matrices G {\displaystyle G} of size k n {\displaystyle kn} over the field F q {\displaystyle \mathbb {F} _{q}} to check if the linear code associated to G {\displaystyle G} achieves the predicted Hamming distance. This exhaustive search requires exponential runtime in the worst case.
  2. There also exists a Las Vegas construction that takes a random linear code and checks if this code has good Hamming distance, but this construction also has an exponential runtime.
  3. For sufficiently large non-prime q and for certain ranges of the variable δ, the Gilbert–Varshamov bound is surpassed by the Tsfasman–Vladut–Zink bound.

See also

References

  1. Tsfasman, M.A.; Vladut, S.G.; Zink, T. (1982). "Modular curves, Shimura curves, and Goppa codes better than the Varshamov-Gilbert bound". Mathematische Nachrichten. 104.
  2. The later inequality comes from the upper bound of the Volume of Hamming ball Archived 2013-11-08 at the Wayback Machine
  3. Stichtenoth, H. (2006). "Transitive and self-dual codes attaining the Tsfasman-Vla/spl breve/dut$80-Zink bound". IEEE Transactions on Information Theory. 52 (5): 2218–2224. doi:10.1109/TIT.2006.872986. ISSN 0018-9448. S2CID 11982763.
Category: