6b/8b encoding

In telecommunications, 6b/8b is a line code that expands 6-bit codes to 8-bit symbols for the purposes of maintaining DC-balance in a communications system.[1]

The 6b/8b encoding is a balanced code -- each 8-bit output symbol contains 4 zero bits and 4 one bits. So the code can, like a parity bit, detect all single-bit errors.

The number of 8-bit patterns with 4 bits set is the binomial coefficient = 70. Further excluding the patterns 11110000 and 00001111, this allows 68 coded patterns: 64 data codes, plus 4 additional control codes.

Coding rules

The 64 possible 6-bit input codes can be classified according to their disparity, the number of 1 bits minus the number of 0 bits:

OnesZerosDisparityNumber
06−61
15−46
24−215
33020
42+215
51+46
60+61

The 6-bit input codes are mapped to 8-bit output symbols as follows:

  • The 20 6-bit codes with disparity 0 are prefixed with 10
    Example: 000111 → 10000111
    Example: 101010 → 10101010
  • The 15 6-bit codes with disparity +2, other than 001111, are prefixed with 00
    Example: 010111 → 00010111
  • The 15 6-bit codes with disparity −2, other than 110000, are prefixed with 11
    Example: 101000 → 11101000
  • The remaining 20 codes: 12 with disparity ±4, 2 with disparity ±6, 001111, 110000, and the 4 control codes, are assigned to codes beginning with 01 as follows:
TypeInputOutput TypeInputOutput Complement
−600000001011001 +611111101100110 01_xx__x
−4 00000101110001 +4 11111001001110 01xx____
00001001110010 11110101001101
00010001100101 11101101011010 01x____x
00100001101001 11011101010110
01000001010011 10111101101100 01_____xx
10000001100011 01111101011100
−211000001110100 +200111101001011 01____x__
Control K 00011101000111 Control K 11100001111000
K 01010101010101 K 10101001101010

Obviously, no data symbol contains more than four consecutive matching bits, and because the patterns 11110000 and 00001111 are excluded, no data symbol begins or ends with more than three identical bits. Thus, the longest run of identical bits that will be produced is 6. (I.e. this is a (0,5) RLL code, with a worst-case running disparity of +3 to −3.)

Any occurrence of 6 consecutive identical bits constitutes a comma sequence or sync mark or syncword; it identifies the symbol boundaries precisely. Those 6 bits straddle the inter-symbol boundary with exactly 3 of those identical bits at the end of one symbol, and 3 of those identical bits at the start of the following next symbol.

See also

References

  1. Kees A. Schouhamer Immink (November 2004). Codes for Mass Data Storage Systems (Second fully revised ed.). Eindhoven, The Netherlands: Shannon Foundation Publishers. ISBN 90-74249-27-2. Retrieved 2015-08-23.


This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.