Bit

From Wikipedia, the free encyclopedia

Jump to: navigation, search
Multiples of bits
SI prefixes Binary prefixes
Name
(Symbol)
Standard
SI
Binary
usage
Name
(Symbol)
Value
kilobit (kbit) 103 210 kibibit (Kibit) 210
megabit (Mbit) 106 220 mebibit (Mibit) 220
gigabit (Gbit) 109 230 gibibit (Gibit) 230
terabit (Tbit) 1012 240 tebibit (Tibit) 240
petabit (Pbit) 1015 250 pebibit (Pibit) 250
exabit (Ebit) 1018 260 exbibit (Eibit) 260
zettabit (Zbit) 1021 270 zebibit (Zibit) 270
yottabit (Ybit) 1024 280 yobibit (Yibit) 280

A bit is a binary digit, taking a value of either 0 or 1. Binary digits are a basic unit of information storage and communication in digital computing and digital information theory. Information theory also often uses the natural digit, called either a nit or a nat. Quantum computing also uses qubits, a single piece of information with a probability of being true.

The bit is also a unit of measurement, the information capacity of one binary digit. It has the symbol bit or b, the latter recommended by IEEE 1541-2002.

Contents

[edit] Binary digit

Claude E. Shannon first used the word bit in his 1948 paper A Mathematical Theory of Communication. He attributed its origin to John W. Tukey, who had written a Bell Labs memo on 9 January 1947 in which he contracted "binary digit" to simply "bit". Interestingly, Vannevar Bush had written in 1936 of "bits of information" that could be stored on the punch cards used in the mechanical computers of that time. [1]

A bit of storage can be either on (1) or off (0). A single bit is a one or a zero, a true or a false, a "flag" which is "on" or "off", or in general, the quantity of information required to distinguish two mutually exclusive equally probable states from each other. Gregory Bateson defined a bit as "a difference which makes a difference".[2]

[edit] Representation

[edit] Transmission

Bits can be implemented in many forms depending on context. For example, in digital circuitry in most computing devices as well as flash memories, a bit is an electrical pulse generated by the internal clock in the control unit or data register. For devices using positive logic, a logical 1 (true value) is represented by up to 5 volts, while a logical 0 (false value) is represented by 0 volt.

[edit] Storage

Bits are manipulated in the volatile memory of a computer, and can further be kept in a persistent manner on a magnetic storage device such as magnetic tape or disc, as well as on optical discs or an non-volatile flash memory.

[edit] Unit

It is important to differentiate between the use of "bit" in referring to a discrete storage unit and the use of "bit" in referring to a statistical unit of information. The bit, as a discrete storage unit, can by definition store only 0 or 1. A statistical bit is the amount of information that, on average[citation needed], can be stored in a discrete bit. It is thus the amount of information carried by a choice between two equally likely outcomes. One bit corresponds to about 0.693 nats (ln(2)), or 0.301 hartleys (log10(2)).

Consider, for example, a computer file with one thousand 0s and 1s which can be losslessly compressed to a file of five hundred 0s and 1s (on average, over all files of that kind). The original file, although having 1,000 bits of storage, has at most 500 bits of information entropy, since information is not destroyed by lossless compression. A file can have no more information theoretical bits than it has storage bits. If these two ideas need to be distinguished, sometimes the name bit is used when discussing data storage while shannon is used for the statistical bit. However, most of the time, the meaning is clear from the context.

[edit] Abbreviation and symbol

IEEE 1541-2002 specifies "b" to be the unit symbol for bit and "B" to be that for byte. This convention is also widely used in computing.

SATA-IO uses Gb/s.[3]

The relevant ISO/IEC standard is IEC 80000-13:2008 which is not publicly available. ISO says: "This standard cancels and replaces subclauses 3.8 and 3.9 of IEC 60027-2:2005. The only significant change is the addition of explicit definitions for some quantities."[4]

These subclauses were related to information theory and prefixes for binary multiples.

The International Electrotechnical Commission's IEC 60027, specifies that the bit should have the symbol bit, used in all multiples, such as "kbit" (for kilobit). In the same documents, the symbols "o" and "B" are specified for the byte.

NIST in their "Guide for the Use of the International System of Units Edition 2008" recommends "bit" while referring to obsolete ISO 31 and IEC 60027.[5]

[edit] Multiple bits

Several naming conventions exist for collections or groups of bits. The byte, although historically differing in size depending on computer hardware architecture, is today almost always eight bits. However, 8-bit bytes are also known specifically as octets. These can represent 256 (28, 0–255) values. A 4-bit quantity is known as a nibble, and can represent 16 (24, 0–15) values.

"Word" is a term for a slightly larger group of bits, but it has no standard size. It represents the size of one register in a Computer-CPU. In the IA-32 architecture more commonly known as x86-32, 16 bits constitute a word (with 32 bits being a double-word or dword), but other architectures have word sizes of 8, 32, 64, 80 bits or others.

Terms for large quantities of bits can be formed using the standard range of SI prefixes, e.g., kilobit (kbit), megabit (Mbit) and gigabit (Gbit), or using any of the binary prefixes. Much confusion exists regarding these units and their abbreviations, due to the historical usage of SI-prefixes for binary multiples (1024-radix) and attempts to define a consistent standard.

When a bit within a group of bits such as a byte or word is to be referred to, it is usually specified by a number from 0 (not 1) upwards corresponding to its position within the byte or word. However, 0 can refer to either the most significant bit or to the least significant bit depending on the context, so the convention of use must be known.

Certain bitwise computer processor instructions (such as bit set) operate at the level of manipulating bits rather than manipulating data interpreted as an aggregate of bits.

Telecommunications or computer network transfer rates are usually described in terms of bits per second (bit/s), not to be confused with baud.

[edit] Uncommon names for groups of bits

Similarly to the well-known terms byte and nibble, other terms of bit groups of varying sizes have been used over time.[6] All of these are jargon, are obsolete, or are not very common.

  • 1 bit: sniff
  • 2 bits: lick, crumb, quad, quarter, tayste, tydbit
  • 4 bits: nibble, nybble
  • 5 bits: nickel, nyckle
  • 10 bits: deckle, dyme bag
  • 16 bits: plate, playte, chomp, chawmp (on a 32-bit machine)
  • 18 bits: chomp, chawmp (on a 36-bit machine)
  • 32 bits: dinner, dynner, gawble (on a 32-bit machine)
  • 48 bits: gobble, gawble (under circumstances that remain obscure)

[edit] See also

[edit] Notes

  1. ^ Darwin among the machines: the evolution of global intelligence, George Dyson, 1997. ISBN 0-201-40649-7
  2. ^ Social Systems
  3. ^ http://www.sata-io.org/6gbnamingguidelines.asp]
  4. ^ http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=31898
  5. ^ http://physics.nist.gov/cuu/pdf/sp811.pdf
  6. ^ nybble reference.com sourced from Jargon File 4.2.0, accessed 2007-08-12
Personal tools