One is interested here in the means of formalizing information in order to be able to handle it (mainly to transmit it). One will thus not be interested in the contents but only in the form.

Alphabet, word, languages

Definitions

One defines a alphabet as a nonempty whole of symbols, for example:
  • has = {has, B, C,…, Z}, the Latin alphabet;
  • has = {0,1,2,…, 9}, the figures known as Arab
  • has = {0,1,…, 9, has, B,…, F}, the hexadecimal figures.
  • has = {0,1}, the alphabet of the Boolean Logique.
  • has = {has, T, G, C}, the bases of DNA which code our Génome (this alphabet is the principal subject of the Bio-data processing).

One names lettre an element of a alphabet.
One names mot a finished continuation of lettres.
The continuation of 0 letter is named the mot vide, noted ε.
One names langage a whole of mots associated with some rules of interpretation (without this last restriction, any table of random values could be named language ). In the case of the DNA, these rules are contained in the Ribosome, in the natural languages, they are contained in their Lexique, on a computer, they are present in the circuits of the Central processing unit.

Operations

That is to say an alphabet A and a natural entirety n.
One notes A^n the whole of all the words length n on A and A^* the whole of all the words of A.
One has: A^* = \ bigcup_ {N \ geq 0} ^ {\ infty} A^n (Closing of Kleene).
The operation of concatenation is defined \ cdot: A^* \ times A^* \ rightarrow A^* which with (U, v) associates a word w which is consisted of the continuation of letters of u then that of v.
Example: “marc” \ cdot “and Sophie” = “marc and Sophie” (the quotation marks are used to delimit the symbols, they are not elements of A).
  • Properties :
    • \ cdot is associative: \ forall U, v, W \ in A^*, (U \ cdot v) \ cdot W = U \ cdot (v \ cdot W)
    • \ cdot admits ε like neutral element: \ forall U \ in A^*, U \ cdot \ epsilon = \ epsilon \ cdot U = u
    • \ cdot is not not commutative.

Codings and codes

Coding

That is to say L and M two langages.
A codage C of L in M is a morphism (for the operation \ cdot) injective. In other words, it is a correspondence between the words of L and those of M, where with any word of L a single word of M is associated and such as coding with concaténée either equal to concaténée with codings. ( \ forall U, v \ in L, C (u.v) = C (U) .c (v) ).

Code

A language L on an alphabet has is a code if and only if there do not exist two factorizations different from the words A^* with words of L.

Applications, examples

Random links:Piano Collections Final Fantasy IV | Jean Gagnepain | Tensta | Tournament of Wimbledon 2004 | Gary Stevens