One is interested here in the means of formalizing information in order to be able to handle it (mainly to transmit it). One will thus not be interested in the contents but only in the form.
Alphabet, word, languages
Definitions
One defines a
alphabet as a nonempty whole of symbols, for example:
- has = {has, B, C,…, Z}, the Latin alphabet;
- has = {0,1,2,…, 9}, the figures known as Arab
- has = {0,1,…, 9, has, B,…, F}, the hexadecimal figures.
- has = {0,1}, the alphabet of the Boolean Logique.
- has = {has, T, G, C}, the bases of DNA which code our Génome (this alphabet is the principal subject of the Bio-data processing).
One names lettre an element of a alphabet.
One names mot a finished continuation of lettres.
The continuation of 0 letter is named the mot vide, noted ε.
One names langage a whole of mots associated with some rules of interpretation (without this last restriction, any table of random values could be named language ). In the case of the DNA, these rules are contained in the Ribosome, in the natural languages, they are contained in their Lexique, on a computer, they are present in the circuits of the Central processing unit.
Operations
That is to say an alphabet
and a natural entirety
.
One notes
the whole of all the words length
on
and
the whole of all the words of
.
One has:
(
Closing of Kleene).
The operation of concatenation is defined
which with
associates a word
which is consisted of the continuation of letters of
then that of
.
Example: “marc”
“and Sophie” = “marc and Sophie” (the quotation marks are used to delimit the symbols, they are not elements of
).
- Properties :
- is associative:
- admits ε like neutral element:
- is not not commutative.
Codings and codes
Coding
That is to say L and M two langages.
A
codage C of L in M is a morphism (for the operation
) injective.
In other words, it is a correspondence between the words of L and those of M, where with any word of L a single word of M is associated and such as coding with concaténée either equal to concaténée with codings. (
).
Code
A language L on an alphabet has is a code if and only if there do not exist two factorizations different from the words with words of L.
Applications, examples