The theorem of Cox-Jaynes (1946), due in its original version to the physicist Richard Cox, is a coding of the processes of training starting from a certain whole of postulates. This coding is to coincide at the end of these considerations with that - historically of very different origin - of Probabilité.
It thus induces a Interprétation “logical” of the probabilities independent of that of Fréquence. It also provides a rational base to the logical mechanism of induction, and thus of the training by machines. Who more is, the invalid theorem - under the conditions of the postulates - any other form of representation of knowledge as skewed. It is thus about an extremely strong result. (source: Myron Tribes, rational Decisions in dubious the , Masson, 1974)
The results of Cox had not touched that a reduced audience before E.T. Jaynes redécouvre this theorem and does not clear of it a series of implications for the methods bayésiennes, and Irving John Good for the Artificial intelligence.
In the name of what to generalize that what was checked in a limited number of cases will also check in the cases which were not tested?
Cox seeks to pose the desirable desideratas for a robot which would reason according to an inductive logical :
It is well indeed necessary to be able constantly to say of two plausibilities which is larger than the other , which suggests a quantitative representation, and the digital form seems convenient.
Adopted convention, arbitrarily, is that larger plausibilities will be represented by larger numbers .
In other words, which appears obvious to us should not be contradicted by the model (with the difference of what occurs with the Paradoxe from Condorcet).
Example:
if has is preferable with B,
For the five following sections, all the formulas are here:
Cox-Jaynes (pdf)
If a conclusion can be obtained by more than one means, then all these means must give the same result well.
This rule eliminates from the field of examination the heuristic multiples since they could contain between them contradictions (as do it for example sometimes the criteria of Wald and the minimax in Game theory).
The robot must always take into account the totality of the information which is provided to him. It should not be unaware of a part of them deliberately and base its conclusions on the remainder. In other words, the robot must be completely nonideological , neutral from point of view.
The robot represents equivalent states of knowledge by equivalent plausibilities. If two problems are identical to a simple labelling of proposals close, the robot must assign same plausibilities in both cases.
That means in particular that proposals will be considered a priori as equivalent plausibility when they are characterized only by their name - what hardly arrives but in very particular cases, like a part or a die having satisfied criteria from non-pipage.
Alan Turing had pointed out in its time that the expression of the probabilities was much easier to handle by replacing a probability p varying from 0 to 1 by the expression ln (p (1-p)) varying between less the infinite one and more the infinite one. In particular, in this form, a contribution of information by the rule of Bayes results in the addition of a single algebraic quantity to this expression (that Turing named log-odd ), that whatever the probability a priori starting before the observation.
It named corresponding measurement, W = 10 log10 (p (1-p)), weight off obviousness because it made it possible “to weigh” the testimony of the facts according to waitings - expressed by “subjective” probabilities former to the observation - independently of these waitings .
And logic = produced probabilities
This consideration led to the invention in the Années 1970 of the stochastic Calculateurs promoted by the company Alsthom (which was written with a H at the time) and which intended to combine the low costs of the gates with the processing capability of the analog computers. Some were carried out at the time.
August 1st
Perhaps in the case of an electric assembly, a specialist in electrical engineering will make an estimate of dissipated power (IH ²) while an other of current low will prefer to estimate the intensity itself (I). If the convergence in the long term of the estimates is ensured in both cases, it will not be done in the same way, even with distributions a priori identical , because the expectation of a square is not mathematically related to the square of a hope. It is the principal stone of obstacle of the methods bayésiennes.
| Random links: | Standard Athletic Club | Arzviller | Canton of Chomérac | Daguenière | Reverberation (acoustic) |