Difference between revisions of "Transition probabilities from selected texts"
(<br/> tags) |
(→Second Order Transition Probabilies: second order added) |
||
Line 31: | Line 31: | ||
==Second Order Transition Probabilies== | ==Second Order Transition Probabilies== | ||
− | + | English (1984 - George Orwell) | |
+ | <br/>Markov Probability: 2.115089006082431E-43 | ||
+ | <br/>Corrected Zeroes: 14 | ||
+ | |||
+ | German (Traumdeutung - Sigmund Freud) | ||
+ | <br/>Note: does not account for Eszett (sharp s) character | ||
+ | <br/>Markov Probability: 3.79644909538402E-35 | ||
+ | <br/>Corrected Zeroes: 21 | ||
+ | |||
+ | French (Les Orientales - Victor Hugo) | ||
+ | <br/>Markov Probability: 4.429249667204738E-34 | ||
+ | <br/>Corrected Zeroes: 18 | ||
+ | |||
+ | Vigenere (English - 1984 - Orwell) | ||
+ | <br/>Markov Probability: 1.6699440985106574E-60 | ||
+ | <br/>Corrected Zeroes: 8 | ||
+ | |||
+ | English Initial Letters (1984 - George Orwell) | ||
+ | <br/>Markov Probability: 7.009981410871232E-53 | ||
+ | <br/>Corrected Zeroes: 2 | ||
+ | |||
+ | German Initial Letters (Traumdeutung - Sigmund Freud) | ||
+ | <br/>Note: does not account for Eszett (sharp s) character | ||
+ | <br/>Markov Probability: 2.908650572588623E-32 | ||
+ | <br/>Corrected Zeroes: 17 | ||
+ | |||
+ | Les Orientales - Victor Hugo.txt | ||
+ | <br/>Not counting l' as a word (but counting the word contracted with it): | ||
+ | <br/>Markov Probability: 1.0762921500526206E-40 | ||
+ | <br/>Corrected Zeroes: 12 | ||
+ | <br/>Counting the "l'" as one word and the other contracted word as another word: | ||
+ | <br/>Markov Probability: 2.970787716759867E-41 | ||
+ | <br/>Corrected Zeroes: 12 |
Revision as of 18:09, 5 June 2009
The Somerton Man's code (without the extra line) is 44 characters long. So, if the text is purely random (1/26 chance of each letter appearing) then the probability of attaining this particular string of 44 is (1/26)^44 = 5.51027E-63. This is a good initial comparison.
First Order Transition Probabilities
English (1984 - George Orwell)
1.4641414719132793E-67
French (Les Orientales - Victor Hugo)
1.1571661202766258E-70
Vigenere Cipher (1984 - George Orwell, Keyword LEMON)
1.646391769425068E-70
German (Traumdeutung - Sigmund Freud)
Note: does not account for Eszett (sharp s) character
3.8662593620911806E-73
English Initial Letters (1984 - George Orwell)
2.351742120702543E-57
French Initial Letters (Les Orientales - Victor Hugo)
counting words like l'hopital as two words ("le" followed by "hopital"):
7.809561685705767E-61
discounting the "l'" (only consider the "hopital")
1.1841007473332175E-60
German Initial Letters (Traumdeutung - Sigmund Freud)
Note: does not account for Eszett (sharp s) character. Though I don't think a word can ever start with this character
4.29592233581315E-64
Second Order Transition Probabilies
English (1984 - George Orwell)
Markov Probability: 2.115089006082431E-43
Corrected Zeroes: 14
German (Traumdeutung - Sigmund Freud)
Note: does not account for Eszett (sharp s) character
Markov Probability: 3.79644909538402E-35
Corrected Zeroes: 21
French (Les Orientales - Victor Hugo)
Markov Probability: 4.429249667204738E-34
Corrected Zeroes: 18
Vigenere (English - 1984 - Orwell)
Markov Probability: 1.6699440985106574E-60
Corrected Zeroes: 8
English Initial Letters (1984 - George Orwell)
Markov Probability: 7.009981410871232E-53
Corrected Zeroes: 2
German Initial Letters (Traumdeutung - Sigmund Freud)
Note: does not account for Eszett (sharp s) character
Markov Probability: 2.908650572588623E-32
Corrected Zeroes: 17
Les Orientales - Victor Hugo.txt
Not counting l' as a word (but counting the word contracted with it):
Markov Probability: 1.0762921500526206E-40
Corrected Zeroes: 12
Counting the "l'" as one word and the other contracted word as another word:
Markov Probability: 2.970787716759867E-41
Corrected Zeroes: 12