Editing
Final Report/Thesis 2015
(section)
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
=====Significance Level Calculation===== The chi-squared and ''p-values'' calculated showed that English was the closest language to the Somerton Man code. From this, ''hypothesis testing'' could be performed based on the English results. Upon consultation with Prof. Abbott and Dr. Berryman, rather than choosing an arbitrary value of significance level such as the typically used p=0.05, it was decided a significance level could be calculated using the ''p-value'' found using real English texts to be used as what we deemed to be an acceptable significance level for which we would confidently be able to say that the most likely language of origin of the Somerton Man code is English. This was achieved by collecting 20 44 letter excerpts from English novels from ''Project Gutenberg'' (see Appendix C), performing the ''chi-squared testing'' for these samples against the English ''Project Gutenberg'' novel used as our English base text, taking an average of the chi-squared values, and from this calculating a ''p-value''. This result was then compared to the results obtained from the English portion of the ''chi-squared testing'' performed on the variants of the code, and was plotted as seen in Figure 40. This same testing was then also run on the English samples and code variants against the original English translation of the ''Universal Declaration of Human rights'' as a means of comparison between the two base texts. Significance levels were unable to be calculated using the ''Universal Declaration of Human Rights'' since the chi-squared values were too large, causing the calculated ''p-values'' to be too small (approaching 0). The results can be seen in Figure 39. It was unnecessary to extend the analysis to collect benchmarks and perform the ''hypothesis testing'' on the other European languages against the code since chi-squared values produced were too large, and so the ''p-values'' calculated were unusable.
Summary:
Please note that all contributions to Derek may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Derek:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Page
Discussion
English
Views
Read
Edit
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Tools
What links here
Related changes
Special pages
Page information