Editing
Final Report/Thesis 2019
(section)
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
===Results and discussion=== Both DNA samples were successfully found their matched DNA kits in the database. Sample 1 have 8182 match kits and there are 5968 DNA files are found related to the sample 2. Top 30 match kits of sample 1 are shown in figure 20. The column Kit, Name and Email indicate the kit number, name of the kit and email of kit's owner. Column Total cM shows the total centimorgan which is a measure of genetic linkage between the 2 DNA kits. Note that the top 30 match kits are the kits with largest total centimorgan. Last but not least, the Overlap column present how many SNPs were used in the comparison between 2 kits. [[File:match_results_sample1.png|thumb|600px|center|Figure 20: match results of DNA sample 1]] Next, the top 30 match kits for each degraded DNA reference files are recorded and the false negatives and false positives are calculated. Since all degraded files except the degraded files with 10% SNPs remaining have more than 30 match kits, the number of false negatives and false positives are same. The degraded files with 10% SNPs remaining have no match results. There are 4 degradation strategies introduced in task 3, therefore 4 sets of false negatives and false positives are provided for analysis. Figure 21 present line graph of the number of false negatives and false positives against degradation levels. The number of false negatives and false positives are the mean of 4 sets of data. Degradation level of 10% SNP remaining is not involved in the graph due to 0 match result. Similar graph which was done by last year's project were shown in figure 22. The DNA sample used in figure 22 is a completely different one from the samples used in figure 21. According to both graphs, the number of false positives and false negatives for different DNA samples are not same. But the trend are similar. As more SNPs are removed, the amount of false positives and false negatives increases until 50% SNPs are removed. When there is more than half amount of SNPs being removed, the number of false positives and false negatives reaches maximum of 30 which indicate that the match results of original kits and degraded kits are totally different at these levels. These results show that as more SNPs removed from the original DNA reference file, the match results would be more inaccuracy. And when there is only half amount of SNPs remaining in the DNA kit, the match results would be totally different and be unreliable. Moreover, when 10% of SNPs are removed, more than half of match results would be different which indicates that even a small amount of SNPs being removed could result a huge difference in DNA match test. [[File:false1.png|thumb|600px|center|Figure 21: False Positives and False Negative test for DNA sample 1 and 2]] [[File:false2.png|thumb|600px|center|Figure 22: False Positives and False Negative test from previous year [14]]]
Summary:
Please note that all contributions to Derek may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Derek:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Page
Discussion
English
Views
Read
Edit
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Tools
What links here
Related changes
Special pages
Page information