Editing Final Report/Thesis 2019 (section)

===Results and discussion===
Both DNA samples were successfully found their matched DNA kits in the database. Sample 1 have 8182 match kits and there are 5968 DNA files are found related to the sample 2. Top 30 match kits of sample 1 are shown in figure 20. The column Kit, Name and Email indicate the kit number, name of the kit and email of kit's owner. Column Total cM shows the total centimorgan which is a measure of genetic linkage between the 2 DNA kits. Note that the top 30 match kits are the kits with largest total centimorgan. Last but not least, the Overlap column present how many SNPs were used in the comparison between 2 kits. 
[[File:match_results_sample1.png|thumb|600px|center|Figure 20: match results of DNA sample 1]]
Next, the top 30 match kits for each degraded DNA reference files are recorded and the false negatives and false positives are calculated. Since all degraded files except the degraded files with 10% SNPs remaining have more than 30 match kits, the number of false negatives and false positives are same. The degraded files with 10% SNPs remaining have no match results. There are 4 degradation strategies introduced in task 3, therefore 4 sets of false negatives and false positives are provided for analysis. Figure 21 present line graph of the number of false negatives and false positives against degradation levels. The number of false negatives and false positives are the mean of 4 sets of data. Degradation level of 10% SNP remaining is not involved in the graph due to 0 match result. Similar graph which was done by last year's project were shown in figure 22. The DNA sample used in figure 22 is a completely different one from the samples used in figure 21. According to both graphs, the number of false positives and false negatives for different DNA samples are not same. But the trend are similar. As more SNPs are removed, the amount of false positives and false negatives increases until 50% SNPs are removed. When there is more than half amount of SNPs being removed, the number of false positives and false negatives reaches maximum of 30 which indicate that the match results of original kits and degraded kits are totally different at these levels. These results show that as more SNPs removed from the original DNA reference file, the match results would be more inaccuracy. And when there is only half amount of SNPs remaining in the DNA kit, the match results would be totally different and be unreliable. Moreover, when 10% of SNPs are removed, more than half of match results would be different which indicates that even a small amount of SNPs being removed could result a huge difference in DNA match test.
[[File:false1.png|thumb|600px|center|Figure 21: False Positives and False Negative test for DNA sample 1 and 2]]
[[File:false2.png|thumb|600px|center|Figure 22: False Positives and False Negative test from previous year [14]]]