Add like
Add dislike
Add to saved papers

Distinguishing the disease-associated SNPs based on composition frequency analysis.

Single-nucleotide polymorphism (SNP) is a basical variation in genome. When SNPs occur at the binding sites of microRNA, they can influence the binding efficiency, cause a fluctuation of the mRNA in vivo, and thus arouse posttranscriptional level abnormality. Therefore, SNP has a strong correlation with diseases. Although enormous SNPs have been experimentally identified, only a tiny proportion of them are truly disease-associated SNPs (dSNPs) that relate to microRNA modification and then are involved in disease causing process. Therefore, it is important to distinguish dSNPs from the usual SNPs. Analysis here shows that composition is different between sequence segments centered by dSNP and SNP. Inspired by the composition, transition and distribution features which are meaningful and effective in characterizing proteins' sequence information, we improved and applied it to represent the frequency and physicochemical properties of a gene sequence. Binary encoding scheme was also used for further labelling four nucleic acids (A, T, C, and G). First, clustering analysis was performed to gain reasonable negative samples. Then, optimization tests were implemented on different ratios of positive vs negative samples and different feature subsets retrieved by evaluation method of F score. The optimal model constructed by random forest achieves an accuracy of more than 90% on the testing data set. Moreover, the promising results of the external validation also demonstrate the practical applicability of our method. Finally, principal component analysis on the features indicates that all features in our method gain the gross contribution to the prediction model.

Full text links

We have located links that may give you full text access.
Can't access the paper?
Try logging in through your university/institutional subscription. For a smoother one-click institutional access experience, please use our mobile app.

Related Resources

For the best experience, use the Read mobile app

Mobile app image

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

All material on this website is protected by copyright, Copyright © 1994-2024 by WebMD LLC.
This website also contains material copyrighted by 3rd parties.

By using this service, you agree to our terms of use and privacy policy.

Your Privacy Choices Toggle icon

You can now claim free CME credits for this literature searchClaim now

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app