Empirical evaluation of data normalization methods for molecular classification.

Huei-Chung Huang, Li-Xuan Qin

PeerJ 2018

Background: Data artifacts due to variations in experimental handling are ubiquitous in microarray studies, and they can lead to biased and irreproducible findings. A popular approach to correct for such artifacts is through post hoc data adjustment such as data normalization. Statistical methods for data normalization have been developed and evaluated primarily for the discovery of individual molecular biomarkers. Their performance has rarely been studied for the development of multi-marker molecular classifiers-an increasingly important application of microarrays in the era of personalized medicine.

Methods: In this study, we set out to evaluate the performance of three commonly used methods for data normalization in the context of molecular classification, using extensive simulations based on re-sampling from a unique pair of microRNA microarray datasets for the same set of samples. The data and code for our simulations are freely available as R packages at GitHub.

Results: In the presence of confounding handling effects, all three normalization methods tended to improve the accuracy of the classifier when evaluated in an independent test data. The level of improvement and the relative performance among the normalization methods depended on the relative level of molecular signal, the distributional pattern of handling effects (e.g., location shift vs scale change), and the statistical method used for building the classifier. In addition, cross-validation was associated with biased estimation of classification accuracy in the over-optimistic direction for all three normalization methods.

Conclusion: Normalization may improve the accuracy of molecular classification for data with confounding handling effects; however, it cannot circumvent the over-optimistic findings associated with cross-validation for assessing classification accuracy.

Full text links

We have located links that may give you full text access.

Show additional links to paperHide additional links to paper

PubMed

Add to Saved Papers

Get 1-tap access

Related Resources

Executive Summary: State-of-the-Art Review: Unintended Consequences: Risk of Opportunistic Infections Associated with Long-term Glucocorticoid Therapies in Adults.Daniel B Chastain et al.Clinical Infectious Diseases 2024 April 11

Lung ultrasound for diagnosis and management of ARDS.Marry R Smit, Paul H Mayo, Silvia MongodiIntensive Care Medicine 2024 April 25

Autoimmune Hemolytic Anemias: Classifications, Pathophysiology, Diagnoses and Management.Melika Loriamini et al.International Journal of Molecular Sciences 2024 April 13

Clinical practice guidelines on the management of status epilepticus in adults: A systematic review.Luca Vignatelli et al.Epilepsia 2024 April 13

Embolic strokes of undetermined source: a clinical consensus statement of the ESC Council on Stroke, the European Association of Cardiovascular Imaging and the European Heart Rhythm Association of the ESC.George Ntaios et al.European Heart Journal 2024 April 31

Detecting and managing the patient with chronic kidney disease in primary care: A review of the latest guidelines.Kaitlin J Mayne, Peter Hanlon, Jennifer S LeesDiabetes, Obesity & Metabolism 2024 May 4

For the best experience, use the Read mobile app

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

All material on this website is protected by copyright, Copyright © 1994-2024 by WebMD LLC.
This website also contains material copyrighted by 3rd parties.

By using this service, you agree to our terms of use and privacy policy.

Your Privacy Choices

You can now claim free CME credits for this literature searchClaim now

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

Empirical evaluation of data normalization methods for molecular classification.

Full text links

Related Resources

Trending Papers

For the best experience, use the Read mobile app