On Data-Preparation Efficiency Application on Breast Cancer Classification
Résumé
Quantifying the informative state of a dataset for a classification task is an important question. In this paper, we try to circumscribe this notion by introducing some new measures and enunciating some principles about data preparation. We experiment the interest of these measures and the validity of these principles by introducing several protocols designed for comparing different ways to prepare the data. We conclude by relating the efficiency of the data preparation and its theoretical diversity.
Origine : Fichiers produits par l'(les) auteur(s)