How to Avoid Data Leakage When Performing Data Preparation - MachineLearningMastery.com

Data preparation is the process of transforming raw data into a form that is appropriate for modeling. A naive approach to preparing data applies the transform on the entire dataset before evaluati...

By · · 1 min read
How to Avoid Data Leakage When Performing Data Preparation - MachineLearningMastery.com

Source: MachineLearningMastery.com

Data preparation is the process of transforming raw data into a form that is appropriate for modeling. A naive approach to preparing data applies the transform on the entire dataset before evaluating the performance of the model. This results in a problem referred to as data leakage, where knowledge of the hold-out test set leaks […]