Data leakage are common pain points while ML model are being trained.
This condition is defined as the situation where the ML model is aware of the data or the data characteristics of the test set, at the training phase, which could lead to biased inference.
some of the common cases of data leakage are found in:
to ensure the data does not leak between training and test data set, some of the common points to consider are