Pipeline is a set of multiple functions that is created to avoid repeating the same preprocessing functions on the new data whenever it is added to the original data set by automating them.
Analyzing the correlations among various features in a data set provides more insight and better understanding of the data set.