Author name: Ankur Raj

ColumnTransformer in Scikit-Learn for Data Preprocessing

ColumnTransformer in Scikit-Learn is very robust tool for doing operation on different subsets of features in dataset. In this tutorial we will learn how to use it. Steps: Import necessary libraries Create sample dataset Define transformations Apply ColumnTransformer Import necessary libraries from sklearn.compose import ColumnTransformer from sklearn.preprocessing import StandardScaler, OneHotEncoder from sklearn.impute import SimpleImputer from …

ColumnTransformer in Scikit-Learn for Data Preprocessing Read More »

Data wrangling using Pandas

Data wrangling is process of preparing raw data for analysis. It involves cleaning, structuring and enriching of raw data. It is also known as data preprocessing. Pandas is great tool which provides with all the features required for data preprocessing. The process includes: Data Cleaning Data Transformation Data Integration Data Filtering Data Validation Data Formatting …

Data wrangling using Pandas Read More »

A quick guide to Python’s NumPy random.rand()

NumPy is library of python that performs operations on array. It is a library rich with features used for mathematical computations and further application in machine learning domain. It comes in very handy while generating any random arrays for operations while using machine learning algorithms. NumPy arrays are also used for speed and efficiency in …

A quick guide to Python’s NumPy random.rand() Read More »

Scroll to Top