Pandas

Machine Learning Project: Anemia Detection

Machine Learning Project: Anemia Detection “Detect anemia we must, with the force of machine learning, progress we achieve.”- Not Yoda but close. Introduction Anemia is no laughing matter (although my puns might be). Affecting over 1.6 billion people globally, anemia is a condition where the body doesn’t produce enough red blood cells, leading to fatigue, …

Machine Learning Project: Anemia Detection Read More »

Predicting the Wine Quality using ML in Python

Prediction of wine quality is a great way to practice end-to-end machine learning workflows, including data cleaning, model training, and optimization. Steps involving to solve the question: Step 1: Set Up Your Environment First step is to ensure we have python and necessary libraries installed. Or we can install essential packages using: pip install pandas numpy …

Predicting the Wine Quality using ML in Python Read More »

Company name matching in Python from two csv datasets

This Python script matches company names between two datasets using fuzzy matching with the thefuzz  library. It ensures that company names are standardized and finds the best match from another dataset. Key Features: Loads two CSV datasets containing company names. Standardize the names by converting them to lowercase and removing extra spaces. It uses fuzzy …

Company name matching in Python from two csv datasets Read More »

Creating a movie recommendation system on the user-like basis

Hey fellas! Let us create a movie recommendation system based on what the user likes using Python and Pandas library. We shall process the movie dataset imported online consisting of thousands of movies including the genre, date of release, and other essential features to obtain a set of movies with a score that defines the …

Creating a movie recommendation system on the user-like basis Read More »

Pandas and Seaborn: Exploratory Data Analysis (EDA) in Python

Using Pandas and Seaborn, this lesson explores exploratory data analysis (EDA) in Python, covering fundamental ideas, methods for exploring data, and visualization using real-world examples. It’s intended to improve your capacity for efficient data trend analysis and interpretation. EDA in Python Using Pandas and Seaborn The steps involved are illustrated in the code below. Follow …

Pandas and Seaborn: Exploratory Data Analysis (EDA) in Python Read More »

ARIMA time – series generator

This tutorial explores the ARIMA model for time series forecasting in Python, covering fundamentals, model building, and tuning with practical examples. It’s designed to help you effectively forecast and analyze trends in time series data. ARIMA Model For Time Series The steps which involved are shown below: Step 1: Import Libraries import pandas as pd …

ARIMA time – series generator Read More »

Python-based Minimum Detectable Effect (MDE)

In this tutorial, we explore how to calculate Minimum Detectable Effect (MDE) using Python. This guide introduces you to the concept of MDE in A/B testing, providing a step-by-step approach with code examples, making it easy to determine the smallest change that can be statistically detected in your experiments. Minimum-Detectable-Effect-(MDE)-Using-Python The Approach can be discussed …

Python-based Minimum Detectable Effect (MDE) Read More »

For identifying similarities between strings in two CSV files

In this tutorial, we will learn how to match the strings in large datasets When working with huge datasets, especially in cases where data comes from different sources, it’s common to encounter slightly different versions of the same string. This post will guide you through performing fuzzy matching to identify and link similar strings between …

For identifying similarities between strings in two CSV files Read More »

Scroll to Top