Author name: Pallabi Samaddar

Get official website URL from company name ( even the company name is misspelled)

In this article, we will understand how we can extract all the links from a URL or an HTML document using Python.   FIRST METHOD :   Libraries Required: bs4 (BeautifulSoup): It is a library in python which makes it easy to scrape information from web pages, and helps in extracting the data from HTML and XML …

Get official website URL from company name ( even the company name is misspelled) Read More »

Best string matching algorithms in Machine Learning

INTRODUCTION There are many applicable business cases for string matching. When unique identifier does not exist for data points, there are some unstructured data to make matching possible. In our case, the unstructured data is text. For example, some can match customers, products, company data come from different data sources. Furthermore a bank can match …

Best string matching algorithms in Machine Learning Read More »

Scroll to Top