This is a web scraping packet that reads the contents of a site and extract pdf links from that site using python library scrapy.
Introduction:
This is a Web scraping packet that reads the contents of a website and extracts Book Pdf links with their title and author names.
This packet uses the scrapy library for web scraping.
The Data Extracted by the packet gets saved in a CSV file.
Technologies Used:
Python
Installation guide:
Hardware Requirements:
Processor: Minimum 1 GHz; Recommended 2GHz or more.
Ethernet connection (LAN) OR a wireless adapter (Wi-Fi)
Hard Drive: Minimum 32 GB; Recommended 64 GB or more.
Memory (RAM): Minimum 1 GB; Recommended 4 GB or above.
Software Requirements:
Python 3
Text Editor
Libraries Requirements:
Scrapy
Pandas
Submitted by MANISH BASWAL (manishbaswal6)
Download packets of source code on Coders Packet
Comments