How to Extract Monkeypox Data By Using Web Scraping & Pandas?
While scraping web data, many cases are there while you want to scrape tabular data from websites. The normal way of going is writing a web data scraper. using Selenium, Scrapy, Python, Beautifulsoup, and more.
However, one easy way of getting tabular data from web pages using Pandas is there and you can do it within one minute and Python code of five lines.
In the example, we would work with available Monkeypox data here. Luckily, there is merely one table on this website having Monkeypox infection data. Here, the point is that our technique will work irrespective of how many tables are given on a page.
Phase 1 - Installing Pandas In case, you haven't installed pandas – just install it utilizing the command given in the terminal.
pip install pandas
Phase 2 - Start Scraping Monkeypox Data The code given here extracts data from a page into the CSV file. Observe the explanation given below to know how this code will work.
import pandas as pd
url = 'https://www.Monkeypox.global.health/'
df_list = pd.read_html(url)
Monkeypox = df_list[0]
Monkeypox.to_csv('Monkeypox.csv')
In the initial line - we have introduced the pandas’ library. Then, we are instructing the scraper about the table or data we wish to extract is at the URL https://www.Monkeypox.global.health/.
The line that comes after that is the most significant. We're instructing the pandas’ library for using the read_html task to get tables on a webpage. read_html() yields the list having data frames about all the accessible tables on a page.
Here, only a single table is there; the initial element given on a list would have Monkeypox data. We retrieve it using an index in the given code link.
Monkeypox = df_list[0]
The following step is converting data into the CSV file, as well as we use the to_csv function for converting Monkeypox data frames into the CSV file.
Monkeypox.to_csv('Monkeypox.csv')
And hurrah…we have done it! That’s how we scrape tabular data from the webpage in the CSV having only five lines of code as well as under one minute.
In this blog, we've discussed how to utilize Python as well as pandas for scraping Monkeypox data from a Global Health site. We've presented you how to use web scraping, which tools to utilize, and how to format code as well as scrape the right data.
Web scraping is an excellent way of saving money and time in the business through automating jobs, which would else take days or hours to complete manually. With Actowiz Solution's expertise in data scraping at a huge scale, we can assist you to get and run the finest web scraping services for your requirements.
Contact Actowiz Solutions for all your web data scraping requirements today!
Know more : https://www.actowizsolutions.com/how-to-extract-monkeypox-data-by-using-web-scraping-pandas.php
Subscribe to my newsletter
Read articles from Actowiz Solutions directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
Actowiz Solutions
Actowiz Solutions
Actowiz is a fully managed enterprise-grade web scraping service. We convert websites into structured data. We do everything for our customers when it comes to data extraction- setting up scrapers, running it, cleaning the data, checking the data quality, and making sure the data is delivered on time. We make significant investments in automation, scalability and process efficiency that allow us to provide an exceptional service at no additional cost to our customers. Our clients get a better quality and dependable service at comparable pricing to all other options.