How to Web Scrape an HTML Table using Pandas Library in Python

Опубликовано: 12 Ноябрь 2024
на канале: CODE MENTAL
1,425
12

#pandas #python #datascience #datanalysis #scrape

It is possible to use the Pandas library to web scrape directly an HTML table from a URL. This can be incredibly useful and a quick way to extract useful data, when you are in a hurry.

Article:

https://spltech.co.uk/how-to-create-a...

Github:

https://github.com/armindocachada/wor...

FOLLOW ME

Instagram:   / ​  
Blog: https://spltech.co.uk/blog/​

SUPPORT THE CHANNEL or get MENTORING

  / codemental  

Timestamps

0:00 Web Scraping an HTML Table from Wikipedia about all the Nvidia GPUs
1:55 Understanding the pandas.read_html function parameters: io, match and flavour
2:58 Counting number of tables parsed by panda.read_html
4:10 pandas.read_html(…, match=)
5:30 Checking the Pandas data frame schema with df.info()
5:55 Removing the first level of columns from a multi-level column data frame
6:30 Data cleanup for a column containing a date using df.str.replace
8:02 Converting column from object to date time
9:00 Plotting a very simple scatter plot with Pandas
10:00 Read HTML table from an HTML String copied directly from the browser