Difference between revisions of "Web scraping"

From Publication Station
Line 3: Line 3:
The purpose of web scraping is to transform web content into usable data for other programs or analysis. In this case we transform the following website
The purpose of web scraping is to transform web content into usable data for other programs or analysis. In this case we transform the following website
{{Columns}}
{{Columns}}
{{Column}}
{{Column}}[[File:Alice Wonderland Gutenberg.png]]{{ColumnEnd}}
[[File:Alice Wonderland Gutenberg.png]]
{{Column}}[[File:Alice Wonderland Gutenberg.png]]{{ColumnEnd}}
{{ColumnEnd}}
{{Column}}
[[File:Alice Wonderland Gutenberg.png]]
{{ColumnEnd}}
{{ColumnsEnd}}
{{ColumnsEnd}}


We will use a browser extension called WebScraper.io. You can install the extension [https://addons.mozilla.org/en-US/firefox/addon/web-scraper/ for Firefox] or for [https://addons.mozilla.org/en-US/firefox/addon/web-scraper/ for Chrome].
We will use a browser extension called WebScraper.io. You can install the extension [https://addons.mozilla.org/en-US/firefox/addon/web-scraper/ for Firefox] or for [https://addons.mozilla.org/en-US/firefox/addon/web-scraper/ for Chrome].

Revision as of 08:37, 2 September 2022

Web scraping is used to scrape data such as text and images from websites. In this example we will scrape data from the Gutenberg website.

The purpose of web scraping is to transform web content into usable data for other programs or analysis. In this case we transform the following website

Alice Wonderland Gutenberg.png
Alice Wonderland Gutenberg.png

We will use a browser extension called WebScraper.io. You can install the extension for Firefox or for for Chrome.