Training: Web Scraping News Sites with Python

* Please ensure to bring your own laptop / device to this workshop *

There is more information on the Internet than any individual can absorb in a lifetime. What is needed is not merely access to that information, but a scalable way to collect, organize, and analyse it. Web scraping is a technique to automatically access and extract large amounts of information from a website.

In this workshop we will learn how to use Python to create a program to automatically retrieve and process information from a range of news sites and save this data in a structured format. This approach opens up a world of possibilities in data mining, data analysis, statistical analysis, and much more.

 

Digital Scholarship Centre

Digital Scholarship Centre, 6th floor

Main Library 

University of Edinburgh 

Edinburgh EH8 9LJ

You might be interested in

A collage image of historical material

Beyond Social Networks: Advanced Uses of Gephi in Humanities Research

An illustrative collage with & symbol and old graphs

Getting Started with Regression in R

A collage image of historical material

Digital Method of the Month: Text Analysis

UoE archive image with title of the training event

Foundations of Machine Learning

An illustrative collage with & symbol and a historical item

Getting Started with Bayesian Statistics

An illustrative collage with & symbol and an old photograph

Building Personal and Project Websites

An illustrative collage with & symbol and some patterns in squares

Modelling Unstructured Data with Bert

A collage image of historical map and images

Processing Geographical Data in QGIS

Thumbnail with title of the training

Comparing Sentiment Analysis Models in R

An illustrative collage with & symbol and a maths graph

Linear Mixed Effects Modelling

Promotional graphic for a workshop titled ‘Getting Started with Python for Research.’ The background is a black-and-white photograph of people screen-printing in a studio. Overlaid is a large teal ampersand featuring an illustration of Ada Lovelace. The logo of the Centre for Data, Culture & Society (DCS) appears in the top right corner.

Getting Started with Python for Research