Go Digital History!

Aarhus University

Wednesday to Friday 22 - 24 May 2024

9:00 - 16:00 CEST

Instructors: Adela Sobotkova, Helle Strandgaard Jensen

Helpers: Stephan Smuts, Petra Hermankova

General Information

The Carpentries project comprises the Software Carpentry, Data Carpentry, and Library Carpentry communities of Instructors, Trainers, Maintainers, helpers, and supporters who share a mission to teach foundational computational and data science skills to researchers.

Want to learn more and stay engaged with The Carpentries? Carpentries Clippings is The Carpentries' biweekly newsletter, where we share community news, community job postings, and more. Sign up to receive future editions and read our full archive: https://carpentries.org/newsletter/

Data Carpentry develops and teaches workshops on the fundamental data skills needed to conduct research. Its target audience is researchers who have little to no prior computational experience, and its lessons are domain specific, building on learners' existing knowledge to enable them to quickly apply skills learned to their own research. Participants will be encouraged to help one another and to apply what they have learned to their own research problems.

For more information on what we teach and why, please see our paper "Good Enough Practices for Scientific Computing".

Who: The course is aimed at graduate students and other researchers. You don't need to have any previous knowledge of the tools that will be presented at the workshop.

Where: Building 1465, room 315, Jens Skous Vej 5, 8000 Aarhus. Get directions with OpenStreetMap or Google Maps.

When: Wednesday to Friday 22 - 24 May 2024. Add to your Google Calendar.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on. They should have a few specific software packages installed (listed below).

Accessibility: We are committed to making this workshop accessible to everybody. For workshops at a physical location, the workshop organizers have checked that:

Materials will be provided in advance of the workshop and large-print handouts are available if needed by notifying the organizers in advance. If we can help making learning easier for you (e.g. sign-language interpreters, lactation facilities) please get in touch (using contact details below) and we will attempt to provide them.

Contact: Please email adela@cas.au.dk for more information.

Roles: To learn more about the roles at the workshop (who will be doing what), refer to our Workshop FAQ.

Code of Conduct

Everyone who participates in Carpentries activities is required to conform to the Code of Conduct. This document also outlines how to report an incident if needed.


Please be sure to complete these surveys before and after the workshop.

Pre-workshop Survey

Post-workshop Survey


Day 1

9:15 - 9:45 Introductions, pre-workshop survey
9:45 - 10:30 Regular Expressions
10:30 - 10:45 Coffee break
10:45 - 12:30 Practising Regular Expressions
12:30 - 13:00 Lunch
13:00 - 14:00 Data Organization in Spreadsheets
14:00 - 14:15 Coffee break
14:15 - 15:00 OpenRefine for Data Cleaning
15:00 - 16:00 Streamlining real data: Aarhus Occupations 1801

Day 2

9:15 - 10:30 Introduction to R
10:30 - 10:45 Coffee break
10:45 - 12:30 Starting with data
12:30 - 13:00 Lunch
13:00 - 14:00 Data wrangling in tidyverse
14:00 - 14:15 Coffee break
14:15 - 16:00 Continue to Wrangle and visualise data with ggplot

Day 3

9:15 - 10:00 Introduction to different types of analysis for the following group work:
10:00 - 12:30 Text mining with tidytext in R
10:00 - 12:30 Sentiment Analysis in R
10:00 - 12:30 Interactive maps in R
10:00 - 12:30 Learn about Webscraping
10:00 - 16:00 Own project elaboration
12:30 - 13:00 Lunch
13:00 - 14:00 Introduction to APIs
14:00 - 15:50 From API to OpenRefine
14:00 - 15:50 Own project elaboration
15:50 - 16:00 Class wrap-up and post-workshop evaluation


To participate in a Data Carpentry workshop, you will need access to software as described below. In addition, you will need an up-to-date web browser.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.

The setup instructions for the Data Carpentry Social Sciences workshops (with R) can be found at the workshop overview site.