Find us on GitHub

A Data Carpentry Workshop

North-West University

1 - 3 August, 2017

Aug 1 from 13:00 16:00, Aug 2-3 from 08:30 - 16:15

Instructors: Angelique van Rensburg, Martin Dreyer, Caroline Ajilogba, Riaan van der Walt, Juan Steyn

Helpers: Zine Sapula

General Information

Thank your for your interest in Digital Humanities focussed Data Carpentry workshop.

Researchers, students, and librarians are increasingly starting to engage in projects with more complex computing components to harness the power of larger and often open datasets and technology. Those who are interested in pursuing Digital Humanities projects often find themselves in unknown territory without a clear idea of a good starting point for acquiring new skills or finding potential collaborators. One of the aims of the South African Center for Digital Language Resources (SADiLaR) as well as the Digital Humanities Organisation of Southern Africa (DHASA) is to build local communities to both facilitate collaboration and skills transfer.


Workshop Aims

This workshop aims to provide a broad introduction to the following concepts and tools

  • Introduction to Digital Humanities
  • Introducing existing training opportunities and available resources
  • Research Data Management
  • Data formatting, cleaning, and manipulation
  • The use of existing data repositories to augment research
  • Introduction to R for analysing data
  • Community building

The workshop is jointly funded by the South African Center for Digital Language Resources as well as the NWU-Digital Humanities and aims to create awareness about Digital Humanities

The workshop is organised by the Digital Humanities Organisation of Southern African in collaboration with NWU-Digital Humanities and we-Deliver


Registration and other information

Please register for the workshop through this online form: https://goo.gl/SMjRcd by 27 July 2017 . Space is limited, if you have any questions please contact Juan.Steyn@nwu.ac.za.


Data Carpentry workshops are for any graduate student or researcher who has data they want to analyze, and no prior computational experience is required. This hands-on workshop teaches basic concepts, skills and tools for working more effectively with data.

We will cover , Digital Humanities - How does it apply to me?, Organising and cleaning my research data and Introduction to in R. Participants should bring their laptops and plan to participate actively. By the end of the workshop learners should be able to more effectively manage and analyze data and be able to apply the tools and approaches directly to their ongoing research.

Who: The course is aimed at graduate students and other researchers.

Where: Building F13, 214, Potchefstroom Campus, 11 Hoffman Street, Potchefstroom, South Africa. Get directions with Google Maps.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating sytem (not a tablet, Chromebook, etc.) that they have administrative privileges on. They should have a few specific software packages installed (listed below). They are also required to abide by Data Carpentry's Code of Conduct.

Contact: Please mail juan.steyn@nwu.ac.za for more information and if you have any trouble registering for the workshop.


Preliminary Schedule

Surveys

Please be sure to complete these surveys before and after the workshop.

Pre-workshop Survey

Post-workshop Survey

Day 1

1 August 2017

Afternoon Digital Humanities - How does it apply to me?

Day 2

2 August 2017

Morning Organising and cleaning my research data
Afternoon Introduction to in R

Day 3

2 August 2017

Morning Data analysis and visualization in R
Afternoon Data analysis and visualization in R

Meals:

Lunch will be catered for.

Tea/Coffee will be provided according to schedule.

Etherpad: http://pad.software-carpentry.org/2017-08-01-potch.
We will use this Etherpad for chatting, taking notes, and sharing URLs and bits of code.


Detail Programme

Day 1 - 1 August 2017

13:15 Digital Humanities - How does it apply to me?
14:00 Existing resources and Future opportunities
14:30 Coffee, Tea and Snacks
15:00 Research Data Management and Workflows
15:30 Wrap-up and Software Installation for day 2 and 3
16:00 Finish

Day 2 - 2 August 2017

08:00 Coffee, Tea and Software Installations
08:30 Organising and Preparing my research data
10:00 Coffee and Tea Break
10:20 Exploring and Augmenting research data
12:00 Lunch
13:00 Exploring and Augmenting research data
14:30 Coffee and Tea Break
14:50 Part 1: Introduction to R
16:15 Wrap-up

Day 3 - 3 August 2017

08:00 Coffee, Tea
08:30 Part 2: Starting with Data in R
10:00 Coffee and Tea Break
10:20 Part 3: Manipulating Data in R
12:00 Lunch
13:00 Part 4: Visualising Data in R
14:30 Coffee and Tea Break
14:50 Part 5: Visualising Data in R
16:15 Wrap-up

Syllabus

Research Data Management and Workflows

  • Changing Research Landscape
  • FAIR - Data Principles
  • 101 - innovations
  • My own workflow

Using spreadsheet programs for scientific data

  • Formatting data & problems
  • Dates as data
  • Quality control
  • Exporting data
  • Data format caveats
  • Cleaning data with OpenRefine

OpenRefine

OpenRefine (previously Google Refine) is a tool for data cleaning that runs through a web browser, and any browser - Safari, Firefox, Chrome, Explorer - should work fine. You will need to download OpenRefine and install it, and when you open it, it will run through the browser, but you don't need an internet connection, and the data will all be stored on your computer.

R for data analysis and visualization

  • Introduction to R
  • Starting with data
  • Aggregating and analyzing data with dplyr
  • Data visualization with ggplot2
  • R and Databases

Setup

To participate in a Data Carpentry based workshop, you will need working copies of the described software. Please make sure to install everything (or at least to download the installers) before the start of your workshop. Participants should bring and use their own laptops to insure the proper setup of tools for an efficient workflow once you leave the workshop.

Please follow these Setup Instructions.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.


SADiLaR