This lesson is still being designed and assembled (Pre-Alpha version)

Research Data Management

Overview

Teaching: 20 min
Exercises: 75 min
Questions
  • What is Research Data Management?

  • Why should a researcher care about data management?

Objectives
  • Discuss research data

  • Learn about the research data lifecycle

  • Discover anti-sharing patterns

Research Data

What is research data? What examples can you think of?

Some reading here: https://ardc.edu.au/wp-content/uploads/2020/01/What-is-research-data.pdf

Research Data lifecycle

Research data lifecycle diagrams give an overview of the stages and processes required for the successful management of data. By showing the different phases a dataset goes through, the data lifecycle concept describes each action as a research project moves from begin an idea to making discoveries and then sharing those discoveries.

This is an example of a research data lifecycle diagram. Is there something surprising in there for you? Or something missing? Where are you at in this cycle right now?

Research Data Lifecycle

Image credit: UC Santa Cruz University Library

Research Data Management (aka RDM)

Research data management includes a range of activities, all related to how to maintain sound practices around research data collection, organisation and sharing.

More info here: https://staff.mq.edu.au/research/strategy-priorities-and-initiatives/data-science-and-eresearch/research-data-toolkit

What could possibly go wrong? Why RDM?

We’ve all no doubt heard of some data disasters by now - people losing data in different ways, data being shared inappropriately.

Let’s tart with a chat about the kinds of things that happen when researchers wing it.

Challenge - A video on data sharing (30 minutes)

  1. In breakout rooms, watch this video in small groups: Data Sharing and Management Snafu in 3 Short Acts
  2. Discuss: Have you run into any of these scenarios? What happened? How should this have gone? Write your room’s conclusions in the etherpad.
  3. Share: When we get back together, each room should share some of its worst horror stories (with the names changed to protect the innocent).

But rather than being motivated by fear, there are lots of good reasons to be your own best data manager - the one most likely to benefit (and suffer) is you!

Data Management Policies and Plans

Best laid plans

Research data is important! Please take care of it.

Challenge - Responsibilities (5 minutes)

Which of these does NOT count as active research data? Put a +1 in the shared document next to which one you think is right!

  1. A database
  2. A research publication
  3. Field notes
  4. Audio and video tapes

Solution

A research publication. These are taken care of by publishers. Even when held within an institution, either on open access or for research reporting purposes, these tend to be managed separately from other research data.

Universities around the world have developed a range of policies and procedures outlining their expectations around research data management. Let’s have a look at some university’s RDM statements.

Practice makes perfect. :smile:

Challenge - RDM, status quo (30 minutes)

As a group, choose (at least) one project that one person in your breakout room is running.

Try to answer the following questions in the shared document:

  1. What backup requirements does your project have? What risks are they protecting against? How confident are you in the backups?
  2. What data publishing requirements do the funders or universities involved in the project have?
  3. Follow the “responsbilities of institutions and responsibilities of researchers sections” in the NHMRC’s requirements. How well does the project stack up?

Write your answers in the shared document as a team.

OK. Let’s say that you have suddenly become responsible for an entire team’s research data. If anything happens, it’s on you. What can you do?

Challenge - Arguments for Collaboration (15 minutes)

Take a quick read of Dropbox sharing article.

In the shared document, in breakout rooms, make a single paragraph argument for a team to switch to cloud for collaboration. Make sure you address why which cloud provider they choose, matters.

As everyone: discuss which paragraphs are most persuasive?

Key Points

  • Research Data Management is the planning and application of a plan to manage the full data lifecycle, from creation to collection and analysis and then to publication and archiving.

  • A good data management plan will result in more consistent data, less time spent cleaning data, and reusable data that can generate better impact.