Skip to content

Learning Unit 1: Reproducibility and Replicability in Scientific Research

This unit focuses on reproducibility and replicability in scientific research. It is the first unit in the learning path title "Technical skills are the bridge to reproducible research: An introduction for data librarians". In this unit we discuss what reproducibility is, the barriers there are to reproducibility and the benefits of reproducibility.

Learning Objectives

After completing this unit, you will be able to:

  • Define reproducibility and replicability in scientific research
  • Explain how reproducibility and replicability are related but also different
  • Describe benefits of reproducibility to researchers
  • Describe common barriers to reproducibility

Target Audience

  • Data Librarians and other data curious

Duration

  • 60 mins

Prerequisites

  • None

Learning Tools

  • Training BBB room
  • Browser

The content is designed to be taught online, therefore a virtual teaching platform with breakout rooms is used to facilitate the content and activities.

Reproducibility and Replicability in Scientific Research

The unit discusses the difference between an original research study and a replication study. Both play an important role in the scientific process. But how do we go from science to knowledge? Or from a single scientific result to reliable knowledge and understanding of some phenomenon or topic?

Step one in the scientific method is to formulate a research question or hypothesis.

Step two is to design a research study to answer that question or test that hypothesis, using appropriate research methods. For instance, we may want to conduct an experiment, which means manipulating an independent variable and measuring a dependent variable to determine cause-and-effect. Once we decide on the appropriate study design, step three is data collection and step four is data analysis.

In step five, after we analyze all the data, we get our results – we get the answer to our research question or the test of our hypothesis.

However, one research study and one result can be very exciting and can give us a hint of how something works, but it is not reliable enough. We could get lucky and get a great result purely by chance. Or the phenomenon we study could be very complex and the data could be messy. There are lots of reasons why a single research result is only the start of the scientific investigation, not the end of it.

This unit introduces the learner to what reproducibility is and how it benefits science. In conclusion, the unit addresses the barriers to reproducibility, which are described as:

  1. Cultural Barriers

    • Researchers are rewarded for producing high-impact, novel results, not for making their work reproducible.
    • The ‘publish or perish’ culture pressures them to prioritize speed over careful validation.
    • Success is often measured by quantity and impact, not quality or reproducibility.
  2. Publishing Barriers

    • Journals favor new findings over replication studies.
    • Many journals don’t require authors to share data, code, or methods, and peer reviewers don’t often have the time or expertise to verify reproducibility.
  3. Technical Barriers

    • (Lack of Standardization) Different fields often use their own methods, tools, and protocols, which makes it harder to reproduce work across studies.
    • (Data and Software Availability) Outdated or unavailable software can make it impossible to reproduce results.
    • (Lack of Infrastructure) Some fields don’t have reliable platforms to store and share large datasets, and reproducing complex simulations can require resources that not everyone has.
    • (Collaboration and Data Integration Challenges) Interdisciplinary work can face unique problems, like differences in data formats, methods, or terminology. Large datasets also come with logistical and technical challenges.

The content of the unit is described in more detail in this document where each slide used in the training is enriched with speaker notes: Unit1_InstructorNotes

Follow the link to find activities. Activities 1 and 2 are used in this unit. Activities

Summary

Reproducibility means getting the same results with the same data, while replicability means getting the same results with new data.

Reproducibility is the first step toward replicability, and replicating results is how we build scientific knowledge.

Reproducibility benefits researchers directly, not just the broader scientific community. It takes time and effort, but that investment pays off in the long run!

Suggested Reading