Snakemake Hackathon

Materials for Snakemake hackathon organized on 22.05.24. Licensed under CC-BY-4.0 license.

Logos of all organizers and partners: Geoportti, CSC, CodeRefinery, ENCCS, NAISS

Prerequisites

About the course

Snakemake is a common entry tool in the world of computational workflows, especially in Bioinformatics.

The tool is designed to support you in setting up a workflow system, which opposed to a “your favorite language here” script, helps you keep track of your processes.

For Snakemake, the user has to define inputs and outputs and the tool will figure out the order of steps and which steps can be run in parallel.

This Hackathon is open for all, free of charge and independent of field of Science and it is all about getting your own workflows to the supercomputer (Puhti, Mahti, Lumi or other).

Content

We will offer short introduction to snakemake and a supercomputing concepts refresher session in the beginning, which you can attend if needed.

Then we will talk about opportunities and challenges with moving workflows from laptop to the supercomputer and give tips on how to go about it.

The afternoon is reserved for your own work. You can either work on your own “Snakefile”, moving your workflow from laptop to supercomputer or work on our toy example, to help understand the concepts better before moving to your own work.

Learning outcomes

After this course, participants will be able to…

  • Understand what is snakemake

  • Decide if using Snakemake fits their workflow

  • Implement own workflow with Snakemake

  • Move own workflow to the supercomputer

See also