Skip to content

Common Workflow Language for Bioinformatics

Summary

Common Workflow Language (or CWL), is a growing language for defining workflows in a cross-platform and cross-domain manner. In biology in particular, we need workflows to automate complex analyses such as DNA variant calling, RNA sequencing, and genome assembly. CWL provides a simple and well-defined format for automating these analysis by specifying their stages and connections using readable CWL documents.

CWL makes use of a number of existing standards, including support for cluster computing using SLURM or PBS, containerisation using Docker, and deployment using common packaging formats. In addition, the CWL ecosystem has grown to include workflow visualisation tools, graphical workflow editors, libraries for interacting with CWL programatically and tools that convert to and from CWL and other workflow formats.

Outcomes

At the end of the course, you will be able to:

  • Find and use CWL tool definitions online
  • Use the Rabix Composer, a graphical editor for CWL
  • Understand how to write CWL tool definitions for command line tools
  • Use Docker with CWL to provide software dependencies and ensure reproducibility
  • Join CWL tools into a workflow
  • Read and write CWL files written in YAML
  • Understand advanced CWL features like secondary files, parameter references and subworkflows
  • Run CWL workflows on local and HPC systems

Requirements

General

  • This workshop is aimed at anyone with basic Unix command-line experience
  • Attendees are required to bring their own laptop computers

Software

All of this software is free, and should run on any operating system (Mac, Windows, or Linux):

Slides

Workshop Slides (use the arrow keys to navigate)