Snakemake training: Building data pipelines in Python
Develop training materials to build reproducible, reusable, and efficient workflows in Python using Snakemake
We will develop trainings on using the Snakemake workflow management system to build data workflows. Snakemake organizes common data tasks (for example, downloading data, converting it into useful forms, analyzing it, and making plots and figures) into sequential steps. Snakemake tracks which steps are complete and makes it easy to run all the steps necessary to obtain a result. It will run steps simultaneously when possible, taking advantage of parallel computing resources. Snakemake syntax is based on the Python programming language, so USGS Python users can learn it quickly. Snakemake helps ensure that data workflows are reproducible, readable, and reusable. Our trainings will introduce data pipelines and their benefits, provide a hands-on tutorial on building Snakemake pipelines, and provide tips and best practices. We will administer trainings, and our materials will be available for anyone at USGS to take, either with an instructor or as a self-guided course.
Develop training materials to build reproducible, reusable, and efficient workflows in Python using Snakemake
We will develop trainings on using the Snakemake workflow management system to build data workflows. Snakemake organizes common data tasks (for example, downloading data, converting it into useful forms, analyzing it, and making plots and figures) into sequential steps. Snakemake tracks which steps are complete and makes it easy to run all the steps necessary to obtain a result. It will run steps simultaneously when possible, taking advantage of parallel computing resources. Snakemake syntax is based on the Python programming language, so USGS Python users can learn it quickly. Snakemake helps ensure that data workflows are reproducible, readable, and reusable. Our trainings will introduce data pipelines and their benefits, provide a hands-on tutorial on building Snakemake pipelines, and provide tips and best practices. We will administer trainings, and our materials will be available for anyone at USGS to take, either with an instructor or as a self-guided course.