CASSIE USER MANUAL

Cloud-Native Genomics Workflow Platform

This page is the web version of the original CASSIE user manual. It covers the full user workflow, from account creation and storage uploads to pipeline publishing and troubleshooting.

Version 1.0
Format Web Manual
Assets Integrated Screenshots
Section 1

Introduction

CASSIE is a cloud-native platform for genomics workflows. It allows researchers to upload genomic data, configure analyses through guided UI selections, translate user-defined settings into reproducible Nextflow workflows, execute jobs on scalable Kubernetes infrastructure, and share successful workflows with the community without manually managing servers or containers.

Section 2

Getting Started

2.1 Registering an Account

  1. Open CASSIE in your web browser.
  2. Click "Register" or "Sign Up" on the landing page.
  3. Fill in your name, email address, and choose a password.
  4. Check your inbox for a verification email and click the confirmation link.
  5. After verification, you will be redirected to the login page.
CASSIE registration page
Registering a new account in CASSIE.

2.2 Logging In

  1. Go to the CASSIE login page.
  2. Enter your registered email and password.
  3. Click "Log In" to open the Home dashboard.
  4. If you forget your password, use the "Forgot Password" link on the login page to request a reset email.
CASSIE login page
Logging in to the platform.

2.3 Navigating the Interface

The main navigation bar runs across the top of the application. Depending on whether you are signed in, you will typically see the following links and controls:

  • Home: Your dashboard and main starting point.
  • Jobs: History of all jobs you have run.
  • Pipelines: Manage your saved workflows and starter templates.
  • Storage: Manage uploaded inputs and generated outputs after signing in.
  • Community: Browse shared pipelines published by other users.
  • Forum: Read and create discussion threads.
  • Balance: Review available and reserved funds for job execution.
  • Account Menu: Open your profile and account actions.

The Home page also provides shortcut cards for viewing jobs, opening pipelines, creating a job, and launching the pipeline builder.

CASSIE interface overview
Main CASSIE interface and navigation.
Section 3

Storage

3.1 Uploading Files

  1. Click "Storage" in the navigation menu.
  2. Select the Input Files tab if it is not already active.
  3. Click "Upload from Computer" to choose local files, or use the Google Drive import option if it is configured.
  4. Select one or more files from your computer.
  5. Wait for the upload queue to finish and for the files to appear in the storage library.
  6. Confirm that the uploaded file appears in your input file list.
Uploading files in CASSIE storage
Uploading genomic files into Storage.

3.2 Supported File Formats

CASSIE recognizes the following common genomic and workflow-related file formats:

  • FASTQ (.fastq, .fq, .fastq.gz): Short-read sequencing data.
  • FASTA (.fasta, .fa, .fna): Reference sequences.
  • GFF (.gff, .gff3): Genome annotations.
  • Additional supported formats include .gtf, .hal, .gfa, .csv, .tsv, .json, and .txt, including supported compressed variants.

If CASSIE does not automatically detect a file format, you can set it manually using the override format option next to the file.

Section 4

Running a Job

4.1 Creating a One-off Job

  1. Open the Create Job flow from the Home page shortcut or from a pipeline action such as "Use in Job".
  2. Select the desired bioinformatics tool from the tool dropdown.
  3. Under input files, add the file or files required by the selected tool.
  4. Review the estimated runtime and cost shown on the page.
  5. Click "Submit Job" to queue the job for execution.

4.2 Configuring Tool Parameters

Most tools expose optional parameters below the file picker. Common examples include genome size, coverage depth, or output format flags. Hover over parameter labels to view short descriptions. Leave fields blank to use the default tool settings.

Tool parameter configuration in CASSIE
Configuring optional tool parameters.

4.3 Monitoring Job Progress

  1. Open the "Jobs" page from the navigation menu.
  2. Click a job row to open its detail view.
  3. Review the current status, the stage-by-stage progress indicator, and the live log output.
  4. If a job fails, review the error summary and use the retry option if available.
Monitoring job progress in CASSIE
Tracking status and logs for a running job.

4.4 Downloading Results

  1. Once the job reaches Completed status, scroll to the outputs section.
  2. Download files one by one or use the "Download All" option to retrieve everything as a ZIP archive.
Downloading job results in CASSIE
Downloading generated outputs from a completed job.
Section 5

Pipeline Builder

5.1 Creating a New Pipeline

  1. Open the builder from the Home page shortcut or from the Pipelines page using "Create New Pipeline".
  2. If no pipeline is open yet, start with a blank pipeline canvas.
  3. Enter a name for the new pipeline when prompted.
Creating a new pipeline in the CASSIE builder
Creating a new pipeline.

5.2 Adding and Connecting Nodes

The builder canvas supports several node types:

  • Input: Represents an uploaded file from Storage.
  • Tool: A bioinformatics processing step.
  • Checkpoint: A pause point for manual review.
  • Result: Marks the final output of the pipeline.

To build a pipeline, add an input node, place a tool node beside it, connect output handles to input handles, continue linking tools in sequence, and finish with a result node.

Connecting nodes in the CASSIE pipeline builder
Building and connecting pipeline nodes.

5.3 Saving and Naming Pipelines

  1. Click the Save button in the toolbar whenever you want to preserve your work.
  2. Rename a pipeline by editing the title shown at the top of the builder.
  3. Clone an existing pipeline through the pipeline menu if you want to reuse and modify it.
Saving and renaming a pipeline in CASSIE
Saving and naming your pipeline.

5.4 Running a Pipeline Job

  1. Open an existing pipeline in the builder or from your Pipelines page.
  2. Click "Run Pipeline".
  3. Map each input node to a file from Storage.
  4. Submit the pipeline run and verify that it appears in your Jobs list.
Running a saved pipeline in CASSIE
Launching a pipeline job.

5.5 Publishing a Pipeline to the Community

  1. Open the Pipelines page to see your saved pipelines.
  2. Find the pipeline you want to share and use the Share action on its card.
  3. After sharing, the pipeline is marked as Shared in your own pipeline list.
  4. Open the Community page to confirm that the shared pipeline is visible to other users.
Section 6

Community Library

The Community page is where you browse workflows shared by other users.

You can search by name, sort by most recent or most popular, filter by tool labels, open a shared pipeline in the builder, and save a copy into your own account for later editing or execution.

Pipelines browser in CASSIE
Browsing shared pipelines in the Community library.
Section 7

Community Forum

The Forum is used to share results, discuss workflows, and ask questions.

Community forum in CASSIE
Forum browsing and discussion features.
Section 8

Profile & Account Settings

Use the Profile page to manage your public information, security settings, and account controls.

Section 9

Admin Panel (Admin Users Only)

If your account has administrator privileges, you can access the dedicated admin console. The Admin Panel provides access to operational controls such as:

CASSIE admin console overview
Overview of the CASSIE admin console.
Section 10

Supported Bioinformatics Tools

CASSIE currently supports the following tools. Each tool displays its own required inputs and parameter fields when selected.

Tool Purpose
FastQCRead quality control and QC reports
GenomeScope2Genome size and heterozygosity estimation
SPAdesShort-read genome assembly
metaSPAdesMetagenomic assembly
HifiasmHiFi long-read genome assembly
VerkkoLong-read telomere-to-telomere assembly
QUASTAssembly quality assessment
BUSCOGenome completeness evaluation
CATTaxonomic classification of contigs
LiftoffAnnotation liftover between assemblies
MerylK-mer database construction
MerquryReference-free assembly evaluation
Section 11

Tips and Troubleshooting

Job stays queued for a long time

The compute cluster may be at capacity. Wait a few minutes and refresh the job detail page. Administrators can also inspect queue status from the Admin Panel.

File upload fails

Make sure the file is in a supported format and is not corrupted. Very large uploads may require a more stable connection.

Tool parameter errors

Review the tooltip text beside each parameter. Leaving a field blank usually applies the default value.

Pipeline node will not connect

Connections are type-checked. Make sure you are dragging from the correct output handle to a compatible input handle.

Published pipeline does not appear immediately

Refresh the Pipelines page and switch back to the Community tab after a short delay.

For further help, post a question in the Community Forum or contact your system administrator.