Skip to main content

General Usage

CBIcall is normally run with a YAML parameters file and a thread count:

bin/cbicall run -p parameters.yaml -t 4

Use this page for command syntax and common execution patterns. For YAML keys and supported combinations, see Configuration Reference.

Typical workflow

Choose or edit a YAML file, run bin/cbicall run -p ... -t ..., then inspect the generated run directory and log.json.

Input Model

For WES/WGS single-sample runs, input_dir points to a sample directory containing paired FASTQ files:

CNAG999_exome/CNAG99901P_ex/
*_R1_*.fastq.gz
*_R2_*.fastq.gz

For WES/WGS cohort runs, sample_map points to a TSV containing sample IDs and gVCF paths.

For mtDNA runs, CBIcall expects BAM files from previous WES/WGS single-sample runs. mtDNA workflows do not start from FASTQ files.

CLI Synopsis

cbicall run -p <parameters_file.yaml> -t <n_threads> [options]

Use run for normal analysis execution. Other subcommands are documented on their dedicated pages.

ArgumentMeaning
-p, --paramYAML parameters file.
-t, --threadsNumber of CPUs/cores/threads passed to the workflow.
OptionMeaning
-verboseEnable verbose output.
-debug, --debugDebugging level.
-nc, --no-colorDisable ANSI colors explicitly. Colors are automatically disabled when output is redirected.
-v, --versionShow version information.
-h, --helpShow brief help.
-manShow full command-line documentation.

Common Commands

bin/cbicall run -p wes_single.yaml -t 4
bin/cbicall run -p wes_cohort.yaml -t 8 -verbose
bin/cbicall run -p mit_single.yaml -t 4 > run.log 2>&1
nohup bin/cbicall run -p parameters.yaml -t 4 > run.log 2>&1 &
PatternUse when
bin/cbicall run -p wes_single.yaml -t 4Normal foreground run.
-verboseYou want more CLI output while the workflow starts.
> run.log 2>&1You are saving terminal output to a file. ANSI colors are disabled automatically.
nohup ... &You need a simple long-running background job outside a scheduler.
Thread choice

For most WES/WGS runs, start with 4 threads per task. See Performance for the benchmark and scaling guidance.

Minimal YAML Examples

WES Single-Sample

mode: single
pipeline: wes
workflow_engine: bash
gatk_version: gatk-4.6
input_dir: CNAG999_exome/CNAG99901P_ex
genome: b37

WES Cohort

mode: cohort
pipeline: wes
workflow_engine: bash
gatk_version: gatk-4.6
genome: b37
sample_map: ./sample_map.tsv

mtDNA Single-Sample

mode: single
pipeline: mit
workflow_engine: bash
gatk_version: gatk-3.5
input_dir: CNAG999_exome/CNAG99901P_ex

Partial Runs

Partial runs are intended for targeted Snakemake execution and restarts. Leave workflow_rule unset for a normal full run.

workflow_engine: snakemake
workflow_rule: call_variants
allow_partial_run: true

Behavior:

  • if workflow_rule is unset, the full workflow runs
  • if workflow_rule is set, the run is marked as partial in metadata and CLI output
  • if workflow_rule is set without allow_partial_run: true, CBIcall refuses to start

Outputs and Logs

Each run creates a directory containing:

  • the main workflow log
  • log.json with CLI arguments, resolved configuration, and YAML parameters
  • workflow outputs such as VCFs, BAMs, statistics, and browser reports

See Outputs for the file reference.

Reproducibility

Keep log.json with the workflow outputs. It records the CLI arguments, resolved configuration, and parameters used for the run.

CBIcall is designed for multi-core Linux desktop, workstation, server, and HPC environments.

ResourceRecommendation
CPUAt least 4 cores
RAMAt least 8 GB for single-sample runs; more for cohort joint genotyping
DiskAt least 250 GB, depending on input size and whether BAM cleanup is enabled
Source installationPython 3.8+, Java 8, Snakemake when using Snakemake workflows

Next Steps