Skip to main content

What Should I Run?

Use this page to choose the right bff-tools mode from your starting point. It is a decision page, not a tutorial.

If you want to run the bundled test data now, go to Quick Start. If you already know the mode and want copy-paste commands, use Command Recipes.

Decision Table

I have...I want...Run
XLSX metadataValidate Beacon metadata and write BFF JSON collectionsbff-tools validate
BFF JSON metadataValidate existing JSON collectionsbff-tools validate
A VCF or VCF.gz fileGenerate BFF genomicVariationsbff-tools vcf
A SNP-array TSV/TXT fileConvert SNP-array data into BFF genomicVariationsbff-tools tsv
Existing BFF filesLoad them into MongoDBbff-tools load
Metadata plus VCF/TSV inputConvert genomic data and load everything into MongoDBbff-tools full
Generated BFF outputInspect it without MongoDBbff-browser
BFF data already in MongoDBQuery it with a small web/API layerbff-portal
Many repeated jobsQueue and monitor them locallybff-queue

How to Decide

Start with validate when

  • your first input is a Beacon metadata workbook
  • you want to check existing BFF JSON metadata
  • you need individuals.json, biosamples.json, runs.json, or datasets.json before loading

Start with vcf or tsv when

  • your first input is genomic data
  • you only need to generate genomicVariations
  • MongoDB loading can wait until after you inspect the output

Use vcf for VCF or VCF.gz. Use tsv for SNP-array style TSV or TXT input.

Start with load when

  • BFF metadata collections already exist
  • genomic variation output already exists, if your Beacon includes genomic data
  • MongoDB is configured and reachable

Use full when

  • you already have a working parameter file
  • metadata paths are configured
  • MongoDB settings are configured
  • you want conversion and loading in one run

For a first real dataset, running validate, then vcf or tsv, then load is easier to debug than starting with full.

Avoid These Mixups

SituationBetter choice
You have not tested the install yetRun Quick Start first
You only want to see command syntaxUse Command Recipes
You need file and log locationsRead Outputs
You are reviewing a dataset for sharingRead Validation and Reproducibility
You need the complete workflow explanationRead Data Beaconization

Next Step

If you are new to the toolkit, run the Quick Start first. If you already know your mode, continue to Command Recipes, the CLI reference, or the data beaconization workflow.