Usage

This page collects a few common convert-pheno command-line patterns. For the flag reference, see Use as a command-line interface.

Convert Phenopackets to individuals-only BFF output

convert-pheno -ipxf pxf.json -obff individuals.json

Use this when your input is a Phenopackets v2 file and you want the individuals-only BFF output.

Convert BFF output to Phenopackets

convert-pheno -ibff individuals.json -opxf pxf.json

Use this when your starting point is a BFF individuals file.

If the source BFF record does not preserve an original Phenopackets subject.vitalStatus, the output still needs a fallback status. By default convert-pheno uses ALIVE, but you can change it:

convert-pheno -ibff individuals.json -opxf pxf.json --default-vital-status UNKNOWN_STATUS

Convert OMOP SQL to BFF output

For smaller inputs:

convert-pheno -iomop omop.sql -obff individuals.json

For larger inputs, the streaming mode is usually more practical:

convert-pheno -iomop omop.sql.gz -obff individuals.json.gz --stream --ohdsi-db

OMOP streaming

--stream is mainly intended for large OMOP exports. The individuals-only -iomop ... -obff path still emits individuals by default. Use --no-stream to explicitly keep the default in-memory mode.

Emit multi-entity BFF output

convert-pheno -ipxf pxf.json -obff --entities biosamples --out-dir out/

This is the entity-aware BFF form. Keep -obff to select BFF output, then use --entities and --out-dir to choose which entity files are written.

This currently works when the PXF input contains biosample data. The output file will be written as out/biosamples.json.

You can also request synthesized datasets and cohorts:

convert-pheno -icsv clinical_data.csv --mapping-file clinical_data_mapping.yaml -obff --entities individuals datasets cohorts --out-dir out/

datasets and cohorts are synthesized from the normalized individuals collection, so they are available from BFF conversion routes beyond PXF. Mapping-based augmentation of these synthesized entities is currently available only for the conversion routes that use a mapping file: csv2bff, redcap2bff, and cdisc2bff. In those conversions, the top-level beacon section can override metadata such as id, name, description, version, or cohortType.

If you want both individuals and biosamples:

convert-pheno -ipxf pxf.json -obff --entities individuals biosamples --out-dir out/

If you want OMOP SPECIMEN rows as Beacon biosamples only:

convert-pheno -iomop PERSON.csv CONCEPT.csv SPECIMEN.csv -obff --entities biosamples --out-dir out/

If you want a custom biosample filename:

convert-pheno -ipxf pxf.json -obff --entities individuals biosamples --out-dir out/ --out-name biosamples=samples.json

-obff FILE Individuals-Only Behavior

convert-pheno -ipxf pxf.json -obff individuals.json keeps the backward-compatible single-output path and emits only individuals. If the input also contains biosamples, the CLI prints a warning and preserves them under info.phenopacket.biosamples.

Review ontology search results in mapping-file conversions

When using a mapping file, you can ask convert-pheno to write a TSV audit of ontology lookups:

convert-pheno -iredcap redcap.csv --redcap-dictionary dictionary.csv --mapping-file mapping.yaml -obff individuals.json --search-audit-tsv search-audit.tsv

This is useful when you want to review how original source terms were mapped to ontology labels and identifiers. The audit also records the effective search configuration for the run, whether each lookup produced a real database match or fell back to NA, and whether the result came from exact matching, similarity search, or fallback.

Omit raw source provenance from BFF output

By default, BFF output preserves copied source values under info for auditability and source-level querying. For smaller exports, or when raw source values should not be carried forward, use:

convert-pheno -iomop omop.sql -obff individuals.json --no-source-info

This omits raw payloads such as OMOP_columns, CSV_columns, and REDCap_columns, while keeping regular mapped fields and conversion metadata.

Work with repository fixtures

The repository test fixtures under t/ are useful as small examples:

bin/convert-pheno -ipxf t/pxf2bff/in/pxf.json -obff individuals.json
bin/convert-pheno -ipxf t/pxf2bff/in/pxf.json -obff --entities biosamples --out-dir out/
bin/convert-pheno -ipxf t/pxf2bff/in/pxf.json -obff --entities individuals biosamples --out-dir out/ --out-name biosamples=samples.json
bin/convert-pheno -icsv t/csv2bff/in/csv_data.csv --mapping-file t/csv2bff/in/csv_mapping.yaml -obff --entities individuals datasets cohorts --out-dir out/
bin/convert-pheno -iomop t/omop2bff/in/omop_cdm_eunomia.sql -opxf phenopackets.json

Convert Phenopackets to individuals-only BFF output​

Convert BFF output to Phenopackets​

Convert OMOP SQL to BFF output​

Emit multi-entity BFF output​

Review ontology search results in mapping-file conversions​

Omit raw source provenance from BFF output​

Work with repository fixtures​

Need more detail?​