ClarID-Tools Documentation
This page is the map for the ClarID-Tools documentation. Use it to decide where to start, which command style applies to your setup, and when to use each ID format.
ClarID-Tools is the reference command-line implementation for encoding and
decoding subject and biosample identifiers from structured biomedical
metadata.
Before You Start
Pick the command style that matches your setup.
- If you cloned this repository, run commands from the repository root with
bin/clarid-tools. - If you installed ClarID-Tools system-wide or with CPAN, run
clarid-tools. - You can omit
--codebookunless you want to use a custom codebook. The packaged default codebook is used automatically. - The
qrcodesubcommand needsqrencodeandzbarimg; the Docker image already includes them.
Start Here
What You Can Do
- Encode subject and biosample identifiers from structured metadata.
- Decode human-readable or stub identifiers back into tabular values.
- Validate YAML codebooks against the bundled JSON Schema.
- Run single-record, bulk CSV, QR-code, and preprocessing workflows from the CLI.
Choosing an ID Format
human
Use this when identifiers will be read directly by people in reports, spreadsheets, filenames, or manual review workflows.
stub
Use this when compactness matters more than readability, for example in QR codes, labels, or constrained downstream systems.
Both formats are generated from the same metadata and codebook, so you can encode and decode between structured metadata and IDs consistently.
What This Repository Implements
The ClarID paper defines the identifier concept and rationale. This repository provides the reference command-line implementation, the codebook/schema validation workflow, and end-to-end examples for subject and biosample identifiers.