Skip to main content

Proposal: Implementing Pheno-Ranker in a Federated Network

In this proposal, we aim to explore the potential application of Pheno-Ranker within two distinct contexts: the Inter-Hospital Network and the Beacon v2 Network.

Federated network diagram

Pheno-Ranker
Image created by GPT-5.5

The current version of Pheno-Ranker is designed for file-based operations and initiates calculations from scratch each time. To adapt the algorithm for use in multiple hospitals without directly sharing clinical data, we propose the following approach:

1. Preparation Stage:

Vector Standardization: Ensure all hospitals use a standardized vector format.

  • Store each patient’s vector in a local-database.
    • “id_1": "1101010101010...n",
    • "id_2": "0101010101000...n
  • Utilize a network aggregator to regularly update a global reference vector. Each update gets a new version identifier.
  • Periodically update the vector database at each site to ensure current data.

Privacy Protocols: Set up differential privacy mechanisms or encryption protocols.

Threshold Agreement: Establish a common threshold for the Hamming distance (or other metric) for matches.

Preparation stage of Pheno-Ranker algorithm in an inter-hospital network

2. Query Initiation:

The querying hospital prepares a vector representation of the individual or set of individuals. The vector is processed using the agreed-upon privacy protocols.

3. Aggregator Mediation:

The querying hospital sends the processed vector to the network aggregator. The network aggregator distributes the query to all hospitals in the federated network.

4. Local Computation:

Each receiving hospital computes the Hamming distance against its local patient vectors. The computation is done entirely within the local environment of each hospital.

5. Thresholding:

Each hospital applies the agreed-upon thresholding to identify vectors that are considered a "match."

6. Response to Aggregator:

Each hospital sends its response (list of matching vectors, counts, etc.) back to the network aggregator.

7. Aggregation:

The network aggregator collects all the responses, processes them, and sends the aggregated result to the querying hospital.

8. Post-Processing:

The querying hospital undertakes further analysis, potentially reaching out to specific hospitals for more information based on the aggregated results, and decides on subsequent actions.

Pheno-Ranker algorithm in a federated network