Introduce new prioritization algorithm #90

cthoyt · 2025-06-30T08:51:22Z

Alternative to #80

The prioritization algorithm already assumed that a set of mappings had been processed with both inversion and chain inference.

Old algorithm:

Make an undirected graph
Get all connected components
In each connected component, get the highest priority node
Make mappings from each node to that one

This algorithm would produce incorrect results if there was no pre-existing mapping from a given node in a connected component to the highest priority node in the component (i.e., it would skip it entirely). This should not have happened because of the precondition for using the function

New algorithm:

Still assume that inference has been run and that in each connected component, there are exact match mappings between all nodes (in both directions). Further, assume that assemble_evidences() has been run / there is only one exact match mapping for each subject/object pair
Make a subject-object-mapping index
For each subject
1. assume that all objects comprise all nodes in the connected component that the subject belongs to.
2. choose the highest priority object from the list and the associated mapping with that object

Benefits:

the new algorithm only has to loop through the mappings once
It doesn't have to create a networkx graph data structure nor run the connected components algorthm

Still todo:

document/harden behavior for mapping sets that don't induce fully connected components

cthoyt added 3 commits June 30, 2025 00:04

Update api.py

593cd84

Update api.py

31ab071

Refactor

2868964

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Introduce new prioritization algorithm #90

Introduce new prioritization algorithm #90

Uh oh!

cthoyt commented Jun 30, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Introduce new prioritization algorithm #90

Are you sure you want to change the base?

Introduce new prioritization algorithm #90

Uh oh!

Conversation

cthoyt commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cthoyt commented Jun 30, 2025 •

edited

Loading