Hi team,
Could we schedule a focused review of the current mis-mapping findings in:
https://github.com/dieterich-lab/CardioGuidelinesGraph/blob/master/docs/trackers/grounding/procedure_mis_mapping_report.md
The goal is to understand why Ground Truth and model labeling are still not fully synchronized and define concrete fixes.
Proposed To-Dos
- Review disagreement cases and classify each mismatch into one of these buckets:
- Ground Truth likely incorrect or outdated.
- Model labeling likely incorrect.
- Task / label semantics ambiguous (needs rule clarification).
- For each bucket, capture:
- Representative examples.
- Root-cause hypothesis.
- Required action (data correction, prompt/rule update, ontology change, or evaluation-policy change).
- Define a short remediation plan with owners and expected impact:
- Quick wins (can be fixed immediately).
- Medium-term fixes (pipeline or labeling policy changes).
- Validation plan for confirming reduced mismatch rate.
- Re-check SNOMED subset dependency (important reminder):
- Verify whether we are still using the previously created cardiology-oriented SNOMED subset.
- If this subset step/file was dropped or bypassed, re-create and reinstate the cardiology-focused SNOMED subset as a priority.
- Document where this subset is generated, stored, and consumed in the current pipeline.
Desired Outcome
- A clear decision per mismatch type (GT fix vs model fix vs ambiguity handling).
- A tracked action list with owners.
- Confirmed status of cardiology SNOMED subset usage, with re-creation task initiated if needed.
Thanks everyone. This should help us stabilize labeling quality and speed up SNOMED grounding.
Hi team,
Could we schedule a focused review of the current mis-mapping findings in:
https://github.com/dieterich-lab/CardioGuidelinesGraph/blob/master/docs/trackers/grounding/procedure_mis_mapping_report.md
The goal is to understand why Ground Truth and model labeling are still not fully synchronized and define concrete fixes.
Proposed To-Dos
Desired Outcome
Thanks everyone. This should help us stabilize labeling quality and speed up SNOMED grounding.