Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Description

Mitigation

Responsible Party

Failure to identify business questions, or picking too many or too few

Draft appropriate business questions - DONE

Lee Harland, John Wise, Bruce Press, Peter Revill

Validate that Open Targets either has a ready to use Knowledge Graph implementation, or can be converted into a KG with reasonable cost

BioCypher could be the right KG. It contains Open Targets - DONE

The Hyve
Jordan Ramsdell
Robert Gill
Brian Evarts

Open Targets/EBI:

Sebastian Lobentanzer

Ellen McDonagh

Failure to identify a suitable LLM

Technology research, feature and cost analysis, and selection

Jon Stevens, Etzard Stolte, Helena Deus; Brian Evarts; Wouter Franke, Matthijs van der Zee

Does Open Targets use an ontology?

Perhaps The Hyve team has a ready answer

Failure to download a large volume of data (all of the PubMed as a maximum) for the prompt-tuning of the LLM

TBD

Failure to perform KG generation from text by an LLM

  1. Technology research

  2. If no ready-to-use technology exists, estimate bespoke development (tuning an existing LLM for this purpose)

Failure to perform local KG comparison with calculation of a score

  1. Technology research

  2. If no ready-to-use technology exists, estimate bespoke development

  3. If estimates indicate infeasibility, this may become a gap

Failure to generate a proper query for a KG database system by an LLM

Technology research.

  • See refs 7, 8 below

  • BioCypher by EBI may have this capability already - needs evaluation

The Hyve
Jordan Ramsdell
Robert Gill
Brian Evarts

Open Targets/EBI:

Sebastian Lobentanzer

Ellen McDonagh

Failure to build a prototypical target discovery pipeline on the limited budget in case of mounting technical difficulties

Schedule the project in phases. Aim to answer known unknowns and to establish risk mitigation strategies early in this phase (“project elaboration”)

Is there need for an architecture that would be broadly applicable beyond the narrow POC?

  • Technology research

  • Need agents that can collect data from any source

  • How is BioChatter/BioCypher architecture deficient?

Need to assemble a new sub-team

Project Stakeholders

Sponsors:

...

References

  1. https://www.sciencedirect.com/science/article/pii/S1359644613001542

  2. https://www.nature.com/articles/s41573-020-0087-3

  3. https://www.epam.com/about/newsroom/press-releases/2023/epam-launches-dial-a-unified-generative-ai-orchestration-platform

  4. https://epam-rail.com/open-source

  5. Open LLM Leaderboard: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard

  6. Chatbot Arena: https://chat.lmsys.org/?arena

  7. Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning

    https://arxiv.org/abs/2310.01061

  8. Knowledge-Consistent Dialogue Generation with Language Models and Knowledge Graphs

    https://openreview.net/forum?id=WhWlYzUTJfP&source=post_page-----97a4cf96eb69--------------------------------

  9. BioChatter Benchmark Results: https://biochatter.org/benchmark-results/#biochatter-query-generation

  10. MBET Benchmark (embeddings) https://huggingface.co/spaces/mteb/leaderboard

  11. Lora-Land and Lorax: https://predibase.com/lora-land