Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

2024.04.19 Sub-team meeting

Agenda:Recording: https://pistoiaalliance-org.zoom.us/rec/share/hdZ_7adOrM9TmAxX2H3Byist0IETyMvMulABcJYk6PZR2TMnHsuZeJWtzPBtwXfB.q3UOy_kaD6Zmb0Pt

Passcode: PMik3&s%

Main objective: need to set-up a testing environment to systematically evaluate and improve LLM ability to generate Cypher queries

  • LLM team members, please remind me of what open-source LLM we picked (in addition to GPT4)

    • VM will look up in recording

  • How is Cypher query generation testing done by the BioCypher team?

  • Propose a set of English questions that we will use (limited by the current OT contents in BioCypher) - Confirm with The Hyve that the questions in our list currently flagged as feasible are indeed such

    • Do we need to write “ideal” Cypher queries for these questions? - yes, make sure we understand the questions asked -this is a line item in the RFP

    • VM Fwd questions in column M to experts

  • Comment from Etzard: in his system elastic search is used across documents to by-pass the failing SPARQL queries generated by LLM, is this conceptually similar to the method proposed by Abbvie? VM shared the recording of the Jon’s talk; ask for the slides

  • In general, it is good for us to collect these and similar hacks that force LLMs to produce better queries

  • Action item for the team (all members): please think about any additional requirements that we need to include into the RFP

  • Note, if payments (beyond minimum cloud expenses reimbursement) are needed we must do a competitive RFP/RFQ

...