-
Notifications
You must be signed in to change notification settings - Fork 17k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Google Cloud Enterprise Search retriever #7857
Google Cloud Enterprise Search retriever #7857
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎ 1 Ignored Deployment
|
This PR adds a LangChain retriever that encapsulates Enterprise Search on Google Cloud Gen App Builder. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
one small comment, otherwise looks awesome. thanks @jarokaz!!
"""Filter expression.""" | ||
get_extractive_answers: bool = False | ||
"""If True the retriever will return Extractive Answers. Otherwise Extractive Segments.""" | ||
max_documents: int = Field(default=5, ge=1, le=100) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
most retriever / doc store interfaces refer to this as top_k
, could we change for consistency?
and out of curiosity, are the bounds enforced on the server side as well? if not, do we need them?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This setting is a little bit different than top_k as the underlying retrieval mechanism is a little bit different than in many other vector stores . The max_documents, max_extractive_answer_count, and max_extractive_segment_count settings together decide how many matching "chunks" will be returned. The max_documents controls how many top ranked documents - as ingested by Enterprise Search engine - will be used to prepare extractive answers or extractive segments. When you populate Enterprise Search you ingest full document (rather than chunks) and the engine does chunking and embedding generations in the backend.
For example, if you set max_documents to 3 and max_extractive_answers_count to 3 you can get up to 9 LangChain documents.
When we have the retriever in the main, we are planning to publish a number of code samples that show how to use different LangChain retrieval chains and agents with Enterprise Search. In these code samples we plan to go into pretty deep detail in describing different strategies for ingesting/quering both structured and unstructured data with Enterprise Search.
lgtm, thanks @jarokaz!! |
@jarokaz - I just spoke to Kristopher Overholt, and he suggested I talk to you about having me work on this for LangChainJS if work isn't already being done and anything I should be aware of in doing so. |
No description provided.