-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Feature: Allow embedding vector search for max_marginal_relevance_search #2620
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Would it help you if the maximal_marginal_relevance function was exposed? Or do you really just want an option to only provide |
I was actually looking for an option to only provide P.S. I am not sure how it works, cause I have never contributed to an open source library, but I would like to work on this if possible. Thanks |
Sure! The first step is to get the test to pass locally: elasticsearch-py/test_elasticsearch/test_server/test_vectorstore/test_vectorstore.py Lines 818 to 877 in 6521b55
https://github.com/elastic/elasticsearch-py/blob/main/CONTRIBUTING.md could help you here. The formatting step ( |
Thanks! Let me give this a go. |
Please ask for help if you get stuck, the current system is quite fragile and does not provide helpful errors when things are not setup in the exact same way as in continuous integration. |
Thanks! so I started elastic search locally using the script |
What STACK_VERSION did you set for And are you able to connect to that Elasticsearch instance? $ curl --resolve instance:9200:127.0.0.1 --cacert .buildkite/certs/ca.pem https://elastic:changeme@instance:9200
{
"name" : "instance",
"cluster_name" : "elasticsearch-8-11-0-SNAPSHOT-rest-test",
"cluster_uuid" : "8UO33t3DS9mziGEfeJNfGw",
"version" : {
"number" : "8.11.0-SNAPSHOT",
"build_flavor" : "default",
"build_type" : "docker",
"build_hash" : "edf70163e6e5371a90090831e055b31973958cf0",
"build_date" : "2023-11-06T18:49:41.957873063Z",
"build_snapshot" : true,
"lucene_version" : "9.8.0",
"minimum_wire_compatibility_version" : "7.17.0",
"minimum_index_compatibility_version" : "7.0.0"
},
"tagline" : "You Know, for Search"
} The |
Yeah I was able to connect to the elastic search instance :
The error that I can for the test :
|
OK, this is pretty annoying. I had to add "instance" in /etc/hosts by adding the following entry:
Now, that test runs fine with:
and then:
Also, I added the following change to only run that test:
|
Thanks a lot! I was able to run the test, I have created a PR |
Signed-off-by: Rishabh Gupta <rishabhgupta@Rishabhs-MacBook-Pro.local>
Signed-off-by: rishabh208gupta <rishabhgupta.52pp@gmail.com>
Signed-off-by: rishabh208gupta <rishabhgupta.52pp@gmail.com>
Hi @pquentin, just wanted to check with you on the status of this, any estimate of when this might be reviewed/merged? |
* allow embeddings vector to be used for mmr searching (#2620) Signed-off-by: rishabh208gupta <rishabhgupta.52pp@gmail.com> * Use embedding service if provided --------- Signed-off-by: rishabh208gupta <rishabhgupta.52pp@gmail.com> Co-authored-by: Quentin Pradet <quentin.pradet@elastic.co>
* allow embeddings vector to be used for mmr searching (#2620) Signed-off-by: rishabh208gupta <rishabhgupta.52pp@gmail.com> * Use embedding service if provided --------- Signed-off-by: rishabh208gupta <rishabhgupta.52pp@gmail.com> Co-authored-by: Quentin Pradet <quentin.pradet@elastic.co> (cherry picked from commit 3b1bce7)
…2639) * allow embeddings vector to be used for mmr searching (#2620) Signed-off-by: rishabh208gupta <rishabhgupta.52pp@gmail.com> * Use embedding service if provided --------- Signed-off-by: rishabh208gupta <rishabhgupta.52pp@gmail.com> Co-authored-by: Quentin Pradet <quentin.pradet@elastic.co> (cherry picked from commit 3b1bce7) Co-authored-by: Rishabh Gupta <71093470+rishabh208gupta@users.noreply.github.com>
Describe the feature: Currently there is no option to perform the max_marginal_relevance_search with only query embeddings vector. The query is mandatory and because of which the embedding service needs to be passed.
The text was updated successfully, but these errors were encountered: