Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Neural sparse query two-phase search processor's bwc test #777

Merged
merged 30 commits into from
Jul 9, 2024

Conversation

conggguan
Copy link
Contributor

Description

Add BWC test for the Neural sparse query two-phase search processor.

Issues Resolved

Resolve #646.

Check List

  • New functionality includes testing.
    • All tests pass
  • New functionality has been documented.
    • New functionality has javadoc added
  • Commits are signed as per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

conggguan and others added 18 commits May 10, 2024 23:50
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
…rocess funciton.

Signed-off-by: conggguan <congguan@amazon.com>
…uilder.

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
…e search pipeline to neural sparse query builder.

Signed-off-by: conggguan <congguan@amazon.com>
…processor.

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
@github-actions github-actions bot added the Features Introduces a new unit of functionality that satisfies a requirement label Jun 6, 2024
@conggguan conggguan marked this pull request as ready for review June 6, 2024 08:38
Signed-off-by: conggguan <congguan@amazon.com>
@conggguan
Copy link
Contributor Author

Can you please check why bwc are failing in CI for rolling upgrade? I've restarted them few times but it still failing, doesn't look like a flaky tests

Hi, I have made a test qa:rolling-upgrade:testAgainstOneThirdUpgradedCluster on a aws ec2, which based ubuntu 22 and JDK17.
The test branch is main and without this PR's code, and get a similar error log.

* What went wrong:
Execution failed for task ':qa:rolling-upgrade:testAgainstOneThirdUpgradedCluster'.
> `cluster{:qa:rolling-upgrade:neuralSearchBwcCluster-rolling}` failed to wait for cluster health yellow after 40 SECONDS
    IO error while waiting cluster
    503 Service Unavailable

Based this, I think the failed maybe caused by something besides my PR.

@yuye-aws
Copy link
Member

Can you update the changelog?

Signed-off-by: conggguan <congguan@amazon.com>
@conggguan
Copy link
Contributor Author

Can you update the changelog?

Sure, add a line in changelog.

CHANGELOG.md Outdated
@@ -17,6 +17,7 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
### Enhancements
### Bug Fixes
### Infrastructure
- Add backward test cases for neural sparse two phase processor.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please make a proper changelog entry - pr number and link are missing

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, have added it to the change log.

@conggguan conggguan force-pushed the search-pipeline-bwc branch 2 times, most recently from 15aeb02 to 7f7aa8d Compare July 8, 2024 03:16
…fix it to correct API.

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
@zhichao-aws zhichao-aws merged commit b0b5128 into opensearch-project:main Jul 9, 2024
80 of 81 checks passed
@zhichao-aws zhichao-aws added the backport 2.x Label will add auto workflow to backport PR to 2.x branch label Jul 9, 2024
@opensearch-trigger-bot
Copy link
Contributor

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.x 2.x
# Navigate to the new working tree
cd .worktrees/backport-2.x
# Create a new branch
git switch --create backport/backport-777-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 b0b51284d29532ed67fcff745fee1d45653ce776
# Push it to GitHub
git push --set-upstream origin backport/backport-777-to-2.x
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-777-to-2.x.

vibrantvarun pushed a commit to vibrantvarun/neural-search that referenced this pull request Jul 9, 2024
…-project#777)

* Poc of pipeline

Signed-off-by: conggguan <congguan@amazon.com>

* Complete some settings for two phase pipeline.

Signed-off-by: conggguan <congguan@amazon.com>

* Change the implement of two-phase from QueryBuilderVistor to custom process funciton.

Signed-off-by: conggguan <congguan@amazon.com>

* Add It and fix some bug on the state of multy same neuralsparsequerybuilder.

Signed-off-by: conggguan <congguan@amazon.com>

* Simplify some logic, and correct some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some test case.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some logic for zhichao-aws's comments.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize a line without application.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments, remove some redundant lines, fix some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove a redundant null check, fix a if format.

Signed-off-by: conggguan <congguan@amazon.com>

* Fix a typo for a comment, camelcase format for some variable.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments to illustrate the influence of the modify on 2-phase search pipeline to neural sparse query builder.

Signed-off-by: conggguan <congguan@amazon.com>

* Add restart and rolling upgrade bwc test for neural sparse two phase processor.

Signed-off-by: conggguan <congguan@amazon.com>

* Spotless on qa.

Signed-off-by: conggguan <congguan@amazon.com>

* Update change log for two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove redundant lines of two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Add changelog.

Signed-off-by: conggguan <congguan@amazon.com>

* Add the PR link and number for the CHANGELOG.md.

Signed-off-by: conggguan <congguan@amazon.com>

* [Fix] NeuralSparseTwoPhaseProcessorIT created wrong ingest pipeline, fix it to correct API.

Signed-off-by: conggguan <congguan@amazon.com>

---------

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
vibrantvarun added a commit that referenced this pull request Jul 9, 2024
* Adds method_parameters in neural search query to support ef_search (#787) (#814)

Signed-off-by: Tejas Shah <shatejas@amazon.com>

* Add BWC for batch ingestion (#769)

* Add BWC for batch ingestion

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Update Changelog

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Fix spotlessLicenseCheck

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Fix comments

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Reuse the same code

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Rename some functions

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Rename a function

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Minor change to trigger rebuild

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

---------

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Neural sparse query two-phase search processor's bwc test (#777)

* Poc of pipeline

Signed-off-by: conggguan <congguan@amazon.com>

* Complete some settings for two phase pipeline.

Signed-off-by: conggguan <congguan@amazon.com>

* Change the implement of two-phase from QueryBuilderVistor to custom process funciton.

Signed-off-by: conggguan <congguan@amazon.com>

* Add It and fix some bug on the state of multy same neuralsparsequerybuilder.

Signed-off-by: conggguan <congguan@amazon.com>

* Simplify some logic, and correct some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some test case.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some logic for zhichao-aws's comments.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize a line without application.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments, remove some redundant lines, fix some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove a redundant null check, fix a if format.

Signed-off-by: conggguan <congguan@amazon.com>

* Fix a typo for a comment, camelcase format for some variable.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments to illustrate the influence of the modify on 2-phase search pipeline to neural sparse query builder.

Signed-off-by: conggguan <congguan@amazon.com>

* Add restart and rolling upgrade bwc test for neural sparse two phase processor.

Signed-off-by: conggguan <congguan@amazon.com>

* Spotless on qa.

Signed-off-by: conggguan <congguan@amazon.com>

* Update change log for two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove redundant lines of two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Add changelog.

Signed-off-by: conggguan <congguan@amazon.com>

* Add the PR link and number for the CHANGELOG.md.

Signed-off-by: conggguan <congguan@amazon.com>

* [Fix] NeuralSparseTwoPhaseProcessorIT created wrong ingest pipeline, fix it to correct API.

Signed-off-by: conggguan <congguan@amazon.com>

---------

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>

* Enable '.' for nested field in text embedding processor (#811)

* Added nested structure for text embed processor mapping

Signed-off-by: Martin Gaievski <gaievski@amazon.com>

* Fix linux build CI error due to action runner env upgrade node 20 (#821)

* Fix linux build CI error due to action runner env upgrade node 20

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix linux build on additional integ tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Tejas Shah <shatejas@amazon.com>
Signed-off-by: Liyun Xiu <xiliyun@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Signed-off-by: Varun Jain <varunudr@amazon.com>
Co-authored-by: Tejas Shah <shatejas@amazon.com>
Co-authored-by: Liyun Xiu <chishui2@gmail.com>
Co-authored-by: conggguan <157357330+conggguan@users.noreply.github.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>
vibrantvarun added a commit that referenced this pull request Jul 9, 2024
…827)

* Fix jdk version for CI test secure cluster action (#801) (#806)

Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* [Part 1] Collector for Sorting Results (#797)

* [Part 2] Normalization Phase for Sorting (#802)

* Normalization Phase for Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fixing compile test issue

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Optimize code

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add method description

Signed-off-by: Varun Jain <varunudr@amazon.com>

* [Part 1] Collector for Sorting Results (#797)

* HybridSearchSortUtil class

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add Integ Tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add Sorting Integ tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add integ test for Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring normalization processor workflow

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix Unit Tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Optimising Normalization

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Vijay comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Vijay Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Update bwc workflow to include 2.16.0-SNAPSHOT (#809) (#810)

* Increment BWC version



* Append 2.16.0-SNAPSHOTn in restart upgrade tests



---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

* [Part 3] Concurrent segment search bug in Sorting (#808)

* Cherry picking Concurrent Segment Search Bug Commit

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix Concurrent Segment Search Bug in Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Functional Interface

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Removing comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin commnents

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* Rebasing with main (#826)

* Adds method_parameters in neural search query to support ef_search (#787) (#814)

Signed-off-by: Tejas Shah <shatejas@amazon.com>

* Add BWC for batch ingestion (#769)

* Add BWC for batch ingestion

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Update Changelog

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Fix spotlessLicenseCheck

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Fix comments

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Reuse the same code

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Rename some functions

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Rename a function

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Minor change to trigger rebuild

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

---------

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Neural sparse query two-phase search processor's bwc test (#777)

* Poc of pipeline

Signed-off-by: conggguan <congguan@amazon.com>

* Complete some settings for two phase pipeline.

Signed-off-by: conggguan <congguan@amazon.com>

* Change the implement of two-phase from QueryBuilderVistor to custom process funciton.

Signed-off-by: conggguan <congguan@amazon.com>

* Add It and fix some bug on the state of multy same neuralsparsequerybuilder.

Signed-off-by: conggguan <congguan@amazon.com>

* Simplify some logic, and correct some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some test case.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some logic for zhichao-aws's comments.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize a line without application.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments, remove some redundant lines, fix some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove a redundant null check, fix a if format.

Signed-off-by: conggguan <congguan@amazon.com>

* Fix a typo for a comment, camelcase format for some variable.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments to illustrate the influence of the modify on 2-phase search pipeline to neural sparse query builder.

Signed-off-by: conggguan <congguan@amazon.com>

* Add restart and rolling upgrade bwc test for neural sparse two phase processor.

Signed-off-by: conggguan <congguan@amazon.com>

* Spotless on qa.

Signed-off-by: conggguan <congguan@amazon.com>

* Update change log for two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove redundant lines of two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Add changelog.

Signed-off-by: conggguan <congguan@amazon.com>

* Add the PR link and number for the CHANGELOG.md.

Signed-off-by: conggguan <congguan@amazon.com>

* [Fix] NeuralSparseTwoPhaseProcessorIT created wrong ingest pipeline, fix it to correct API.

Signed-off-by: conggguan <congguan@amazon.com>

---------

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>

* Enable '.' for nested field in text embedding processor (#811)

* Added nested structure for text embed processor mapping

Signed-off-by: Martin Gaievski <gaievski@amazon.com>

* Fix linux build CI error due to action runner env upgrade node 20 (#821)

* Fix linux build CI error due to action runner env upgrade node 20

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix linux build on additional integ tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Tejas Shah <shatejas@amazon.com>
Signed-off-by: Liyun Xiu <xiliyun@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Signed-off-by: Varun Jain <varunudr@amazon.com>
Co-authored-by: Tejas Shah <shatejas@amazon.com>
Co-authored-by: Liyun Xiu <chishui2@gmail.com>
Co-authored-by: conggguan <157357330+conggguan@users.noreply.github.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* Add changelog

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Signed-off-by: Varun Jain <varunudr@amazon.com>
Signed-off-by: Tejas Shah <shatejas@amazon.com>
Signed-off-by: Liyun Xiu <xiliyun@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>
Co-authored-by: Tejas Shah <shatejas@amazon.com>
Co-authored-by: Liyun Xiu <chishui2@gmail.com>
Co-authored-by: conggguan <157357330+conggguan@users.noreply.github.com>
opensearch-trigger-bot bot pushed a commit that referenced this pull request Jul 9, 2024
…827)

* Fix jdk version for CI test secure cluster action (#801) (#806)

Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* [Part 1] Collector for Sorting Results (#797)

* [Part 2] Normalization Phase for Sorting (#802)

* Normalization Phase for Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fixing compile test issue

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Optimize code

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add method description

Signed-off-by: Varun Jain <varunudr@amazon.com>

* [Part 1] Collector for Sorting Results (#797)

* HybridSearchSortUtil class

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add Integ Tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add Sorting Integ tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add integ test for Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring normalization processor workflow

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix Unit Tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Optimising Normalization

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Vijay comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Vijay Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Update bwc workflow to include 2.16.0-SNAPSHOT (#809) (#810)

* Increment BWC version

* Append 2.16.0-SNAPSHOTn in restart upgrade tests

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

* [Part 3] Concurrent segment search bug in Sorting (#808)

* Cherry picking Concurrent Segment Search Bug Commit

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix Concurrent Segment Search Bug in Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Functional Interface

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Removing comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin commnents

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* Rebasing with main (#826)

* Adds method_parameters in neural search query to support ef_search (#787) (#814)

Signed-off-by: Tejas Shah <shatejas@amazon.com>

* Add BWC for batch ingestion (#769)

* Add BWC for batch ingestion

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Update Changelog

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Fix spotlessLicenseCheck

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Fix comments

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Reuse the same code

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Rename some functions

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Rename a function

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Minor change to trigger rebuild

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

---------

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Neural sparse query two-phase search processor's bwc test (#777)

* Poc of pipeline

Signed-off-by: conggguan <congguan@amazon.com>

* Complete some settings for two phase pipeline.

Signed-off-by: conggguan <congguan@amazon.com>

* Change the implement of two-phase from QueryBuilderVistor to custom process funciton.

Signed-off-by: conggguan <congguan@amazon.com>

* Add It and fix some bug on the state of multy same neuralsparsequerybuilder.

Signed-off-by: conggguan <congguan@amazon.com>

* Simplify some logic, and correct some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some test case.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some logic for zhichao-aws's comments.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize a line without application.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments, remove some redundant lines, fix some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove a redundant null check, fix a if format.

Signed-off-by: conggguan <congguan@amazon.com>

* Fix a typo for a comment, camelcase format for some variable.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments to illustrate the influence of the modify on 2-phase search pipeline to neural sparse query builder.

Signed-off-by: conggguan <congguan@amazon.com>

* Add restart and rolling upgrade bwc test for neural sparse two phase processor.

Signed-off-by: conggguan <congguan@amazon.com>

* Spotless on qa.

Signed-off-by: conggguan <congguan@amazon.com>

* Update change log for two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove redundant lines of two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Add changelog.

Signed-off-by: conggguan <congguan@amazon.com>

* Add the PR link and number for the CHANGELOG.md.

Signed-off-by: conggguan <congguan@amazon.com>

* [Fix] NeuralSparseTwoPhaseProcessorIT created wrong ingest pipeline, fix it to correct API.

Signed-off-by: conggguan <congguan@amazon.com>

---------

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>

* Enable '.' for nested field in text embedding processor (#811)

* Added nested structure for text embed processor mapping

Signed-off-by: Martin Gaievski <gaievski@amazon.com>

* Fix linux build CI error due to action runner env upgrade node 20 (#821)

* Fix linux build CI error due to action runner env upgrade node 20

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix linux build on additional integ tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Tejas Shah <shatejas@amazon.com>
Signed-off-by: Liyun Xiu <xiliyun@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Signed-off-by: Varun Jain <varunudr@amazon.com>
Co-authored-by: Tejas Shah <shatejas@amazon.com>
Co-authored-by: Liyun Xiu <chishui2@gmail.com>
Co-authored-by: conggguan <157357330+conggguan@users.noreply.github.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* Add changelog

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Signed-off-by: Varun Jain <varunudr@amazon.com>
Signed-off-by: Tejas Shah <shatejas@amazon.com>
Signed-off-by: Liyun Xiu <xiliyun@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>
Co-authored-by: Tejas Shah <shatejas@amazon.com>
Co-authored-by: Liyun Xiu <chishui2@gmail.com>
Co-authored-by: conggguan <157357330+conggguan@users.noreply.github.com>
(cherry picked from commit d22e1b8)
vibrantvarun added a commit that referenced this pull request Jul 9, 2024
…827) (#829)

* Fix jdk version for CI test secure cluster action (#801) (#806)

Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* [Part 1] Collector for Sorting Results (#797)

* [Part 2] Normalization Phase for Sorting (#802)

* Normalization Phase for Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fixing compile test issue

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Optimize code

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add method description

Signed-off-by: Varun Jain <varunudr@amazon.com>

* [Part 1] Collector for Sorting Results (#797)

* HybridSearchSortUtil class

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add Integ Tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add Sorting Integ tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Add integ test for Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring normalization processor workflow

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix Unit Tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Refactoring

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Optimising Normalization

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Vijay comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Vijay Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Update bwc workflow to include 2.16.0-SNAPSHOT (#809) (#810)

* Increment BWC version

* Append 2.16.0-SNAPSHOTn in restart upgrade tests

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

* [Part 3] Concurrent segment search bug in Sorting (#808)

* Cherry picking Concurrent Segment Search Bug Commit

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix Concurrent Segment Search Bug in Sorting

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Functional Interface

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Removing comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Addressing Martin commnents

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Address Martin Comments

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* Rebasing with main (#826)

* Adds method_parameters in neural search query to support ef_search (#787) (#814)

Signed-off-by: Tejas Shah <shatejas@amazon.com>

* Add BWC for batch ingestion (#769)

* Add BWC for batch ingestion

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Update Changelog

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Fix spotlessLicenseCheck

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Fix comments

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Reuse the same code

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Rename some functions

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Rename a function

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Minor change to trigger rebuild

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

---------

Signed-off-by: Liyun Xiu <xiliyun@amazon.com>

* Neural sparse query two-phase search processor's bwc test (#777)

* Poc of pipeline

Signed-off-by: conggguan <congguan@amazon.com>

* Complete some settings for two phase pipeline.

Signed-off-by: conggguan <congguan@amazon.com>

* Change the implement of two-phase from QueryBuilderVistor to custom process funciton.

Signed-off-by: conggguan <congguan@amazon.com>

* Add It and fix some bug on the state of multy same neuralsparsequerybuilder.

Signed-off-by: conggguan <congguan@amazon.com>

* Simplify some logic, and correct some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some test case.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some logic for zhichao-aws's comments.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize a line without application.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments, remove some redundant lines, fix some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove a redundant null check, fix a if format.

Signed-off-by: conggguan <congguan@amazon.com>

* Fix a typo for a comment, camelcase format for some variable.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments to illustrate the influence of the modify on 2-phase search pipeline to neural sparse query builder.

Signed-off-by: conggguan <congguan@amazon.com>

* Add restart and rolling upgrade bwc test for neural sparse two phase processor.

Signed-off-by: conggguan <congguan@amazon.com>

* Spotless on qa.

Signed-off-by: conggguan <congguan@amazon.com>

* Update change log for two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove redundant lines of two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Add changelog.

Signed-off-by: conggguan <congguan@amazon.com>

* Add the PR link and number for the CHANGELOG.md.

Signed-off-by: conggguan <congguan@amazon.com>

* [Fix] NeuralSparseTwoPhaseProcessorIT created wrong ingest pipeline, fix it to correct API.

Signed-off-by: conggguan <congguan@amazon.com>

---------

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>

* Enable '.' for nested field in text embedding processor (#811)

* Added nested structure for text embed processor mapping

Signed-off-by: Martin Gaievski <gaievski@amazon.com>

* Fix linux build CI error due to action runner env upgrade node 20 (#821)

* Fix linux build CI error due to action runner env upgrade node 20

Signed-off-by: Varun Jain <varunudr@amazon.com>

* Fix linux build on additional integ tests

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Tejas Shah <shatejas@amazon.com>
Signed-off-by: Liyun Xiu <xiliyun@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Signed-off-by: Varun Jain <varunudr@amazon.com>
Co-authored-by: Tejas Shah <shatejas@amazon.com>
Co-authored-by: Liyun Xiu <chishui2@gmail.com>
Co-authored-by: conggguan <157357330+conggguan@users.noreply.github.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>

* Add changelog

Signed-off-by: Varun Jain <varunudr@amazon.com>

---------

Signed-off-by: Martin Gaievski <gaievski@amazon.com>
Signed-off-by: Varun Jain <varunudr@amazon.com>
Signed-off-by: Tejas Shah <shatejas@amazon.com>
Signed-off-by: Liyun Xiu <xiliyun@amazon.com>
Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
Co-authored-by: Martin Gaievski <gaievski@amazon.com>
Co-authored-by: Tejas Shah <shatejas@amazon.com>
Co-authored-by: Liyun Xiu <chishui2@gmail.com>
Co-authored-by: conggguan <157357330+conggguan@users.noreply.github.com>
(cherry picked from commit d22e1b8)

Co-authored-by: Varun Jain <varunudr@amazon.com>
@zhichao-aws zhichao-aws added backport 2.x Label will add auto workflow to backport PR to 2.x branch and removed backport 2.x Label will add auto workflow to backport PR to 2.x branch labels Jul 11, 2024
opensearch-trigger-bot bot pushed a commit that referenced this pull request Jul 11, 2024
* Poc of pipeline

Signed-off-by: conggguan <congguan@amazon.com>

* Complete some settings for two phase pipeline.

Signed-off-by: conggguan <congguan@amazon.com>

* Change the implement of two-phase from QueryBuilderVistor to custom process funciton.

Signed-off-by: conggguan <congguan@amazon.com>

* Add It and fix some bug on the state of multy same neuralsparsequerybuilder.

Signed-off-by: conggguan <congguan@amazon.com>

* Simplify some logic, and correct some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some test case.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some logic for zhichao-aws's comments.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize a line without application.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments, remove some redundant lines, fix some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove a redundant null check, fix a if format.

Signed-off-by: conggguan <congguan@amazon.com>

* Fix a typo for a comment, camelcase format for some variable.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments to illustrate the influence of the modify on 2-phase search pipeline to neural sparse query builder.

Signed-off-by: conggguan <congguan@amazon.com>

* Add restart and rolling upgrade bwc test for neural sparse two phase processor.

Signed-off-by: conggguan <congguan@amazon.com>

* Spotless on qa.

Signed-off-by: conggguan <congguan@amazon.com>

* Update change log for two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove redundant lines of two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Add changelog.

Signed-off-by: conggguan <congguan@amazon.com>

* Add the PR link and number for the CHANGELOG.md.

Signed-off-by: conggguan <congguan@amazon.com>

* [Fix] NeuralSparseTwoPhaseProcessorIT created wrong ingest pipeline, fix it to correct API.

Signed-off-by: conggguan <congguan@amazon.com>

---------

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
(cherry picked from commit b0b5128)
zane-neo pushed a commit that referenced this pull request Jul 13, 2024
* Poc of pipeline

Signed-off-by: conggguan <congguan@amazon.com>

* Complete some settings for two phase pipeline.

Signed-off-by: conggguan <congguan@amazon.com>

* Change the implement of two-phase from QueryBuilderVistor to custom process funciton.

Signed-off-by: conggguan <congguan@amazon.com>

* Add It and fix some bug on the state of multy same neuralsparsequerybuilder.

Signed-off-by: conggguan <congguan@amazon.com>

* Simplify some logic, and correct some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some test case.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize some logic for zhichao-aws's comments.

Signed-off-by: conggguan <congguan@amazon.com>

* Optimize a line without application.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments, remove some redundant lines, fix some format.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove a redundant null check, fix a if format.

Signed-off-by: conggguan <congguan@amazon.com>

* Fix a typo for a comment, camelcase format for some variable.

Signed-off-by: conggguan <congguan@amazon.com>

* Add some comments to illustrate the influence of the modify on 2-phase search pipeline to neural sparse query builder.

Signed-off-by: conggguan <congguan@amazon.com>

* Add restart and rolling upgrade bwc test for neural sparse two phase processor.

Signed-off-by: conggguan <congguan@amazon.com>

* Spotless on qa.

Signed-off-by: conggguan <congguan@amazon.com>

* Update change log for two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Remove redundant lines of two-phase BWC test.

Signed-off-by: conggguan <congguan@amazon.com>

* Add changelog.

Signed-off-by: conggguan <congguan@amazon.com>

* Add the PR link and number for the CHANGELOG.md.

Signed-off-by: conggguan <congguan@amazon.com>

* [Fix] NeuralSparseTwoPhaseProcessorIT created wrong ingest pipeline, fix it to correct API.

Signed-off-by: conggguan <congguan@amazon.com>

---------

Signed-off-by: conggguan <congguan@amazon.com>
Signed-off-by: conggguan <157357330+conggguan@users.noreply.github.com>
(cherry picked from commit b0b5128)

Co-authored-by: conggguan <157357330+conggguan@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport 2.x Label will add auto workflow to backport PR to 2.x branch Features Introduces a new unit of functionality that satisfies a requirement
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[RFC] Enhancing Neural Sparse Query Speed with a Two-Phase Approach
4 participants