Enable vllm for DocSum#1716
Merged
Merged
Conversation
Signed-off-by: letonghan <letong.han@intel.com>
Signed-off-by: letonghan <letong.han@intel.com>
Signed-off-by: letonghan <letong.han@intel.com>
Dependency Review✅ No vulnerabilities or license issues found.Scanned Files |
Signed-off-by: letonghan <letong.han@intel.com>
Signed-off-by: letonghan <letong.han@intel.com>
Collaborator
Author
|
@XinyaoWa xinyao will help to check the DocSum tgi issues of max tokens/langchain dependency versions. |
4 tasks
Signed-off-by: letonghan <letong.han@intel.com>
Signed-off-by: letonghan <letong.han@intel.com>
XinyaoWa
approved these changes
Mar 28, 2025
Spycsh
reviewed
Mar 28, 2025
Signed-off-by: letonghan <letong.han@intel.com>
Spycsh
reviewed
Mar 28, 2025
Signed-off-by: letonghan <letong.han@intel.com>
21 tasks
chyundunovDatamonsters
pushed a commit
to chyundunovDatamonsters/OPEA-GenAIExamples
that referenced
this pull request
Apr 1, 2025
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts. Fix issue opea-project#1436 Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>
chyundunovDatamonsters
pushed a commit
to chyundunovDatamonsters/OPEA-GenAIExamples
that referenced
this pull request
Apr 1, 2025
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts. Fix issue opea-project#1436 Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>
chyundunovDatamonsters
pushed a commit
to chyundunovDatamonsters/OPEA-GenAIExamples
that referenced
this pull request
May 16, 2025
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts. Fix issue opea-project#1436 Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Chingis Yundunov <c.yundunov@datamonsters.com>
letonghan
pushed a commit
that referenced
this pull request
Sep 17, 2025
* initial file structure created. Populated with unimplemented files Signed-off-by: Madison Evans <madison.evans@intel.com> * added relevant code to files within comps/router/deployment Signed-off-by: Madison Evans <madison.evans@intel.com> * added Dockerfile, opea_router_microservice.py, README.md, and requirements.txt contents Signed-off-by: Madison Evans <madison.evans@intel.com> * added controller components for router instances Signed-off-by: Madison Evans <madison.evans@intel.com> * added initial routellm controller test script in router directory Signed-off-by: Madison Evans <madison.evans@intel.com> * fixed requirements.txt issue Signed-off-by: Madison Evans <madison.evans@intel.com> * added HUGGINGFACEHUB_API_TOKEN as an env variable Signed-off-by: Madison Evans <madison.evans@intel.com> * removed hard OPENAI dependency and made OPENAI_API_KEY default to empty str Signed-off-by: Madison Evans <madison.evans@intel.com> * removed empty str fallback for OPENAI_API_KEY var Signed-off-by: Madison Evans <madison.evans@intel.com> * target localhost in RouteLLM E2E test to avoid Docker network issues Signed-off-by: Madison Evans <madison.evans@intel.com> * fixed e2e test issue for routellm test Signed-off-by: Madison Evans <madison.evans@intel.com> * changed the checkpoint path for the custom mf model weights. Now using 'routellm-e5-base-V2' under OPEA HF group Signed-off-by: Madison Evans <madison.evans@intel.com> * moved RouteEndpointDoc class into 'api_protocol.py' under cores/proto Signed-off-by: Madison Evans <madison.evans@intel.com> * added 'router-compose.yaml' to workflows/docker/compose Signed-off-by: Madison Evans <madison.evans@intel.com> * pre commit format updates Signed-off-by: Madison Evans <madison.evans@intel.com> * removed the forked version of RouteLLM from requirements.txt dependency. Now pulls from the referenced repo and then applies the patch located at 'comps/router/src/hf_compatibility.patch' Signed-off-by: Madison Evans <madison.evans@intel.com> * updated README to reflect the patch usage for modified RouteLLM repo Signed-off-by: Madison Evans <madison.evans@intel.com> * added H1 title to README Signed-off-by: Madison Evans <madison.evans@intel.com> * comply with formatting requests. Signed-off-by: Haim Barad <haim.barad@intel.com> * fix pre-commit issues: remove trailing whitespace and add newline Signed-off-by: Haim Barad <haim.barad@intel.com> --------- Signed-off-by: Madison Evans <madison.evans@intel.com> Signed-off-by: Haim Barad <haim.barad@intel.com> Co-authored-by: Haim Barad <haim.barad@intel.com>
cogniware-devops
pushed a commit
to Cogniware-Inc/GenAIExamples
that referenced
this pull request
Dec 19, 2025
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts. Fix issue opea-project#1436 Signed-off-by: letonghan <letong.han@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts.
Issues
#1436
Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
None
Tests
local tested