Skip to content

[AudioQnA] Enable vLLM and set it as default LLM serving#1657

Merged
yinghu5 merged 19 commits into
opea-project:mainfrom
wangkl2:audioqna-vllm
Mar 14, 2025
Merged

[AudioQnA] Enable vLLM and set it as default LLM serving#1657
yinghu5 merged 19 commits into
opea-project:mainfrom
wangkl2:audioqna-vllm

Conversation

@wangkl2
Copy link
Copy Markdown
Collaborator

@wangkl2 wangkl2 commented Mar 13, 2025

Description

  • Enable vLLM for AudioQnA and set it as the default LLM serving framework on both Xeon and Gaudi
  • Update the default LLM to avoid the chat template issue with neural-chat
  • Make asr/llm/tts service ports and vllm-gaudi parameters configurable in compose files
  • Modify readmes
  • Add test scripts

Issues

#1436

Type of change

List the type of change like below. Please delete options that are not relevant.

  • New feature (non-breaking change which adds new functionality)

Dependencies

n/a

Tests

Local tests

wangkl2 added 15 commits March 12, 2025 03:09
- Enable vLLM and set it as the default LLM serving framework
- Make service ports configurable in compose files
- Modify readmes
- Add test scripts

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
…LLM_SERVER_PORT configurable in compose files

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
…able in compose file

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Mar 13, 2025

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

pre-commit-ci Bot and others added 3 commits March 13, 2025 07:22
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
@yinghu5 yinghu5 added aitce A2 ready to fix labels Mar 13, 2025
@yinghu5 yinghu5 added this to OPEA Mar 13, 2025
@yinghu5 yinghu5 added this to the v1.3 milestone Mar 13, 2025
@yinghu5 yinghu5 added the feature New feature or request label Mar 13, 2025
Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
@yinghu5 yinghu5 self-requested a review March 14, 2025 00:50
Copy link
Copy Markdown
Collaborator

@yinghu5 yinghu5 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thank you a lot!

@yinghu5
Copy link
Copy Markdown
Collaborator

yinghu5 commented Mar 14, 2025

@Spycsh @WenjiaoYue Could you please help to review the PR? thank you!

@yinghu5 yinghu5 added A1 high prority and removed A2 ready to fix labels Mar 14, 2025
@yinghu5 yinghu5 merged commit 8fe1929 into opea-project:main Mar 14, 2025
@github-project-automation github-project-automation Bot moved this to Done in OPEA Mar 14, 2025
@joshuayao joshuayao removed this from OPEA Mar 18, 2025
chyundunovDatamonsters pushed a commit to chyundunovDatamonsters/OPEA-GenAIExamples that referenced this pull request Mar 21, 2025
…t#1657)

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>
chyundunovDatamonsters pushed a commit to chyundunovDatamonsters/OPEA-GenAIExamples that referenced this pull request Apr 1, 2025
…t#1657)

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>
chyundunovDatamonsters pushed a commit to chyundunovDatamonsters/OPEA-GenAIExamples that referenced this pull request Apr 1, 2025
…t#1657)

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com>
chyundunovDatamonsters pushed a commit to chyundunovDatamonsters/OPEA-GenAIExamples that referenced this pull request May 16, 2025
…t#1657)

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Chingis Yundunov <c.yundunov@datamonsters.com>
cogniware-devops pushed a commit to Cogniware-Inc/GenAIExamples that referenced this pull request Dec 19, 2025
…t#1657)

Signed-off-by: Wang, Kai Lawrence <kai.lawrence.wang@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

A1 high prority aitce feature New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants