Skip to content

Enable vLLM Profiling for ChatQnA#1124

Merged
yinghu5 merged 1 commit into
opea-project:mainfrom
intel-ai-tce:vLLM_Profiling
Nov 13, 2024
Merged

Enable vLLM Profiling for ChatQnA#1124
yinghu5 merged 1 commit into
opea-project:mainfrom
intel-ai-tce:vLLM_Profiling

Conversation

@louie-tsai
Copy link
Copy Markdown
Collaborator

@louie-tsai louie-tsai commented Nov 13, 2024

Description

Enable vLLM PyTorch Profiling for ChatQnA.
For advance users who want to do vLLM performance profiling, good to have profiling feature enabled.

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

NA

Tests

Manual Testing

Copy link
Copy Markdown
Collaborator

@yinghu5 yinghu5 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thank you for the good feature

@yinghu5 yinghu5 merged commit 7adbba6 into opea-project:main Nov 13, 2024
lkk12014402 pushed a commit that referenced this pull request Jan 17, 2025
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
cogniware-devops pushed a commit to Cogniware-Inc/GenAIExamples that referenced this pull request Dec 19, 2025
Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants