Skip to content

change codegen tgi model#646

Merged
daisy-ycguo merged 3 commits into
opea-project:mainfrom
KfreeZ:changeModelForCodegen
Aug 22, 2024
Merged

change codegen tgi model#646
daisy-ycguo merged 3 commits into
opea-project:mainfrom
KfreeZ:changeModelForCodegen

Conversation

@KfreeZ
Copy link
Copy Markdown
Contributor

@KfreeZ KfreeZ commented Aug 22, 2024

Description

Same as #625 mentioned, the tgi 2.2.0 in v0.9 is also not compatible with "ise-uiuc/Magicoder-S-DS-6.7B" model, which GMC e2e is using in the codegen tests.

Update the model to "meta-llama/CodeLlama-7b-hf" in gaudi and "HuggingFaceH4/mistral-7b-grok" in xeon for codegen in GMC pipeline config

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

n/a.

Tests

CI/CD will cover this

Signed-off-by: KfreeZ <kefei.zhang@intel.com>
@KfreeZ KfreeZ requested a review from lvliang-intel as a code owner August 22, 2024 02:10
KfreeZ added 2 commits August 22, 2024 10:34
Signed-off-by: KfreeZ <kefei.zhang@intel.com>
Signed-off-by: KfreeZ <kefei.zhang@intel.com>
Copy link
Copy Markdown
Collaborator

@mkbhanda mkbhanda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lianhao lianhao added this to the v0.9 milestone Aug 22, 2024
Copy link
Copy Markdown
Collaborator

@daisy-ycguo daisy-ycguo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why we use different models for xeon and Gaudi?

@KfreeZ
Copy link
Copy Markdown
Contributor Author

KfreeZ commented Aug 22, 2024

Why we use different models for xeon and Gaudi?

the tests with same model running on gaudi is faster than on xeon, and in GMC's test on xeon failed with the timeout result. So for not blocking the tests purpose, switch to another model, but I have created an issue opea-project/GenAIInfra#338 to identify whether it's GMC specific and find out the root cause

@mkbhanda
Copy link
Copy Markdown
Collaborator

mkbhanda commented Aug 22, 2024 via email

@daisy-ycguo daisy-ycguo merged commit 06cb308 into opea-project:main Aug 22, 2024
dmsuehir pushed a commit to dmsuehir/GenAIExamples that referenced this pull request Sep 11, 2024
* change codegen tgi model

Signed-off-by: KfreeZ <kefei.zhang@intel.com>
wangkl2 pushed a commit to wangkl2/GenAIExamples that referenced this pull request Dec 11, 2024
* add resume finetuning checkpoint ut.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add final tuned model.

---------

Co-authored-by: root <root@idc708073.jf.intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
cogniware-devops pushed a commit to Cogniware-Inc/GenAIExamples that referenced this pull request Dec 19, 2025
* change codegen tgi model

Signed-off-by: KfreeZ <kefei.zhang@intel.com>
Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants