change codegen tgi model by KfreeZ · Pull Request #646 · opea-project/GenAIExamples

KfreeZ · 2024-08-22T02:10:36Z

Description

Same as #625 mentioned, the tgi 2.2.0 in v0.9 is also not compatible with "ise-uiuc/Magicoder-S-DS-6.7B" model, which GMC e2e is using in the codegen tests.

Update the model to "meta-llama/CodeLlama-7b-hf" in gaudi and "HuggingFaceH4/mistral-7b-grok" in xeon for codegen in GMC pipeline config

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

n/a.

Tests

CI/CD will cover this

Signed-off-by: KfreeZ <kefei.zhang@intel.com>

mkbhanda

LGTM

daisy-ycguo

Why we use different models for xeon and Gaudi?

KfreeZ · 2024-08-22T03:39:17Z

Why we use different models for xeon and Gaudi?

the tests with same model running on gaudi is faster than on xeon, and in GMC's test on xeon failed with the timeout result. So for not blocking the tests purpose, switch to another model, but I have created an issue opea-project/GenAIInfra#338 to identify whether it's GMC specific and find out the root cause

mkbhanda · 2024-08-22T03:41:50Z

Good job on filing an issue to investigate more

…

________________________________ From: Kefei Zhang ***@***.***> Sent: Wednesday, August 21, 2024 8:39:39 PM To: opea-project/GenAIExamples ***@***.***> Cc: Bhandaru, Malini ***@***.***>; Comment ***@***.***> Subject: Re: [opea-project/GenAIExamples] change codegen tgi model (PR #646) Why we use different models for xeon and Gaudi? the model running on guadi is faster than on xeon, and in GMC's test on xeon failed with the timeout result. So for not blocking the tests, switch to another model, but I have created an issue opea-project/GenAIInfra#338<opea-project/GenAIInfra#338> to identify whether it's GMC specific and find out the root cause — Reply to this email directly, view it on GitHub<#646 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAUTXVM526DLSMAKOSRZUJTZSVMPXAVCNFSM6AAAAABM5FDG5KVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMBTGYYTCNRUHE>. You are receiving this because you commented.Message ID: ***@***.***>

* change codegen tgi model Signed-off-by: KfreeZ <kefei.zhang@intel.com>

* add resume finetuning checkpoint ut. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add final tuned model. --------- Co-authored-by: root <root@idc708073.jf.intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* change codegen tgi model Signed-off-by: KfreeZ <kefei.zhang@intel.com> Signed-off-by: cogniware-devops <ambarish.desai@cogniware.ai>

change codegen tgi model

fed908c

Signed-off-by: KfreeZ <kefei.zhang@intel.com>

KfreeZ requested a review from lvliang-intel as a code owner August 22, 2024 02:10

KfreeZ added 2 commits August 22, 2024 10:34

change codegen tgi model, set xeon to default model

f67a8da

Signed-off-by: KfreeZ <kefei.zhang@intel.com>

change xeon to HuggingFaceH4/mistral-7b-grok

b28555a

Signed-off-by: KfreeZ <kefei.zhang@intel.com>

mkbhanda approved these changes Aug 22, 2024

View reviewed changes

lianhao added this to the v0.9 milestone Aug 22, 2024

daisy-ycguo reviewed Aug 22, 2024

View reviewed changes

daisy-ycguo approved these changes Aug 22, 2024

View reviewed changes

daisy-ycguo merged commit 06cb308 into opea-project:main Aug 22, 2024

KfreeZ mentioned this pull request Aug 28, 2024

Improve the performance of GMC router opea-project/GenAIInfra#356

Merged

3 tasks

dmsuehir pushed a commit to dmsuehir/GenAIExamples that referenced this pull request Sep 11, 2024

change codegen tgi model (opea-project#646)

515a902

* change codegen tgi model Signed-off-by: KfreeZ <kefei.zhang@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change codegen tgi model#646

change codegen tgi model#646
daisy-ycguo merged 3 commits into
opea-project:mainfrom
KfreeZ:changeModelForCodegen

KfreeZ commented Aug 22, 2024 •

edited

Loading

Uh oh!

mkbhanda left a comment

Uh oh!

daisy-ycguo left a comment

Uh oh!

KfreeZ commented Aug 22, 2024 •

edited

Loading

Uh oh!

mkbhanda commented Aug 22, 2024 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

KfreeZ commented Aug 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

mkbhanda left a comment

Choose a reason for hiding this comment

Uh oh!

daisy-ycguo left a comment

Choose a reason for hiding this comment

Uh oh!

KfreeZ commented Aug 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mkbhanda commented Aug 22, 2024 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

KfreeZ commented Aug 22, 2024 •

edited

Loading

KfreeZ commented Aug 22, 2024 •

edited

Loading