Add Supervised Fine Tuning Train Operator, Hook, Tests, Docs by CYarros10 · Pull Request #41807 · apache/airflow

CYarros10 · 2024-08-27T20:32:40Z

This pull request adds the following:

SupervisedFineTuningHook: Hook for Google Cloud Vertex AI Supervised Fine Tuning APIs.
SupervisedFineTuningTrainOperator: Use the Google Cloud Supervised Fine Tuning API to create a tuning job.

About Model tuning: a crucial process in adapting Gemini to perform specific tasks with greater precision and accuracy. Model tuning works by providing a model with a training dataset that contains a set of examples of specific downstream tasks.

A sample DAG containing these operators could look like:
JSONL training data arrives in GCS >> GCSObjectExistenceSensor >> SupervisedFineTuningTrainOperator >> GenerativeModelGenerateContentOperator

MaksYermak · 2024-08-29T14:43:38Z

@CYarros10 What do you think about adding Links to this Operator and, maybe, for the previous operators related to generative AI? It is an example of code how it looks for PipelineJob https://github.com/apache/airflow/blob/main/airflow/providers/google/cloud/operators/vertex_ai/pipeline_job.py#L115 and https://github.com/apache/airflow/blob/main/airflow/providers/google/cloud/links/vertex_ai.py#L329

… tests

CYarros10 · 2024-08-29T23:48:53Z

@CYarros10 What do you think about adding Links to this Operator and, maybe, for the previous operators related to generative AI? It is an example of code how it looks for PipelineJob https://github.com/apache/airflow/blob/main/airflow/providers/google/cloud/operators/vertex_ai/pipeline_job.py#L115 and https://github.com/apache/airflow/blob/main/airflow/providers/google/cloud/links/vertex_ai.py#L329

I think this is a great idea, I would love to do this but want to prioritize CountTokensAPI and EvaluationAPI as these could be part of a broader LLM pipeline with SupervisedTuningFineTrainOperator and GenerativeModelGenerateContentOperator. Will work on Links when I've completed those first!

CYarros10 · 2024-08-29T23:50:41Z

Refactored the code in this PR to be included in airflow/airflow/providers/google/cloud/operators/vertex_ai/generative_model.py - to keep all generative AI / generative model operations in one place. as well as hooks, tests, docs, etc. Feel free to add thoughts @MaksYermak - thank you for the review!

MaksYermak

LGTM

potiuk

Nice one

add supervised_fine_tuning

fdd9e19

boring-cyborg Bot added area:providers area:system-tests kind:documentation provider:google Google (including GCP) related issues labels Aug 27, 2024

CYarros10 added 3 commits August 27, 2024 21:54

build fix

6b3eac6

build,test fix

e55bcce

unit test build fix

d8ada16

CYarros10 commented Aug 28, 2024

View reviewed changes

Comment thread tests/providers/google/cloud/hooks/vertex_ai/test_supervised_fine_tuning.py Outdated

CYarros10 mentioned this pull request Aug 28, 2024

Conflicting results between Ruff linting and Pydantic tests #41847

Closed

xcom fix

619ec5e

MaksYermak reviewed Aug 29, 2024

View reviewed changes

CYarros10 added 2 commits August 29, 2024 23:39

refactor supervised tuning into generative_model module, PR feedback,…

748d964

… tests

minor system test fix

3b17677

CYarros10 added 3 commits August 29, 2024 23:54

update provider.yaml

838f16f

doc fix

415e752

Update Vertex AI Documentation

ddf9150

MaksYermak approved these changes Aug 30, 2024

View reviewed changes

potiuk approved these changes Aug 30, 2024

View reviewed changes

potiuk merged commit 35ce2f1 into apache:main Aug 30, 2024

CYarros10 deleted the sft-tuning-operator branch August 30, 2024 16:15

CYarros10 restored the sft-tuning-operator branch August 30, 2024 16:16

CYarros10 deleted the sft-tuning-operator branch August 30, 2024 16:26

eladkal mentioned this pull request Sep 21, 2024

Status of testing Providers that were prepared on September 21, 2024 #42393

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Supervised Fine Tuning Train Operator, Hook, Tests, Docs#41807

Add Supervised Fine Tuning Train Operator, Hook, Tests, Docs#41807
potiuk merged 10 commits into
apache:mainfrom
CYarros10:sft-tuning-operator

CYarros10 commented Aug 27, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MaksYermak commented Aug 29, 2024

Uh oh!

CYarros10 commented Aug 29, 2024 •

edited

Loading

Uh oh!

CYarros10 commented Aug 29, 2024 •

edited

Loading

Uh oh!

MaksYermak left a comment

Uh oh!

potiuk left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

CYarros10 commented Aug 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MaksYermak commented Aug 29, 2024

Uh oh!

CYarros10 commented Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CYarros10 commented Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaksYermak left a comment

Choose a reason for hiding this comment

Uh oh!

potiuk left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CYarros10 commented Aug 27, 2024 •

edited

Loading

CYarros10 commented Aug 29, 2024 •

edited

Loading

CYarros10 commented Aug 29, 2024 •

edited

Loading