Skip to content

fix(s3_task_handler.py): upload log with put_object#68619

Merged
vincbeck merged 1 commit into
apache:mainfrom
FrankYang0529:airflow-68585
Jun 16, 2026
Merged

fix(s3_task_handler.py): upload log with put_object#68619
vincbeck merged 1 commit into
apache:mainfrom
FrankYang0529:airflow-68585

Conversation

@FrankYang0529

@FrankYang0529 FrankYang0529 commented Jun 16, 2026

Copy link
Copy Markdown
Member

When S3 is configured as the remote log backend, task log files are incorrectly reported as task outputs in OpenLineage events.

Upload the log via the boto3 client directly (get_conn().put_object(...)) instead of S3Hook.load_string(), so the log write never touches the hook lineage collector.

closes: #68585

Was generative AI tooling used to co-author this PR?
  • Yes (please specify the tool below)

Generated-by: [Claude Code with Opus 4.8] following the guidelines


  • Read the Pull Request Guidelines for more information. Note: commit author/co-author name and email in commits become permanently public when merged.
  • For fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
  • When adding dependency, check compliance with the ASF 3rd Party License Policy.
  • For significant user-facing changes create newsfragment: {pr_number}.significant.rst, in airflow-core/newsfragments. You can add this file in a follow-up commit after the PR is created so you know the PR number.

Signed-off-by: PoAn Yang <payang@apache.org>
@FrankYang0529 FrankYang0529 marked this pull request as ready for review June 16, 2026 11:33
@FrankYang0529 FrankYang0529 requested a review from o-nikolas as a code owner June 16, 2026 11:33
@vincbeck vincbeck merged commit 98ea29a into apache:main Jun 16, 2026
188 checks passed
@FrankYang0529 FrankYang0529 deleted the airflow-68585 branch June 16, 2026 13:50
RulerChen pushed a commit to RulerChen/airflow that referenced this pull request Jun 16, 2026
When S3 is configured as the remote log backend, task log files are incorrectly reported as task outputs in OpenLineage events.

Upload the log via the boto3 client directly (get_conn().put_object(...)) instead of S3Hook.load_string(), so the log write never touches the hook lineage collector.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Remote task logs in S3 being added as task outputs in openlineage events

2 participants