Skip to content

Deprecate Parquet Support for Bulk Data Export #3139

@prb112

Description

@prb112

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

The BulkData feature include an export to application/parquet.
This drags extra dependencies with spark (and spark-sql) to cache and flush parquet using stocator-fs to parquet on S3/COS.

This feature is fairly brittle, and our recommendation is to export to ndjson, and then transform the ndjson to parquet.

Describe the solution you'd like
A clear and concise description of what you want to happen.

  • Remove the Parquet feature.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Acceptance Criteria

  1. GIVEN [a precondition]
    AND [another precondition]
    WHEN [test step]
    AND [test step]
    THEN [verification step]
    AND [verification step]

Additional context
Add any other context or screenshots about the feature request here.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions