Skip to content

[Python] parquet.read_table nested fields in columns does not work for use_legacy_dataset=False #30143

@asfimport

Description

@asfimport

Reading nested field does not work with use_legacy_dataset=False.

This works:

 

import pyarrow.parquet as pq
t = pq.read_table(
 source=*filename*,
 columns=['store_key', 'properties.country'], 
 use_legacy_dataset=True,
).to_pandas()

This does not work (for the same parquet file):

 

import pyarrow.parquet as pq

t = pq.read_table(
 source=*filename*,
 columns=['store_key', 'properties.country'], 
 use_legacy_dataset=False,
).to_pandas()

 

Reporter: Tom Scheffers
Assignee: Miles Granger / @milesgranger

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-14596. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions