Skip to content

Query failing to return any results when filter is an equality check on strings (bad statistics in parquet) #1433

@maxburke

Description

@maxburke

With the attached file, running the query:

CREATE EXTERNAL TABLE t STORED AS PARQUET LOCATION 'test.parquet';
SELECT "adt" FROM t  WHERE "direction" = 'Two Way';

returns 0 rows, but:

SELECT "adt" FROM t  WHERE "direction" LIKE 'Two Way';

returns the expected 101 rows.

I noticed that if I disable the FilterPushDown optimization pass in Datafusion I also get the expected result.

test.zip

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions