fix: gc string view arrays in RepartitionExec by Samyak2 · Pull Request #20500 · apache/datafusion

Samyak2 · 2026-02-23T17:20:46Z

Which issue does this PR close?

Closes Over-counting of memory in aggregation + repartition over Utf8View/StringViewArray #20491.

Rationale for this change

Fixes over-counting when there's a RepartitionExec above a partial agg on a StringViewArray column.
Took the fix from ExternalSorter introduced in Fix: External sort failing on StringView due to shared buffers #14823

What changes are included in this PR?

If any StringViewArray columns are present in the repartitioned input, we gc them to reduce duplicate tracking of the same string view buffer.

Are these changes tested?

Checked that this fixes the issue reported in Over-counting of memory in aggregation + repartition over Utf8View/StringViewArray #20491
Added a unit test, but it doesn't specifically look for reduced memory tracking

Are there any user-facing changes?

No

kosiew

@Samyak2

Thanks for working on this.

kosiew · 2026-03-17T07:26:23Z

    }
+
+    #[tokio::test]
+    async fn hash_repartition_string_view_compaction() -> Result<()> {


This test does not actually exercise the regression it is meant to cover.
It only checks that repartition returns all rows. That would also pass before the gc() change.

As a result, we still do not have a test that would catch the over-counting bug if this logic regresses.

Please add an assertion that observes the compaction or accounting behavior directly. For example:

Check that the total get_array_memory_size() across the repartitioned outputs stays close to the original batch, instead of scaling with the number of output partitions.

Test spill behavior under a tight memory limit (e.g., spilled bytes).

Verify StringViewArray buffer ownership after repartition, so outputs no longer all retain the original shared payload buffer.

Yeah, I did try to add a test that checks for memory size specifically, but it seemed a bit fragile to assert on those numbers. Let me try the other approaches, thanks!

I've updated the test, please take a look!

Without the fix, the mem usage blows up to 4x of the original size. With the fix, it's actually less than the original size. To have some margin for error, I have used a threshold of 2x for the mem usage assertion

kosiew · 2026-03-17T07:32:43Z

+                                    if let Some(sv) =
+                                        col.as_any().downcast_ref::<StringViewArray>()
+                                    {
+                                        Arc::new(sv.gc())


The new StringViewArray::gc() pass in repartition is very similar to the existing organize_stringview_arrays logic in datafusion/physical-plan/src/sorts/sort.rs.

A small shared helper would keep the workaround in one place and reduce the chance that sort/repartition diverge when Arrow view handling changes again.

Agreed. I'll change this

I have made this change.

I wasn't sure of where to put this util. I have made it pub(crate) and kept it in physical-plan/src/common.rs

In the case of repartition, we would be doing the RecordBatch constructor validation twice when string view arrays are present (when there's no string view array, we return the same batch back). But this should be okay since the actual gc time will dominate over the time for schema validation, etc.

(will fix the other comment tomorrow)

Fixes apache#20491 - Took the fix from `ExternalSorter` introduced in apache#14823 - If any `StringViewArray` columns are present in the repartitioned input, we gc them to reduce duplicate tracking of the same string view buffer. - Fixes over-counting when there's a `RepartitionExec` above a partial agg on a `StringViewArray` column.

This is not needed for round robin repartition

Samyak2 · 2026-03-24T12:08:45Z

CI is green now after rebasing on main

kosiew

lgtm

Dandandan · 2026-03-26T09:16:43Z

run benchmarks

adriangbot · 2026-03-26T09:19:28Z

🤖 Benchmark running (GKE) | trigger
Linux bench-c4132952794-559-rcrn4 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux
Comparing fix-repartition-string-view-counting (979decc) to cdaecf0 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-26T09:19:33Z

🤖 Benchmark running (GKE) | trigger
Linux bench-c4132952794-560-2fpf7 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux
Comparing fix-repartition-string-view-counting (979decc) to cdaecf0 (merge-base) diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-26T09:19:37Z

🤖 Benchmark running (GKE) | trigger
Linux bench-c4132952794-561-7kl86 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux
Comparing fix-repartition-string-view-counting (979decc) to cdaecf0 (merge-base) diff using: tpch
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-26T09:32:12Z

🤖 Benchmark completed (GKE) | trigger

Details

Comparing HEAD and fix-repartition-string-view-counting
--------------------
Benchmark tpch_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━┓
┃ Query     ┃                           HEAD ┃ fix-repartition-string-view-counting ┃       Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━┩
│ QQuery 1  │ 45.30 / 46.09 ±1.01 / 48.05 ms │       45.20 / 45.71 ±0.79 / 47.27 ms │    no change │
│ QQuery 2  │ 20.88 / 21.12 ±0.19 / 21.38 ms │       20.49 / 20.87 ±0.27 / 21.21 ms │    no change │
│ QQuery 3  │ 31.65 / 32.15 ±0.37 / 32.75 ms │       31.16 / 31.34 ±0.10 / 31.48 ms │    no change │
│ QQuery 4  │ 20.80 / 21.54 ±0.66 / 22.69 ms │       21.03 / 22.71 ±1.16 / 24.66 ms │ 1.05x slower │
│ QQuery 5  │ 47.61 / 48.54 ±1.20 / 50.91 ms │       46.71 / 50.06 ±1.70 / 51.37 ms │    no change │
│ QQuery 6  │ 16.98 / 17.29 ±0.23 / 17.65 ms │       17.02 / 17.13 ±0.14 / 17.41 ms │    no change │
│ QQuery 7  │ 53.12 / 53.89 ±0.64 / 54.89 ms │       52.91 / 54.96 ±1.57 / 57.24 ms │    no change │
│ QQuery 8  │ 47.61 / 48.07 ±0.45 / 48.90 ms │       48.17 / 48.41 ±0.17 / 48.61 ms │    no change │
│ QQuery 9  │ 54.77 / 55.70 ±0.63 / 56.64 ms │       53.78 / 55.34 ±1.19 / 57.40 ms │    no change │
│ QQuery 10 │ 70.98 / 72.29 ±1.21 / 74.11 ms │       75.88 / 77.32 ±0.86 / 78.52 ms │ 1.07x slower │
│ QQuery 11 │ 13.89 / 14.12 ±0.26 / 14.52 ms │       13.42 / 13.69 ±0.28 / 14.21 ms │    no change │
│ QQuery 12 │ 27.99 / 28.43 ±0.29 / 28.87 ms │       28.46 / 28.99 ±0.53 / 30.01 ms │    no change │
│ QQuery 13 │ 37.81 / 39.09 ±0.70 / 39.79 ms │       39.42 / 40.23 ±0.65 / 41.35 ms │    no change │
│ QQuery 14 │ 28.39 / 28.84 ±0.46 / 29.68 ms │       29.21 / 29.68 ±0.45 / 30.38 ms │    no change │
│ QQuery 15 │ 33.34 / 33.64 ±0.17 / 33.80 ms │       33.62 / 35.05 ±2.04 / 39.07 ms │    no change │
│ QQuery 16 │ 15.89 / 16.89 ±1.36 / 19.58 ms │       15.74 / 16.18 ±0.31 / 16.59 ms │    no change │
│ QQuery 17 │ 72.70 / 73.87 ±0.75 / 74.96 ms │       71.66 / 74.36 ±1.69 / 76.12 ms │    no change │
│ QQuery 18 │ 76.04 / 78.46 ±1.23 / 79.35 ms │       76.41 / 78.37 ±1.18 / 79.63 ms │    no change │
│ QQuery 19 │ 37.47 / 37.70 ±0.20 / 38.02 ms │       37.30 / 37.93 ±0.59 / 39.00 ms │    no change │
│ QQuery 20 │ 39.40 / 40.72 ±1.14 / 42.60 ms │       39.26 / 40.31 ±0.89 / 41.50 ms │    no change │
│ QQuery 21 │ 63.27 / 65.13 ±1.30 / 67.09 ms │       64.62 / 65.95 ±0.81 / 67.00 ms │    no change │
│ QQuery 22 │ 17.58 / 17.80 ±0.14 / 17.96 ms │       17.49 / 17.80 ±0.24 / 18.21 ms │    no change │
└───────────┴────────────────────────────────┴──────────────────────────────────────┴──────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┓
┃ Benchmark Summary                                   ┃          ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━┩
│ Total Time (HEAD)                                   │ 891.37ms │
│ Total Time (fix-repartition-string-view-counting)   │ 902.36ms │
│ Average Time (HEAD)                                 │  40.52ms │
│ Average Time (fix-repartition-string-view-counting) │  41.02ms │
│ Queries Faster                                      │        0 │
│ Queries Slower                                      │        2 │
│ Queries with No Change                              │       20 │
│ Queries with Failure                                │        0 │
└─────────────────────────────────────────────────────┴──────────┘

Resource Usage

tpch — base (merge-base)

Metric	Value
Wall time	4.7s
Peak memory	4.1 GiB
Avg memory	3.6 GiB
CPU user	33.3s
CPU sys	2.8s
Disk read	0 B
Disk write	136.0 KiB

tpch — branch

Metric	Value
Wall time	4.7s
Peak memory	4.0 GiB
Avg memory	3.6 GiB
CPU user	33.3s
CPU sys	2.9s
Disk read	0 B
Disk write	56.0 KiB

File an issue against this benchmark runner

adriangbot · 2026-03-26T09:37:41Z

🤖 Benchmark completed (GKE) | trigger

Details

Comparing HEAD and fix-repartition-string-view-counting
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━┓
┃ Query     ┃                                     HEAD ┃     fix-repartition-string-view-counting ┃       Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━┩
│ QQuery 1  │           42.61 / 43.53 ±0.77 / 44.80 ms │           42.60 / 43.36 ±0.82 / 44.77 ms │    no change │
│ QQuery 2  │        149.00 / 149.52 ±0.44 / 150.06 ms │        150.44 / 151.55 ±1.03 / 152.81 ms │    no change │
│ QQuery 3  │        112.03 / 113.21 ±1.11 / 115.25 ms │        113.48 / 114.06 ±0.56 / 115.11 ms │    no change │
│ QQuery 4  │    1230.70 / 1250.11 ±15.87 / 1277.43 ms │    1230.27 / 1270.96 ±20.95 / 1287.28 ms │    no change │
│ QQuery 5  │        172.21 / 173.85 ±1.37 / 176.29 ms │        172.93 / 174.23 ±1.01 / 175.79 ms │    no change │
│ QQuery 6  │     998.45 / 1018.20 ±17.20 / 1040.18 ms │     994.36 / 1033.86 ±38.87 / 1107.23 ms │    no change │
│ QQuery 7  │        349.50 / 354.15 ±4.01 / 359.45 ms │        348.98 / 352.16 ±2.59 / 355.03 ms │    no change │
│ QQuery 8  │        112.93 / 116.50 ±2.20 / 119.84 ms │        115.74 / 116.65 ±0.94 / 118.38 ms │    no change │
│ QQuery 9  │        100.51 / 102.49 ±2.09 / 106.28 ms │        101.19 / 104.77 ±2.29 / 107.43 ms │    no change │
│ QQuery 10 │        107.86 / 108.81 ±0.56 / 109.62 ms │        107.81 / 109.03 ±1.02 / 110.76 ms │    no change │
│ QQuery 11 │        845.28 / 850.59 ±3.28 / 854.90 ms │       839.90 / 855.25 ±10.10 / 869.70 ms │    no change │
│ QQuery 12 │           44.50 / 47.06 ±1.91 / 49.51 ms │           43.74 / 44.91 ±0.69 / 45.90 ms │    no change │
│ QQuery 13 │        396.05 / 400.21 ±3.97 / 407.76 ms │        397.49 / 400.79 ±1.96 / 403.38 ms │    no change │
│ QQuery 14 │     998.00 / 1012.36 ±13.77 / 1035.15 ms │     1014.37 / 1018.05 ±3.56 / 1024.73 ms │    no change │
│ QQuery 15 │           15.02 / 15.89 ±1.10 / 18.05 ms │           15.44 / 16.06 ±0.66 / 17.28 ms │    no change │
│ QQuery 16 │           39.76 / 40.92 ±0.69 / 41.82 ms │           39.28 / 40.41 ±0.79 / 41.61 ms │    no change │
│ QQuery 17 │        236.72 / 238.39 ±1.69 / 241.06 ms │        237.52 / 239.36 ±1.85 / 242.76 ms │    no change │
│ QQuery 18 │        128.36 / 130.29 ±1.27 / 132.17 ms │        129.47 / 130.42 ±0.78 / 131.85 ms │    no change │
│ QQuery 19 │        153.88 / 155.97 ±1.54 / 157.98 ms │        154.45 / 155.13 ±0.38 / 155.49 ms │    no change │
│ QQuery 20 │           13.42 / 13.80 ±0.38 / 14.48 ms │           13.69 / 14.08 ±0.35 / 14.59 ms │    no change │
│ QQuery 21 │           19.02 / 19.50 ±0.31 / 20.00 ms │           19.45 / 20.18 ±0.51 / 20.86 ms │    no change │
│ QQuery 22 │        487.15 / 493.04 ±4.29 / 497.66 ms │        485.11 / 489.15 ±3.03 / 493.13 ms │    no change │
│ QQuery 23 │        862.48 / 874.82 ±7.21 / 883.17 ms │       867.45 / 884.51 ±13.80 / 903.47 ms │    no change │
│ QQuery 24 │        410.03 / 414.66 ±2.63 / 417.51 ms │        413.54 / 414.85 ±1.74 / 418.29 ms │    no change │
│ QQuery 25 │        351.63 / 353.77 ±1.67 / 355.81 ms │        352.42 / 355.41 ±1.87 / 358.02 ms │    no change │
│ QQuery 26 │           83.15 / 84.44 ±1.16 / 86.21 ms │           82.89 / 85.73 ±1.95 / 88.56 ms │    no change │
│ QQuery 27 │        342.86 / 347.76 ±3.44 / 353.61 ms │        347.64 / 351.06 ±2.72 / 354.49 ms │    no change │
│ QQuery 28 │        148.89 / 150.51 ±1.67 / 153.26 ms │        148.57 / 149.39 ±1.11 / 151.56 ms │    no change │
│ QQuery 29 │        296.18 / 298.09 ±1.23 / 299.56 ms │        295.83 / 298.68 ±1.93 / 301.24 ms │    no change │
│ QQuery 30 │           43.29 / 44.44 ±1.23 / 46.79 ms │           42.75 / 44.25 ±1.46 / 46.41 ms │    no change │
│ QQuery 31 │        168.31 / 170.53 ±1.60 / 172.53 ms │        169.68 / 172.13 ±1.54 / 174.14 ms │    no change │
│ QQuery 32 │           56.33 / 57.38 ±0.84 / 58.66 ms │           55.71 / 57.07 ±1.11 / 58.54 ms │    no change │
│ QQuery 33 │        137.13 / 140.36 ±1.70 / 141.90 ms │        138.95 / 139.44 ±0.58 / 140.52 ms │    no change │
│ QQuery 34 │        104.45 / 105.98 ±1.27 / 108.18 ms │        106.00 / 106.78 ±0.88 / 108.47 ms │    no change │
│ QQuery 35 │        104.06 / 108.73 ±2.74 / 112.48 ms │        104.79 / 109.19 ±2.31 / 111.40 ms │    no change │
│ QQuery 36 │        213.67 / 218.05 ±3.01 / 221.92 ms │        216.67 / 220.28 ±3.41 / 224.50 ms │    no change │
│ QQuery 37 │        173.91 / 178.38 ±3.21 / 182.73 ms │        177.14 / 180.50 ±1.96 / 183.06 ms │    no change │
│ QQuery 38 │           84.00 / 86.51 ±1.38 / 88.13 ms │           81.54 / 84.38 ±1.58 / 86.19 ms │    no change │
│ QQuery 39 │        123.37 / 124.96 ±1.83 / 128.35 ms │        126.30 / 128.35 ±1.72 / 130.92 ms │    no change │
│ QQuery 40 │        112.23 / 118.22 ±7.35 / 132.63 ms │        111.73 / 117.16 ±5.30 / 126.53 ms │    no change │
│ QQuery 41 │           14.15 / 15.23 ±0.84 / 16.59 ms │           15.02 / 15.55 ±0.46 / 16.40 ms │    no change │
│ QQuery 42 │        106.91 / 108.04 ±0.89 / 109.47 ms │        105.90 / 107.54 ±1.25 / 109.48 ms │    no change │
│ QQuery 43 │           81.79 / 82.79 ±0.99 / 84.66 ms │           82.41 / 83.02 ±0.43 / 83.72 ms │    no change │
│ QQuery 44 │           11.28 / 11.80 ±0.96 / 13.72 ms │           11.22 / 11.54 ±0.35 / 12.14 ms │    no change │
│ QQuery 45 │           53.30 / 54.26 ±0.74 / 55.24 ms │           51.80 / 52.62 ±0.68 / 53.81 ms │    no change │
│ QQuery 46 │        227.18 / 230.59 ±3.67 / 236.94 ms │        226.43 / 230.92 ±2.88 / 233.93 ms │    no change │
│ QQuery 47 │        674.83 / 685.83 ±7.99 / 695.07 ms │        681.53 / 688.54 ±7.44 / 702.62 ms │    no change │
│ QQuery 48 │        288.39 / 293.48 ±3.30 / 297.54 ms │        277.38 / 290.13 ±7.46 / 299.41 ms │    no change │
│ QQuery 49 │        256.09 / 258.49 ±1.88 / 261.69 ms │        255.77 / 258.28 ±2.64 / 263.13 ms │    no change │
│ QQuery 50 │        233.12 / 238.23 ±3.19 / 242.70 ms │        227.83 / 232.00 ±2.80 / 235.29 ms │    no change │
│ QQuery 51 │        179.66 / 183.77 ±2.99 / 188.74 ms │        183.31 / 184.76 ±1.53 / 187.45 ms │    no change │
│ QQuery 52 │        106.41 / 107.56 ±0.94 / 109.18 ms │        106.05 / 106.80 ±0.59 / 107.63 ms │    no change │
│ QQuery 53 │        101.38 / 103.33 ±1.81 / 106.09 ms │        101.09 / 102.51 ±1.30 / 104.81 ms │    no change │
│ QQuery 54 │        146.00 / 147.07 ±0.82 / 148.40 ms │        145.21 / 147.10 ±1.29 / 148.85 ms │    no change │
│ QQuery 55 │        104.95 / 106.39 ±0.91 / 107.60 ms │        107.12 / 108.79 ±0.87 / 109.61 ms │    no change │
│ QQuery 56 │        138.30 / 140.18 ±1.57 / 141.91 ms │        141.38 / 142.97 ±2.04 / 146.99 ms │    no change │
│ QQuery 57 │        171.39 / 172.72 ±1.07 / 173.67 ms │        173.37 / 176.49 ±1.91 / 178.43 ms │    no change │
│ QQuery 58 │        299.12 / 305.36 ±7.59 / 320.07 ms │       330.89 / 344.67 ±11.72 / 363.39 ms │ 1.13x slower │
│ QQuery 59 │        199.20 / 203.31 ±4.75 / 212.52 ms │        201.59 / 202.99 ±0.95 / 204.52 ms │    no change │
│ QQuery 60 │        143.07 / 144.47 ±0.94 / 145.88 ms │        142.98 / 144.42 ±0.91 / 145.65 ms │    no change │
│ QQuery 61 │        171.93 / 172.84 ±0.54 / 173.44 ms │        169.01 / 170.46 ±1.16 / 172.40 ms │    no change │
│ QQuery 62 │       877.71 / 911.77 ±34.08 / 973.15 ms │       864.90 / 908.75 ±31.64 / 944.20 ms │    no change │
│ QQuery 63 │        102.68 / 103.67 ±0.81 / 104.66 ms │        102.23 / 105.93 ±3.07 / 109.50 ms │    no change │
│ QQuery 64 │        692.17 / 699.36 ±5.31 / 707.50 ms │        694.14 / 700.34 ±4.76 / 708.49 ms │    no change │
│ QQuery 65 │        246.99 / 251.73 ±3.87 / 256.32 ms │        246.15 / 250.41 ±3.07 / 254.46 ms │    no change │
│ QQuery 66 │        244.40 / 255.97 ±7.41 / 263.71 ms │        242.98 / 254.78 ±8.66 / 265.88 ms │    no change │
│ QQuery 67 │        310.77 / 314.60 ±4.01 / 319.80 ms │        311.89 / 316.67 ±5.38 / 325.72 ms │    no change │
│ QQuery 68 │        276.48 / 280.83 ±3.12 / 284.18 ms │        276.13 / 277.67 ±1.74 / 279.83 ms │    no change │
│ QQuery 69 │        103.81 / 105.82 ±2.28 / 110.22 ms │        104.36 / 105.35 ±0.85 / 106.88 ms │    no change │
│ QQuery 70 │        337.55 / 342.76 ±5.19 / 351.08 ms │        334.30 / 346.06 ±8.92 / 359.03 ms │    no change │
│ QQuery 71 │        133.74 / 137.60 ±3.03 / 143.06 ms │        132.64 / 136.57 ±2.65 / 140.20 ms │    no change │
│ QQuery 72 │        709.39 / 721.68 ±8.52 / 734.27 ms │       724.91 / 749.75 ±16.82 / 768.92 ms │    no change │
│ QQuery 73 │        102.17 / 103.61 ±1.04 / 105.11 ms │        102.27 / 103.90 ±1.19 / 105.28 ms │    no change │
│ QQuery 74 │        524.73 / 531.13 ±4.73 / 538.67 ms │        530.90 / 533.15 ±1.80 / 535.61 ms │    no change │
│ QQuery 75 │        274.24 / 276.15 ±1.17 / 277.46 ms │        274.08 / 278.17 ±2.79 / 281.68 ms │    no change │
│ QQuery 76 │        129.97 / 131.97 ±1.67 / 135.06 ms │        132.07 / 134.10 ±1.06 / 134.90 ms │    no change │
│ QQuery 77 │        185.57 / 186.95 ±1.12 / 188.54 ms │        186.34 / 188.87 ±1.95 / 191.82 ms │    no change │
│ QQuery 78 │        348.84 / 352.37 ±2.76 / 357.09 ms │        353.81 / 356.37 ±1.57 / 358.76 ms │    no change │
│ QQuery 79 │        225.06 / 230.34 ±2.83 / 232.83 ms │        230.28 / 232.11 ±1.04 / 233.11 ms │    no change │
│ QQuery 80 │        327.73 / 331.84 ±2.44 / 333.85 ms │        330.83 / 332.21 ±1.26 / 333.96 ms │    no change │
│ QQuery 81 │           25.58 / 26.23 ±0.71 / 27.58 ms │           26.52 / 27.84 ±1.32 / 29.83 ms │ 1.06x slower │
│ QQuery 82 │        198.13 / 199.35 ±1.02 / 200.98 ms │        202.74 / 204.47 ±1.69 / 206.98 ms │    no change │
│ QQuery 83 │           39.69 / 41.16 ±1.81 / 44.70 ms │           40.54 / 42.33 ±1.20 / 43.49 ms │    no change │
│ QQuery 84 │           47.36 / 48.36 ±0.63 / 49.06 ms │           48.29 / 49.20 ±1.06 / 51.18 ms │    no change │
│ QQuery 85 │        147.77 / 149.89 ±2.20 / 152.65 ms │        149.25 / 150.70 ±1.25 / 152.84 ms │    no change │
│ QQuery 86 │           38.50 / 40.36 ±1.12 / 41.69 ms │           37.82 / 39.26 ±0.83 / 40.38 ms │    no change │
│ QQuery 87 │           84.00 / 87.90 ±3.06 / 92.80 ms │           84.36 / 88.91 ±3.05 / 92.86 ms │    no change │
│ QQuery 88 │         98.09 / 100.04 ±2.92 / 105.72 ms │           97.77 / 99.05 ±0.70 / 99.71 ms │    no change │
│ QQuery 89 │        116.29 / 118.00 ±1.36 / 120.00 ms │        115.68 / 117.88 ±1.51 / 120.16 ms │    no change │
│ QQuery 90 │           23.35 / 23.55 ±0.22 / 23.94 ms │           23.22 / 23.70 ±0.39 / 24.13 ms │    no change │
│ QQuery 91 │           60.53 / 64.17 ±2.00 / 66.08 ms │           64.02 / 65.22 ±0.98 / 66.60 ms │    no change │
│ QQuery 92 │           56.79 / 57.06 ±0.30 / 57.48 ms │           56.69 / 57.69 ±0.94 / 59.45 ms │    no change │
│ QQuery 93 │        191.37 / 192.07 ±0.57 / 193.04 ms │        189.44 / 193.23 ±2.21 / 196.30 ms │    no change │
│ QQuery 94 │           60.46 / 61.26 ±0.67 / 62.50 ms │           59.50 / 60.83 ±0.84 / 61.71 ms │    no change │
│ QQuery 95 │        133.17 / 134.47 ±0.99 / 135.62 ms │        133.20 / 136.76 ±1.95 / 138.67 ms │    no change │
│ QQuery 96 │           71.91 / 73.41 ±1.04 / 75.04 ms │           72.62 / 73.16 ±0.46 / 73.88 ms │    no change │
│ QQuery 97 │        126.90 / 129.25 ±1.62 / 131.49 ms │        129.89 / 132.28 ±1.30 / 133.84 ms │    no change │
│ QQuery 98 │        150.92 / 153.97 ±1.86 / 156.41 ms │        152.62 / 155.03 ±1.49 / 156.76 ms │    no change │
│ QQuery 99 │ 10728.71 / 10766.75 ±31.73 / 10815.78 ms │ 10719.82 / 10777.01 ±51.76 / 10871.30 ms │    no change │
└───────────┴──────────────────────────────────────────┴──────────────────────────────────────────┴──────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                                   ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                                   │ 33229.12ms │
│ Total Time (fix-repartition-string-view-counting)   │ 33401.37ms │
│ Average Time (HEAD)                                 │   335.65ms │
│ Average Time (fix-repartition-string-view-counting) │   337.39ms │
│ Queries Faster                                      │          0 │
│ Queries Slower                                      │          2 │
│ Queries with No Change                              │         97 │
│ Queries with Failure                                │          0 │
└─────────────────────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	166.5s
Peak memory	5.6 GiB
Avg memory	4.5 GiB
CPU user	266.0s
CPU sys	18.0s
Disk read	0 B
Disk write	701.7 MiB

tpcds — branch

Metric	Value
Wall time	167.3s
Peak memory	5.4 GiB
Avg memory	4.6 GiB
CPU user	267.6s
CPU sys	17.8s
Disk read	0 B
Disk write	144.0 KiB

File an issue against this benchmark runner

adriangbot · 2026-03-26T09:38:26Z

🤖 Benchmark completed (GKE) | trigger

Details

Comparing HEAD and fix-repartition-string-view-counting
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃  fix-repartition-string-view-counting ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.30 / 4.49 ±6.29 / 17.07 ms │          1.30 / 4.52 ±6.35 / 17.23 ms │     no change │
│ QQuery 1  │        14.37 / 14.63 ±0.23 / 15.03 ms │        13.87 / 14.36 ±0.30 / 14.75 ms │     no change │
│ QQuery 2  │        56.30 / 56.60 ±0.15 / 56.70 ms │        55.80 / 56.08 ±0.23 / 56.34 ms │     no change │
│ QQuery 3  │        47.57 / 50.04 ±1.66 / 52.67 ms │        49.31 / 50.77 ±1.09 / 51.97 ms │     no change │
│ QQuery 4  │     288.98 / 299.18 ±9.67 / 312.91 ms │     294.21 / 299.69 ±5.15 / 306.28 ms │     no change │
│ QQuery 5  │     344.78 / 350.51 ±4.86 / 355.90 ms │     352.03 / 355.84 ±3.55 / 361.10 ms │     no change │
│ QQuery 6  │          5.94 / 7.80 ±2.47 / 12.60 ms │           4.74 / 5.12 ±0.34 / 5.69 ms │ +1.52x faster │
│ QQuery 7  │        16.87 / 17.62 ±0.66 / 18.65 ms │        16.17 / 16.61 ±0.30 / 17.09 ms │ +1.06x faster │
│ QQuery 8  │     411.15 / 418.57 ±6.81 / 428.99 ms │     423.51 / 437.19 ±9.56 / 449.39 ms │     no change │
│ QQuery 9  │     647.50 / 651.62 ±3.07 / 656.88 ms │     656.52 / 661.50 ±7.16 / 675.67 ms │     no change │
│ QQuery 10 │       91.16 / 94.36 ±3.97 / 101.81 ms │       96.30 / 98.23 ±1.44 / 100.28 ms │     no change │
│ QQuery 11 │     106.18 / 107.20 ±0.90 / 108.24 ms │     106.02 / 107.89 ±1.56 / 110.28 ms │     no change │
│ QQuery 12 │     337.72 / 343.44 ±3.99 / 349.56 ms │     347.80 / 360.40 ±6.68 / 367.83 ms │     no change │
│ QQuery 13 │     451.11 / 461.68 ±8.27 / 470.55 ms │    457.21 / 474.51 ±12.23 / 494.93 ms │     no change │
│ QQuery 14 │     342.22 / 355.95 ±7.22 / 363.60 ms │     369.55 / 373.79 ±4.42 / 382.29 ms │  1.05x slower │
│ QQuery 15 │    356.92 / 383.60 ±19.22 / 407.52 ms │    367.51 / 388.11 ±22.80 / 430.14 ms │     no change │
│ QQuery 16 │    716.71 / 750.46 ±17.17 / 763.38 ms │    740.68 / 770.20 ±18.61 / 788.40 ms │     no change │
│ QQuery 17 │     710.82 / 720.99 ±5.91 / 728.30 ms │     727.59 / 734.73 ±4.70 / 740.66 ms │     no change │
│ QQuery 18 │ 1426.54 / 1455.45 ±29.30 / 1507.66 ms │ 1421.96 / 1486.96 ±46.13 / 1563.65 ms │     no change │
│ QQuery 19 │       35.67 / 43.53 ±12.96 / 69.32 ms │        33.87 / 39.07 ±8.83 / 56.70 ms │ +1.11x faster │
│ QQuery 20 │    707.92 / 725.26 ±16.79 / 750.67 ms │    706.90 / 722.12 ±19.60 / 760.51 ms │     no change │
│ QQuery 21 │     750.75 / 756.49 ±5.66 / 767.15 ms │     753.26 / 756.20 ±1.87 / 758.22 ms │     no change │
│ QQuery 22 │  1121.32 / 1130.20 ±7.20 / 1143.38 ms │  1125.16 / 1128.40 ±2.23 / 1131.66 ms │     no change │
│ QQuery 23 │ 3086.49 / 3103.23 ±15.99 / 3129.12 ms │ 3050.56 / 3077.07 ±18.97 / 3104.25 ms │     no change │
│ QQuery 24 │      99.87 / 101.71 ±1.54 / 104.56 ms │      98.97 / 102.47 ±2.52 / 106.16 ms │     no change │
│ QQuery 25 │     139.24 / 141.52 ±1.98 / 144.71 ms │     137.90 / 139.37 ±1.52 / 142.11 ms │     no change │
│ QQuery 26 │      98.34 / 101.08 ±1.79 / 103.72 ms │      99.45 / 100.74 ±1.18 / 102.23 ms │     no change │
│ QQuery 27 │     844.08 / 849.75 ±5.65 / 860.45 ms │     841.84 / 846.34 ±4.40 / 853.93 ms │     no change │
│ QQuery 28 │ 7744.89 / 7774.60 ±18.05 / 7797.30 ms │ 7732.60 / 7749.41 ±16.47 / 7778.09 ms │     no change │
│ QQuery 29 │        55.36 / 59.27 ±3.76 / 65.67 ms │        55.78 / 57.90 ±1.37 / 59.89 ms │     no change │
│ QQuery 30 │     353.69 / 369.35 ±8.92 / 377.86 ms │     352.40 / 364.14 ±6.61 / 370.70 ms │     no change │
│ QQuery 31 │     374.54 / 384.41 ±6.57 / 393.34 ms │    362.61 / 375.52 ±13.26 / 398.32 ms │     no change │
│ QQuery 32 │ 1052.35 / 1091.84 ±47.19 / 1179.31 ms │ 1261.93 / 1299.69 ±28.50 / 1333.74 ms │  1.19x slower │
│ QQuery 33 │ 1437.22 / 1457.23 ±16.00 / 1476.01 ms │ 1500.18 / 1541.14 ±50.28 / 1638.26 ms │  1.06x slower │
│ QQuery 34 │ 1444.02 / 1461.39 ±12.99 / 1482.42 ms │ 1489.57 / 1514.78 ±21.00 / 1546.79 ms │     no change │
│ QQuery 35 │    388.73 / 401.96 ±12.93 / 425.97 ms │     386.46 / 392.23 ±5.16 / 401.77 ms │     no change │
│ QQuery 36 │     113.16 / 118.54 ±4.17 / 123.59 ms │     122.34 / 128.96 ±3.58 / 132.30 ms │  1.09x slower │
│ QQuery 37 │        46.88 / 49.20 ±2.09 / 52.56 ms │        50.30 / 53.11 ±2.01 / 55.89 ms │  1.08x slower │
│ QQuery 38 │        73.61 / 76.79 ±2.44 / 79.86 ms │        74.69 / 76.38 ±1.12 / 77.63 ms │     no change │
│ QQuery 39 │     208.10 / 217.98 ±7.76 / 227.19 ms │     235.18 / 239.92 ±3.39 / 244.63 ms │  1.10x slower │
│ QQuery 40 │        24.15 / 26.30 ±1.16 / 27.65 ms │        23.89 / 26.61 ±2.31 / 30.19 ms │     no change │
│ QQuery 41 │        18.79 / 20.13 ±0.98 / 21.25 ms │        19.55 / 20.42 ±0.80 / 21.77 ms │     no change │
│ QQuery 42 │        19.33 / 20.45 ±0.99 / 22.27 ms │        19.49 / 19.89 ±0.35 / 20.52 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                                   ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                                   │ 27026.41ms │
│ Total Time (fix-repartition-string-view-counting)   │ 27468.36ms │
│ Average Time (HEAD)                                 │   628.52ms │
│ Average Time (fix-repartition-string-view-counting) │   638.80ms │
│ Queries Faster                                      │          3 │
│ Queries Slower                                      │          6 │
│ Queries with No Change                              │         34 │
│ Queries with Failure                                │          0 │
└─────────────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	136.3s
Peak memory	44.0 GiB
Avg memory	32.7 GiB
CPU user	1282.9s
CPU sys	87.7s
Disk read	0 B
Disk write	3.7 GiB

clickbench_partitioned — branch

Metric	Value
Wall time	138.5s
Peak memory	42.9 GiB
Avg memory	29.4 GiB
CPU user	1277.5s
CPU sys	100.0s
Disk read	0 B
Disk write	852.0 KiB

File an issue against this benchmark runner

Samyak2 · 2026-03-26T10:25:14Z

Do these benchmarks have string view arrays enabled? If not, I don't see why the numbers are getting affected (although it's a small delta)

kosiew · 2026-03-26T12:01:05Z

@Samyak2

For the above benchmark runs, the Parquet-backed benchmark data is expected to use view types by default.

Why:

DataFusion's Parquet config defaults schema_force_view_types to true
when that option is enabled, Parquet string columns are read as Utf8View and binary columns as BinaryView
the TPC-H benchmark constructs ParquetFormat using the session's Parquet table options, so it inherits that default behavior
the ClickBench benchmark also uses the session Parquet defaults and additionally sets binary_as_string = true so legacy binary-encoded string columns in the hits_partitioned dataset are treated as strings

That means both of the benchmark outputs under discussion should be assumed to have string view arrays enabled for Parquet-backed string columns unless view types were explicitly disabled.

Dandandan · 2026-03-26T12:36:50Z

Yes, they do all use string views by default.

Dandandan · 2026-03-26T12:37:19Z

run benchmarks

adriangbot · 2026-03-26T12:39:23Z

🤖 Benchmark running (GKE) | trigger
Linux bench-c4134362995-562-tqgfb 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux
Comparing fix-repartition-string-view-counting (979decc) to cdaecf0 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-26T12:40:04Z

🤖 Benchmark running (GKE) | trigger
Linux bench-c4134362995-563-dh2fc 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux
Comparing fix-repartition-string-view-counting (979decc) to cdaecf0 (merge-base) diff using: tpcds
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-26T12:40:05Z

🤖 Benchmark running (GKE) | trigger
Linux bench-c4134362995-564-kt7gx 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux
Comparing fix-repartition-string-view-counting (979decc) to cdaecf0 (merge-base) diff using: tpch
Results will be posted here when complete

File an issue against this benchmark runner

Dandandan · 2026-03-26T12:46:10Z

+                            // pool to count the same data buffers multiple times, once for each
+                            // consumer of the repartition.
+                            // So we gc the output arrays, which creates new data buffers.
+                            let batch = gc_stringview_arrays(batch)?;


I think it would be best to use the coalesce kernels (which do GC-ing already I think) before sending them to the upstream partitions.

I got some mixed performance results from that before, but I think the upcoming morsel / workstealing changes might be able to improve this (as it won't benefit from pushing the copying work over to a new (possibly idling) thread)

adriangbot · 2026-03-26T12:48:57Z

🤖 Benchmark completed (GKE) | trigger

Details

Comparing HEAD and fix-repartition-string-view-counting
--------------------
Benchmark tpch_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                           HEAD ┃ fix-repartition-string-view-counting ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │ 47.22 / 47.50 ±0.38 / 48.26 ms │       47.34 / 47.74 ±0.74 / 49.22 ms │     no change │
│ QQuery 2  │ 22.05 / 22.18 ±0.13 / 22.37 ms │       21.00 / 21.71 ±0.42 / 22.15 ms │     no change │
│ QQuery 3  │ 31.98 / 33.11 ±0.64 / 33.78 ms │       31.24 / 31.52 ±0.22 / 31.90 ms │     no change │
│ QQuery 4  │ 21.02 / 21.88 ±0.67 / 22.86 ms │       22.43 / 23.13 ±0.79 / 24.65 ms │  1.06x slower │
│ QQuery 5  │ 48.11 / 49.79 ±1.41 / 51.58 ms │       47.47 / 47.95 ±0.59 / 49.08 ms │     no change │
│ QQuery 6  │ 17.22 / 17.44 ±0.26 / 17.84 ms │       17.08 / 17.25 ±0.18 / 17.54 ms │     no change │
│ QQuery 7  │ 54.21 / 58.09 ±2.19 / 60.68 ms │       57.30 / 59.38 ±1.35 / 61.04 ms │     no change │
│ QQuery 8  │ 48.64 / 50.44 ±1.08 / 51.77 ms │       47.71 / 49.56 ±1.34 / 50.93 ms │     no change │
│ QQuery 9  │ 54.25 / 55.43 ±1.23 / 57.55 ms │       54.38 / 54.94 ±0.46 / 55.49 ms │     no change │
│ QQuery 10 │ 71.07 / 73.01 ±2.22 / 77.30 ms │       77.65 / 80.04 ±2.04 / 82.90 ms │  1.10x slower │
│ QQuery 11 │ 15.07 / 15.56 ±0.70 / 16.92 ms │       14.76 / 15.04 ±0.27 / 15.47 ms │     no change │
│ QQuery 12 │ 29.37 / 29.66 ±0.26 / 30.14 ms │       29.69 / 30.27 ±0.39 / 30.80 ms │     no change │
│ QQuery 13 │ 38.62 / 40.52 ±1.55 / 42.37 ms │       38.65 / 39.61 ±0.71 / 40.57 ms │     no change │
│ QQuery 14 │ 28.46 / 28.71 ±0.20 / 29.06 ms │       29.20 / 29.46 ±0.26 / 29.89 ms │     no change │
│ QQuery 15 │ 33.47 / 33.69 ±0.17 / 33.93 ms │       33.21 / 34.37 ±1.31 / 36.75 ms │     no change │
│ QQuery 16 │ 15.78 / 16.20 ±0.56 / 17.30 ms │       15.79 / 16.51 ±0.41 / 16.99 ms │     no change │
│ QQuery 17 │ 75.06 / 82.47 ±4.43 / 88.93 ms │       76.20 / 81.32 ±2.71 / 84.29 ms │     no change │
│ QQuery 18 │ 78.39 / 79.91 ±1.62 / 82.66 ms │       78.26 / 79.74 ±1.20 / 81.32 ms │     no change │
│ QQuery 19 │ 37.94 / 38.43 ±0.55 / 39.51 ms │       37.19 / 37.73 ±0.40 / 38.20 ms │     no change │
│ QQuery 20 │ 40.25 / 42.99 ±1.63 / 44.50 ms │       41.79 / 42.41 ±0.60 / 43.36 ms │     no change │
│ QQuery 21 │ 68.69 / 70.85 ±1.43 / 72.65 ms │       65.91 / 68.82 ±3.04 / 73.79 ms │     no change │
│ QQuery 22 │ 18.49 / 19.18 ±0.74 / 20.55 ms │       17.51 / 17.76 ±0.29 / 18.29 ms │ +1.08x faster │
└───────────┴────────────────────────────────┴──────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┓
┃ Benchmark Summary                                   ┃          ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━┩
│ Total Time (HEAD)                                   │ 927.05ms │
│ Total Time (fix-repartition-string-view-counting)   │ 926.24ms │
│ Average Time (HEAD)                                 │  42.14ms │
│ Average Time (fix-repartition-string-view-counting) │  42.10ms │
│ Queries Faster                                      │        1 │
│ Queries Slower                                      │        2 │
│ Queries with No Change                              │       19 │
│ Queries with Failure                                │        0 │
└─────────────────────────────────────────────────────┴──────────┘

Resource Usage

tpch — base (merge-base)

Metric	Value
Wall time	4.9s
Peak memory	4.0 GiB
Avg memory	3.6 GiB
CPU user	34.6s
CPU sys	3.0s
Disk read	0 B
Disk write	140.0 KiB

tpch — branch

Metric	Value
Wall time	4.9s
Peak memory	4.0 GiB
Avg memory	3.5 GiB
CPU user	34.4s
CPU sys	3.1s
Disk read	0 B
Disk write	56.0 KiB

File an issue against this benchmark runner

adriangbot · 2026-03-26T12:54:04Z

🤖 Benchmark completed (GKE) | trigger

Details

Comparing HEAD and fix-repartition-string-view-counting
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                     HEAD ┃     fix-repartition-string-view-counting ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 1  │           42.71 / 43.42 ±0.79 / 44.90 ms │           42.90 / 43.71 ±0.65 / 44.78 ms │     no change │
│ QQuery 2  │        149.60 / 151.09 ±1.29 / 152.85 ms │        150.13 / 150.76 ±0.53 / 151.66 ms │     no change │
│ QQuery 3  │        113.39 / 114.78 ±1.13 / 116.74 ms │        113.87 / 115.06 ±1.11 / 117.05 ms │     no change │
│ QQuery 4  │    1274.40 / 1311.63 ±19.13 / 1326.69 ms │    1247.38 / 1281.42 ±20.85 / 1309.06 ms │     no change │
│ QQuery 5  │        173.06 / 174.11 ±0.74 / 174.99 ms │        172.90 / 173.69 ±0.74 / 175.07 ms │     no change │
│ QQuery 6  │     982.59 / 1037.77 ±52.77 / 1131.35 ms │     966.28 / 1006.15 ±22.09 / 1032.72 ms │     no change │
│ QQuery 7  │        352.36 / 358.23 ±5.52 / 368.39 ms │        352.85 / 355.71 ±2.86 / 359.90 ms │     no change │
│ QQuery 8  │        115.15 / 116.96 ±1.40 / 119.06 ms │        113.32 / 116.77 ±1.79 / 118.40 ms │     no change │
│ QQuery 9  │        102.16 / 105.31 ±2.53 / 108.20 ms │        100.01 / 104.29 ±3.05 / 108.40 ms │     no change │
│ QQuery 10 │        108.47 / 109.32 ±0.76 / 110.70 ms │        107.69 / 109.46 ±1.13 / 110.85 ms │     no change │
│ QQuery 11 │       867.24 / 879.76 ±12.72 / 903.76 ms │        878.82 / 881.13 ±2.27 / 885.14 ms │     no change │
│ QQuery 12 │           42.89 / 45.82 ±1.73 / 48.12 ms │           44.68 / 46.40 ±1.29 / 48.41 ms │     no change │
│ QQuery 13 │        399.99 / 406.82 ±3.91 / 412.18 ms │        399.71 / 404.71 ±2.64 / 407.20 ms │     no change │
│ QQuery 14 │     1022.40 / 1029.94 ±4.20 / 1034.87 ms │     1007.60 / 1019.41 ±7.74 / 1031.49 ms │     no change │
│ QQuery 15 │           16.14 / 16.83 ±0.61 / 17.72 ms │           15.23 / 15.91 ±0.98 / 17.85 ms │ +1.06x faster │
│ QQuery 16 │           40.90 / 41.63 ±0.62 / 42.55 ms │           39.98 / 40.63 ±0.40 / 40.99 ms │     no change │
│ QQuery 17 │        236.17 / 240.47 ±2.19 / 242.08 ms │        236.59 / 240.06 ±2.00 / 242.76 ms │     no change │
│ QQuery 18 │        128.90 / 129.86 ±0.65 / 130.81 ms │        128.98 / 130.75 ±1.34 / 133.05 ms │     no change │
│ QQuery 19 │        154.29 / 155.27 ±0.87 / 156.86 ms │        154.09 / 155.50 ±1.06 / 156.95 ms │     no change │
│ QQuery 20 │           13.31 / 14.06 ±0.78 / 15.45 ms │           13.76 / 14.47 ±0.71 / 15.82 ms │     no change │
│ QQuery 21 │           19.47 / 19.85 ±0.38 / 20.52 ms │           19.35 / 19.89 ±0.36 / 20.29 ms │     no change │
│ QQuery 22 │        487.75 / 491.01 ±2.80 / 495.06 ms │        483.69 / 487.68 ±4.56 / 496.26 ms │     no change │
│ QQuery 23 │        885.57 / 887.70 ±1.65 / 889.28 ms │        875.49 / 887.27 ±7.70 / 896.50 ms │     no change │
│ QQuery 24 │        416.23 / 418.06 ±1.93 / 421.71 ms │        413.55 / 415.98 ±1.62 / 418.41 ms │     no change │
│ QQuery 25 │        353.08 / 353.90 ±0.64 / 354.86 ms │        350.81 / 354.17 ±2.70 / 358.45 ms │     no change │
│ QQuery 26 │           82.58 / 85.48 ±2.35 / 88.47 ms │           82.41 / 83.69 ±1.34 / 86.11 ms │     no change │
│ QQuery 27 │        348.60 / 351.48 ±2.04 / 354.99 ms │        346.86 / 349.60 ±1.58 / 351.47 ms │     no change │
│ QQuery 28 │        149.29 / 150.60 ±1.25 / 152.99 ms │        149.08 / 150.73 ±1.57 / 152.63 ms │     no change │
│ QQuery 29 │        296.80 / 298.65 ±1.97 / 302.12 ms │        294.72 / 298.83 ±2.96 / 301.51 ms │     no change │
│ QQuery 30 │           41.66 / 43.50 ±1.06 / 44.82 ms │           43.74 / 45.18 ±1.38 / 47.78 ms │     no change │
│ QQuery 31 │        168.41 / 171.86 ±2.47 / 175.53 ms │        168.79 / 172.54 ±2.62 / 175.56 ms │     no change │
│ QQuery 32 │           56.14 / 57.27 ±1.12 / 59.22 ms │           56.14 / 57.96 ±1.32 / 59.25 ms │     no change │
│ QQuery 33 │        140.84 / 143.66 ±2.26 / 147.54 ms │        138.48 / 140.24 ±1.11 / 141.48 ms │     no change │
│ QQuery 34 │        106.85 / 107.50 ±0.45 / 108.16 ms │        105.95 / 106.64 ±0.73 / 107.99 ms │     no change │
│ QQuery 35 │        108.95 / 109.59 ±0.67 / 110.84 ms │        108.21 / 109.61 ±1.99 / 113.50 ms │     no change │
│ QQuery 36 │        210.69 / 219.84 ±5.40 / 226.03 ms │        214.69 / 219.59 ±4.13 / 226.56 ms │     no change │
│ QQuery 37 │        177.50 / 180.86 ±2.18 / 184.38 ms │        174.87 / 179.77 ±2.73 / 182.55 ms │     no change │
│ QQuery 38 │           81.86 / 87.30 ±3.01 / 91.15 ms │           82.68 / 87.53 ±3.73 / 94.00 ms │     no change │
│ QQuery 39 │        124.87 / 127.43 ±1.86 / 129.48 ms │        124.79 / 126.49 ±1.36 / 127.92 ms │     no change │
│ QQuery 40 │        108.03 / 115.13 ±5.97 / 126.09 ms │        115.32 / 119.79 ±4.28 / 127.23 ms │     no change │
│ QQuery 41 │           14.38 / 15.55 ±0.74 / 16.52 ms │           14.38 / 15.63 ±1.47 / 18.48 ms │     no change │
│ QQuery 42 │        107.11 / 108.55 ±0.98 / 109.76 ms │        106.47 / 108.99 ±1.70 / 111.04 ms │     no change │
│ QQuery 43 │           82.94 / 84.00 ±0.79 / 85.15 ms │           83.43 / 83.96 ±0.29 / 84.29 ms │     no change │
│ QQuery 44 │           11.51 / 12.24 ±0.70 / 13.55 ms │           11.38 / 11.59 ±0.14 / 11.78 ms │ +1.06x faster │
│ QQuery 45 │           52.09 / 53.70 ±1.24 / 55.38 ms │           51.44 / 52.73 ±1.10 / 54.64 ms │     no change │
│ QQuery 46 │        229.38 / 232.39 ±3.38 / 237.95 ms │        227.23 / 230.48 ±1.92 / 232.34 ms │     no change │
│ QQuery 47 │        681.79 / 691.32 ±8.50 / 706.85 ms │        677.97 / 686.47 ±5.24 / 692.02 ms │     no change │
│ QQuery 48 │        287.35 / 291.19 ±3.15 / 295.96 ms │        291.20 / 294.76 ±2.78 / 298.69 ms │     no change │
│ QQuery 49 │        255.94 / 260.32 ±2.72 / 264.29 ms │        254.61 / 257.27 ±2.63 / 261.97 ms │     no change │
│ QQuery 50 │        231.50 / 235.26 ±3.96 / 242.91 ms │        221.01 / 232.76 ±6.68 / 241.00 ms │     no change │
│ QQuery 51 │        180.24 / 186.54 ±3.35 / 189.93 ms │        180.31 / 184.13 ±2.64 / 187.03 ms │     no change │
│ QQuery 52 │        105.76 / 108.18 ±1.44 / 109.88 ms │        106.93 / 108.71 ±1.85 / 112.24 ms │     no change │
│ QQuery 53 │        102.31 / 103.43 ±0.97 / 104.87 ms │        101.57 / 102.52 ±0.61 / 103.23 ms │     no change │
│ QQuery 54 │        147.72 / 148.25 ±0.59 / 149.25 ms │        144.68 / 148.18 ±2.46 / 152.18 ms │     no change │
│ QQuery 55 │        105.85 / 108.04 ±1.85 / 111.05 ms │        106.56 / 107.81 ±1.16 / 109.42 ms │     no change │
│ QQuery 56 │        140.85 / 144.02 ±2.17 / 146.41 ms │        139.97 / 141.14 ±0.77 / 142.38 ms │     no change │
│ QQuery 57 │        174.43 / 176.05 ±1.49 / 178.65 ms │        175.61 / 177.75 ±1.51 / 180.29 ms │     no change │
│ QQuery 58 │       296.90 / 312.12 ±10.22 / 323.31 ms │       324.50 / 338.75 ±15.02 / 367.76 ms │  1.09x slower │
│ QQuery 59 │        199.87 / 202.47 ±1.32 / 203.33 ms │        200.13 / 201.76 ±1.47 / 203.63 ms │     no change │
│ QQuery 60 │        143.20 / 145.26 ±1.31 / 147.08 ms │        142.06 / 144.55 ±1.28 / 145.66 ms │     no change │
│ QQuery 61 │        169.81 / 171.18 ±1.22 / 172.87 ms │        167.84 / 170.21 ±1.23 / 171.31 ms │     no change │
│ QQuery 62 │       876.66 / 904.74 ±27.67 / 952.88 ms │       870.78 / 904.17 ±29.60 / 943.45 ms │     no change │
│ QQuery 63 │        103.79 / 105.69 ±1.57 / 107.58 ms │        101.93 / 105.45 ±2.76 / 109.18 ms │     no change │
│ QQuery 64 │        698.63 / 704.87 ±6.79 / 717.46 ms │        698.68 / 704.30 ±3.89 / 709.46 ms │     no change │
│ QQuery 65 │        248.30 / 251.60 ±2.73 / 255.38 ms │        249.40 / 251.70 ±2.62 / 255.69 ms │     no change │
│ QQuery 66 │       249.44 / 259.75 ±13.50 / 284.78 ms │        248.48 / 254.81 ±6.57 / 266.99 ms │     no change │
│ QQuery 67 │        310.09 / 315.67 ±5.45 / 324.68 ms │        311.48 / 317.50 ±4.10 / 322.64 ms │     no change │
│ QQuery 68 │        276.89 / 281.71 ±4.76 / 287.96 ms │        279.22 / 282.19 ±2.20 / 285.37 ms │     no change │
│ QQuery 69 │        103.59 / 104.80 ±1.52 / 107.69 ms │        104.08 / 105.57 ±1.03 / 107.19 ms │     no change │
│ QQuery 70 │        348.53 / 356.95 ±8.84 / 373.93 ms │        332.12 / 336.59 ±3.37 / 342.22 ms │ +1.06x faster │
│ QQuery 71 │        134.25 / 138.46 ±2.53 / 141.14 ms │        132.84 / 133.82 ±0.87 / 135.20 ms │     no change │
│ QQuery 72 │       702.42 / 725.77 ±13.20 / 742.60 ms │       722.69 / 740.28 ±12.89 / 762.44 ms │     no change │
│ QQuery 73 │        102.53 / 104.59 ±1.84 / 107.75 ms │        102.29 / 103.86 ±1.67 / 107.04 ms │     no change │
│ QQuery 74 │        535.68 / 541.04 ±6.05 / 552.65 ms │        539.05 / 546.19 ±6.92 / 556.30 ms │     no change │
│ QQuery 75 │        273.94 / 277.22 ±2.61 / 281.43 ms │        275.16 / 277.86 ±1.64 / 279.54 ms │     no change │
│ QQuery 76 │        131.14 / 133.56 ±1.68 / 135.57 ms │        131.18 / 133.18 ±1.29 / 134.67 ms │     no change │
│ QQuery 77 │        185.51 / 188.56 ±1.95 / 190.90 ms │        187.02 / 190.05 ±1.53 / 191.09 ms │     no change │
│ QQuery 78 │        346.27 / 354.02 ±4.36 / 357.81 ms │        345.08 / 352.23 ±5.81 / 360.14 ms │     no change │
│ QQuery 79 │        227.27 / 230.42 ±2.53 / 233.54 ms │        229.96 / 231.99 ±1.69 / 234.98 ms │     no change │
│ QQuery 80 │        330.84 / 332.49 ±1.16 / 333.85 ms │        331.15 / 334.25 ±2.54 / 338.64 ms │     no change │
│ QQuery 81 │           25.78 / 27.23 ±1.08 / 28.86 ms │           26.47 / 27.10 ±0.50 / 27.70 ms │     no change │
│ QQuery 82 │        201.77 / 203.80 ±1.82 / 206.01 ms │        202.00 / 204.75 ±3.27 / 210.38 ms │     no change │
│ QQuery 83 │           38.62 / 40.14 ±0.98 / 41.60 ms │           43.26 / 43.79 ±0.65 / 44.96 ms │  1.09x slower │
│ QQuery 84 │           47.97 / 48.83 ±0.57 / 49.68 ms │           49.69 / 51.29 ±1.82 / 54.77 ms │  1.05x slower │
│ QQuery 85 │        151.15 / 152.03 ±0.71 / 153.22 ms │        148.77 / 152.68 ±2.83 / 156.60 ms │     no change │
│ QQuery 86 │           37.81 / 39.15 ±1.16 / 41.21 ms │           38.13 / 39.91 ±1.44 / 42.49 ms │     no change │
│ QQuery 87 │           88.72 / 89.94 ±1.99 / 93.90 ms │           85.53 / 90.13 ±3.67 / 96.77 ms │     no change │
│ QQuery 88 │           98.64 / 99.24 ±0.47 / 99.84 ms │         99.01 / 100.13 ±1.15 / 101.75 ms │     no change │
│ QQuery 89 │        115.80 / 118.55 ±1.92 / 121.68 ms │        120.01 / 121.31 ±0.81 / 122.38 ms │     no change │
│ QQuery 90 │           23.28 / 23.80 ±0.30 / 24.15 ms │           22.13 / 23.22 ±0.60 / 23.94 ms │     no change │
│ QQuery 91 │           64.06 / 66.31 ±1.89 / 69.46 ms │           63.42 / 65.85 ±1.89 / 68.65 ms │     no change │
│ QQuery 92 │           56.23 / 58.36 ±1.48 / 60.19 ms │           57.34 / 58.38 ±1.20 / 60.36 ms │     no change │
│ QQuery 93 │        190.79 / 192.66 ±1.10 / 193.88 ms │        189.59 / 194.39 ±2.69 / 197.19 ms │     no change │
│ QQuery 94 │           61.39 / 61.96 ±0.51 / 62.74 ms │           61.18 / 62.47 ±1.09 / 64.00 ms │     no change │
│ QQuery 95 │        133.64 / 135.55 ±1.34 / 137.27 ms │        135.40 / 136.59 ±1.09 / 137.95 ms │     no change │
│ QQuery 96 │           74.64 / 75.46 ±0.63 / 76.54 ms │           69.54 / 74.12 ±2.38 / 76.45 ms │     no change │
│ QQuery 97 │        130.37 / 131.15 ±0.69 / 132.08 ms │        131.16 / 133.53 ±1.94 / 136.16 ms │     no change │
│ QQuery 98 │        151.66 / 155.16 ±3.31 / 160.19 ms │        153.91 / 155.98 ±1.20 / 157.40 ms │     no change │
│ QQuery 99 │ 10727.09 / 10758.19 ±19.89 / 10784.70 ms │ 10710.23 / 10744.19 ±32.30 / 10787.79 ms │     no change │
└───────────┴──────────────────────────────────────────┴──────────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                                   ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                                   │ 33483.26ms │
│ Total Time (fix-repartition-string-view-counting)   │ 33405.10ms │
│ Average Time (HEAD)                                 │   338.21ms │
│ Average Time (fix-repartition-string-view-counting) │   337.43ms │
│ Queries Faster                                      │          3 │
│ Queries Slower                                      │          3 │
│ Queries with No Change                              │         93 │
│ Queries with Failure                                │          0 │
└─────────────────────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric	Value
Wall time	167.8s
Peak memory	5.6 GiB
Avg memory	4.6 GiB
CPU user	268.1s
CPU sys	18.7s
Disk read	0 B
Disk write	702.3 MiB

tpcds — branch

Metric	Value
Wall time	167.3s
Peak memory	5.6 GiB
Avg memory	4.5 GiB
CPU user	267.5s
CPU sys	18.2s
Disk read	0 B
Disk write	644.0 KiB

File an issue against this benchmark runner

adriangbot · 2026-03-26T12:54:44Z

🤖 Benchmark completed (GKE) | trigger

Details

Comparing HEAD and fix-repartition-string-view-counting
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃  fix-repartition-string-view-counting ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.38 / 4.77 ±6.59 / 17.94 ms │          1.39 / 4.78 ±6.58 / 17.93 ms │     no change │
│ QQuery 1  │        14.32 / 14.93 ±0.32 / 15.21 ms │        14.45 / 15.11 ±0.37 / 15.56 ms │     no change │
│ QQuery 2  │        56.34 / 56.86 ±0.31 / 57.19 ms │        56.26 / 56.74 ±0.40 / 57.38 ms │     no change │
│ QQuery 3  │        48.17 / 50.99 ±2.56 / 55.02 ms │        49.29 / 51.83 ±1.54 / 53.56 ms │     no change │
│ QQuery 4  │    300.78 / 311.18 ±17.52 / 346.16 ms │    294.51 / 312.04 ±15.21 / 334.24 ms │     no change │
│ QQuery 5  │     343.94 / 349.59 ±3.88 / 355.01 ms │     349.44 / 361.67 ±8.47 / 373.92 ms │     no change │
│ QQuery 6  │           4.67 / 5.34 ±0.84 / 6.95 ms │           4.88 / 5.31 ±0.34 / 5.69 ms │     no change │
│ QQuery 7  │        16.41 / 16.51 ±0.15 / 16.81 ms │        16.74 / 18.57 ±1.99 / 21.32 ms │  1.12x slower │
│ QQuery 8  │     411.65 / 417.42 ±5.36 / 424.46 ms │    434.53 / 446.07 ±13.49 / 470.55 ms │  1.07x slower │
│ QQuery 9  │     633.50 / 646.38 ±9.13 / 659.53 ms │    660.06 / 712.34 ±44.48 / 771.61 ms │  1.10x slower │
│ QQuery 10 │        93.62 / 94.95 ±1.69 / 98.25 ms │     103.85 / 105.87 ±1.68 / 108.05 ms │  1.12x slower │
│ QQuery 11 │     103.48 / 104.75 ±0.71 / 105.51 ms │     117.07 / 117.63 ±0.36 / 118.18 ms │  1.12x slower │
│ QQuery 12 │     337.31 / 342.33 ±6.76 / 355.46 ms │     411.96 / 417.06 ±4.34 / 422.68 ms │  1.22x slower │
│ QQuery 13 │     454.25 / 467.62 ±8.75 / 479.00 ms │    484.01 / 510.44 ±16.85 / 534.16 ms │  1.09x slower │
│ QQuery 14 │    347.44 / 382.16 ±18.51 / 399.07 ms │     387.55 / 390.29 ±1.71 / 392.31 ms │     no change │
│ QQuery 15 │    415.25 / 432.14 ±22.46 / 476.09 ms │    370.30 / 392.68 ±25.58 / 441.27 ms │ +1.10x faster │
│ QQuery 16 │    717.90 / 772.09 ±45.24 / 829.10 ms │     742.72 / 753.61 ±6.48 / 761.06 ms │     no change │
│ QQuery 17 │     707.92 / 719.60 ±7.99 / 731.38 ms │    758.88 / 811.88 ±27.62 / 836.29 ms │  1.13x slower │
│ QQuery 18 │ 1538.71 / 1631.25 ±55.46 / 1699.86 ms │ 1487.83 / 1554.64 ±55.05 / 1644.78 ms │     no change │
│ QQuery 19 │        35.58 / 39.08 ±4.84 / 48.66 ms │      37.86 / 53.67 ±28.41 / 110.45 ms │  1.37x slower │
│ QQuery 20 │    713.95 / 734.88 ±19.31 / 760.42 ms │    735.21 / 751.55 ±14.90 / 770.34 ms │     no change │
│ QQuery 21 │     755.16 / 761.70 ±4.75 / 768.98 ms │    757.84 / 775.57 ±16.26 / 800.69 ms │     no change │
│ QQuery 22 │ 1163.74 / 1180.78 ±10.55 / 1194.90 ms │  1127.15 / 1131.04 ±2.96 / 1134.19 ms │     no change │
│ QQuery 23 │ 3105.01 / 3232.08 ±96.13 / 3347.21 ms │ 3089.56 / 3195.19 ±90.08 / 3343.96 ms │     no change │
│ QQuery 24 │     102.92 / 104.09 ±1.16 / 106.10 ms │     102.47 / 107.04 ±3.47 / 112.80 ms │     no change │
│ QQuery 25 │     137.75 / 140.68 ±1.72 / 142.69 ms │     142.10 / 144.30 ±1.61 / 147.07 ms │     no change │
│ QQuery 26 │     100.12 / 103.53 ±1.99 / 106.29 ms │     101.38 / 103.64 ±1.44 / 105.17 ms │     no change │
│ QQuery 27 │     852.31 / 858.17 ±6.10 / 869.11 ms │     846.83 / 851.18 ±4.32 / 858.56 ms │     no change │
│ QQuery 28 │ 7768.50 / 7836.83 ±45.97 / 7898.42 ms │ 7754.16 / 7825.95 ±52.08 / 7916.00 ms │     no change │
│ QQuery 29 │        56.84 / 60.57 ±3.68 / 66.04 ms │        55.20 / 60.64 ±5.65 / 70.45 ms │     no change │
│ QQuery 30 │     363.02 / 365.23 ±1.85 / 367.26 ms │     362.78 / 369.64 ±4.86 / 375.46 ms │     no change │
│ QQuery 31 │    364.20 / 373.96 ±10.03 / 392.52 ms │    372.99 / 389.94 ±13.70 / 406.28 ms │     no change │
│ QQuery 32 │ 1190.83 / 1278.55 ±62.46 / 1336.44 ms │ 1112.00 / 1188.64 ±44.23 / 1248.53 ms │ +1.08x faster │
│ QQuery 33 │ 1461.37 / 1527.88 ±88.14 / 1696.47 ms │ 1554.16 / 1609.41 ±70.06 / 1743.43 ms │  1.05x slower │
│ QQuery 34 │ 1439.90 / 1458.81 ±10.53 / 1470.57 ms │ 1624.73 / 1692.51 ±67.18 / 1781.61 ms │  1.16x slower │
│ QQuery 35 │     383.15 / 389.58 ±5.36 / 397.97 ms │     385.15 / 391.98 ±5.19 / 398.58 ms │     no change │
│ QQuery 36 │     124.86 / 128.39 ±2.71 / 132.06 ms │     122.62 / 128.81 ±4.31 / 135.66 ms │     no change │
│ QQuery 37 │        51.82 / 53.03 ±1.05 / 54.26 ms │        50.94 / 52.60 ±1.42 / 54.33 ms │     no change │
│ QQuery 38 │        75.80 / 77.84 ±1.17 / 79.35 ms │        77.12 / 78.42 ±1.47 / 81.29 ms │     no change │
│ QQuery 39 │     231.94 / 234.81 ±4.20 / 243.04 ms │     230.86 / 241.36 ±6.23 / 248.35 ms │     no change │
│ QQuery 40 │        23.66 / 26.65 ±2.23 / 29.69 ms │        22.81 / 24.80 ±1.66 / 27.51 ms │ +1.07x faster │
│ QQuery 41 │        22.19 / 23.67 ±0.99 / 25.15 ms │        21.07 / 22.58 ±1.40 / 25.11 ms │     no change │
│ QQuery 42 │        20.77 / 21.56 ±0.47 / 22.05 ms │        20.01 / 20.27 ±0.36 / 20.98 ms │ +1.06x faster │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                                   ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                                   │ 27833.23ms │
│ Total Time (fix-repartition-string-view-counting)   │ 28249.28ms │
│ Average Time (HEAD)                                 │   647.28ms │
│ Average Time (fix-repartition-string-view-counting) │   656.96ms │
│ Queries Faster                                      │          4 │
│ Queries Slower                                      │         11 │
│ Queries with No Change                              │         28 │
│ Queries with Failure                                │          0 │
└─────────────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	140.3s
Peak memory	38.0 GiB
Avg memory	27.9 GiB
CPU user	1317.1s
CPU sys	97.5s
Disk read	0 B
Disk write	4.0 GiB

clickbench_partitioned — branch

Metric	Value
Wall time	142.4s
Peak memory	38.9 GiB
Avg memory	29.9 GiB
CPU user	1323.7s
CPU sys	97.1s
Disk read	0 B
Disk write	92.0 KiB

File an issue against this benchmark runner

alamb · 2026-03-28T10:50:54Z

🤔 some of the benchmarks look like they got slower. I'll rerun to be sure

alamb · 2026-03-28T10:51:11Z

run benchmark clickbench_partitioned

alamb · 2026-03-28T10:52:38Z

+    for array in batch.columns() {
+        if let Some(string_view_array) = array.as_any().downcast_ref::<StringViewArray>()
+        {
+            let new_array = string_view_array.gc();


Calling gc here basically forces all the data to copied which is a lot of work. I am not sure it is a good idea to do so unconditionally

We had to do some pretty sophisticated heuristics of when to do a GC as part of the arrow BatchCoalescer

Interesting. I see that here. Repartition does use the batch coalescer, but on the receiver side, not the sender side. Memory tracking happens on the sender, before coalesce.

@Dandandan mentioned above that coalescing before sending had mixed results. I can try re-doing that on latest main to see if it still has the same effect. If so, I'll try branching out from the morsel PR and try it out.

Yes, I think in theory it should be slightly better to do the coalescing directly when batches are potentially already in cache & saving partition - 1 * wake ups (which are not used).

But it seems (my theory) pushing it upstream might create some parallelism / mitigating some skew as also for a slow partition it can do the coalescing in parallel.

@Dandandan I see that you had tried moving the coalesce upstream in repartition here: #21550
But looks like you closed it. I see an improvement in clickbench from the benchmarks there. Why was it closed?

I noticed that the morsel PRs are now merged, so I was planning to try out moving the coalesce to the producer.

Yeah, I tried it again, but I did still see a small regression compared to the morsel PR coalesce upstream.
Perhaps it can still extract some more parallelism in certain cases (e.g. it can still do coalescing in another task/thread and start triggering IO request again in the current task?). Or when we only have a few files left we still can use some more threads to do the coalesce in another thread.

I couldn't solve it yet but feel free to try again!

Alright, let me try this.

But if your guess is correct, then we're blocking the next IO call from happening due to CPU compute above the data source. So keeping the coalesce downstream is just removing some compute from that path. The core issue seems to be that we're not doing pre-fetching of IO while the compute is running?

I'm not sure, but I hope to get more insights when I try this out.

adriangbot · 2026-03-28T10:53:46Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4147841674-597-j2prb 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing fix-repartition-string-view-counting (979decc) to cdaecf0 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-03-28T11:15:17Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and fix-repartition-string-view-counting
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃  fix-repartition-string-view-counting ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.29 / 4.48 ±6.30 / 17.08 ms │          1.39 / 4.77 ±6.59 / 17.95 ms │  1.07x slower │
│ QQuery 1  │        14.53 / 14.81 ±0.18 / 15.10 ms │        15.94 / 16.14 ±0.11 / 16.23 ms │  1.09x slower │
│ QQuery 2  │        55.54 / 56.04 ±0.36 / 56.53 ms │        58.04 / 58.54 ±0.33 / 58.96 ms │     no change │
│ QQuery 3  │        47.55 / 49.21 ±1.29 / 51.37 ms │        51.80 / 53.71 ±1.55 / 55.97 ms │  1.09x slower │
│ QQuery 4  │     288.03 / 295.05 ±4.49 / 301.48 ms │     328.92 / 337.01 ±4.95 / 342.17 ms │  1.14x slower │
│ QQuery 5  │     343.81 / 349.27 ±3.66 / 353.68 ms │     394.86 / 401.08 ±6.50 / 413.30 ms │  1.15x slower │
│ QQuery 6  │           4.81 / 5.32 ±0.33 / 5.78 ms │          5.44 / 7.96 ±2.05 / 11.32 ms │  1.50x slower │
│ QQuery 7  │        16.41 / 18.83 ±2.93 / 23.70 ms │        18.45 / 18.59 ±0.08 / 18.69 ms │     no change │
│ QQuery 8  │    419.58 / 455.56 ±25.62 / 488.94 ms │    435.92 / 460.08 ±17.29 / 481.06 ms │     no change │
│ QQuery 9  │    661.71 / 737.87 ±38.76 / 768.78 ms │     643.41 / 656.83 ±9.39 / 668.14 ms │ +1.12x faster │
│ QQuery 10 │        90.32 / 90.88 ±0.64 / 92.12 ms │       92.79 / 96.01 ±3.51 / 102.02 ms │  1.06x slower │
│ QQuery 11 │     101.43 / 103.18 ±1.56 / 105.86 ms │     106.49 / 108.85 ±1.78 / 111.58 ms │  1.06x slower │
│ QQuery 12 │     344.69 / 350.74 ±4.75 / 356.47 ms │     358.16 / 363.09 ±3.69 / 367.74 ms │     no change │
│ QQuery 13 │    451.95 / 467.76 ±11.38 / 481.95 ms │     464.66 / 470.47 ±6.84 / 483.82 ms │     no change │
│ QQuery 14 │     350.70 / 354.72 ±2.60 / 358.66 ms │     368.70 / 375.69 ±4.53 / 382.39 ms │  1.06x slower │
│ QQuery 15 │    358.04 / 372.29 ±12.20 / 390.60 ms │    351.73 / 380.45 ±17.53 / 406.98 ms │     no change │
│ QQuery 16 │    795.57 / 816.62 ±26.78 / 869.22 ms │    787.94 / 829.31 ±43.08 / 904.94 ms │     no change │
│ QQuery 17 │    708.52 / 749.40 ±40.43 / 801.22 ms │    786.36 / 813.35 ±20.72 / 838.62 ms │  1.09x slower │
│ QQuery 18 │ 1368.04 / 1471.48 ±59.52 / 1531.35 ms │ 1507.85 / 1576.61 ±71.14 / 1700.40 ms │  1.07x slower │
│ QQuery 19 │        35.62 / 40.52 ±6.95 / 54.29 ms │        37.70 / 39.18 ±1.45 / 41.57 ms │     no change │
│ QQuery 20 │    718.05 / 735.56 ±17.79 / 761.08 ms │    719.30 / 732.89 ±14.75 / 760.16 ms │     no change │
│ QQuery 21 │    757.59 / 772.57 ±10.44 / 781.72 ms │     753.55 / 759.49 ±4.18 / 764.23 ms │     no change │
│ QQuery 22 │ 1125.25 / 1159.10 ±23.00 / 1190.86 ms │ 1118.53 / 1144.30 ±24.52 / 1179.43 ms │     no change │
│ QQuery 23 │ 3062.73 / 3167.89 ±88.65 / 3291.13 ms │ 3063.48 / 3154.59 ±79.04 / 3273.57 ms │     no change │
│ QQuery 24 │     100.83 / 102.84 ±2.28 / 106.73 ms │     102.68 / 107.18 ±2.90 / 111.73 ms │     no change │
│ QQuery 25 │     137.98 / 139.94 ±1.85 / 143.12 ms │     138.88 / 140.44 ±1.34 / 142.39 ms │     no change │
│ QQuery 26 │      98.21 / 100.42 ±1.49 / 102.49 ms │       95.85 / 99.68 ±2.30 / 102.97 ms │     no change │
│ QQuery 27 │     860.19 / 869.90 ±5.38 / 875.02 ms │     842.20 / 849.80 ±6.42 / 861.25 ms │     no change │
│ QQuery 28 │ 7692.17 / 7793.40 ±70.16 / 7890.72 ms │ 7711.50 / 7809.68 ±57.89 / 7884.51 ms │     no change │
│ QQuery 29 │        55.38 / 61.20 ±5.70 / 71.27 ms │        57.64 / 63.66 ±4.57 / 71.19 ms │     no change │
│ QQuery 30 │     363.46 / 368.58 ±4.73 / 375.09 ms │    363.18 / 395.32 ±19.09 / 415.47 ms │  1.07x slower │
│ QQuery 31 │     369.62 / 379.59 ±8.90 / 391.58 ms │    365.35 / 380.43 ±10.26 / 396.30 ms │     no change │
│ QQuery 32 │ 1100.09 / 1193.55 ±65.34 / 1291.44 ms │ 1036.55 / 1064.48 ±24.60 / 1107.27 ms │ +1.12x faster │
│ QQuery 33 │ 1440.32 / 1467.11 ±40.60 / 1547.09 ms │ 1517.79 / 1654.26 ±91.93 / 1759.12 ms │  1.13x slower │
│ QQuery 34 │ 1450.37 / 1554.11 ±80.35 / 1626.68 ms │ 1577.07 / 1650.28 ±48.93 / 1719.37 ms │  1.06x slower │
│ QQuery 35 │     383.84 / 388.62 ±5.16 / 398.12 ms │     387.84 / 400.39 ±9.61 / 415.30 ms │     no change │
│ QQuery 36 │     117.42 / 122.51 ±4.26 / 130.11 ms │     118.82 / 127.19 ±5.54 / 132.67 ms │     no change │
│ QQuery 37 │        48.31 / 49.95 ±1.39 / 52.09 ms │        50.90 / 52.28 ±1.13 / 53.67 ms │     no change │
│ QQuery 38 │        73.38 / 75.82 ±1.54 / 77.54 ms │        75.76 / 77.73 ±1.72 / 80.45 ms │     no change │
│ QQuery 39 │     205.03 / 210.72 ±4.70 / 216.39 ms │     232.50 / 240.22 ±6.41 / 251.11 ms │  1.14x slower │
│ QQuery 40 │        23.61 / 24.85 ±0.65 / 25.50 ms │        24.33 / 26.45 ±1.47 / 28.25 ms │  1.06x slower │
│ QQuery 41 │        19.64 / 22.01 ±2.38 / 26.11 ms │        20.36 / 22.17 ±1.33 / 24.47 ms │     no change │
│ QQuery 42 │        19.47 / 19.85 ±0.33 / 20.39 ms │        19.47 / 20.70 ±0.65 / 21.35 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                                   ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                                   │ 27584.12ms │
│ Total Time (fix-repartition-string-view-counting)   │ 28041.35ms │
│ Average Time (HEAD)                                 │   641.49ms │
│ Average Time (fix-repartition-string-view-counting) │   652.12ms │
│ Queries Faster                                      │          2 │
│ Queries Slower                                      │         16 │
│ Queries with No Change                              │         25 │
│ Queries with Failure                                │          0 │
└─────────────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	139.1s
Peak memory	42.7 GiB
Avg memory	32.2 GiB
CPU user	1311.0s
CPU sys	92.3s
Disk read	0 B
Disk write	3.3 GiB

clickbench_partitioned — branch

Metric	Value
Wall time	141.3s
Peak memory	40.1 GiB
Avg memory	33.1 GiB
CPU user	1310.8s
CPU sys	99.5s
Disk read	0 B
Disk write	748.0 KiB

File an issue against this benchmark runner

alamb

I think we need to resolve the performance regression before merging this PR

github-actions Bot added the physical-plan Changes to the physical-plan crate label Feb 23, 2026

Samyak2 force-pushed the fix-repartition-string-view-counting branch from 6a1547d to 471de13 Compare February 25, 2026 07:44

Samyak2 marked this pull request as ready for review February 25, 2026 10:03

kosiew requested changes Mar 17, 2026

View reviewed changes

Samyak2 force-pushed the fix-repartition-string-view-counting branch from 4385fb4 to 0733758 Compare March 17, 2026 17:35

Samyak2 added 6 commits March 24, 2026 16:34

fix: move gc to hash repartition only

275bfa1

This is not needed for round robin repartition

test: add test for string view array in repartition

cf80341

refactor: add gc_stringview_arrays util and use that

f9e791f

chore: remove unused imports

0696f3c

test: ensure that the test actually checks mem usage

979decc

Samyak2 force-pushed the fix-repartition-string-view-counting branch from 3fb8e02 to 979decc Compare March 24, 2026 11:05

kosiew approved these changes Mar 26, 2026

View reviewed changes

Dandandan reviewed Mar 26, 2026

View reviewed changes

alamb added the performance Make DataFusion faster label Mar 28, 2026

alamb reviewed Mar 28, 2026

View reviewed changes

alamb requested changes Mar 31, 2026

View reviewed changes

damahua mentioned this pull request Apr 2, 2026

fix: gc StringView/BinaryView arrays before spilling to prevent write amplification #21325

Open

Conversation

Samyak2 commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

kosiew left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Samyak2 Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Samyak2 commented Mar 24, 2026

Uh oh!

kosiew left a comment

Choose a reason for hiding this comment

Uh oh!

Dandandan commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

Samyak2 commented Mar 26, 2026

Uh oh!

kosiew commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dandandan commented Mar 26, 2026

Uh oh!

Dandandan commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

adriangbot commented Mar 26, 2026

Uh oh!

alamb commented Mar 28, 2026

Uh oh!

alamb commented Mar 28, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Samyak2 commented Feb 23, 2026 •

edited

Loading

Samyak2 Mar 17, 2026 •

edited

Loading

kosiew commented Mar 26, 2026 •

edited

Loading

Dandandan Apr 23, 2026 •

edited

Loading