melt(measure.vars=list) returns indices in variable column by tdhock · Pull Request #5247 · Rdatatable/data.table

tdhock · 2021-11-02T06:15:12Z

This was not a big problem, but it would be nice for consistency: whenever user specifies measure.vars=list, variable column should contains indices rather than column names. Increased consistency may help avoid issues such as #5201 in which a user thought that measure.vars=list(...) and measure.vars=c(...) should yield same result (docs say otherwise).

Previous behavior: (master/CRAN)

> melt(data.table(a=10, b=20), measure.vars=list("a"))
    b variable value
1: 20        a    10

New behavior:

> melt(data.table(a=10, b=20), measure.vars=list("a"))
    b variable value
1: 20        1    10

Main fix involves checking if measure.vars is list in fmelt C code.
To make some old tests keep passing, I had to change

patterns: when one argument/pattern input, used to output a list with one element (integer vector of match indices), now returns just the integer vector (not in a list). When used like melt(DT, measure.vars=patterns("^a") we therefore still get the same result as before this PR (variable is column name, not index).
.SDcols=patterns("^a") again patterns returns vector instead of list so that needed to be special cased in [.data.table -- just use that vector instead of trying to pass it through Reduce(intersect, ...) -- again same results as before this PR.

…length>1

codecov · 2021-11-02T06:22:37Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.53%. Comparing base (53df7e5) to head (548d94c).
Report is 3 commits behind head on master.

❗ Current head 548d94c differs from pull request most recent head d738a12. Consider uploading reports for the commit d738a12 to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #5247      +/-   ##
==========================================
+ Coverage   97.51%   97.53%   +0.02%     
==========================================
  Files          80       80              
  Lines       14920    14920              
==========================================
+ Hits        14549    14552       +3     
+ Misses        371      368       -3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mnazarov · 2023-03-01T14:17:35Z

Another inconsistency with length-1 list in measure.vars, is that names are discarded, i.e.

# CRAN version
> melt(data.table(a=10, b=20), measure.vars=list("n" = "a"))
    b variable value
1: 20        a    10

And this PR seems to fix that too (even if it seems that the names were returned when PR was created in 2021):

# with this PR
> melt(data.table(a=10, b=20), measure.vars=list("n" = "a"))
    b variable  n
1: 20        1 10

Maybe adding an extra test with names could be useful.

tdhock · 2024-02-15T23:00:58Z

hi @jangorecki @MichaelChirico I would like to include this fix in next release, and I believe that it is ready to merge into master, so could you please review and/or merge if you agree that it is ready? Thanks!

NEWS.md

R/fmelt.R

src/fmelt.c

MichaelChirico · 2024-03-20T19:35:18Z

inst/tests/tests.Rraw

 test(2182.4, melt(DTid, measure.vars=list(a=c(NA,"a2"), b=c("b1","b2")), id.vars="id"), exid)
 test(2182.5, melt(DT.wide, measure.vars=list(a=c(NA,1), b=2:3), na.rm=TRUE), data.table(variable=factor(2), a=2, b=2))
-test(2182.6, melt(DT.wide, measure.vars=list(b=c("b1","b2"))), data.table(a2=2, variable=factor(c("b1","b2")), b=c(1,2))) # measure.vars named list length=1, #5065
+test(2182.6, melt(DT.wide, measure.vars=list(b=c("b1","b2"))), data.table(a2=2, variable=factor(c("1","2")), b=c(1,2))) # measure.vars named list length=1, #5065


why change the test?

this is the new result using this PR

> melt(DT.wide, measure.vars=list(b=c("b1","b2"))) a2 variable b <num> <fctr> <num> 1: 2 1 1 2: 2 2 2 > melt(DT.wide, measure.vars=c("b1","b2")) a2 variable value <num> <fctr> <num> 1: 2 b1 1 2: 2 b2 2

src/fmelt.c

MichaelChirico · 2024-03-20T19:39:12Z

Minor feedback only. It does look like it could be a breaking change, though? We can merge and see what revdep testing tells us.

Co-authored-by: Michael Chirico <michaelchirico4@gmail.com>

tdhock · 2024-04-05T03:38:28Z

yes it could be a breaking change. I can follow up and suggest fixes for any revdeps that show up.

src/fmelt.c

NEWS.md

MichaelChirico · 2024-04-08T05:12:34Z

yes it could be a breaking change. I can follow up and suggest fixes for any revdeps that show up.

Here are some places to look beyond CRAN packages:

https://github.com/search?q=lang%3AR+%2Fmeasure%5B.%5Dvars%5Cs*%3D%5Cs*list%5B%28%5D%5B%5E%2C%29%5D%2B%5B%29%5D%2F&type=code

MichaelChirico

Feel free to merge after addressing latest round of review. Thanks!

jangorecki · 2024-04-08T06:34:56Z

I feel we are much less strict about backward compatibility than we used to be. Checking revdeps is not sufficient IMO, it is not sufficient by a huge factor. First, most of pkgs on CRAN does not have good test coverage. Second, we don't know how many revdeps there are which are not o CRAN or github, I would assume at least 3-4 times more than CRAN. Third, there are thousands of plain scripts. Current process of looking at cran/bioc revdeps, and checking github for the usage is as much as we can do, but IMO not enough to jump in with breaking changes. Let's try to give at least 1 release cycle for transition. We could have extra section in NEWS file "breaking changes in next release" which described how to switch to new behavior.

MichaelChirico · 2024-04-08T15:47:54Z

While you make a very good point in general, in this case, we are aligning implementation with documented behavior. I am not sure we should be as deferential to users relying on bugs/undocumented behavior.

Revdeps/GH search can help us in this case -- if many users wound up relying on this, we should proceed more carefully.

Co-authored-by: Michael Chirico <michaelchirico4@gmail.com>

tdhock · 2024-04-08T16:16:31Z

We could have extra section in NEWS file "breaking changes in next release" which described how to switch to new behavior.

great idea Jan, I added a sentence about that:

melt returns an integer column for variable when measure.vars is a list of length=1, consistent with the documented behavior, #5209. Thanks to @tdhock for reporting and fixing. Any users who were relying on this behavior can change measure.vars=list("col_name") (output variable was column name, now is column index/integer) to measure.vars="col_name" (variable still is column name).

tdhock added 3 commits November 1, 2021 21:45

expect factor(1) when measure=list

0273b12

melt checks if measure.vars is list

8fa8c8f

inconsistent variable between measure.vars with list of length=1 and …

9659f7a

…length>1

tdhock requested a review from mattdowle November 2, 2021 06:22

link related issue

eff2622

tdhock added consistency reshape dcast melt labels Feb 15, 2023

tdhock added this to the 1.16.0 milestone Jan 5, 2024

tdhock and others added 3 commits February 15, 2024 15:11

Merge branch 'master' into fix5209

1bc57de

move news item up

afdcab7

add test suggested by @mnazarov

33d7b2a

Merge branch 'master' into fix5209

aeadb67

tdhock requested review from MichaelChirico and jangorecki as code owners March 20, 2024 19:28

MichaelChirico reviewed Mar 20, 2024

View reviewed changes

NEWS.md Outdated Show resolved Hide resolved

MichaelChirico reviewed Mar 20, 2024

View reviewed changes

R/fmelt.R Outdated Show resolved Hide resolved

MichaelChirico reviewed Mar 20, 2024

View reviewed changes

src/fmelt.c Outdated Show resolved Hide resolved

MichaelChirico reviewed Mar 20, 2024

View reviewed changes

src/fmelt.c Outdated Show resolved Hide resolved

tdhock and others added 4 commits March 26, 2024 09:43

use Michael wording

e24663a

Update R/fmelt.R

dbfd853

Co-authored-by: Michael Chirico <michaelchirico4@gmail.com>

Update src/fmelt.c

d3d00a9

Co-authored-by: Michael Chirico <michaelchirico4@gmail.com>

Rboolean->bool

548d94c

MichaelChirico reviewed Apr 8, 2024

View reviewed changes

src/fmelt.c Outdated Show resolved Hide resolved

MichaelChirico reviewed Apr 8, 2024

View reviewed changes

NEWS.md Outdated Show resolved Hide resolved

Merge branch 'master' into fix5209

38c3e36

MichaelChirico approved these changes Apr 8, 2024

View reviewed changes

tdhock and others added 3 commits April 8, 2024 09:02

Update src/fmelt.c

e127f48

Co-authored-by: Michael Chirico <michaelchirico4@gmail.com>

mention length=1

2e1f5ef

comment about how to upgrade

d287ba8

Merge branch 'master' into fix5209

d738a12

tdhock merged commit 1329230 into master Apr 8, 2024

MichaelChirico mentioned this pull request Apr 8, 2024

fix docs/errors for melt, measure, patterns, eval_with_cols: cols arg should not be provided by user #5115

Closed

tdhock mentioned this pull request Aug 1, 2024

melt warns for measure.vars=list of length=1 #6333

Merged

MichaelChirico deleted the fix5209 branch July 8, 2025 17:44

Mukulyadav2004 mentioned this pull request Aug 23, 2025

Align melt with docs for list measure.vars #7257

Closed

Conversation

tdhock commented Nov 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mnazarov commented Mar 1, 2023

Uh oh!

tdhock commented Feb 15, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MichaelChirico Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

tdhock Apr 5, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MichaelChirico commented Mar 20, 2024

Uh oh!

tdhock commented Apr 5, 2024

Uh oh!

Uh oh!

Uh oh!

MichaelChirico commented Apr 8, 2024

Uh oh!

MichaelChirico left a comment

Choose a reason for hiding this comment

Uh oh!

jangorecki commented Apr 8, 2024

Uh oh!

MichaelChirico commented Apr 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tdhock commented Apr 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tdhock commented Nov 2, 2021 •

edited

Loading

codecov bot commented Nov 2, 2021 •

edited

Loading

MichaelChirico commented Apr 8, 2024 •

edited

Loading