Add probability tests to check vectorized/scalar lpmfs/lpdfs are the same#1989

Closed

bbbales2 wants to merge 57 commits intodevelopfrom

bugfix/issue-1861-scalars-vs-vectors

Member

bbbales2 commented Jul 28, 2020 •

edited

Loading

Summary

This addresses the missing tests from #1861

Edit: Breaking up this pull into pieces. I'm gonna leave this here until it's done.

So far #2039, #2041, and #2042

Tests

Side Effects

Release notes

Added extra tests to check lpdfs/lpmfs evaluated with vectors produce the same results as with scalars

Checklist

Math issue Probability test framework didn't catch bug with vectorization #1861
Copyright holder: Columbia University

The copyright holder is typically you or your assignee, such as a university or company. By submitting this pull request, the copyright holder is agreeing to the license the submitted work under the following licenses:
- Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
- Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
the basic tests are passing
- unit tests pass (to run, use: ./runTests.py test/unit)
- header checks pass, (make test-headers)
- dependencies checks pass, (make test-math-dependencies)
- docs build, (make doxygen)
- code passes the built in C++ standards checks (make cpplint)
the code is written in idiomatic C++ and changes are documented in the doxygen
the new changes are tested

bbbales2 and others added 3 commits

July 27, 2020 21:11


          Added test to check that vectorized lpdfs/lpmfs the same as evaluatin…

d6c5225

…g them in a non-vectorized way (Issue #1861)


          Re-indenting code (Issue #1861)

bd2ebc1


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

ea07b9e

…4.1 (tags/RELEASE_600/final)

bbbales2 changed the title ~~Bugfix/issue 1861 scalars vs vectors~~ Add probability tests to check vectorized/scalar lpmfs/lpdfs are the same

bbbales2 and others added 4 commits

July 28, 2020 14:08


          Changed how memory is handled to avoid segfaults. Fixed vector handli…

0aa88c1

…ng in as_scalars_vs_as_vectors to be like repeat_as_vectors (Issue #1861)


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

fdc7757

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          Merge commit 'fdf5db851ea6f2c5dc59fcb9e9aa45b24b202afe' into HEAD

5db698a


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

35b74c4

…4.1 (tags/RELEASE_600/final)

Member Author

bbbales2 commented Jul 28, 2020

I also intend to fix: #1978 with this pull request. And similarly there should be tests for the lccdfs and the cdfs. These should basically be copy-paste to add once the lpdf/lpmf code is there.

bbbales2 and others added 21 commits

July 28, 2020 15:16


          Finished merge (Issue #1861)

6ebcaad


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

0ef320e

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

778280f

…4.1 (tags/RELEASE_600/final)


          Fixed handling of Eigen matrices of fvar<T> types in test framework (…

95b4288

…Issue #1861)


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

a631664

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

f2d7504

…4.1 (tags/RELEASE_600/final)


          Fixed Frechet distribution for higher order autodiff (Issue #1861)

a080c63


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

84310eb

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          Added Frechet test in mix (Issue #1861)

3f2c5ea


          Merge commit 'd34f10a67df9affb3e12af4b7f2a7fd4d6f757d3' into HEAD

69045e7


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

acb7a3f

…4.1 (tags/RELEASE_600/final)


          Switched test framework to use equality checks from unit tests which …

9304cf9

…work with things near zero better (Issue #1861)


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

79231ca

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

e68772a

…4.1 (tags/RELEASE_600/final)


          Fixed Gumbel test distribution implementation (Issue #1861)

70de131


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

e84ff0a

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          Replaced the comparisons in probability distribution comparisons with…

8e4886c

… expect_near_rel from test/unit (Issue #1861)


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

791d99c

…4.1 (tags/RELEASE_600/final)


          Adjusted tolerances for finite difference comparison (Issue #1861)

bd880ee


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

e0277ca

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

f934306

…4.1 (tags/RELEASE_600/final)

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

    
                  }

                  plus[n] += e;

                  minus[n] -= e;

                  auto f_wrap = [&](const Eigen::VectorXd& e) {

Member Author

bbbales2 Aug 10, 2020

The stan math finite difference function is higher order and easier to use, so I defer to it.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

                 // works for <var>
                 double calculate_gradients_1storder(vector<double>& grad, var& ccdf_log,
                                                     vector<var>& x) {
+                  stan::math::set_zero_all_adjoints();

Member Author

bbbales2 Aug 10, 2020

These gradient functions get called a lot in a sequence. If we do stan::math::recover_memory we clear the autodiff stack and then the tests aren't meaningful. I switched the recover_memory s to set_zero_all_adjoint s and put recover_memory calls in the tests that use the calculate_gradients_* functions.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

                 double calculate_gradients_1storder(vector<double>& grad,
                                                     fvar<double>& ccdf_log, vector<var>& x) {
-                  x.push_back(ccdf_log.d_);
+                  grad.push_back(ccdf_log.d_);

Member Author

bbbales2 Aug 10, 2020

Pushing stuff into x doesn't do anything. I think this was a bug. grad is the thing that gets checked.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

+                           << "  grads:        " << gradients;
+                    stan::test::expect_near_rel(stream.str(), finite_dif[i], gradients[i],
+                                                stan::test::relative_tolerance(1e-4, 1e-7));

Member Author

bbbales2 Aug 10, 2020

1e-4 relative error (this is what the unit tests use for gradients, see here: https://github.com/stan-dev/math/blob/develop/test/unit/math/ad_tolerances.hpp)

I think I used 1e-7 for the minimum error tolerance cause 1e-8 didn't work for some function. I'm fuzzy on this.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

-                    test_gradients_equal(expected_gradients1, gradients1);
-                    test_gradients_equal(expected_gradients2, gradients2);
-                    test_gradients_equal(expected_gradients3, gradients3);
+                    test_gradients_equal(expected_gradients1, gradients1, 1e-3);

Member Author

bbbales2 Aug 10, 2020

The reference implementation gradients are quite bad. I had to use a relative tolerance of 1e-3 to get them to pass (finite difference worked with 1e-4).

It's stuff like gamma_p and gamma_q I think (#2006).

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

-                    add_vars(s2, p0_, p1_, p2_, p3_, p4_, p5_);
-                    add_vars(s3, p0_, p1_, p2_, p3_, p4_, p5_);
+                    vector<var> scalar_vars;
+                    add_vars(scalar_vars, p0_, p1_, p2_, p3_, p4_, p5_);

Member Author

bbbales2 Aug 10, 2020

s1, s2, and s3 seemed like duplicates so I simplified things.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

-                    calculate_gradients_1storder(multiple_gradients3, multiple_ccdf_log, x1);
+                    calculate_gradients_1storder(multiple_gradients1, multiple_ccdf_log,
+                                                 vector_vars);
+                    calculate_gradients_2ndorder(multiple_gradients2, multiple_ccdf_log,

Member Author

bbbales2 Aug 10, 2020

Previously we were only computing 1st order gradients.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

+                  }
+                }
+                void test_as_scalars_vs_as_vector() {

Member Author

bbbales2 Aug 10, 2020

This test should catch errors like #1978 and #1861 for lccdfs.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_cdf.hpp

                   }
                 }
+                void test_as_scalars_vs_as_vector() {

Member Author

bbbales2 Aug 10, 2020

This test should catch errors like #1978 and #1861 for cdfs.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_cdf.hpp

               #include <stan/math/rev.hpp>
               #include <test/prob/utility.hpp>
+              #include <test/unit/math/expect_near_rel.hpp>

Member Author

bbbales2 Aug 10, 2020

All the changes in this file are similar to the equivalent ones in the lccdf file.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_cdf_log.hpp

               #include <stan/math/rev.hpp>
               #include <test/prob/utility.hpp>
+              #include <test/unit/math/expect_near_rel.hpp>

Member Author

bbbales2 Aug 10, 2020

This should fix #1978. The changes in this file are similar to the ones in the lccdf and cdf checks.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_cdf.hpp

-                    T_return_type cdf
-                        = TestClass.template cdf<Scalar0, Scalar1, Scalar2, Scalar3, Scalar4,
-                                                 Scalar5>(p0_, p1_, p2_, p3_, p4_, p5_);
+                    T_return_type single_cdf = pow(

Member Author

bbbales2 Aug 10, 2020

You'll notice a pow here. In the old version we were comparing gradients of something like grad(x) with the gradients of something like grad(x^r). To do this there was extra compare logic in test_multiple_gradient_values.

As I recall this was confusing for higher order things, so I just added a pow here so we're comparing grad(x^r) directly against grad(x^r) computed another way and there's no confusion.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_distr.hpp

               #include <stan/math/mix.hpp>
               #include <test/prob/utility.hpp>
+              #include <test/unit/math/expect_near_rel.hpp>

Member Author

bbbales2 Aug 10, 2020

This should basically be the same as the lccdf, cdf, and lcdf checks.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_distr.hpp

                   }
                 }
+                void test_as_scalars_vs_as_vector() {

Member Author

bbbales2 Aug 10, 2020

This should fix #1861.

bbbales2 commented

View reviewed changes

test/prob/utility.hpp


		// ------------------------------------------------------------

		template <typename T>

Member Author

bbbales2 Aug 10, 2020

I moved these definitions up here so they are visible to get_params.

bbbales2 commented

View reviewed changes

test/prob/utility.hpp

    
                for (size_t n = 0; n < parameters.size(); n++)

                  if (p < parameters[0].size())

                    param(n) = parameters[n][p];

                    param(n) = get_param<stan::scalar_type_t<T>>(parameters[n], p);

Member Author

bbbales2 Aug 10, 2020

For higher order autodiff types, we need to initialize each of the params with something more than just casting up a double.

For vars, we assign the derivative term to 1.0, for instance.

If we don't do this code that depends on get_params and get_param getting the same params fails.

bbbales2 commented

View reviewed changes

test/prob/von_mises/von_mises_test.hpp

                   param[0] = boost::math::constants::third_pi<double>();
                   param[1] = boost::math::constants::sixth_pi<double>();
-                  param[2] = 1e-8;
+                  param[2] = 1e-2;

Member Author

bbbales2 Aug 10, 2020

Larger test value to avoid finite difference out of range errors

bbbales2 commented

View reviewed changes

test/unit/math/mix/prob/frechet_test.cpp

+                  return stan::math::frechet_lpdf<false>(y, alpha, beta);
+                };
+                stan::test::expect_ad(f, 2.0, 1.0, 1.0);

Member Author

bbbales2 Aug 10, 2020

I added this test since the higher order Frechet stuff was failing. I think I could remove it but it seems fine to me.

bbbales2 commented

View reviewed changes

test/unit/math/prim/functor/ode_rk45_prim_test.cpp

               #include <stan/math/prim.hpp>
               #include <gtest/gtest.h>
               #include <test/unit/util.hpp>
+              #include <test/unit/math/prim/functor/ode_test_functors.hpp>

Member Author

bbbales2 Aug 10, 2020

All the changes in the ODE files were an alternate fix to a problem that popped up here: #1993 (comment)

They don't change behavior. They just rearrange test code a bit so that the jumbo tests (#1965) build correctly.

Member Author

bbbales2 commented Aug 10, 2020

This is ready to review. There's a ton of different sorts of changes in here. I went through I tried to explain each of them, cause some of them probably look pretty weird. When I got into the testing framework and started pulling threads I ended up working on a lot more things than I intended to.

Member Author

bbbales2 commented Aug 17, 2020

@syclik you think you'd have a chance to review this in like the next week or so? If not I'll grab someone else.

Member Author

bbbales2 commented Aug 20, 2020

@t4c1 yo can you review this? There were a few of problems with the testing framework. I wanna get these in before we make all the expression-compatibility changes.

Contributor

t4c1 commented Aug 21, 2020

This is a huge PR. Could you split it into 2 or 3 smaller ones?

There is a lot of math stuff in here. I am not sure I feel comfortable reviewing that.

This was referenced Aug 24, 2020

Generalize cauchy #1944

Merged

Reduced some duplicate code in ODE tests #2039

Merged

bbbales2 marked this pull request as draft

August 26, 2020 20:41

bbbales2 mentioned this pull request

Fix problems with higher order gradients in probability test framework #2042

Merged

5 tasks

bbbales2 mentioned this pull request

Add probability tests to check vectorized/scalar lpmfs/lpdfs are the same #2085

Merged

5 tasks

Member Author

bbbales2 commented Oct 12, 2020

Closed by #2085

bbbales2 closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet