Fix mode(Binomial) and modes(Binomial) #1931

marcusps · 2024-12-17T05:11:10Z

Fix #1927

Fix JuliaStats#1927

marcusps · 2024-12-17T05:17:27Z

The impact on runtime is minimal

julia> d = Binomial(5, 2//3)
Binomial{Rational{Int64}}(n=5, p=2//3)

julia> @benchmark mode(d) # current master
BenchmarkTools.Trial: 10000 samples with 994 evaluations.
 Range (min … max):  23.342 ns … 76.490 ns  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     23.791 ns              ┊ GC (median):    0.00%
 Time  (mean ± σ):   25.491 ns ±  3.586 ns  ┊ GC (mean ± σ):  0.00% ± 0.00%

  █▇▅▅▅▆▃▂▁              ▄▃ ▃▂▁▂▂  ▁                          ▂
  █████████▇▇▇█▄██▇▇█▆█▆▆██▆█████▇██▆▆▆▆▆▆▄▅▇█▄▆▆▅▆▄▅▆▅▆▄▄▄▆▇ █
  23.3 ns      Histogram: log(frequency) by time      39.2 ns <

 Memory estimate: 0 bytes, allocs estimate: 0.

julia> @benchmark mode(d) # this PR
BenchmarkTools.Trial: 10000 samples with 995 evaluations.
 Range (min … max):  22.868 ns … 90.999 ns  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     23.201 ns              ┊ GC (median):    0.00%
 Time  (mean ± σ):   25.739 ns ±  4.551 ns  ┊ GC (mean ± σ):  0.00% ± 0.00%

  █▄▃   ▂▂    ▁▂▂▂▂▅▄▃▃                                       ▁
  ████▇███▇▇▇▇█████████▇█▇▇█▅▇▇▄██▅▄▇▇▄▆█▆▄▇▇▅▅▄▅▅▅▄▅▅▄▁▅▅▅▄▅ █
  22.9 ns      Histogram: log(frequency) by time      44.2 ns <

 Memory estimate: 0 bytes, allocs estimate: 0.

similarly for modes

julia> @benchmark modes(d) # current master
BenchmarkTools.Trial: 10000 samples with 984 evaluations.
 Range (min … max):  43.023 ns …   6.794 μs  ┊ GC (min … max): 0.00% … 98.72%
 Time  (median):     47.259 ns               ┊ GC (median):    0.00%
 Time  (mean ± σ):   54.389 ns ± 143.785 ns  ┊ GC (mean ± σ):  8.71% ±  3.40%

     ▅▂▂▁▆█▅▃                                                   
  ▁▂▆█████████▅▄▂▂▂▂▂▂▂▂▃▃▃▄▅▄▄▄▃▃▂▂▃▂▂▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁ ▃
  43 ns           Histogram: frequency by time         69.7 ns <

 Memory estimate: 64 bytes, allocs estimate: 1.

julia> @benchmark modes(d) # this PR
BenchmarkTools.Trial: 10000 samples with 980 evaluations.
 Range (min … max):  47.515 ns …   3.625 μs  ┊ GC (min … max): 0.00% … 97.66%
 Time  (median):     56.248 ns               ┊ GC (median):    0.00%
 Time  (mean ± σ):   63.108 ns ± 127.480 ns  ┊ GC (mean ± σ):  7.78% ±  3.79%

     ▁▄█▆▆▄        ▁ ▁▁                                         
  ▁▂▅███████▆▆▆▆▇▇█████▇▅▄▃▂▂▂▂▂▂▂▁▂▂▁▁▁▁▁▁▁▁▁▁▁▁▂▂▂▂▂▂▂▂▂▁▁▁▁ ▃
  47.5 ns         Histogram: frequency by time         89.3 ns <

 Memory estimate: 80 bytes, allocs estimate: 1.

devmotion · 2024-12-17T08:31:55Z

src/univariate/discrete/binomial.jl

+    v = (n + 1) * p
+    quasi_mode = floor(Int, v)
+    if quasi_mode == v
+        if p == 1
+            n
+        else
+            quasi_mode-1
+        end
+    else
+        quasi_mode
+    end


AFAICT alternatives with fewer branches would be

Suggested change

v = (n + 1) * p

quasi_mode = floor(Int, v)

if quasi_mode == v

if p == 1

n

else

quasi_mode-1

end

else

quasi_mode

end

iszero(p) ? 0 : ceil(Int, (n + 1) * p) - 1

or

Suggested change

v = (n + 1) * p

quasi_mode = floor(Int, v)

if quasi_mode == v

if p == 1

n

else

quasi_mode-1

end

else

quasi_mode

end

max(0, ceil(Int, (n + 1) * p) - 1)

or

Suggested change

v = (n + 1) * p

quasi_mode = floor(Int, v)

if quasi_mode == v

if p == 1

n

else

quasi_mode-1

end

else

quasi_mode

end

clamp(ceil(Int, (n + 1) * p) - 1, 0, n)

There is little to no difference in runtime between these options (or the option in the PR). The main difference is in the max runtime, and the PR options and the max(0, ceil(...)) have the shorter max runtime (the two are nearly identical with a max of about 10x the average runtime), while the max runtime for the other two options is about twice as long (about 20x the average runtime). The standard deviations show a similar pattern (but they are smaller than the average runtime).

Given there isn't much of a different in run times, I propose the PR version is preferable because it is easiest to read and reason about.

@devmotion Is there another reason to consider fewer branches other than runtime?

test/univariate/discrete/binomial.jl

Co-authored-by: David Widmann <[email protected]>

codecov-commenter · 2024-12-19T06:11:15Z

Codecov Report

Attention: Patch coverage is 88.23529% with 2 lines in your changes missing coverage. Please review.

Project coverage is 86.02%. Comparing base (ceb6343) to head (6c0abd5).

Files with missing lines	Patch %	Lines
src/univariate/discrete/binomial.jl	88.23%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1931      +/-   ##
==========================================
+ Coverage   86.01%   86.02%   +0.01%     
==========================================
  Files         144      144              
  Lines        8696     8710      +14     
==========================================
+ Hits         7480     7493      +13     
- Misses       1216     1217       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

marcusps added 2 commits December 17, 2024 04:34

Returns smallest mode in mode(Binomial), fix modes

194adf9

Fix JuliaStats#1927

Update tests for mode(Binomial) and modes(Binomial)

ea39724

devmotion reviewed Dec 17, 2024

View reviewed changes

test/univariate/discrete/binomial.jl Outdated Show resolved Hide resolved

marcusps and others added 4 commits December 19, 2024 03:24

Fix incorrect test

87df67f

Co-authored-by: David Widmann <[email protected]>

Add a few more tests

1116bf1

Fix bug with mode calculation condition

0fac6c0

Merge branch 'master' into marcusps/fix-mode

6c0abd5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mode(Binomial) and modes(Binomial) #1931

Fix mode(Binomial) and modes(Binomial) #1931

marcusps commented Dec 17, 2024

marcusps commented Dec 17, 2024

devmotion Dec 17, 2024

marcusps Dec 19, 2024

marcusps Dec 28, 2024

codecov-commenter commented Dec 19, 2024 •

edited

Loading

Fix mode(Binomial) and modes(Binomial) #1931

Are you sure you want to change the base?

Fix mode(Binomial) and modes(Binomial) #1931

Conversation

marcusps commented Dec 17, 2024

marcusps commented Dec 17, 2024

devmotion Dec 17, 2024

Choose a reason for hiding this comment

marcusps Dec 19, 2024

Choose a reason for hiding this comment

marcusps Dec 28, 2024

Choose a reason for hiding this comment

codecov-commenter commented Dec 19, 2024 • edited Loading

Codecov Report

codecov-commenter commented Dec 19, 2024 •

edited

Loading