Report ESS 0 if khat too large #191

avehtari · 2021-12-02T16:36:39Z

Example output from https://discourse.mc-stan.org/t/understanding-loo-results-for-non-standard-contaminated-controls-model/25493

Pareto k diagnostic values:
                         Count Pct.    Min. n_eff
(-Inf, 0.5]   (good)     717   73.2%   151       
 (0.5, 0.7]   (ok)       133   13.6%   116       
   (0.7, 1]   (bad)       74    7.6%   38        
   (1, Inf)   (very bad)  56    5.7%   839       
See help('pareto-k-diagnostic') for details.

where the very bad category shows quite high effective sample size. PSIS paper demonstrates that the effective sample size estimate is unreliable when k>0.7. We are already reporting for MCSE NA if any khat>0.7, so I propose we report effective sample size 0 in these categories. This would reduce possible confusion.

The text was updated successfully, but these errors were encountered:

avehtari · 2022-01-14T17:08:41Z

Just report 0, or also store in loo object $diagnostics$ess value 0?

sethaxen · 2022-01-14T17:57:24Z

Why prefer ESS of 0 over NaN?

MansMeg · 2022-01-15T16:24:49Z

I agree with Seth, maybe it is more defensive to return NaN. Or is there a specific reason to return 0 here?

avehtari · 2022-01-17T19:07:28Z

Why NaN instead of NA? Currently if any k>0.7, we report MCSE as NA

I don't have a strong preference for 0, but I would assume 0 would produce fewer questions. But as we report NA for MCSE, I'm with NA, too

MansMeg · 2022-01-19T15:23:37Z

Oh. I have no strong feelings on NA vs NaN. If NA is better I think that is good, but I think you said that MCSE returned NaN? I just think NA/NaN is better than 0 since it will cast a more explicit error and hence be more defensive. People would not accidentally use the value.

Maybe we can throw a warning when returning NA to avoid questions?

avehtari · 2022-01-19T15:57:08Z

but I think you said that MCSE returned NaN?

Where? In the above comment, I write "Currently if any k>0.7, we report MCSE as NA"

sethaxen · 2022-01-19T15:59:13Z

NA seems better, since IIUC, it indicates the value is unknown or unavailable.

avehtari · 2022-01-19T15:59:58Z

I just think NA/NaN is better than 0 since it will cast a more explicit error and hence be more defensive. People would not accidentally use the value.

NA (or NaN) is not an error. I'm happy if they use ESS 0, when we know the efficiency is very close to 0.

Maybe we can throw a warning when returning NA to avoid questions?

I think we already have too many warnings thrown that are annoying after the first time. I'm fine adding an explanation in the output if there are k>0.7 as we have planned to do anyway.

MansMeg · 2022-01-19T16:59:09Z

Sorry. I see ” We are already reporting for MCSE NaN if any khat>0.7, so I propose we report…” in the comment above?

I meant that if people would use the n_eff they would explicitly need to set its value to 0 for k>0.7. Hence it would need an active decision to use it. The current recommendations is to not use the estimates of elpd_i in the first place if k>0.7?

I agree, a warning is not good. I think the current documentation is quite clear. Maybe add a sentance there that the NA can be replaced with zero under the paragraph of effective sample size? If we go with a defensive solution.

Although, this is no big thing for me so just decide what you think is best. Its just me that have a very defensive coding style.

ps. When reading I realized that a ref in the docs is incorrect. I guess Vehtari et al (2017b) should be Vehtari et al (2019).

avehtari · 2022-01-19T17:20:27Z

Sorry. I see ” We are already reporting for MCSE NaN if any khat>0.7, so I propose we report…” in the comment above?

Oh, in the first post! I fixed that. The actual printed output is NA.

I meant that if people would use the n_eff

btw. we're switching to use ESS instead of n_eff #192

they would explicitly need to set its value to 0 for k>0.7.

Based on the questions about the current NA, many novice users don't know the meaning of NA. For ESS, 0 might be more informative.

The current recommendations is to not use the estimates of elpd_i in the first place if k>0.7?

The current recommendation is not to use it, if someone's life depend on it (or something similar). In early phases of the workflow you may use it with caution. Of course in the output we need to opt for a short message, and then provide the more cautious explanation.

0: Pros: might be easier to interpret in comparison of other ESS values (ie 0 is smaller than any positive value). Pro or cons: if used to estimate MCSE produces Inf, while we're now reporting NA.
NA: Pros: would follow the current use of NA for MCSE, would mean more of "we don't know" than certain 0.

MansMeg · 2022-01-19T18:07:17Z

I think this is a good summary. I would add that using NA would probably nudge users to read the Pareteto-k-diagnostic docs with higher probability / be more defensive.

I still lean more toward NA, but I have no strong opinion here.

MansMeg · 2022-01-19T18:22:07Z

I now fixed the refs in PR #194 as well.

avehtari mentioned this issue Feb 17, 2024

show NA for n_eff/ESS if k > k_threshold #248

Merged

jgabry closed this as completed in #248 Feb 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report ESS 0 if khat too large #191

Report ESS 0 if khat too large #191

avehtari commented Dec 2, 2021 •

edited

Loading

avehtari commented Jan 14, 2022

sethaxen commented Jan 14, 2022

MansMeg commented Jan 15, 2022

avehtari commented Jan 17, 2022

MansMeg commented Jan 19, 2022

avehtari commented Jan 19, 2022

sethaxen commented Jan 19, 2022

avehtari commented Jan 19, 2022

MansMeg commented Jan 19, 2022 •

edited

Loading

avehtari commented Jan 19, 2022

MansMeg commented Jan 19, 2022

MansMeg commented Jan 19, 2022

Report ESS 0 if khat too large #191

Report ESS 0 if khat too large #191

Comments

avehtari commented Dec 2, 2021 • edited Loading

avehtari commented Jan 14, 2022

sethaxen commented Jan 14, 2022

MansMeg commented Jan 15, 2022

avehtari commented Jan 17, 2022

MansMeg commented Jan 19, 2022

avehtari commented Jan 19, 2022

sethaxen commented Jan 19, 2022

avehtari commented Jan 19, 2022

MansMeg commented Jan 19, 2022 • edited Loading

avehtari commented Jan 19, 2022

MansMeg commented Jan 19, 2022

MansMeg commented Jan 19, 2022

avehtari commented Dec 2, 2021 •

edited

Loading

MansMeg commented Jan 19, 2022 •

edited

Loading