simulator: add counterexample minimization #623

alpaylan · 2025-01-06T15:28:21Z

This PR introduces counterexample minimization(shrinking) in the simulator. It will require changes to the current structure in various places, so I've opened it as a draft PR for now, in order to not overwhelm the reviewers all at once.

Turn interactions plans into sequences of properties instead of sequences of interactions, adding a semantic layer.
Add assumptions to the properties, rendering a property invalid if its assumptions are not valid.
Record the failure point in a failing assertion, as shrinking by definition works when the same assertion fails for a smaller input.
Add shrinking at three levels, top level(removing whole properties), property level(removing interactions within properties), interaction level(shrinking values in interactions to smaller ones).
Add marauders as a dev dependency, inject custom mutations to a testing branch for evaluating the simulator performance.
Integrate the simulator evaluation with the CI.

…perations that did not reflect the internal structure, in which they were actually concatenations of properties, which are a coherent set of interactions that are meaningful by themselves. this commit introduces this semantic layer into the data model by turning interaction plans into a sequence of properties, which are a sequence of interactions

the execution of the current property and switches to the next one.

three indexes(connection, interaction pointer, and secondary pointer) that can uniquely identify the executed interaction at any point. we will use the history for shrinking purposes.

pereman2 · 2025-01-07T11:29:07Z

This looks quite cool @alpaylan. Could you post some literature you think is relevant for this?

alpaylan · 2025-01-07T12:56:59Z

This looks quite cool @alpaylan. Could you post some literature you think is relevant for this?

Thanks @pereman2 ! Of course, here are some rather informal writing discussing different shrinking strategies(warning: they're quite opinionated with respect to author's position, which I don't agree)

Two other informal articles,

ECOOP20 paper from the hypothesis author,

https://drmaciver.github.io/papers/reduction-via-generation-preview.pdf

I view shrinking very related to delta debugging, which I think is a lot more used in the literature. I have a submission that discusses shrinking too, I'm not sure if sharing it here breaks double blind, but I can send it over email if you would like.

jussisaurio · 2025-01-07T15:03:30Z

I thought this page had a nice layman definition for shrinking:

In property-based testing, the initially found failing case may contain a lot of complexity that does actually cause the test to fail. Shrinking is the mechanism through which a property-based testing framework can simplify failing cases in order to find out the minimal reproducible case is.

We could even add it to our code

jussisaurio · 2025-01-07T15:08:04Z

simulator/generation/plan.rs

@@ -251,31 +283,67 @@ impl Interaction {
            Self::Assertion(_) => {
                unreachable!("unexpected: this function should only be called on queries")
            }
-            Interaction::Fault(_) => {
+            Self::Assumption(_) => {


Btw you can just do sth like

_ => unreachable!("unexpected: ...")

to reduce some lines-of-code noise from the pattern matching. or alternatively

if let Self::Query(query) = self { ... } else { unreachable!(...) }

That's a nice touch, adding it in the next commit.

alpaylan · 2025-01-08T08:17:44Z

The shrinking is a bit harder than I hoped for, particularly due to the fact that we cannot shrink closures, which limits our ability to minimize aggressively by removing columns from tables, because we wouldn't be able to modify the assertion that checks it.

The solution is to turn the assertions themselves into a small DSL, but at that point it goes into too much research, which I don't think is the right choice for now. I've instead focused on other shrinking mechanisms;

Removing properties after the failing one,
Removing properties that do not refer to the tables in the failing interaction,
Removing extensional parts in the property(I'll also add these with some bit of explanation, probably today)

alpaylan added 3 commits January 6, 2025 18:16

add assumptions to the interactions, where a failing assumption stops

daa77fe

the execution of the current property and switches to the next one.

add execution history to the simulator, the history records

cc56276

three indexes(connection, interaction pointer, and secondary pointer) that can uniquely identify the executed interaction at any point. we will use the history for shrinking purposes.

jussisaurio reviewed Jan 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simulator: add counterexample minimization #623

simulator: add counterexample minimization #623

alpaylan commented Jan 6, 2025 •

edited

Loading

pereman2 commented Jan 7, 2025

alpaylan commented Jan 7, 2025

jussisaurio commented Jan 7, 2025

jussisaurio Jan 7, 2025

alpaylan Jan 8, 2025

alpaylan commented Jan 8, 2025

simulator: add counterexample minimization #623

Are you sure you want to change the base?

simulator: add counterexample minimization #623

Conversation

alpaylan commented Jan 6, 2025 • edited Loading

pereman2 commented Jan 7, 2025

alpaylan commented Jan 7, 2025

jussisaurio commented Jan 7, 2025

jussisaurio Jan 7, 2025

Choose a reason for hiding this comment

alpaylan Jan 8, 2025

Choose a reason for hiding this comment

alpaylan commented Jan 8, 2025

alpaylan commented Jan 6, 2025 •

edited

Loading