Add covering inputs as `@example(...)`s in the code #13

Zac-HD · 2022-10-31T06:11:03Z

Here's a neat workflow, combining the benefits for PBT and fuzzing with deterministic tests:

Use the fuzzer to find a reasonably diverse set of covering examples (already works)
Automatically edit them into the code as explicit @example(...) cases (this issue!)
Run your standard CI with only full-explicit deterministic examples (already works; see also GH-86275: Implementation of hypothesis stubs for property-based tests, with zoneinfo tests python/cpython#22863)

So what will it take to have automatically-maintained explicit examples? Some quick notes:

This only works for test cases which can be written using the @example() decorator, which rules out stateful tests or those using st.data(). We'll also have trouble with reprs that can't be eval'd back to an equivalent object - we might get a short distance by representing objects from st.builds() as the result of the call (also useful for Explaining failing examples - by showing which arguments (don't) matter HypothesisWorks/hypothesis#3411), but this seems like a fundamental limitation.
We need to know where the test is, and how to insert the decorator. Introspection works, albeit with some pretty painful edge cases we'll need to bail out on, and I think LibCST should make the latter pretty easy - we can construct a string call, attempt to parse it, and then insert it into the decorator list.
My preferred UX for this is "HypoFuzz dumps a <hash>.patch file and the user does git apply ...". We can dump the file on disk, and also make it downloadable from the dashboard for remote use. The patch shouldn't be too ugly, e.g. one line per arg, but users are expected to run their choice of autoformatter.
I mentioned "automatically-maintained": it'd be nice to remove previously-covering examples when the set updates; or crucial if we haven't shrunk to a minimal covering example (and currently we don't!). This probably means using magic comments to distinguish human-added examples from machine-maintained covering examples. Note that fuzzer-discovered minimal failing examples might be automatically added to the former set!

This seems fiddly, but not actually that hard - we already report covering examples on the dashboard, after all. No timeline on when I'll get to this, but I'd be very happy to provide advice and code review to anyone interested in contributing 🙂

The text was updated successfully, but these errors were encountered:

mentalisttraceur · 2022-11-15T21:21:49Z

So if I understand correctly, the big-picture benefit is to work better with graceful degradation of hypothesis tests to just plain unit tests with hard-coded cases?

By automatically providing those hard-coded cases in a form accessible to stub implementations of the hypothesis APIs which want to stay so simple that they can't even read the example database?

(Besides working around any hassle with making the example database available everywhere, which to me seems better solved by providing a separate solution with great UX for hosting/distributing/syncing the example database.)

Zac-HD · 2022-11-15T21:46:55Z

Smoothly stepping down to a parametrized unit-test, yes. For some users (e.g. CPython) this is so that they can use a stub implementation; others might be happy to use Hypothesis' phases setting which already supports this!

At scale, say 100+ people and 1M+ lines of code, it can be really important to have fast and deterministic CI because you're testing for regressions rather than bugs via that workflow above, and can run a separate bug-hunting program 'alongside' your CI system. Otherwise I agree that sharing the example database is almost always going to be a better and easier solution, via e.g. Hypothesis' RedisExampleDatabase or just writing your own trivial wrapper around whatever datastore you like.

Zac-HD · 2023-04-26T22:46:00Z

Turns out that HypothesisWorks/hypothesis#3631 is a rather nice feature for Hypothesis itself, so we'll ship almost all the code upstream and HypoFuzz can just call into the hooks I left😁

Zac-HD · 2023-05-31T17:32:35Z

OK, we've shipped all the upstreamable internals in Hypothesis - including removal of @example()s with a specific tag - and I have a MVP branch up at https://github.com/Zac-HD/hypofuzz/compare/write-patches. Further notes:

Do we want explain mode for covering examples? On one hand, the # or any other generated value comments are actually pretty nice; on the other it can be a big of a performance hog at the moment. The branch doesn't included these at the moment.
We should probably offer both a failing-examples-only and a failing-and-covering-examples patch.
Dashboard interface: I'm imagining some fairly small links between the main-page chart and the table of details, to download each patch. It might also be nice to have a view-patch-in-browser option, not sure that's worth it but if we do it should have syntax highlighting.
Do we want per-test patches accessible from their pages? Would have to compute these on demand, which come to think of it we should do for all the patches to save on CPU time.
We want to serve the "long-running CI job which uploads a patch file" niche, and I'd prefer to have an atexit handler rather than saving every X minutes. To make uploading patch files easier, we can copy to a canonical .hypothesis/patches/latest-{covering,failing}.patch location. To support naive use of timeout, we'll also need to register a handler for sigterm. Finally, of course, we'll need to document the full GitHub Actions configuration we recommend.

Zac-HD mentioned this issue Nov 12, 2022

Support @example(...).via("string description") to track example provenance HypothesisWorks/hypothesis#3506

Closed

Zac-HD mentioned this issue Nov 16, 2022

Docs: example GitHub Actions configuration with CI + a fuzzing cronjob with shared database #14

Open

Zac-HD mentioned this issue Dec 4, 2022

Improve database keys by ignoring decorators, comments, etc. HypothesisWorks/hypothesis#3523

Merged

This was referenced Apr 29, 2023

Write .patch files to add @example() decorators for failing test cases HypothesisWorks/hypothesis#3631

Merged

(maybe) put hypothesis back twisted/klein#588

Open

Zac-HD self-assigned this May 23, 2023

Zac-HD mentioned this issue May 27, 2023

Improved patch files for failing examples HypothesisWorks/hypothesis#3659

Merged

Zac-HD mentioned this issue Jun 3, 2023

Write patches with explicit examples #25

Merged

Zac-HD closed this as completed in #25 Jun 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add covering inputs as `@example(...)`s in the code #13

Add covering inputs as `@example(...)`s in the code #13

Zac-HD commented Oct 31, 2022

mentalisttraceur commented Nov 15, 2022 •

edited

Zac-HD commented Nov 15, 2022

Zac-HD commented Apr 26, 2023 •

edited

Zac-HD commented May 31, 2023

Add covering inputs as @example(...)s in the code #13

Add covering inputs as @example(...)s in the code #13

Comments

Zac-HD commented Oct 31, 2022

mentalisttraceur commented Nov 15, 2022 • edited

Zac-HD commented Nov 15, 2022

Zac-HD commented Apr 26, 2023 • edited

Zac-HD commented May 31, 2023

Add covering inputs as `@example(...)`s in the code #13

Add covering inputs as `@example(...)`s in the code #13

mentalisttraceur commented Nov 15, 2022 •

edited

Zac-HD commented Apr 26, 2023 •

edited