Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add reason for failed match to RaisesGroup #3145

Open
wants to merge 36 commits into
base: main
Choose a base branch
from

Conversation

jakkdl
Copy link
Member

@jakkdl jakkdl commented Nov 29, 2024

As another preparatory step for adding RaisesGroup into pytest, I thought I'd finally get around to making it.. not a nightmare to debug why it fails to match. When I've been using RaisesGroup with a complicated structure I've sometimes found it very tricky to figure out why it's not matching - and I imagine that's 10x worse for somebody not intimately familiar with its innards.

This does introduce a major change in behavior, previously RaisesGroup, like pytest.raises, would silently pass through the exception if it didn't match - but now it will instead fail with an AssertionError. This also has precedent upstream though, pytest.raises will fail with an AssertionError iff you've specified match.

You still see the original exception in the traceback, and in many ways I think always failing with an AssertionError is more legible.
I don't think this will impact end user's test suites in a majority of cases, unless they're either testing RaisesGroup behavior itself, or doing some very weird nesting. But even if so, they can rewrite their code as:

with pytest.raises(AssertionError) as exc_info:
    with RaisesGroup(...):
        ...
assert isinstance(exc_info.type, ValueError)

Another improvement would be making .matches() directly raise an AssertionError, instead of quietly setting .fail_reason. This would break any test currently doing if not RaisesGroup(...).matches(...):, or even more plausibly assert not RaisesGroup(...).matches(...), so
I think I should add a new method .assert_matches() to both RaisesGroup and Matcher, which either calls .matches() and asserts the return value with fail_reason as the message - or do it the other way around and have .matches() call .assert_matches() in a try:.

There's lots of bikeshedding possible with the phrasing of each error message, and am not completely happy with nested cases, so would very much appreciate any suggestions.

@jakkdl jakkdl requested a review from Zac-HD November 29, 2024 12:16
Copy link

codecov bot commented Nov 29, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00000%. Comparing base (3e0058b) to head (561660b).

Additional details and impacted files
@@               Coverage Diff                @@
##                 main        #3145    +/-   ##
================================================
  Coverage   100.00000%   100.00000%            
================================================
  Files             124          124            
  Lines           18460        18723   +263     
  Branches         1216         1265    +49     
================================================
+ Hits            18460        18723   +263     
Files with missing lines Coverage Δ
src/trio/_tests/test_exports.py 100.00000% <100.00000%> (ø)
src/trio/_tests/test_testing_raisesgroup.py 100.00000% <100.00000%> (ø)
src/trio/testing/_raises_group.py 100.00000% <100.00000%> (ø)

Copy link
Member

@Zac-HD Zac-HD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm excited to see continued progress here! Really good support for ExceptionGroup in testing is one of those things that doesn't seem like a huge problem until you're living with it 😅

src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
Comment on lines 445 to 464
with pytest.raises(
AssertionError,
match=wrap_escape(
"Raised exception did not match: RuntimeError():\n"
" RuntimeError() is not of type 'ValueError'\n"
" Matcher(TypeError): RuntimeError() is not of type 'TypeError'\n"
" RaisesGroup(RuntimeError): RuntimeError() is not an exception group, but would match with `allow_unwrapped=True`\n"
" RaisesGroup(ValueError): RuntimeError() is not an exception group",
),
):
with RaisesGroup(
ValueError,
Matcher(TypeError),
RaisesGroup(RuntimeError),
RaisesGroup(ValueError),
):
raise ExceptionGroup(
"a",
[RuntimeError(), TypeError(), ValueError(), ValueError()],
)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Even after staring at this for a while, I'm having trouble matching up the corresponding lines to matchers to leaf exceptions. Can you add some comments laying out the intent?

I think that's mostly because RaisesGroup doesn't care about the order of matches, but I'm concerned that a greedy algorithm will fail to find a passing match when one exists (although exhaustive checks could be quite expensive in the worst case!).

I think our error messages are also unhelpful or even misleading here, in that the lines of output all seem to be describing the first leaf exception, without accounding for the fact that they might be (close) matches for other leaves or subgroups. Instead, I suggest trying to match each sub-exception.

  • Set aside whatever matchers and subgroups have a 1:1 correspondence. This is probably the common case; i.e. one or more missing or unexpected exceptions. Report each missing exception and unused matcher.
  • If it's more complicated than that - for example, two TypeErrors and only one matcher - report all of those as well. I don't think it's worth trying to work out which are "closer to matching", or which to set aside; the user can sort it out.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sorry yeah, it's failing to match any of the expected exceptions with the first exception in the group. But had to pass more exceptions in the group to make it pass the number-of-exceptions check. Will add comments :)

The algorithm is currently described in the class docstring:

The matching algorithm is greedy, which means cases such as this may fail::
with RaisesGroups(ValueError, Matcher(ValueError, match="hello")):
raise ExceptionGroup("", (ValueError("hello"), ValueError("goodbye")))
even though it generally does not care about the order of the exceptions in the group.
To avoid the above you should specify the first ValueError with a Matcher as well.

But I suppose I should at the very least also mention it in the docstring of matches().

I very much like your idea, will give it a shot!
This probably also means we should stop abandoning as soon as the number of exceptions is wrong, with RaisesGroup(ValueError, TypeError): raise ExceptionGroup("", [ValueError]) can point out that TypeError is the one unmatched exception.
This makes my current suggest-flattening logic impossible though, so I'll go ahead with run-it-again-but-with-flattening-if-it-fails.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Made an attempt at generating a more verbose error message, and while this is an intentionally complicated example it's also fairly reasonable that a user expects a complicated structure, but instead gets a very different exceptiongroup with very different errors.

So for this code:

with RaisesGroup(
    RaisesGroup(ValueError),
    RaisesGroup(RaisesGroup(ValueError)),
    RaisesGroup(Matcher(TypeError, match="foo")),
    RaisesGroup(TypeError, ValueError)
):
    raise ExceptionGroup(
        "", [TypeError(), ExceptionGroup("", [TypeError()]),
             ExceptionGroup("", [TypeError(), TypeError()])]
    )
my current WIP implementation outputs the following:
=========================================================================================== FAILURES ===========================================================================================
__________________________________________________________________________________ test_assert_message_nested __________________________________________________________________________________
  + Exception Group Traceback (most recent call last):
  |   File "/home/h/Git/trio/raisesgroup_fail_reason/src/trio/_tests/test_testing_raisesgroup.py", line 533, in test_assert_message_nested
  |     raise ExceptionGroup(
  | ExceptionGroup:  (3 sub-exceptions)
  +-+---------------- 1 ----------------
    | TypeError
    +---------------- 2 ----------------
    | ExceptionGroup:  (1 sub-exception)
    +-+---------------- 1 ----------------
      | TypeError
      +------------------------------------
    +---------------- 3 ----------------
    | ExceptionGroup:  (2 sub-exceptions)
    +-+---------------- 1 ----------------
      | TypeError
      +---------------- 2 ----------------
      | TypeError
      +------------------------------------

During handling of the above exception, another exception occurred:

    def test_assert_message_nested() -> None:
>       with RaisesGroup(
            RaisesGroup(ValueError),
            RaisesGroup(RaisesGroup(ValueError)),
            RaisesGroup(Matcher(TypeError, match="foo")),
            RaisesGroup(TypeError, ValueError)
        ):
E       AssertionError: Raised exception did not match: Failed to match expected and raised exceptions.
E       The following expected exceptions did not find a match: {RaisesGroup(Matcher(TypeError, match='foo')), RaisesGroup(ValueError), RaisesGroup(RaisesGroup(ValueError)), RaisesGroup(TypeError, ValueError)}
E       The following raised exceptions did not find a match
E         TypeError():
E           RaisesGroup(Matcher(TypeError, match='foo')): TypeError() is not an exception group
E           RaisesGroup(ValueError): TypeError() is not an exception group
E           RaisesGroup(RaisesGroup(ValueError)): TypeError() is not an exception group
E           RaisesGroup(TypeError, ValueError): TypeError() is not an exception group
E         ExceptionGroup('', [TypeError()]):
E           RaisesGroup(Matcher(TypeError, match='foo')): Failed to match: Matcher(TypeError, match='foo'): Regex pattern 'foo' did not match ''
E           RaisesGroup(ValueError): Failed to match: TypeError() is not of type 'ValueError'
E           RaisesGroup(RaisesGroup(ValueError)): Failed to match: RaisesGroup(ValueError): TypeError() is not an exception group
E           RaisesGroup(TypeError, ValueError): Too few exceptions raised! The following expected exception(s) found no match: {<class 'ValueError'>}
E         ExceptionGroup('', [TypeError(), TypeError()]):
E           RaisesGroup(Matcher(TypeError, match='foo')): Failed to match expected and raised exceptions.
E             The following expected exceptions did not find a match: {Matcher(TypeError, match='foo')}
E             The following raised exceptions did not find a match
E               TypeError():
E                 Matcher(TypeError, match='foo'): Regex pattern 'foo' did not match ''
E               TypeError():
E                 Matcher(TypeError, match='foo'): Regex pattern 'foo' did not match ''
E       
E           RaisesGroup(ValueError): Failed to match expected and raised exceptions.
E             The following expected exceptions did not find a match: {<class 'ValueError'>}
E             The following raised exceptions did not find a match
E               TypeError():
E                 TypeError() is not of type 'ValueError'
E               TypeError():
E                 TypeError() is not of type 'ValueError'
E       
E           RaisesGroup(RaisesGroup(ValueError)): Failed to match expected and raised exceptions.
E             The following expected exceptions did not find a match: {RaisesGroup(ValueError)}
E             The following raised exceptions did not find a match
E               TypeError():
E                 RaisesGroup(ValueError): TypeError() is not an exception group
E               TypeError():
E                 RaisesGroup(ValueError): TypeError() is not an exception group
E       
E           RaisesGroup(TypeError, ValueError): Failed to match expected and raised exceptions.
E             The following expected exceptions did not find a match: {<class 'ValueError'>}
E             The following raised exceptions did not find a match
E               TypeError():
E                 TypeError() is not of type 'ValueError'
E                 It matches <class 'TypeError'> which was paired with TypeError()

src/trio/_tests/test_testing_raisesgroup.py:527: AssertionError

There's about a million things to tweak, and this doesn't hit all logic (in particular when some-but-not-all exceptions match), but do you like this as the basic structure? It feels kind of insane to have an exception message that's 45 lines - and given the quadratic nature could quickly spiral even further - but maybe that's fine?

src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
Comment on lines 591 to 595
# TODO: try to avoid printing the check function twice?
# it's very verbose with printing out memory location.
# And/or don't print memory location and just print the name
"Raised exception did not match: OSError(6, ''):\n"
f" Matcher(OSError, check={check_errno_is_5!r}): check {check_errno_is_5!r} did not return True for OSError(6, '')",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Matcher(OSError, check={check_errno_is_5!r}): check did not return True for OSError(6, '') seems reasonable to me? (and see above for Hypothesis' pprinter idea...)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The problem with that is in the RaisesGroup case, if you don't want RaisesGroup(RaisesGroup(ValueError, check=my_fun)) to print the function twice - then RaisesGroup(ValueError, check=my_fun) will not print it at all.
The only way out of that is to not print it in the overview (maybe something like Raises(ValueError, check=...): check {check_errno_is_5!r} did not return True for OsError(6, '')), or to use logic to see if we're nested or not (I suppose we already have _depth)*

Never printing it in the un-nested case is probably fine a majority of the time, but if you're doing something like for f in ...: with RaisesGroup(..., check=f) then you'd need an extra debugging cycle.

* I'll go ahead with this latter one for now

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed that case, but still printing it twice in nested cases:

# in nested cases you still get it multiple times though...
with pytest.raises(
AssertionError,
match=wrap_escape(
f"Raised exception group did not match: RaisesGroup(Matcher(OSError, check={check_errno_is_5!r})): Matcher(OSError, check={check_errno_is_5!r}): check did not return True for OSError(6, '')",
),
):
with RaisesGroup(RaisesGroup(Matcher(OSError, check=check_errno_is_5))):
raise ExceptionGroup("", [ExceptionGroup("", [OSError(6, "")])])

src/trio/_tests/test_testing_raisesgroup.py Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
@jakkdl jakkdl requested a review from Zac-HD December 9, 2024 16:11
src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
src/trio/_tests/test_testing_raisesgroup.py Outdated Show resolved Hide resolved
@jakkdl
Copy link
Member Author

jakkdl commented Jan 8, 2025

The only remaining TODO now is

# TODO: this one should maybe say that it matched an already matched exception
with (
fails_raises_group(
"1 matched exception. Too few exceptions raised, found no match for: [<class 'ValueError'>]"
),
RaisesGroup(ValueError, ValueError),
):
raise ExceptionGroup("", (ValueError(),))

which I'm procrastinating on because _check_exceptions is becoming an unwieldy beast.
I think I'm sticking to my guns on the matching being greedy, but _check_exceptions is so close to being fully exhaustive on fail already that we can probably have a message for "there is a pairing that would've worked, you should specify your requirements more stringently to allow the greedy matching to work"

@CoolCat467
Copy link
Member

Run failure for pypi looks familiar, I think the issue last time was that package cache pypi runner was using had cached an outdated version of what uv versions were available, leading it to believe that constraint-specified version of uv does not exist when it actually does in reality, just using an old cache.

…h no matches. Print if there's a possible covering that failed due to too-general expected.
@jakkdl jakkdl marked this pull request as ready for review January 9, 2025 17:29
@jakkdl
Copy link
Member Author

jakkdl commented Jan 9, 2025

Code is perhaps a bit messy, and the tests definitely are (I think I'll restructure them when moving to pytest), but otherwise I'm happy with current functionality. So please review away!

Copy link
Member

@Zac-HD Zac-HD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suggest we check the two tweaks to messages below, and then come back for the hypothesis-plugin-monkeypatch in a followup PR. Otherwise, looks great and I can't wait to use it!



def _check_repr(check: Callable[[BaseExcT_1], bool]) -> str:
"""Split out so it can be monkeypatched (e.g. by hypothesis)"""
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
"""Split out so it can be monkeypatched (e.g. by hypothesis)"""
"""Split out so it can be monkeypatched (e.g. by our hypothesis plugin)"""

and then in this function, add

try:
    from ..testing import raises_group
    from hypothesis.internal.reflection import get_pretty_function_description

    raises_group._check_repr = get_pretty_function_description
except ImportError:
    pass

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ooh, making trio do the monkeypatching is nice. Though gonna require a bunch of pytest.skipif/wildcards in the test suite

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ah hmm, the hook does not appear to execute in time. What I think is happening is that function only executes when hypothesis tries to do its thing, which might be after pytest runs other test functions, so it's not enough to merely have hypothesis installed in the venv.
Adding

try:
    from hypothesis.internal.reflection import get_pretty_function_description
    _check_repr = get_pretty_function_description
except ImportError:
    pass

directly to _raises_group.py works perfectly fine though, but at that point we're perhaps better off adding it to hypothesis.

We will want to wrap the return value of get_pretty_function_description in quotes though:
Raised exception group did not match: check lambda x: x is exc did not return True

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ah, yeah, this triggers when Hypothesis is imported; just having it installed won't do anything. And indeed quotes are useful here, python is quite similar to english sometimes!

Copy link
Member Author

@jakkdl jakkdl Jan 13, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hm, I guess if we were to put monkeypatching in Hypothesis it'd also require it to be imported. It would be great to be able to say "just install hypothesis and you get prettier repr's"; or pretty bad if you do

pytest tests/my_specific_test_file_that_doesnt_import_hypothesis.py

and suddenly you don't get pretty repr's anymore. (edit: actually that probably isn't an issue if hypothesis puts it in the pytest plugin)

It'd be kinda great if pytest incorporated get_pretty_function_description itself, surely that would be of use elsewhere too

Comment on lines +930 to +932
assert res is not NotChecked
# why doesn't mypy pick up on the above assert?
return res # type: ignore[return-value]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think because type[NotChecked] allows subclasses; if you did assert not issubclass(res, NotChecked) it might work?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

that's why I tried adding @final... 🤔 @A5rocks

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤔 maybe mypy doesn't distinguish this for @final clases? It's an interaction of some weird edge cases, that wouldn't surprise me much.

# and `isinstance(raised, expected_type)`.
with (
fails_raises_group(
"Regex pattern 'foo' did not match 'bar' of 'ExceptionGroup', but matched expected exception. You might want RaisesGroup(Matcher(ValueError, match='foo'"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe "matched the inner exception"?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suppose we can just name the expected exception

self.fail_reason += f", but matched the expected {expected.__name__!r}. You might want RaisesGroup(Matcher({expected.__name__}, match={_match_pattern(self.match_expr)!r}"

Comment on lines +583 to +585
# Ideally the "did you mean to re.escape" should be indented twice
"Matcher(match='h(ell)o'): Regex pattern 'h(ell)o' did not match 'h(ell)o'\n"
"Did you mean to `re.escape()` the regex?",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we just put " " at the start of the message to get this effect?

src/trio/_tests/test_testing_raisesgroup.py Show resolved Hide resolved
match=(
r"^Raised exception group did not match: \n"
r"The following expected exceptions did not find a match:\n"
r" Matcher\(check=<function test_assert_message_nested.<locals>.<lambda> at .*>\)\n"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ah... if we do the monkeypatch from Trio's hypothesis plugin as I suggest above, we'll then also want to use the maybe-monkeypatched _check_repr function to construct the expected string; and maybe also assert that that string is one of two known possibilities. Which is kinda ugly, but imo showing the lambda source here is nice enough to be worth it.

Copy link
Contributor

@A5rocks A5rocks left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I haven't looked that hard at the logic but this looks good! (note to self: restart review at _check_exceptions)

This is mostly wording/nitpicks/minor changes. A bunch of them, but it's a few concerns duplicated:

  • I don't like _depth and proposed some changes to get rid of it
  • __str__ defaults to __repr__
  • honestly I can't remember, probably just suggestions about errors/docs

# Ignore classes that don't use attrs, they only define their members once
# __init__ is called (and reason they don't use attrs is because they're going
# to be reimplemented in pytest).
# I think that's why these are failing at least?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You could try tossing attrs on them and seeing! But I think so too.

Maybe even just try putting __slots__ on them and using that to double check, then removing this (checked) comment.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ugh, they're passing locally (but a bunch of other classes aren't 🙃)
and this is running through a minimal tox with -r test-requirements.txt as the only dep so I have no clue what's going on. That's for another day

newsfragments/3145.feature.rst Outdated Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
src/trio/testing/_raises_group.py Outdated Show resolved Hide resolved
@jakkdl
Copy link
Member Author

jakkdl commented Jan 13, 2025

Okay I like the new structure with an ABC, just a couple last things.
A couple methods should probably be renamed to disambiguate _check_xxx methods that set fail_reason and those that don't.

@A5rocks I still can't get rid of the type: ignore, if you want to take a look feel free. But even so I'm not sure it's worth having a bunch of @overloads (if they're necessary) and sending a dummy variable to _check_exceptions just to get rid of a type: ignore/cast.

@jakkdl
Copy link
Member Author

jakkdl commented Jan 13, 2025

Okay I like the new structure with an ABC, just a couple last things. A couple methods should probably be renamed to disambiguate _check_xxx methods that set fail_reason and those that don't.

@A5rocks I still can't get rid of the type: ignore, if you want to take a look feel free. But even so I'm not sure it's worth having a bunch of @overloads (if they're necessary) and sending a dummy variable to _check_exceptions just to get rid of a type: ignore/cast.

I ... moved the type: ignore to RaisesGroup.__init__ 🙃

@jakkdl jakkdl requested a review from A5rocks January 13, 2025 16:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants