-
-
Notifications
You must be signed in to change notification settings - Fork 348
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add reason for failed match to RaisesGroup #3145
base: main
Are you sure you want to change the base?
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #3145 +/- ##
================================================
Coverage 100.00000% 100.00000%
================================================
Files 124 124
Lines 18460 18723 +263
Branches 1216 1265 +49
================================================
+ Hits 18460 18723 +263
|
….. though somewhat skeptical of their complexity cost vs benefit
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm excited to see continued progress here! Really good support for ExceptionGroup
in testing is one of those things that doesn't seem like a huge problem until you're living with it 😅
with pytest.raises( | ||
AssertionError, | ||
match=wrap_escape( | ||
"Raised exception did not match: RuntimeError():\n" | ||
" RuntimeError() is not of type 'ValueError'\n" | ||
" Matcher(TypeError): RuntimeError() is not of type 'TypeError'\n" | ||
" RaisesGroup(RuntimeError): RuntimeError() is not an exception group, but would match with `allow_unwrapped=True`\n" | ||
" RaisesGroup(ValueError): RuntimeError() is not an exception group", | ||
), | ||
): | ||
with RaisesGroup( | ||
ValueError, | ||
Matcher(TypeError), | ||
RaisesGroup(RuntimeError), | ||
RaisesGroup(ValueError), | ||
): | ||
raise ExceptionGroup( | ||
"a", | ||
[RuntimeError(), TypeError(), ValueError(), ValueError()], | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Even after staring at this for a while, I'm having trouble matching up the corresponding lines to matchers to leaf exceptions. Can you add some comments laying out the intent?
I think that's mostly because RaisesGroup
doesn't care about the order of matches, but I'm concerned that a greedy algorithm will fail to find a passing match when one exists (although exhaustive checks could be quite expensive in the worst case!).
I think our error messages are also unhelpful or even misleading here, in that the lines of output all seem to be describing the first leaf exception, without accounding for the fact that they might be (close) matches for other leaves or subgroups. Instead, I suggest trying to match each sub-exception.
- Set aside whatever matchers and subgroups have a 1:1 correspondence. This is probably the common case; i.e. one or more missing or unexpected exceptions. Report each missing exception and unused matcher.
- If it's more complicated than that - for example, two TypeErrors and only one matcher - report all of those as well. I don't think it's worth trying to work out which are "closer to matching", or which to set aside; the user can sort it out.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sorry yeah, it's failing to match any of the expected exceptions with the first exception in the group. But had to pass more exceptions in the group to make it pass the number-of-exceptions check. Will add comments :)
The algorithm is currently described in the class docstring:
trio/src/trio/testing/_raises_group.py
Lines 376 to 382 in 993a173
The matching algorithm is greedy, which means cases such as this may fail:: | |
with RaisesGroups(ValueError, Matcher(ValueError, match="hello")): | |
raise ExceptionGroup("", (ValueError("hello"), ValueError("goodbye"))) | |
even though it generally does not care about the order of the exceptions in the group. | |
To avoid the above you should specify the first ValueError with a Matcher as well. |
But I suppose I should at the very least also mention it in the docstring of
matches()
.
I very much like your idea, will give it a shot!
This probably also means we should stop abandoning as soon as the number of exceptions is wrong, with RaisesGroup(ValueError, TypeError): raise ExceptionGroup("", [ValueError])
can point out that TypeError
is the one unmatched exception.
This makes my current suggest-flattening logic impossible though, so I'll go ahead with run-it-again-but-with-flattening-if-it-fails.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Made an attempt at generating a more verbose error message, and while this is an intentionally complicated example it's also fairly reasonable that a user expects a complicated structure, but instead gets a very different exceptiongroup with very different errors.
So for this code:
with RaisesGroup(
RaisesGroup(ValueError),
RaisesGroup(RaisesGroup(ValueError)),
RaisesGroup(Matcher(TypeError, match="foo")),
RaisesGroup(TypeError, ValueError)
):
raise ExceptionGroup(
"", [TypeError(), ExceptionGroup("", [TypeError()]),
ExceptionGroup("", [TypeError(), TypeError()])]
)
my current WIP implementation outputs the following:
=========================================================================================== FAILURES ===========================================================================================
__________________________________________________________________________________ test_assert_message_nested __________________________________________________________________________________
+ Exception Group Traceback (most recent call last):
| File "/home/h/Git/trio/raisesgroup_fail_reason/src/trio/_tests/test_testing_raisesgroup.py", line 533, in test_assert_message_nested
| raise ExceptionGroup(
| ExceptionGroup: (3 sub-exceptions)
+-+---------------- 1 ----------------
| TypeError
+---------------- 2 ----------------
| ExceptionGroup: (1 sub-exception)
+-+---------------- 1 ----------------
| TypeError
+------------------------------------
+---------------- 3 ----------------
| ExceptionGroup: (2 sub-exceptions)
+-+---------------- 1 ----------------
| TypeError
+---------------- 2 ----------------
| TypeError
+------------------------------------
During handling of the above exception, another exception occurred:
def test_assert_message_nested() -> None:
> with RaisesGroup(
RaisesGroup(ValueError),
RaisesGroup(RaisesGroup(ValueError)),
RaisesGroup(Matcher(TypeError, match="foo")),
RaisesGroup(TypeError, ValueError)
):
E AssertionError: Raised exception did not match: Failed to match expected and raised exceptions.
E The following expected exceptions did not find a match: {RaisesGroup(Matcher(TypeError, match='foo')), RaisesGroup(ValueError), RaisesGroup(RaisesGroup(ValueError)), RaisesGroup(TypeError, ValueError)}
E The following raised exceptions did not find a match
E TypeError():
E RaisesGroup(Matcher(TypeError, match='foo')): TypeError() is not an exception group
E RaisesGroup(ValueError): TypeError() is not an exception group
E RaisesGroup(RaisesGroup(ValueError)): TypeError() is not an exception group
E RaisesGroup(TypeError, ValueError): TypeError() is not an exception group
E ExceptionGroup('', [TypeError()]):
E RaisesGroup(Matcher(TypeError, match='foo')): Failed to match: Matcher(TypeError, match='foo'): Regex pattern 'foo' did not match ''
E RaisesGroup(ValueError): Failed to match: TypeError() is not of type 'ValueError'
E RaisesGroup(RaisesGroup(ValueError)): Failed to match: RaisesGroup(ValueError): TypeError() is not an exception group
E RaisesGroup(TypeError, ValueError): Too few exceptions raised! The following expected exception(s) found no match: {<class 'ValueError'>}
E ExceptionGroup('', [TypeError(), TypeError()]):
E RaisesGroup(Matcher(TypeError, match='foo')): Failed to match expected and raised exceptions.
E The following expected exceptions did not find a match: {Matcher(TypeError, match='foo')}
E The following raised exceptions did not find a match
E TypeError():
E Matcher(TypeError, match='foo'): Regex pattern 'foo' did not match ''
E TypeError():
E Matcher(TypeError, match='foo'): Regex pattern 'foo' did not match ''
E
E RaisesGroup(ValueError): Failed to match expected and raised exceptions.
E The following expected exceptions did not find a match: {<class 'ValueError'>}
E The following raised exceptions did not find a match
E TypeError():
E TypeError() is not of type 'ValueError'
E TypeError():
E TypeError() is not of type 'ValueError'
E
E RaisesGroup(RaisesGroup(ValueError)): Failed to match expected and raised exceptions.
E The following expected exceptions did not find a match: {RaisesGroup(ValueError)}
E The following raised exceptions did not find a match
E TypeError():
E RaisesGroup(ValueError): TypeError() is not an exception group
E TypeError():
E RaisesGroup(ValueError): TypeError() is not an exception group
E
E RaisesGroup(TypeError, ValueError): Failed to match expected and raised exceptions.
E The following expected exceptions did not find a match: {<class 'ValueError'>}
E The following raised exceptions did not find a match
E TypeError():
E TypeError() is not of type 'ValueError'
E It matches <class 'TypeError'> which was paired with TypeError()
src/trio/_tests/test_testing_raisesgroup.py:527: AssertionError
There's about a million things to tweak, and this doesn't hit all logic (in particular when some-but-not-all exceptions match), but do you like this as the basic structure? It feels kind of insane to have an exception message that's 45 lines - and given the quadratic nature could quickly spiral even further - but maybe that's fine?
# TODO: try to avoid printing the check function twice? | ||
# it's very verbose with printing out memory location. | ||
# And/or don't print memory location and just print the name | ||
"Raised exception did not match: OSError(6, ''):\n" | ||
f" Matcher(OSError, check={check_errno_is_5!r}): check {check_errno_is_5!r} did not return True for OSError(6, '')", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Matcher(OSError, check={check_errno_is_5!r}): check did not return True for OSError(6, '')
seems reasonable to me? (and see above for Hypothesis' pprinter idea...)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The problem with that is in the RaisesGroup
case, if you don't want RaisesGroup(RaisesGroup(ValueError, check=my_fun))
to print the function twice - then RaisesGroup(ValueError, check=my_fun)
will not print it at all.
The only way out of that is to not print it in the overview (maybe something like Raises(ValueError, check=...): check {check_errno_is_5!r} did not return True for OsError(6, '')
), or to use logic to see if we're nested or not (I suppose we already have _depth
)*
Never printing it in the un-nested case is probably fine a majority of the time, but if you're doing something like for f in ...: with RaisesGroup(..., check=f)
then you'd need an extra debugging cycle.
* I'll go ahead with this latter one for now
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed that case, but still printing it twice in nested cases:
trio/src/trio/_tests/test_testing_raisesgroup.py
Lines 718 to 726 in a8e263c
# in nested cases you still get it multiple times though... | |
with pytest.raises( | |
AssertionError, | |
match=wrap_escape( | |
f"Raised exception group did not match: RaisesGroup(Matcher(OSError, check={check_errno_is_5!r})): Matcher(OSError, check={check_errno_is_5!r}): check did not return True for OSError(6, '')", | |
), | |
): | |
with RaisesGroup(RaisesGroup(Matcher(OSError, check=check_errno_is_5))): | |
raise ExceptionGroup("", [ExceptionGroup("", [OSError(6, "")])]) |
… on check fail. Fix output issues when multiple copies of the same object were expected/raised
…. Also fix succesful->successful typo.
…raised-exception by adding another example
…and suggest using Matcher if match would match against a lone expected & raised exception
The only remaining TODO now is trio/src/trio/_tests/test_testing_raisesgroup.py Lines 103 to 110 in 9612d1e
which I'm procrastinating on because |
Run failure for pypi looks familiar, I think the issue last time was that package cache pypi runner was using had cached an outdated version of what uv versions were available, leading it to believe that constraint-specified version of uv does not exist when it actually does in reality, just using an old cache. |
…h no matches. Print if there's a possible covering that failed due to too-general expected.
Code is perhaps a bit messy, and the tests definitely are (I think I'll restructure them when moving to pytest), but otherwise I'm happy with current functionality. So please review away! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suggest we check the two tweaks to messages below, and then come back for the hypothesis-plugin-monkeypatch in a followup PR. Otherwise, looks great and I can't wait to use it!
src/trio/testing/_raises_group.py
Outdated
|
||
|
||
def _check_repr(check: Callable[[BaseExcT_1], bool]) -> str: | ||
"""Split out so it can be monkeypatched (e.g. by hypothesis)""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"""Split out so it can be monkeypatched (e.g. by hypothesis)""" | |
"""Split out so it can be monkeypatched (e.g. by our hypothesis plugin)""" |
and then in this function, add
try:
from ..testing import raises_group
from hypothesis.internal.reflection import get_pretty_function_description
raises_group._check_repr = get_pretty_function_description
except ImportError:
pass
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ooh, making trio do the monkeypatching is nice. Though gonna require a bunch of pytest.skipif
/wildcards in the test suite
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah hmm, the hook does not appear to execute in time. What I think is happening is that function only executes when hypothesis tries to do its thing, which might be after pytest runs other test functions, so it's not enough to merely have hypothesis
installed in the venv.
Adding
try:
from hypothesis.internal.reflection import get_pretty_function_description
_check_repr = get_pretty_function_description
except ImportError:
pass
directly to _raises_group.py
works perfectly fine though, but at that point we're perhaps better off adding it to hypothesis.
We will want to wrap the return value of get_pretty_function_description
in quotes though:
Raised exception group did not match: check lambda x: x is exc did not return True
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah, yeah, this triggers when Hypothesis is imported; just having it installed won't do anything. And indeed quotes are useful here, python is quite similar to english sometimes!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hm, I guess if we were to put monkeypatching in Hypothesis it'd also require it to be imported. It would be great to be able to say "just install hypothesis and you get prettier repr's"; or pretty bad if you do
pytest tests/my_specific_test_file_that_doesnt_import_hypothesis.py
and suddenly you don't get pretty repr's anymore. (edit: actually that probably isn't an issue if hypothesis puts it in the pytest plugin)
It'd be kinda great if pytest incorporated get_pretty_function_description
itself, surely that would be of use elsewhere too
assert res is not NotChecked | ||
# why doesn't mypy pick up on the above assert? | ||
return res # type: ignore[return-value] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think because type[NotChecked]
allows subclasses; if you did assert not issubclass(res, NotChecked)
it might work?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
that's why I tried adding @final
... 🤔 @A5rocks
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🤔 maybe mypy doesn't distinguish this for @final
clases? It's an interaction of some weird edge cases, that wouldn't surprise me much.
# and `isinstance(raised, expected_type)`. | ||
with ( | ||
fails_raises_group( | ||
"Regex pattern 'foo' did not match 'bar' of 'ExceptionGroup', but matched expected exception. You might want RaisesGroup(Matcher(ValueError, match='foo'" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe "matched the inner exception"?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suppose we can just name the expected exception
self.fail_reason += f", but matched the expected {expected.__name__!r}. You might want RaisesGroup(Matcher({expected.__name__}, match={_match_pattern(self.match_expr)!r}"
# Ideally the "did you mean to re.escape" should be indented twice | ||
"Matcher(match='h(ell)o'): Regex pattern 'h(ell)o' did not match 'h(ell)o'\n" | ||
"Did you mean to `re.escape()` the regex?", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we just put " "
at the start of the message to get this effect?
match=( | ||
r"^Raised exception group did not match: \n" | ||
r"The following expected exceptions did not find a match:\n" | ||
r" Matcher\(check=<function test_assert_message_nested.<locals>.<lambda> at .*>\)\n" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah... if we do the monkeypatch from Trio's hypothesis plugin as I suggest above, we'll then also want to use the maybe-monkeypatched _check_repr
function to construct the expected string; and maybe also assert that that string is one of two known possibilities. Which is kinda ugly, but imo showing the lambda source here is nice enough to be worth it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I haven't looked that hard at the logic but this looks good! (note to self: restart review at _check_exceptions
)
This is mostly wording/nitpicks/minor changes. A bunch of them, but it's a few concerns duplicated:
- I don't like
_depth
and proposed some changes to get rid of it __str__
defaults to__repr__
- honestly I can't remember, probably just suggestions about errors/docs
src/trio/_tests/test_exports.py
Outdated
# Ignore classes that don't use attrs, they only define their members once | ||
# __init__ is called (and reason they don't use attrs is because they're going | ||
# to be reimplemented in pytest). | ||
# I think that's why these are failing at least? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You could try tossing attrs on them and seeing! But I think so too.
Maybe even just try putting __slots__
on them and using that to double check, then removing this (checked) comment.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ugh, they're passing locally (but a bunch of other classes aren't 🙃)
and this is running through a minimal tox with -r test-requirements.txt
as the only dep so I have no clue what's going on. That's for another day
Co-authored-by: A5rocks <[email protected]>
…n check suggestion
Okay I like the new structure with an ABC, just a couple last things. @A5rocks I still can't get rid of the |
I ... moved the |
As another preparatory step for adding
RaisesGroup
into pytest, I thought I'd finally get around to making it.. not a nightmare to debug why it fails to match. When I've been usingRaisesGroup
with a complicated structure I've sometimes found it very tricky to figure out why it's not matching - and I imagine that's 10x worse for somebody not intimately familiar with its innards.This does introduce a major change in behavior, previously
RaisesGroup
, likepytest.raises
, would silently pass through the exception if it didn't match - but now it will instead fail with anAssertionError
. This also has precedent upstream though,pytest.raises
will fail with anAssertionError
iff you've specifiedmatch
.You still see the original exception in the traceback, and in many ways I think always failing with an
AssertionError
is more legible.I don't think this will impact end user's test suites in a majority of cases, unless they're either testing RaisesGroup behavior itself, or doing some very weird nesting. But even if so, they can rewrite their code as:
Another improvement would be making.matches()
directly raise anAssertionError
, instead of quietly setting.fail_reason
. This would break any test currently doingif not RaisesGroup(...).matches(...):
, or even more plausiblyassert not RaisesGroup(...).matches(...)
, soI think I should add a new method
.assert_matches()
to bothRaisesGroup
andMatcher
, which either calls.matches()
and asserts the return value withfail_reason
as the message - or do it the other way around and have.matches()
call.assert_matches()
in atry:
.There's lots of bikeshedding possible with the phrasing of each error message, and am not completely happy with nested cases, so would very much appreciate any suggestions.