-
Notifications
You must be signed in to change notification settings - Fork 667
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Skip observables contained in particle groups #4615
Skip observables contained in particle groups #4615
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hello there first time contributor! Welcome to the MDAnalysis community! We ask that all contributors abide by our Code of Conduct and that first time contributors introduce themselves on GitHub Discussions so we can get to know you. You can learn more about participating here. Please also add yourself to package/AUTHORS
as part of this PR.
Linter Bot Results:Hi @PythonFZ! Thanks for making this PR. We linted your code and found the following: Some issues were found with the formatting of your code.
Please have a look at the Please note: The |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## develop #4615 +/- ##
===========================================
- Coverage 93.61% 93.59% -0.02%
===========================================
Files 171 183 +12
Lines 21243 22316 +1073
Branches 3934 3936 +2
===========================================
+ Hits 19886 20886 +1000
- Misses 898 971 +73
Partials 459 459 ☔ View full report in Codecov by Sentry. |
faafe8b
to
078d168
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I like the idea of using the logger but logging at every step is potentially a lot of output and performance hit. Furthermore, I don’t think users care. Thus I suggest you remove the logging and just pass
.
We also need at least one test case.
@hmacdope would you be able to shepherd this PR? If not switch it to me. |
…com/PythonFZ/mdanalysis into h5md-skip-per-particles-observables
prec = 3 | ||
ext = 'h5md' | ||
|
||
@pytest.fixture(scope='class') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I may be mistaken, but I think the test_n_frames
method isn't testing the issue- if the reader was going to except out due to a key error, this would happen during the line in the universe
fixture:
reader = H5MDReader(H5MD_energy, convert_units=True)
when the H5MDReader's __init__
is called and the first call to _read_next_timestep
is made. The test method doesn't call any code that reads a frame, it just ends up calling the reader's n_frames
method. I agree with your approach to fixing the issue, just maybe have the test initialize or initialize and iterate through the reader so that the potential issue will occur in the scope of the test
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also make sure to update the CHANGELOG!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should be fixed now. The CI failing seems not related to this PR?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Small nitpick- can you change the name of the test to make it represent what it is testing? Also, the assert statement seems unrelated to what the test is testing. Maybe write an except and pytest.fail
statement upon raising a KeyError but otherwise don't have an assert statement. Here's an example from testsuite/coordinates/base.py
:
def test_frame_jump_issue1942(self, ref, reader):
"""Test for issue 1942 (especially XDR on macOS)"""
reader.rewind()
try:
for ii in range(ref.n_frames + 2):
reader[0]
except StopIteration:
pytest.fail("Frame-seeking wrongly iterated (#1942)")
Otherwise LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I updated the test and used pytest.fail
. I hope the test structure now is a bit cleaner. Thanks for the review.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great work! This is a good start until we decide what to do with the other form of observables.
@@ -231,6 +232,7 @@ class MockH5pyFile: | |||
HAS_H5PY = True | |||
|
|||
|
|||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
undo blank line change.
try: | ||
_ = H5MDReader(H5MD_energy, convert_units=True) | ||
except KeyError: | ||
pytest.fail("Could not read H5MD file with observables.") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can just have
_ = H5MDReader(H5MD_energy, convert_units=True)
As the body of the test, no need to try, except
. Pytest can catch all the errors.
See also comment about warnings, we should catch suggested warning.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FWIW @PythonFZ the following will successfully read into ts.data
with the key form group/observable
def _copy_to_data(self):
"""assigns values to keys in data dictionary"""
if "observables" in self._file:
for key in self._file["observables"].keys():
try:
# if has value as subkey read directly into data
if "value" in self._file["observables"][key]:
self.ts.data[key] = self._file["observables"][key][
"value"
][self._frame]
# if value is not a subkey, read dict of subkeys
else:
for subkey in self._file["observables"][key].keys():
self.ts.data[key + "/" + subkey] = self._file["observables"][key][
subkey
]["value"][self._frame]
except KeyError:
warnings.warn(f"Could not read {key} from observables group, not a legal H5MD observable specification or data not ")
@hmacdope What is the expected way to auto-format the code? I'm used to running either Maybe there are some |
@@ -372,6 +373,7 @@ | |||
TPR_xvf = (_data_ref / 'cobrotoxin.tpr').as_posix() | |||
TRR_xvf = (_data_ref / 'cobrotoxin.trr').as_posix() | |||
H5MD_xvf = (_data_ref / 'cobrotoxin.h5md').as_posix() | |||
H5MD_energy = (_data_ref / 'cu.h5').as_posix() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is it the convention for other h5md
files out there to use the *.h5
extension rather than *.h5md
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can rename the file to *.h5md
. I've seen both and personally prefer *.h5
because I was lazy and did not configure the VS Code H5WEB extension to also read *.h5md
. Afaik H5MD is the only format specification for H5 files, so there is no risk of confusion.
Edit: I haven't seen a recommendation for the *.h5md
suffix in the H5MD definition so I would actually prefer keeping the more general *.h5
.
@PythonFZ lets go with |
I renamed the files and ensured everything is properly formatted. I hope everything is good to go? |
There seems to be some failing test on the |
All tests are passing for me locally. From what I see in the logs, it is the following test (correct me if I am wrong)
All tests from that file run locally for me. Also, these tests should not have been affected by this PR and the tests I changed also all run locally. The same seems to be true for the GH hosted runners. I'm a bit lost here on how to fix this, because I don't know the differences between the Azure runner and the GH runners. |
This failing test is flaky and is not related to your work. It will not block the PR. OliverAm 6/28/24 um 07:20 schrieb Fabian Zills ***@***.***>:
There seems to be some failing test on the azure runner. Can your reproduce locally @PythonFZ ?
All tests are passing for me locally. From what I see in the logs, it is the following test (correct me if I am wrong)
/home/vsts/work/1/s/testsuite/MDAnalysisTests/parallelism/test_multiprocessing.py
All tests from that file run locally for me. Also, these tests should not have been affected by this PR and the tests I changed also all run locally. The same seems to be true for the GH hosted runners. I'm a bit lost here on how to fix this, because I don't know the differences between the Azure runner and the GH runners.
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: ***@***.***>
|
Let me kick CI IIRC there were some failing tests on |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@orbeckst are you happy with changes here? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good overall, just two docs/comments requests.
"observables" | ||
][key][subkey]["value"][self._frame] | ||
except KeyError: | ||
warnings.warn( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If it's not legal, shouldn't we be failing? Or is the reasoning that we don't overly care about observables as opposed to x/v/f?
Perhaps add a comment explaining your rationale.
Also, given that this is not tested, add a #pragma: nocover
(or whatever you have to do to show that this code block isn't covered). It will tell anyone else reading the code that it's un-tested.
Tested would be better, of course, but if we don't care about observables then I don't insist on a test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd be in favor of raising an Error here as well. Shall I create an additional h5md
file which contains wrong data to test against this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That would be ideal!
@PythonFZ if you can have a quick look at my comments then we should be able to merge this PR soon. |
@orbeckst I've added a file that tests for I'm not sure how to account for
If you are referring to
This should not longer be the case and all valid, non-fixed time step, H5MD datasets should work. |
lgtm, thanks for adding the test @PythonFZ ! |
Thank you for your contribution @PythonFZ ! 🎉 |
This is not a fix for #4598 but enables reading files that contain data in
Changes made in this Pull Request:
PR Checklist
Developers certificate of origin
📚 Documentation preview 📚: https://mdanalysis--4615.org.readthedocs.build/en/4615/
This file hasn't been touched in 9 months and I don't think there are some open PRs so there would be a chance to format the entire file using
ruff
which I accidently did here faafe8b - is this of interest?