-
Notifications
You must be signed in to change notification settings - Fork 125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PyImath Bindings fail to build when using Clang on Windows (Release branch, 3.17, issue still present on Main) #324
Comments
I'm having trouble getting the python binding tests to run; ctest is not properly setting up the environment to load all required dll's for the module. This is probably because I'm making use of python 3.10, and if I recall correctly python 3.8+ require some extra steps to load dll's that aren't in the same directory (it no longer checks PATH). That said, setting up the directories in a test script I wrote appears to properly load the module. |
See also #312 (comment) and further discussion. There is indeed something not quite right w/ the exports on Windows... |
An update; I have managed to get the numpy test and python3 test to run (still no luck on PyImathTestC, I'll have to figure out what's up with that). PyImathTest_Python3 fails on M44
I'll probably go back through and add some prints to the test to figure out why that part of the test fails. Currently it appears that at that part of the test,
Currently in the process of figuring out why the assert is failing; I don't think it has to do with the base imath implementation, considering those tests pass. |
Belated thanks for the investigation. The PYIMATH_EXPORT on both the deleted members and in the anonymous namespace are almost certainly oversights. If you have a PR you can submit for that, we'll happily accept it. It looks like we need to re-examine the other exports in PyImath. We have a bunch of internal code that relies on this library, but we don't compile it with either clang or Windows, so I can't easily reproduce #312. Regarding the failed assertion in testM44, could you also print out s-Sinq? I suspect it's close to baseTypeEpsilon but would like to see for sure. |
Apologies for the large delay @cary-ilm; I've had a hectic couple of weeks. I'll go ahead and submit a PR, details on the changes will be in the request (not many lines were changed; I didn't spot any other exports that were clearly not needed. At the very least, no other lines caused errors). Edit: PR #331 submitted. I closed #330 because I had to revise the commit to contain a signoff and use the correct e-mail. Here's the values on the failed M44d test: [ctest] M44d
[ctest] s: V3d(1, 2, 3)
[ctest] sInq: V3d(1, 2, 3)
[ctest] sInq.baseTypeEpsilon(): 2.220446049250313e-16
[ctest] s-sInq: V3d(0, -4.44089e-16, 0)
[ctest] s[1]: 2.0
[ctest] sInq[1]: 2.0000000000000004 It looks like the precision error is due to sInq[1] being the next represent-able double value after 2, which means the difference is going to be greater than the epsilon returned by the standard library's I did manage to get PyImathTestC running -- it fails on Plane. I haven't had a chance to dig into that yet. Should I open a separate issue for the failed tests? |
Thanks for that analysis. I'm pretty sure the purpose of that test is to confirm that |
I can believe I'm an edge case on the tests; my environment has caused some strange behavior on other projects in the past. Turns out compiling with The assert that fails is on line 6244 of pyImathTest.in, which I believe is the scale; it doesn't have a multiplier on the epsilon. I think I know which code you're referring to, though -- it looks like in the M33x tests there is a multiplier on the epsilon for the scale; perhaps a similar issue came up in the past and that extra wiggle room was added there, but the M44x test wasn't changed. Looking at the revision history it looks like both tests were added 12 years ago and this section hasn't been touched much since then; looks like I'm the lucky one where the difference became relevant, haha. # The M33x test
b = m.extractSHRT(sInq, hInq, rInq, tInq)
assert sInq.equalWithAbsError(s, 2 * sInq.baseTypeEpsilon())
assert hInq.equalWithAbsError(h, hInq.baseTypeEpsilon())
assert rInq.equalWithAbsError((-a, 0), 2 * rInq.baseTypeEpsilon())
assert tInq.equalWithAbsError(t, tInq.baseTypeEpsilon()) # The M44x test
b = m.extractSHRT(sInq, hInq, rInq, tInq)
assert sInq.equalWithAbsError(s, sInq.baseTypeEpsilon()) # line 6244, this is the assert that fails
assert hInq.equalWithAbsError(h, hInq.baseTypeEpsilon())
assert rInq.equalWithAbsError((0, 0, -a), 2 * rInq.baseTypeEpsilon())
assert tInq.equalWithAbsError(t, tInq.baseTypeEpsilon()) Adding a multiplier to be 2*basetypeepsilon should work, since what I'm getting is the next representable value after 2 -- the difference should be exactly twice the base epsilon for double. I'll check in a bit and edit this comment afterwards. edit: Yep. Adding a multiplier of two works fine. This has me thinking about how the floating point values are compared; I'll be posting a comment shortly on that. PR #333 submitted, one line change. |
As a follow-up before I submit a PR to just change the test, a quick note: #include <iostream>
#include <iomanip>
#include <limits>
#include <cmath>
using namespace std;
int main()
{
double baseEp = std::numeric_limits<double>::epsilon();
double deltaone = std::nextafter(1.0, 2.0) - 1.0;
double deltatwo = std::nextafter(2.0, 3.0) - 2.0;
double deltathree = std::nextafter(3.0, 4.0) - 3.0;
double deltafour = std::nextafter(4.0, 5.0) - 4.0;
double deltaseven = std::nextafter(7.0, 8.0) - 7.0;
double deltaeight = std::nextafter(8.0, 9.0) - 8.0;
cout << std::setprecision(17)
<< "numeric_limits<double>::epsilon(): " << baseEp << endl
<< "Delta after one: " << deltaone << endl
<< "Delta after two: " << deltatwo << endl
<< "Delta after three: " << deltathree << endl
<< "Delta after four: " << deltafour << endl
<< "Delta after seven: " << deltaseven << endl
<< "Delta after eight: " << deltaeight << endl
<< "Deltatwo / deltaone: " << deltatwo / deltaone << endl
<< "Deltafour / deltaone: " << deltafour / deltaone << endl
<< "Deltaeight / deltaone: " << deltaeight / deltaone << endl
;
return 0;
} The output of this is: numeric_limits<double>::epsilon(): 2.2204460492503131e-16
Delta after one: 2.2204460492503131e-16
Delta after two: 4.4408920985006262e-16
Delta after three: 4.4408920985006262e-16
Delta after four: 8.8817841970012523e-16
Delta after seven: 8.8817841970012523e-16
Delta after eight: 1.7763568394002505e-15
Deltatwo / deltaone: 2
Deltafour / deltaone: 4
Deltaeight / deltaone: 8 This means that in the tests when checking values >= 2.0, unless there's a multiplier used in the test for the epsilon value, |
@cary-ilm I believe I understand why the bounds are raised for scale but not for the shear or the translation, looking at the matrix the vectors are extracted from and the values being compared. This is for M44x tests, but the same holds true for the M33x tests where you saw the difference in bounds. s = Vec(1, 2, 3)
h = Vec(0.5, 1, 0.75)
a = pi/4
t = Vec(4, 5, 6)
...
# this isn't in the code, but this is the value of m before extraction
m = M44d((0.70710678118654757, -0.70710678118654757, 0, 0),
(2.1213203435596428, 0.70710678118654757, 0, 0),
(3.7123106012293747, -0.5303300858899106, 3, 0),
(4, 5, 6, 1)) S contains values >= 2.0 but < 4.0, so multiplying the baseTypeEpsilon value by 2 allows for bounds up to one representable value away. The values for shear are all less than 2.0, so this multiplier isn't needed. The values expected for translation are unaltered, and there shouldn't be precision error introduced, so it's fine to check for exact equivalence. I am mildly surprised to see that there's a multiplier for the rotation, since the comparison is on values less than one; I wouldn't expect there to be error greater than one value away. |
Thank you for that analysis and the fix. I'm not all that familiar with the history of the various sections of that test, many people have contributed to it over the years, but it certainly looks like many of the comparisons aren't especially scientific or precise, both the ones that are looser than necessary as well as in the case of your failure, too tight. But it's certainly appropriate for the M33 and M44 tests to be similar. |
OS: Windows 10 22H2, 19045.2965
Tools:
Libraries:
cmake configuration:
I've tested using both the Ninja generator and the Visual Studio generator, as well as specifying cxx standards 11 and 17.
Using clang (or clang-cl) to compile the release branch (commit
a4f9d5c
) will error out on the python bindings. I've truncated the log to one of the relevant sections. Apologies if it's messy, it's from VS Code's output window.The issue is tied to
src/python/PyImath/PyImathUtil.h
specifically, wherePYIMATH_EXPORT
is assigned to the deleted copy and move constructors/operators. Clang on Windows will provide a_MSC_VER
definition, so this resolves to__declspec(dllexport)
inPyImathExport.h
. The issue is with the declarations of PyAcquireLock and PyReleaseLock.Clang explicitly disallows dllexport to be assigned to deleted functions; while clang-cl in general attempts to mimic the behavior of MSVC, it would appear that it is rigid in this scenario, while MSVC will still compile. My understanding is that dllexports only apply to non-deleted functions, and as such it's not good practice to be declaring them on deleted functions regardless. Looking through the dumpbin on the generated library file from MSVC, it appears that MSVC's behavior is to not export the deleted functions (the constructors/destructors for
PyAcquireLock
andPyReleaseLock
are still properly exported), implying it sees the declaration and ignores it for the deleted copy/move constructor/operator functions. Performing a diff on the dumpbins compiled using MSVC with and without thePYIMATH_EXPORT
declarations on the deleted functions shows that output is equivalent.Changing the code to
works.
The next issue has to do with the exports declared in an anonymous namespace in
PyImathStringTable.cpp
-- these have internal linkage, and so Clang errors out because__declspec(dllexport)
was used on functions without external linkage. I'm actually unsure as to how MSVC handles this case (I haven't checked a dump for these symbols), but I'm assuming that it's similar to the above scenario.I'm currently working on a fork to fix compile errors when using Clang on Windows. I've managed to get the program to compile; I still need to run tests to verify that I didn't inadvertently break the python bindings. Before I propose the removal of
PYIMATH_EXPORT
being assigned to the deleted functions, I did want to ask if there's a reason for it to be there -- I'll admit I'm not very familiar with building libraries for sharing, and know little about how symbol visibility and exports are handled beyond what I've managed to look up in response to this error, or how__attribute__((visibility("default")))
behaves in environments where that is declared instead. If it wouldn't cause issues (and assuming it's not too minor for a PR), I'll submit a PR once I've got things working -- currently I'm fixing my PATH to include the required DLL's from boost for the python tests to run.The text was updated successfully, but these errors were encountered: