-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tribits: MPI_EXEC if unaccessible doesn't produce an error related to MPI but fails obscurely #594
Comments
I think so - or a clearer error/warning that MPI didn't setup correctly inside CMake. The user came to me with this obscure unit test configure error. I'm guessing Tribits requires that MPI_EXEC be defined by that point, but I guess no where is that assumption enforced? |
It seems in a perfect world,
|
Just throw an error there, so that latter stuff can assume MPI_EXEC must be legal |
@jjellio, it is interesting that it has taken 15 years for this issue to come up. (I don't ever remember hearing about a use case like this.) |
This is trivial to address, except for perhaps an automated test to ensure that a "false" |
@jjellio, how urgent is this? Sounds like other users might hit this on that system? |
Not urgent. I did want to report it though. I'm guessing this will come up again though. If you remember we wrote a Well on El Cap, we don't need that type of tool - but to get faster testing throughput, I wrote a tool I shared with the FLUX team (the job manager on El Cap), that allows for much higher job throughput. But that script was in folder that my colleague couldn't see. Hopefully we won't need middle-man scripts like that on ATS-4 (El Cap), but in the stand-up process they are nice. What stinks is how ambiguous the error was. I guess I have enough intuition now to know how to diagnose things better. But anyone else that would have seen this likely would have done alot of wrong things trying to resolve it. |
@jjellio, many errors often are, depending who sees them. That is why it is good to be paranoid and check everything you can (unless there is a significant performance hit, which is often not the case for issues like this). |
I believe this falls under Tribits, since I think Trilinos doesn't use the official Kitware MPI module.
What happened is a user on the El Capitan systems was using my build setup, which set
`-DMPI_EXEC=/some/path/they/cant/read'
The user's configure only gave this as an error:
Yet, if I took the same configure+environment, it worked.... so I diffed the CMakeCache... and we see:
Looking in their configure log (saved cmake output), this is reported, but it is not an error:
Instead, the user got the Teuchos unit test error which would send most users looking in the wrong direction.
We could probably produce this failure anywhere. Just set
MPI_EXEC
to a location you can't read.The text was updated successfully, but these errors were encountered: