Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Graceful shutdown stuck #65

Open
manuelbernhardt opened this issue Jun 17, 2024 · 1 comment
Open

Graceful shutdown stuck #65

manuelbernhardt opened this issue Jun 17, 2024 · 1 comment

Comments

@manuelbernhardt
Copy link

We're deploying the snowplow-lake-loader using Google Cloud Run (for convenience). Sometimes the process receives a SIGTERM and proceeds to graceful shutdown. However, it looks like there's something off with the shutdown and in the end an exception is thrown (excerpt from the log below):

INFO com.snowplowanalytics.snowplow.lakes.Run - Received signal 15. Cancelling execution.
INFO com.snowplowanalytics.snowplow.lakes.processing.SparkUtils - Closing the global spark session...
java.lang.IllegalStateException: supervisor already shutdown

This leads to the container to be in a zombie state - the health probe is still correct, but the application hasn't exited and so nothing happens any longer. Is there perhaps a way to catch this exception should it occur?

@istreeter
Copy link
Collaborator

Hi @manuelbernhardt which output format are you writing to? i.e. Delta/Iceberg/Hudi. I'm asking because the shutdown process is slightly different depending on which spark libraries are being used. This might help me track down where the problem is coming from.

Also, which stream are you consuming from? Kinesis/pubsub/kafka?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants