Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document running the RAPIDS Notebooks container as a Snowpark Service #419

Closed
jacobtomlinson opened this issue Sep 5, 2024 · 2 comments
Closed
Assignees

Comments

@jacobtomlinson
Copy link
Member

jacobtomlinson commented Sep 5, 2024

The Snowflake documentation has an example of running a container with Jupyter in as a service in Snowpark and accessing the Jupyter UI via a Service endpoint.

It would be great to add a documentation page that walks through the same steps but using the RAPIDS notebooks container on a GPU compute pool.

I imagine the steps will be something along the lines of:

  • Build an image based on the RAPIDS notebooks image and install the snowflake Python library
  • Push the image to the Snowflake container registry
  • Create a service specification with the RAPIDS container
  • Launch the service
  • Expose/access the Jupyter UI
  • Show reading data from Snowflake into a cudf dataframe
@ncclementi
Copy link
Contributor

ncclementi commented Dec 17, 2024

I looked a bit more into this, and it looks like the steps are pretty much what's detailed. Couple of notes and things to figure out are:

Here is a walkthrough video that also covers part of the rpocess: https://youtu.be/izeG3pvvy9M?list=PLavJpcg8cl1GN7AwaY95PXqwUw4qj8zLo&t=411

Snowflake setup:

TODO:

local env setup/config:

Build an image based on the RAPIDS notebooks image and install the snowflake Python library

TODO:

  • Determine what extra dependencies need to be added (probably "snowflake-snowpark-python[pandas]" snowflake-connector-python ? )
  • Create docker file that is based on the RAPIDS notebook image, create the image

For what I could gather from the demos, it seems like you build the image locally and then push it to the container registry. I was hoping one would be able to pull from let's say dockerhub or similar directly into snowpark registry but I can't find docs on that. Which it might be a bit limiting for a doc example

Push the image to the Snowflake container registry

From the demo docs steps are the image is build locally and they use snowpark CLI + docker to push it to the registry.

Create a service specification with the RAPIDS container
Launch the service
Expose/access the Jupyter UI

This should be almost identical to the process here https://quickstarts.snowflake.com/guide/intro_to_snowpark_container_services/#3 except some rapids caveats that we will find out.

Show reading data from Snowflake into a cudf dataframe

Do we need to create a snowflake table then, is there any snowflake table we can "copy"? what do we have in mind here?

@jacobtomlinson
Copy link
Member Author

Closed by #493

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants