-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
use 'sagemaker-python-sdk' instead of 'sagemaker' #504
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
- **IAM_ROLE** = Create a new role > Create role | ||
- **Notebook instance name** = Name of the notebook instance | ||
- **Notebook instance type** = Type of notebook instance. Select a RAPIDS-compatible GPU ([see the RAPIDS docs](https://docs.rapids.ai/install#system-req)) as the SageMaker Notebook instance type (e.g., `ml.p3.2xlarge`). | ||
- **Platform identifier** = 'Amazon Linux 2, Jupyter Lab 4' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes here:
- use the same case as the UI
- remove "IAM Role" (these examples work with the default role added by SageMaker)
- update to
Amazon Linux 2, Jupyter Lab 4
(the latest platform SageMaker supports)
@@ -1,27 +0,0 @@ | |||
ARG RAPIDS_IMAGE |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is code in the notebook that writes this... it's not necessary to have a copy checked into the repo too.
" cupy \\\n", | ||
" dask-ml \\\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This was missing, and so the example code failed like this:
Traceback (most recent call last):
File "/opt/ml/code/train.py", line 75, in <module>
train()
File "/opt/ml/code/train.py", line 27, in train
ml_workflow = create_workflow(hpo_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/ml/code/MLWorkflow.py", line 43, in create_workflow
from workflows.MLWorkflowMultiGPU import MLWorkflowMultiGPU
File "/opt/ml/code/workflows/MLWorkflowMultiGPU.py", line 34, in <module>
from dask_ml.model_selection import train_test_split
ModuleNotFoundError: No module named 'dask_ml'
" flask \\\n", | ||
" protobuf \\\n", | ||
" sagemaker\n", | ||
" rapids-dask-dependency=${{ rapids_version }} \\\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To ensure dask-ml
installation doesn't result in upgrading / downgrading dask
and distributed
and therefore changing the installed version of RAPIDS libraries.
}, | ||
"nightly": { | ||
"rapids_version": f"{nightly_version}-nightly", | ||
"rapids_version": f"{nightly_version}", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nothing in the repo relies on this -nightly
being there, as far as I can tell.
git grep rapids_version
And in fact, this difference is causing some bugs. Look at https://docs.rapids.ai/deployment/nightly/platforms/databricks/#install-rapids-and-dask .... it's saying you should install dask-cuda==25.02-nightly
, which does not exist.
![Screenshot 2025-02-11 at 9 00 42 PM](https://private-user-images.githubusercontent.com/7608904/412245674-e9d665a5-6e04-45d8-82d4-efd4d2f7c69d.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2OTA3NTUsIm5iZiI6MTczOTY5MDQ1NSwicGF0aCI6Ii83NjA4OTA0LzQxMjI0NTY3NC1lOWQ2NjVhNS02ZTA0LTQ1ZDgtODJkNC1lZmQ0ZDJmN2M2OWQucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI1MDIxNiUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNTAyMTZUMDcyMDU1WiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9NjAzMGE2NzM1ZTQwZTlkMDJlZTg2ZDRjYWZiNzFiMzA2NTI2MzE4NjM2ZTgxOGU0OGUxYWIzMGM4MDFlZDdiNCZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QifQ.ZLtXL_dogePSIve3Wv9alrFASzWQj_OqGLt7hOFnRpA)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I think this might be a holdover from a previous naming scheme. Fine to remove it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good!
Could we add a warning admonition that explains to users that we need to pin until Sagemaker moves off Amazon Linux 2?
}, | ||
"nightly": { | ||
"rapids_version": f"{nightly_version}-nightly", | ||
"rapids_version": f"{nightly_version}", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I think this might be a holdover from a previous naming scheme. Fine to remove it.
RAPIDS `>24.12` will not be installable on SageMaker Notebook Instances until those instances support | ||
Amazon Linux 2023 or other Linux distributions with GLIBC of at least 2.28. | ||
For more details, see https://github.com/rapidsai/deployment/issues/520. | ||
``` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jacobtomlinson what do you think about this language for a warning?
I checked the preview and looks like it's rendering OK:
![image](https://private-user-images.githubusercontent.com/7608904/412546465-262b9a9c-b7e2-4e97-843f-6ae8e53c8055.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2OTA3NTUsIm5iZiI6MTczOTY5MDQ1NSwicGF0aCI6Ii83NjA4OTA0LzQxMjU0NjQ2NS0yNjJiOWE5Yy1iN2UyLTRlOTctODQzZi02YWU4ZTUzYzgwNTUucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI1MDIxNiUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNTAyMTZUMDcyMDU1WiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9MTkxYjY4ZDE5ZTBmYzE1YTk4OWY1NjMzZDg0M2Y1NDRkNjA2YjJlOWVlMWJlNjM2NWYxNDQ3NGNkZjFhNWJlNSZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QifQ.ZFAbL7Z8FWGMUBFICCnc7uleRusri45K4cvMoDRMFV0)
https://rapids-deployment--504.org.readthedocs.build/en/504/cloud/aws/sagemaker/
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Language looks good 👍
Co-authored-by: Jacob Tomlinson <[email protected]>
Fixes #443
Contributes to #506
sagemaker
tosagemaker-python-sdk
conda-forge packagessagemaker
is just abandoned: Archive this feedstock? conda-forge/sagemaker-feedstock#61sagemaker-python-sdk
to avoid fallbacks to older, less-constrained versionsNotes for Reviewers
I tested the SageMaker pages as part of this:
ml.p3.2xlarge
instances, could not provision larger instances likeml.p3.8xlarge
(quota limit hit)ml.p3.2xlarge
instances, could not provision larger instances likeml.p3.8xlarge
(no capacity)