Ability to ignore specific files/folders in ModelTrainer's script mode #5091
Labels
component: pysdk-team
Related to SageMaker Python SDK Core Issues
component: training
Relates to the SageMaker Training Platform
type: feature request
Describe the feature you'd like
I do not want having
.git
,.env
,.vscode
,data
,__pycache__
or any irrelevant files/folders to be uploaded to S3 artifacts when I use script mode ofModelTrainer
in SourceCode. Moreover, copingsource_dir
may be time-consuming due to the large number of files, such as.git
or/and.env
.How would this feature be used? Please describe.
During development/sanity-checing on local machine I have some unnecessary files/folders. The idea is to not upload them during using a script mode.
Let's say I have project structure:
expected result:
s3://<default_bucket_path>/<base_job_name>/input/code/
w/o ignored files/foldersDescribe alternatives you've considered
script
mode (isn't practical for some cases, e.g. sanity-check)Additional context
Add any other context or screenshots about the feature request here.
The text was updated successfully, but these errors were encountered: