Loading all splits of object detection dataset #940

DANISHFAYAZNAJAR · 2024-02-22T08:13:28Z

DANISHFAYAZNAJAR
Feb 22, 2024

Search before asking

I have searched the Supervision issues and found no similar feature requests.

Question

Why Instead of loading just a split from dataset, why can't we load whole dataset with all splits.

`import roboflow
from roboflow import Roboflow
import supervision as sv

roboflow.login()
rf = Roboflow()

project = rf.workspace(WORKSPACE_ID).project(PROJECT_ID)
dataset = project.version(PROJECT_VERSION).download("yolov5")

ds = sv.DetectionDataset.from_yolo(
images_directory_path=f"{dataset.location}/train/images",
annotations_directory_path=f"{dataset.location}/train/labels",
data_yaml_path=f"{dataset.location}/data.yaml"
)

ds.classes

['dog', 'person']

`

Additional

"I attempted to load my dataset, which is divided into training, validation, and testing sets. However, when I attempted to use sv.DetectionDataset.from_yolo to load the dataset, it required me to provide paths for the images folder, the annotations directory and data.yaml file. Is there a way to simply pass the main dataset folder path and have the function automatically load all the available splits?"

SkalskiP · 2024-02-23T21:55:03Z

SkalskiP
Feb 23, 2024
Maintainer

Hi @DANISHFAYAZNAJAR 👋🏻 Let me convert this issue into a discussion and put it into the Q&A section.

0 replies

SkalskiP · 2024-02-23T21:58:20Z

SkalskiP
Feb 23, 2024
Maintainer

Hi @DANISHFAYAZNAJAR 👋🏻 YOLO is the only data format that uses one file to store information about subsets like this. COCO, PASCAL, and other formats treat subsets as separate datasets. To keep the API consistent, we made YOLO behave the same.

At this point, we do not plan to change that behavior.

I recommend writing a utility using supervision datasets as building blocks.

0 replies

DANISHFAYAZNAJAR · 2024-02-25T17:57:36Z

DANISHFAYAZNAJAR
Feb 25, 2024
Author

Hi @SkalskiP Thank you for your response. Indeed, we can develop a function to load dataset splits. By providing the path to the dataset folder, we can search for the 'train', 'valid', and 'test' folders, create three datasets, merge them, and then split them again. However, this method may result in mixed samples within splits. Furthermore, while merging the datasets, we may not obtain the same structured dataset with three distinct splits as provided by Hugging Face's Dataset.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading all splits of object detection dataset #940

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Loading all splits of object detection dataset #940

DANISHFAYAZNAJAR Feb 22, 2024

Search before asking

Question

['dog', 'person']

Additional

Replies: 3 comments

SkalskiP Feb 23, 2024 Maintainer

SkalskiP Feb 23, 2024 Maintainer

DANISHFAYAZNAJAR Feb 25, 2024 Author

DANISHFAYAZNAJAR
Feb 22, 2024

SkalskiP
Feb 23, 2024
Maintainer

SkalskiP
Feb 23, 2024
Maintainer

DANISHFAYAZNAJAR
Feb 25, 2024
Author