Skip to content

Commit 97187bc

Browse files
authored
Adding image segmentation 01 and 11 notebooks
1 parent 3dd9914 commit 97187bc

23 files changed

+3024
-648
lines changed

README.md

+5-4
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
```diff
44
+ March 27: Released v1.1 with new and improved
5-
+ functionality for image retrieval, object detection,
5+
+ functionality for image retrieval, object detection,
66
+ keypoint detection and action recognition.
77
+ For additional details, please refer to our releases page.
88
```
@@ -41,7 +41,7 @@ instructions on how to setup the compute environment and dependencies needed to
4141
notebooks in this repo. Once your environment is setup, navigate to the
4242
[Scenarios](scenarios) folder and start exploring the notebooks.
4343

44-
Alternatively, we support Binder
44+
Alternatively, we support Binder
4545
[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/PatrickBue/computervision-recipes/master?filepath=scenarios%2Fclassification%2F01_training_introduction_BINDER.ipynb)
4646
which makes it easy to try one of our notebooks in a web-browser simply by following this link. However, Binder is free, and as as result only comes with limited CPU compute power and without GPU support. Expect the notebook to run very slowly (this is somewhat improved by reducing image resolution to e.g. 60 pixels but at the cost of low accuracies).
4747

@@ -51,10 +51,11 @@ The following is a summary of commonly used Computer Vision scenarios that are c
5151

5252
| Scenario | Support | Description |
5353
| -------- | ----------- | ----------- |
54-
| [Classification](scenarios/classification) | Base | Image Classification is a supervised machine learning technique that allows you to learn and predict the category of a given image. |
54+
| [Classification](scenarios/classification) | Base | Image Classification is a supervised machine learning technique to learn and predict the category of a given image. |
5555
| [Similarity](scenarios/similarity) | Base | Image Similarity is a way to compute a similarity score given a pair of images. Given an image, it allows you to identify the most similar image in a given dataset. |
5656
| [Detection](scenarios/detection) | Base | Object Detection is a technique that allows you to detect the bounding box of an object within an image. |
5757
| [Keypoints](scenarios/keypoints) | Base | Keypoint detection can be used to detect specific points on an object. A pre-trained model is provided to detect body joints for human pose estimation. |
58+
| [Segmentation](scenarios/segmentation) | Base | Image Segmentation assigns a category to each pixel in an image. |
5859
| [Action recognition](contrib/action_recognition) | Contrib | Action recognition to identify in video/webcam footage what actions are performed (e.g. "running", "opening a bottle") and at what respective start/end times.|
5960
| [Crowd counting](contrib/crowd_counting) | Contrib | Counting the number of people in low-crowd-density (e.g. less than 10 people) and high-crowd-density (e.g. thousands of people) scenarios.|
6061

@@ -67,7 +68,7 @@ Note that for certain computer vision problems, you may not need to build your o
6768
The following Microsoft services offer simple solutions to address common computer vision tasks:
6869

6970
- [Vision Services](https://docs.microsoft.com/en-us/azure/cognitive-services/computer-vision/)
70-
are a set of pre-trained REST APIs which can be called for image tagging, face recognition, OCR, video analytics, and more. These APIs work out of the box and require minimal expertise in machine learning, but have limited customization capabilities. See the various demos available to get a feel for the functionality (e.g. [Computer Vision](https://azure.microsoft.com/en-us/services/cognitive-services/computer-vision/#analyze)). The service can be used through API calls or through SDKs (available in .NET, Python, Java, Node and Go languages)
71+
are a set of pre-trained REST APIs which can be called for image tagging, face recognition, OCR, video analytics, and more. These APIs work out of the box and require minimal expertise in machine learning, but have limited customization capabilities. See the various demos available to get a feel for the functionality (e.g. [Computer Vision](https://azure.microsoft.com/en-us/services/cognitive-services/computer-vision/#analyze)). The service can be used through API calls or through SDKs (available in .NET, Python, Java, Node and Go languages)
7172

7273
- [Custom Vision](https://docs.microsoft.com/en-us/azure/cognitive-services/custom-vision-service/home)
7374
is a SaaS service to train and deploy a model as a REST API given a user-provided training set. All steps including image upload, annotation, and model deployment can be performed using an intuitive UI or through SDKs (available in .NEt, Python, Java, Node and Go languages). Training image classification or object detection models can be achieved with minimal machine learning expertise. The Custom Vision offers more flexibility than using the pre-trained cognitive services APIs, but requires the user to bring and annotate their own data.

scenarios/README.md

+5-2
Original file line numberDiff line numberDiff line change
@@ -2,10 +2,13 @@
22

33
| Scenario | Description |
44
| -------- | ----------- |
5-
| [Classification](classification) | Image Classification is a supervised machine learning technique that allows you to learn and predict the category of a given image. |
5+
| [Classification](classification) | Image Classification is a supervised machine learning technique to learn and predict the category of a given image. |
66
| [Similarity](similarity) | Image Similarity is a way to compute a similarity score given a pair of images. Given an image, it allows you to identify the most similar image in a given dataset. |
77
| [Detection](detection) | Object Detection is a technique that allows you to detect the bounding box of an object within an image. |
8-
| [Keypoints](keypoints) | Keypoint detection can be used to detect specific points on an object. A pre-trained model is provided to detect body joints for human pose estimation. |
8+
| [Keypoints](keypoints) | Keypoint Detection can be used to detect specific points on an object. A pre-trained model is provided to detect body joints for human pose estimation. |
9+
| [Segmentation](segmentation) | Image Segmentation assigns a category to each pixel in an image. |
10+
| [Action Recognition](action_recognition) | Action Recognition (also known as activity recognition) consists of classifying various actions from a sequence of frames, such as "reading" or "drinking". |
11+
912

1013
# Scenarios
1114

scenarios/media/cv_overview.jpg

13.2 KB
Loading

scenarios/media/figures.pptx

318 KB
Binary file not shown.

scenarios/segmentation/01_training_introduction.ipynb

+873
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)