You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
which makes it easy to try one of our notebooks in a web-browser simply by following this link. However, Binder is free, and as as result only comes with limited CPU compute power and without GPU support. Expect the notebook to run very slowly (this is somewhat improved by reducing image resolution to e.g. 60 pixels but at the cost of low accuracies).
47
47
@@ -51,10 +51,11 @@ The following is a summary of commonly used Computer Vision scenarios that are c
51
51
52
52
| Scenario | Support | Description |
53
53
| -------- | ----------- | ----------- |
54
-
|[Classification](scenarios/classification)| Base | Image Classification is a supervised machine learning technique that allows you to learn and predict the category of a given image. |
54
+
|[Classification](scenarios/classification)| Base | Image Classification is a supervised machine learning technique to learn and predict the category of a given image. |
55
55
|[Similarity](scenarios/similarity)| Base | Image Similarity is a way to compute a similarity score given a pair of images. Given an image, it allows you to identify the most similar image in a given dataset. |
56
56
|[Detection](scenarios/detection)| Base | Object Detection is a technique that allows you to detect the bounding box of an object within an image. |
57
57
|[Keypoints](scenarios/keypoints)| Base | Keypoint detection can be used to detect specific points on an object. A pre-trained model is provided to detect body joints for human pose estimation. |
58
+
|[Segmentation](scenarios/segmentation)| Base | Image Segmentation assigns a category to each pixel in an image. |
58
59
|[Action recognition](contrib/action_recognition)| Contrib | Action recognition to identify in video/webcam footage what actions are performed (e.g. "running", "opening a bottle") and at what respective start/end times.|
59
60
|[Crowd counting](contrib/crowd_counting)| Contrib | Counting the number of people in low-crowd-density (e.g. less than 10 people) and high-crowd-density (e.g. thousands of people) scenarios.|
60
61
@@ -67,7 +68,7 @@ Note that for certain computer vision problems, you may not need to build your o
67
68
The following Microsoft services offer simple solutions to address common computer vision tasks:
are a set of pre-trained REST APIs which can be called for image tagging, face recognition, OCR, video analytics, and more. These APIs work out of the box and require minimal expertise in machine learning, but have limited customization capabilities. See the various demos available to get a feel for the functionality (e.g. [Computer Vision](https://azure.microsoft.com/en-us/services/cognitive-services/computer-vision/#analyze)). The service can be used through API calls or through SDKs (available in .NET, Python, Java, Node and Go languages)
71
+
are a set of pre-trained REST APIs which can be called for image tagging, face recognition, OCR, video analytics, and more. These APIs work out of the box and require minimal expertise in machine learning, but have limited customization capabilities. See the various demos available to get a feel for the functionality (e.g. [Computer Vision](https://azure.microsoft.com/en-us/services/cognitive-services/computer-vision/#analyze)). The service can be used through API calls or through SDKs (available in .NET, Python, Java, Node and Go languages)
is a SaaS service to train and deploy a model as a REST API given a user-provided training set. All steps including image upload, annotation, and model deployment can be performed using an intuitive UI or through SDKs (available in .NEt, Python, Java, Node and Go languages). Training image classification or object detection models can be achieved with minimal machine learning expertise. The Custom Vision offers more flexibility than using the pre-trained cognitive services APIs, but requires the user to bring and annotate their own data.
Copy file name to clipboardexpand all lines: scenarios/README.md
+5-2
Original file line number
Diff line number
Diff line change
@@ -2,10 +2,13 @@
2
2
3
3
| Scenario | Description |
4
4
| -------- | ----------- |
5
-
|[Classification](classification)| Image Classification is a supervised machine learning technique that allows you to learn and predict the category of a given image. |
5
+
|[Classification](classification)| Image Classification is a supervised machine learning technique to learn and predict the category of a given image. |
6
6
|[Similarity](similarity)| Image Similarity is a way to compute a similarity score given a pair of images. Given an image, it allows you to identify the most similar image in a given dataset. |
7
7
|[Detection](detection)| Object Detection is a technique that allows you to detect the bounding box of an object within an image. |
8
-
|[Keypoints](keypoints)| Keypoint detection can be used to detect specific points on an object. A pre-trained model is provided to detect body joints for human pose estimation. |
8
+
|[Keypoints](keypoints)| Keypoint Detection can be used to detect specific points on an object. A pre-trained model is provided to detect body joints for human pose estimation. |
9
+
|[Segmentation](segmentation)| Image Segmentation assigns a category to each pixel in an image. |
10
+
|[Action Recognition](action_recognition)| Action Recognition (also known as activity recognition) consists of classifying various actions from a sequence of frames, such as "reading" or "drinking". |
0 commit comments