ML-Kit is a free-to-use machine learning API for Android and
iOS apps, focusing
on on-device AI solutions.
In this repo, I have made a demo of all the AI techniques I have performed with ML-Kit on Android.
The ML techniques used in this demo app include but are not limited to:
- This algorithm detects face mesh info on close-range images.
- This algorithm can recognize and extract text from images.
- Detecting the position of human body in any given picture/video in real-time.
- Separates the background of a picture/video from users within it. Helps to focus on more important objects in the picture/video.
- Localize and tag in realtime one or more objects in the live camera feed.
- Scanning and processing most kinds of barcodes. Supports various standard 1D and 2D (a.k.a. QR) barcode formats.
- This algorithm identifies objects, locations, activities, animal species, products and more in a given picture.
- Detects faces and facial landmarks in a given image/video.
- This amazing demo camera app firstly determines the language of a string of text with just a few words. And then, translates that text between 58 languages; completely on device.
- This part of the app recognizes handwritten text and hand-drawn shapes (such as emojis) on a digital surface, such as a touch screen.
Example results:
data:image/s3,"s3://crabby-images/e2906/e2906f2fc0efb79291352dda3d1f68ba45335d6d" alt="face mesh 1"
Some examples:
data:image/s3,"s3://crabby-images/564b1/564b1753eed22a65a4affef223ba1be39c8206ef" alt="before applying text recognition"
data:image/s3,"s3://crabby-images/4d5c7/4d5c71a3c9af5771452878ef29d3afc3d745f761" alt="after applying text recognition"
Example result:
data:image/s3,"s3://crabby-images/52607/5260792be58f51717e4c70669d4f8ef26fe8bf06" alt="pose detection 1"
data:image/s3,"s3://crabby-images/0b74a/0b74a82ac1c074343f45d2d6d74e41142a31c577" alt="selfie segmentation 1"
data:image/s3,"s3://crabby-images/9ec54/9ec54e7fdb69a843be2595e0e1edd8e9804ac424" alt="selfie segmentation 2"
Example results of object detection:
data:image/s3,"s3://crabby-images/9614f/9614f8853a95b98c7a949a4b5cae4e698e4695e0" alt="Object detection 1"
data:image/s3,"s3://crabby-images/a7bb4/a7bb49635b1a9ce2f067507f2a86154462a19b79" alt="Object detection 2"
data:image/s3,"s3://crabby-images/1c558/1c5581343111e10555e4c8f5e1d189f5ee3d5199" alt="Object detection 3"
data:image/s3,"s3://crabby-images/bc060/bc0600fdecd8b893bc8d69d57ba746400689d42c" alt="Object detection 4"
data:image/s3,"s3://crabby-images/efe71/efe71c977ed6588f77688df496415c43c9dfd5cd" alt="Object detection 5"
Example result of barcode scanner:
data:image/s3,"s3://crabby-images/55c5b/55c5bce20ef5253c294f7e4d66a899788f085518" alt="Barcode scanner 1"
A video demo of barcode scanner:
Realtime barcode scanner video demo
barcode_demo.mp4
For example, in the picture below, it has managed to label the road, Jeans, Jacket, and Buildings in the picture correctly.
data:image/s3,"s3://crabby-images/0775d/0775d75089ba9165d7dce2602e9ab587c1995799" alt="face detection 1"
Example results:
data:image/s3,"s3://crabby-images/7cb63/7cb6334aae0c1e8037b0c8b45ebfa60e1681f8df" alt="face detection 1"
A video demo of realtime camera translator:
Live camera translator video demo
live.camera.translator.mp4
Some examples:
data:image/s3,"s3://crabby-images/705be/705be134bcf9e32908f91938b5d91b1b95655e38" alt="Digital ink recognition 1"
data:image/s3,"s3://crabby-images/e43fa/e43fa97041707bd38f5664ef8505b4d8412d72ed" alt="Digital ink recognition 2"
A video demo of digital ink recognition:
Digital ink recognition video demo
2023-04-11.01.05.42.mp4
Please star and fork this repo, I will be maintaining it over time and will try to add more ML
related library demos to
it.
Feel free to open issues and point out the bugs and short-comings.
new architecture of the app is going to be like this: we will have these 5 main modules in our codebase:
- feature_ink_recognition
- feature_live_translator
- feature_still_image
- feature_live_preview
- shared