Efficiently detect and segment document regions.
For this challenge,
Input consists in a set of files containing a document from a predefined set
Output should be cropped images in which one can find the document without any other noise for extracting data.
1. Load the Image and resize it.
2. Convert the Image to GrayScale.
3. Image Process
3.1. Bilaternal Filter to reduce Noise and smooth out image.
3.2. Adaptive Threshold to Convert the Image to an Binary Image
3.3. Median Blur to filter out small pixel details
3.4. Black Border around Image to handle Documents that maybe touching the Border of Image
4. Detect Edges using Canny Edge Detector.
5. Find Contours in the Edge Detected Image.
6. Pick Points which Cover the Document.
7. Transform the Image
git clone https://github.com/Yashs744/Inkredo-Programming-Challenge.git
cd Inkredo-Programming-Challenge/
pip3 install -r requirements.txt
python3 detect-Segment.py --image path/to/the/image.jpg
To get results close to human level cropping and segmentation with minimum error rate we can use Auto-Encoder Network.
We can create a dataset of images by cropping the document manually then the network would work as follow:
- Pass the Image to Encoder part of the Network.
- Expect a Cropped Document from the Network as an Output.
- Minimize the Loss between the Document produced by the Network and manually Segmented Document.