Skip to content

Yashs744/Inkredo-Programming-Challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Inkredo-Programming-Challenge

Task

Efficiently detect and segment document regions.

For this challenge,
Input consists in a set of files containing a document from a predefined set
Output should be cropped images in which one can find the document without any other noise for extracting data.


Detection and Segmentataion of Documents in Images.

Steps to Detect Document in the Image

1. Load the Image and resize it.
2. Convert the Image to GrayScale.
3. Image Process
    3.1. Bilaternal Filter to reduce Noise and smooth out image.
    3.2. Adaptive Threshold to Convert the Image to an Binary Image
    3.3. Median Blur to filter out small pixel details
    3.4. Black Border around Image to handle Documents that maybe touching the Border of Image
4. Detect Edges using Canny Edge Detector.
5. Find Contours in the Edge Detected Image.
6. Pick Points which Cover the Document.
7. Transform the Image

Installation

  git clone https://github.com/Yashs744/Inkredo-Programming-Challenge.git
  cd Inkredo-Programming-Challenge/

Requirements

  pip3 install -r requirements.txt

Execution

  python3 detect-Segment.py --image path/to/the/image.jpg

Example

result-1

Input

result-2(i)

Output

result-2(ii)


What's Next ?

To get results close to human level cropping and segmentation with minimum error rate we can use Auto-Encoder Network.
We can create a dataset of images by cropping the document manually then the network would work as follow:

  • Pass the Image to Encoder part of the Network.
  • Expect a Cropped Document from the Network as an Output.
  • Minimize the Loss between the Document produced by the Network and manually Segmented Document.

References

Releases

No releases published

Packages

No packages published

Languages