Difference between revisions of "OCR templates"
Line 1: | Line 1: | ||
{{TOCright}} __TOC__ | {{TOCright}} __TOC__ | ||
− | {{Note|'''From OpenKM 6.4+ the Plug-in system of OpenKM''' allows you to quickly expand the functionality offered by the platform, extending the available OCR field parsers without having to rebuild the system to add/change the existing functionality. Refer to [[Extend_OCR_field_parsers]] if you need to extend them. | + | {{Note|'''From OpenKM 6.4+ the Plug-in system of OpenKM''' allows you to quickly expand the functionality offered by the platform, extending the available OCR field parsers without having to rebuild the system to add/change the existing functionality.}} |
+ | |||
+ | Refer to [[Extend_OCR_field_parsers]] if you need to extend them. | ||
OCR Templates allows to create zonal OCR templates which allows to recognise and extract estructured text from scanned images. | OCR Templates allows to create zonal OCR templates which allows to recognise and extract estructured text from scanned images. |
Revision as of 19:06, 7 September 2013
Refer to Extend_OCR_field_parsers if you need to extend them.
OCR Templates allows to create zonal OCR templates which allows to recognise and extract estructured text from scanned images.
Images should be scanned at least at 200 dpi of resolution to get good text recognition from OCR engine. |
Template creation
Open OCR template administration option.
Then click on the new ocr template icon
Fill the form and click create button.
Add field zones
Fill the form and select the zone and click create button.
Test OCR template
Click on the check icon and will be shown all the data fields extracted by zone.
Recognise testing
From main OCR template list, click recognise icon.
Fill the form selecting some scanned image to test recognision.
Click recognise button.
Enable Zonal OCR data capture
OCR data capture can be enabled from profiles and automation.
Profiles
To enable zonal OCR data capture from profiles should be enabled the check OCR data capture.
Automation
Automation is divided in two operations, validations and actions.
There's a validation called IsOCRDataCaptureFile which validates if OCR data capture engine supports the image format.
There're two actions OCRDataCapture and AddOCRDataCaptureToWizard. OCRDataCapture capture data and store to metadata. AddOCRDataCaptureToWizard enables end user wizard to see live ocr data capture process.
For more information take a look at Automation.