Click or drag to resize

Tesseract Class

http://www.emgu.com
The tesseract OCR engine
Inheritance Hierarchy

Namespace:  Emgu.CV.OCR
Assembly:  Emgu.CV.World (in Emgu.CV.World.dll) Version: 4.1.1.3497 (4.1.1.3497)
Syntax
public class Tesseract : UnmanagedObject

The Tesseract type exposes the following members.

Constructors
  NameDescription
Public methodTesseract(Boolean)
Create a default tesseract engine. Needed to Call Init function to load language files in a later stage.
Public methodTesseract(String, String, OcrEngineMode, String, Boolean)
Create a Tesseract OCR engine.
Top
Properties
  NameDescription
Public propertyStatic memberDefaultTesseractDirectory
Get the default tesseract ocr directory. This should return the folder of the dll in most situations.
Public propertyOem
Get the OCR Engine Mode
Public propertyPageSegMode
Gets or sets the page seg mode.
Public propertyPtr
Pointer to the unmanaged object
(Inherited from UnmanagedObject.)
Public propertyStatic memberVersion
Get the tesseract version
Public propertyStatic memberVersionString
Get the tesseract version as String
Top
Methods
  NameDescription
Public methodAnalyseLayout
Runs page layout analysis in the mode set by SetPageSegMode. May optionally be called prior to Recognize to get access to just the page layout results. Returns an iterator to the results. Returns NULL on error or an empty page. The returned iterator must be deleted after use. WARNING! This class points to data held within the TessBaseAPI class, and therefore can only be used while the TessBaseAPI class still exists and has not been subjected to a call of Init, SetImage, Recognize, Clear, End DetectOS, or anything else that changes the internal PAGE_RES.
Public methodDispose
The dispose function that implements IDisposable interface
(Inherited from DisposableObject.)
Protected methodDisposeObject
Release the unmanaged resource associated with this class
(Overrides DisposableObjectDisposeObject.)
Public methodEquals (Inherited from Object.)
Protected methodFinalize
Destructor
(Inherited from DisposableObject.)
Public methodGetBoxText
The recognized text is returned as coded in the same format as a box file used in training.
Public methodGetCharacters
Detect all the characters in the image.
Public methodGetHashCode (Inherited from Object.)
Public methodGetHOCRText
Make a HTML-formatted string with hOCR markup from the internal data structures.
Public methodStatic memberGetLangFileUrl
Get the url to download the tessdata file for the specific language
Public methodGetOpenCLDevice
If compiled with OpenCL AND an available OpenCL device is deemed faster than serial code, then "device" is populated with the cl_device_id and returns sizeof(cl_device_id) otherwise *device=nullptr and returns 0.
Public methodGetOsdText
The recognized text
Public methodGetTSVText
Make a TSV-formatted string from the internal data structures.
Public methodGetType (Inherited from Object.)
Public methodGetUNLVText
The recognized text is returned coded as UNLV format Latin-1 with specific reject and suspect codes
Public methodGetUTF8Text
Get all the text in the image
Public methodInit
Initialize the OCR engine using the specific dataPath and language name.
Public methodIsValidWord
Check whether a word is valid according to Tesseract's language model
Protected methodMemberwiseClone (Inherited from Object.)
Public methodProcessPage
Turn a single image into symbolic text.
Public methodRecognize
Recognize the image from SetAndThresholdImage, generating Tesseract internal structures.
Protected methodReleaseManagedResources
Release the managed resources. This function will be called during the disposal of the current object. override ride this function if you need to call the Dispose() function on any managed IDisposable object created by the current object
(Inherited from DisposableObject.)
Public methodSetImage(IInputArray)
Set the image for optical character recognition
Public methodSetImage(Pix)
Set the image for optical character recognition
Public methodSetVariable
Set the variable to the specific value.
Public methodToString (Inherited from Object.)
Top
Fields
  NameDescription
Protected field_ptr
A pointer to the unmanaged object
(Inherited from UnmanagedObject.)
Top
See Also