Click or drag to resize
Tesseract Class
The tesseract OCR engine
Inheritance Hierarchy

Namespace: Emgu.CV.OCR
Assembly: Emgu.CV.World (in Emgu.CV.World.dll) Version: (
public class Tesseract : UnmanagedObject

The Tesseract type exposes the following members.

Public methodTesseract
Create a default tesseract engine. Needed to Call Init function to load language files in a later stage.
Public methodTesseract(String, String, OcrEngineMode)
Create an tesseract OCR engine.
Public methodTesseract(String, String, OcrEngineMode, String)
Create an tesseract OCR engine.
Public propertyOem
Get the OCR Engine Mode
Public propertyPageSegMode
Gets or sets the page seg mode.
Public propertyPtr
Pointer to the unmanaged object
(Inherited from UnmanagedObject.)
Public propertyStatic memberVersion
Get the tesseract version
Public propertyStatic memberVersionString
Get the tesseract version as String
Public methodAnalyseLayout
Runs page layout analysis in the mode set by SetPageSegMode. May optionally be called prior to Recognize to get access to just the page layout results. Returns an iterator to the results. Returns NULL on error or an empty page. The returned iterator must be deleted after use. WARNING! This class points to data held within the TessBaseAPI class, and therefore can only be used while the TessBaseAPI class still exists and has not been subjected to a call of Init, SetImage, Recognize, Clear, End DetectOS, or anything else that changes the internal PAGE_RES.
Public methodDispose
The dispose function that implements IDisposable interface
(Inherited from DisposableObject.)
Protected methodDisposeObject
Release the unmanaged resource associated with this class
(Overrides DisposableObjectDisposeObject.)
Public methodEquals (Inherited from Object.)
Protected methodFinalize
(Inherited from DisposableObject.)
Public methodGetBoxText
The recognized text is returned as coded in the same format as a box file used in training.
Public methodGetCharacters
Detect all the characters in the image.
Public methodGetHashCode (Inherited from Object.)
Public methodGetHOCRText
Make a HTML-formatted string with hOCR markup from the internal data structures.
Public methodGetOpenCLDevice
Get the OpenCL device pointer
Public methodGetOsdText
The recognized text
Public methodGetTSVText
Make a TSV-formatted string from the internal data structures.
Public methodGetType (Inherited from Object.)
Public methodGetUNLVText
The recognized text is returned coded as UNLV format Latin-1 with specific reject and suspect codes
Public methodGetUTF8Text
Get all the text in the image
Public methodInit
Initialize the OCR engine using the specific dataPath and language name.
Public methodIsValidWord
Check whether a word is valid according to Tesseract's language model
Protected methodMemberwiseClone (Inherited from Object.)
Public methodProcessPage
Turn a single image into symbolic text.
Public methodRecognize
Recognize the image from SetAndThresholdImage, generating Tesseract internal structures.
Protected methodReleaseManagedResources
Release the managed resources. This function will be called during the disposal of the current object. override ride this function if you need to call the Dispose() function on any managed IDisposable object created by the current object
(Inherited from DisposableObject.)
Public methodSetImage(IInputArray)
Set the image for optical character recognition
Public methodSetImage(Pix)
Set the image for optical character recognition
Public methodSetVariable
Set the variable to the specific value.
Public methodToString (Inherited from Object.)
Protected field_ptr
A pointer to the unmanaged object
(Inherited from UnmanagedObject.)
See Also