Difference between revisions of "Working with Images"
Line 1: | Line 1: | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
== Depth and Color as Generic Parameter == | == Depth and Color as Generic Parameter == | ||
An Image is defined by its generic parameters: '''color''' and '''depth'''. To create a 8bit unsigned Grayscale image, in [[Emgu CV]] it is done by calling | An Image is defined by its generic parameters: '''color''' and '''depth'''. To create a 8bit unsigned Grayscale image, in [[Emgu CV]] it is done by calling | ||
− | + | <source lang="csharp"> | |
− | Image | + | Image<Gray, Byte> image = new Image<Gray, Byte>( width, height); |
− | + | </source> | |
− | Not only this syntax make you aware the color and the depth of the image, it also restrict the way you use functions and capture errors in compile time. For example, the | + | Not only this syntax make you aware the color and the depth of the image, it also restrict the way you use functions and capture errors in compile time. For example, the <code>SetValue([[#Image Color |TColor]] color, Image<Gray, Byte> mask)</code> function in Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> class (version >= [[Version_History#Emgu.CV-1.2.2.0|1.2.2.0]]) will only accept colors of the same type, and mask has to be an 8-bit unsigned grayscale image. Any attempts to use a 16-bit floating point or non-grayscale image as a mask will results a compile time error! |
==Creating Image== | ==Creating Image== | ||
− | Although it is possible to create image by calling | + | Although it is possible to create image by calling <code>CvInvoke.cvCreateImage</code>, it is suggested to construct a Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> object instead. There are several advantages using the managed Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> class |
* Memory is automatically released by the garbage collector | * Memory is automatically released by the garbage collector | ||
− | * Image | + | * Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> class can be examined by [[Debugger Visualizer]] |
− | * Image | + | * Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> class contains advanced method that is not available on [[OpenCV]], for example, [[Tutorial#Generic_Operation | generic operation on image pixels]], conversion to Bitmap etc. |
===Image Color=== | ===Image Color=== | ||
The first generic parameter of the Image class specific the color of the image type. For example | The first generic parameter of the Image class specific the color of the image type. For example | ||
− | + | <code>Image<Gray, ...> img1; </code> | |
− | indicates that | + | indicates that <code>img1</code> is a single channel grayscale image. |
Color Types supported in [[Version_History#Emgu.CV-1.3.0.0|Emgu CV 1.3.0.0]] includes: | Color Types supported in [[Version_History#Emgu.CV-1.3.0.0|Emgu CV 1.3.0.0]] includes: | ||
Line 39: | Line 31: | ||
===Image Depth=== | ===Image Depth=== | ||
− | Image Depth is specified using the second generic parameter | + | Image Depth is specified using the second generic parameter <code>Depth</code>. |
The types of depth supported in [[Version_History#Emgu.CV-1.4.0.0|Emgu CV 1.4.0.0]] include | The types of depth supported in [[Version_History#Emgu.CV-1.4.0.0|Emgu CV 1.4.0.0]] include | ||
* Byte | * Byte | ||
Line 51: | Line 43: | ||
===Creating a new image=== | ===Creating a new image=== | ||
To create an 480x320 image of Bgr color and 8-bit unsigned depth. The code in C# would be | To create an 480x320 image of Bgr color and 8-bit unsigned depth. The code in C# would be | ||
− | + | <source lang="csharp"> | |
− | Image | + | Image<Bgr, Byte> img1 = new Image<Bgr, Byte>(480, 320); |
− | + | </source> | |
If you wants to specify the background value of the image, let's say in Blue. The code in C# would be | If you wants to specify the background value of the image, let's say in Blue. The code in C# would be | ||
− | + | <source lang="csharp"> | |
− | Image | + | Image<Bgr, Byte> img1 = new Image<Bgr, Byte>(480, 320, new Bgr(255, 0, 0)); |
− | + | </source> | |
===Reading image from file=== | ===Reading image from file=== | ||
Creating image from file is also simple. If the image file is "MyImage.jpg", in C# it is | Creating image from file is also simple. If the image file is "MyImage.jpg", in C# it is | ||
− | + | <source lang="csharp"> | |
− | Image | + | Image<Bgr, Byte> img1 = new Image<Bgr, Byte>("MyImage.jpg"); |
− | + | </source> | |
===Creating image from Bitmap=== | ===Creating image from Bitmap=== | ||
− | It is also possible to create an Image | + | It is also possible to create an Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> from a .Net Bitmap object. The code in C# would be |
− | + | <source lang="csharp"> | |
− | Image | + | Image<Bgr, Byte> img = new Image<Bgr, Byte>(bmp); //where bmp is a Bitmap |
− | + | </source> | |
==Automatic Garbage Collection== | ==Automatic Garbage Collection== | ||
− | The Image | + | The Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> class automatically take care of the memory management and garbage collection. |
− | Once the garbage collector decided that there is no more reference to the Image | + | Once the garbage collector decided that there is no more reference to the Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> object, it will call the <code>Disposed</code> method, which release the unmanaged IplImage structure. |
− | The time of when garbage collector decides to dispose the image is not guaranteed. When working with large image, it is recommend to call the | + | The time of when garbage collector decides to dispose the image is not guaranteed. When working with large image, it is recommend to call the <code>Dispose()</code> method to explicitly release the object. Alternatively, use the '''''using''''' keyword in C# to limit the scope of the image |
− | + | <source lang="csharp"> | |
− | using (Image | + | using (Image<Gray, Single> image = new Image<Gray, Single>(1000, 800)) |
{ | { | ||
... //do something here in the image | ... //do something here in the image | ||
} //The image will be disposed here and memory freed | } //The image will be disposed here and memory freed | ||
− | + | </source> | |
==Getting or Setting Pixels== | ==Getting or Setting Pixels== | ||
===The safe (slow) way=== | ===The safe (slow) way=== | ||
− | * Suppose you are working on an Image | + | * Suppose you are working on an Image<Bgr, Byte>. You can obtain the pixel on the y-th row and x-th column by calling |
− | + | <source lang="csharp"> | |
Bgr color = img[y, x]; | Bgr color = img[y, x]; | ||
− | + | </source> | |
* Setting the pixel on the y-th row and x-th column is also simple | * Setting the pixel on the y-th row and x-th column is also simple | ||
− | + | <source lang="csharp"> | |
img[y,x] = color; | img[y,x] = color; | ||
− | + | </source> | |
===The fast way=== | ===The fast way=== | ||
* The Image pixels values are stored in the Data property, a 3D array. Use this property if you need to iterate through the pixel values of the image. | * The Image pixels values are stored in the Data property, a 3D array. Use this property if you need to iterate through the pixel values of the image. | ||
Line 100: | Line 92: | ||
==Methods== | ==Methods== | ||
===Naming Convention=== | ===Naming Convention=== | ||
− | * Method | + | * Method <code>XYZ</code> in Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> class corresponds to the [[OpenCV]] function <code>cvXYZ</code>. For example, Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]>.Not() function corresponds to <code> cvNot </code> function with the resulting image being returned. |
− | * Method | + | * Method <code>_XYZ</code> is usually the same as Method <code>XYZ</code> except that the operation is performed '''inplace''' rather than returning a value. For example, Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]>._Not() function performs the bit-wise inversion inplace. |
==Operators Overload== | ==Operators Overload== | ||
− | The operators | + | The operators <code>+ - * /</code> has been overloaded (version > [[Version_History#Emgu.CV-1.2.2.0|1.2.2.0]]) such that it is perfectly legal to write codes like: |
− | Image | + | Image<Gray, Byte> image3 = (image1 + image2 - 2.0) * 0.5; |
==Generic Operation== | ==Generic Operation== | ||
Line 112: | Line 104: | ||
It's best if I demonstrate this with an example. Suppose we have an grayscale image of bytes | It's best if I demonstrate this with an example. Suppose we have an grayscale image of bytes | ||
− | + | <source lang="csharp"> | |
− | Image | + | Image<Gray, Byte> img1 = new Image<Gray, Byte>(400, 300, new Gray(30)); |
− | + | </source> | |
To invert all the pixels in this image we can call the Not function | To invert all the pixels in this image we can call the Not function | ||
− | + | <source lang="csharp"> | |
− | Image | + | Image<Gray, Byte> img2 = img1.Not(); |
− | + | </source> | |
− | As an alternative, we can also use the generic method | + | As an alternative, we can also use the generic method <code> Convert </code> available from the Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> class |
− | + | <source lang="csharp"> | |
− | Image | + | Image<Gray, Byte> img3 = img1.Convert<Byte>( delegate(Byte b) { return (Byte) (255-b); } ); |
− | + | </source> | |
− | The resulting image | + | The resulting image <code>img2</code> and <code>img3</code> contains the same value for each pixel. |
− | At first glance it wouldn't seems to be a big gain when using generic operations. In fact, since [[OpenCV]] already has an implementation of the | + | At first glance it wouldn't seems to be a big gain when using generic operations. In fact, since [[OpenCV]] already has an implementation of the <code>Not</code> function and performance-wise it is better than the generic version of the equivalent <code>Convert</code> function call. However, there comes to cases when generic functions provide the flexibility with only minor performance penalty. |
− | Let's say you have an | + | Let's say you have an <code>Image<Gray, Byte> img1</code> with pixels set. You wants to create a single channel floating point image of the same size, where each pixel of the new image, correspond to the old image, described with the following delegate |
− | + | <source lang="csharp"> | |
delegate(Byte b) { return (Single) Math.cos( b * b / 255.0); } | delegate(Byte b) { return (Single) Math.cos( b * b / 255.0); } | ||
− | + | </source> | |
This operation can be completed as follows in [[Emgu CV]] | This operation can be completed as follows in [[Emgu CV]] | ||
− | + | <source lang="csharp"> | |
− | Image | + | Image<Gray, Single> img4 = img1.Convert<Single>( delegate(Byte b) { return (Single) Math.cos( b * b / 255.0); } ); |
− | + | </source> | |
− | The syntax is simple and meaningful. On the other hand, this operation in [[OpenCV]] is hard to perform since equivalent function such as | + | The syntax is simple and meaningful. On the other hand, this operation in [[OpenCV]] is hard to perform since equivalent function such as <code>Math.cos</code> is not available. |
==Drawing Objects on Image== | ==Drawing Objects on Image== | ||
− | The | + | The <code> Draw( )</code> method in Image< [[#Image Color |Color]], [[#Image Depth |Depth]]> can be used to draw different types of objects, including fonts, lines, circles, rectangles, boxes, ellipses as well as contours. Use the documentation and intellisense as a guideline to discover the many functionality of the <code> Draw </code> function. |
==Color and Depth Conversion== | ==Color and Depth Conversion== | ||
− | Converting an Image | + | Converting an Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> between different colors and depths are simple. For example, if you have <code> Image<Bgr, Byte> img1 </code> and you wants to convert it to a grayscale image of Single, all you need to do is |
− | + | <source lang="csharp"> | |
− | Image | + | Image<Gray, Single> img2 = img1.Convert<Gray, Single>(); |
− | + | </source> | |
== Displaying Image == | == Displaying Image == | ||
Line 154: | Line 146: | ||
===Converting to Bitmap=== | ===Converting to Bitmap=== | ||
− | The Image class has a | + | The Image class has a <code>ToBitmap()</code> function that return a Bitmap object, which can easily be displayed on a PictureBox control using Windows Form. |
==XML Serialization== | ==XML Serialization== | ||
===Why do I care?=== | ===Why do I care?=== | ||
− | One of the future of Emgu CV is that Image | + | One of the future of Emgu CV is that Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> can be XML serializated. You might ask why we need to serialization an Image. The answer is simple, we wants to use it in a web service! |
− | Since the Image | + | Since the Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> class implements ISerializable, when you work in WCF (Windows Communication Fundation), you are free to use Image<[[#Image Color |TColor]], [[#Image Depth |TDepth]]> type as parameters or return value of a web service. |
− | This will be ideal, for example, if you are building a cluster of computers to recognize different groups of object and have a central computer to coordinate the tasks. I will also be useful if your wants to implement remote monitoring software that constantly query image from a remote server, which use the | + | This will be ideal, for example, if you are building a cluster of computers to recognize different groups of object and have a central computer to coordinate the tasks. I will also be useful if your wants to implement remote monitoring software that constantly query image from a remote server, which use the <code>Capture</code> class in [[Emgu CV]] to capture images from camera. |
===Conversion to XML=== | ===Conversion to XML=== | ||
− | You can use the following code to convert an | + | You can use the following code to convert an <code>Image<Bgr, Byte> image</code> to <code>XmlDocument</code>: |
− | + | <source lang="csharp"> | |
StringBuilder sb = new StringBuilder(); | StringBuilder sb = new StringBuilder(); | ||
− | (new XmlSerializer(typeof(Image | + | (new XmlSerializer(typeof(Image<Bgr, Byte>))).Serialize(new StringWriter(sb), o); |
XmlDocument xDoc = new XmlDocument(); | XmlDocument xDoc = new XmlDocument(); | ||
xDoc.LoadXml(sb.ToString()); | xDoc.LoadXml(sb.ToString()); | ||
− | + | </source> | |
===Conversion from XML=== | ===Conversion from XML=== | ||
− | You can use the following code to convert a | + | You can use the following code to convert a <code>XmlDocument xDoc</code> to <code>Image<Bgr,Byte></code> |
− | + | <source lang="csharp"> | |
− | Image | + | Image<Bgr, Byte> image = (Image<Bgr, Byte>) |
− | (new XmlSerializer(typeof(Image | + | (new XmlSerializer(typeof(Image<Bgr, Byte>))).Deserialize(new XmlNodeReader(xDoc)); |
− | + | </source> |
Revision as of 09:52, 26 November 2010
Depth and Color as Generic Parameter
An Image is defined by its generic parameters: color and depth. To create a 8bit unsigned Grayscale image, in Emgu CV it is done by calling
Image<Gray, Byte> image = new Image<Gray, Byte>( width, height);
Not only this syntax make you aware the color and the depth of the image, it also restrict the way you use functions and capture errors in compile time. For example, the SetValue(TColor color, Image<Gray, Byte> mask)
function in Image<TColor, TDepth> class (version >= 1.2.2.0) will only accept colors of the same type, and mask has to be an 8-bit unsigned grayscale image. Any attempts to use a 16-bit floating point or non-grayscale image as a mask will results a compile time error!
Creating Image
Although it is possible to create image by calling CvInvoke.cvCreateImage
, it is suggested to construct a Image<TColor, TDepth> object instead. There are several advantages using the managed Image<TColor, TDepth> class
- Memory is automatically released by the garbage collector
- Image<TColor, TDepth> class can be examined by Debugger Visualizer
- Image<TColor, TDepth> class contains advanced method that is not available on OpenCV, for example, generic operation on image pixels, conversion to Bitmap etc.
Image Color
The first generic parameter of the Image class specific the color of the image type. For example
Image<Gray, ...> img1;
indicates that img1
is a single channel grayscale image.
Color Types supported in Emgu CV 1.3.0.0 includes:
- Gray
- Bgr (Blue Green Red)
- Bgra (Blue Green Red Alpha)
- Hsv (Hue Saturation Value)
- Hls (Hue Lightness Saturation)
- Lab (CIE L*a*b*)
- Luv (CIE L*u*v*)
- Xyz (CIE XYZ.Rec 709 with D65 white point)
- Ycc (YCrCb JPEG)
Image Depth
Image Depth is specified using the second generic parameter Depth
.
The types of depth supported in Emgu CV 1.4.0.0 include
- Byte
- SByte
- Single (float)
- Double
- UInt16
- Int16
- Int32 (int)
Creating a new image
To create an 480x320 image of Bgr color and 8-bit unsigned depth. The code in C# would be
Image<Bgr, Byte> img1 = new Image<Bgr, Byte>(480, 320);
If you wants to specify the background value of the image, let's say in Blue. The code in C# would be
Image<Bgr, Byte> img1 = new Image<Bgr, Byte>(480, 320, new Bgr(255, 0, 0));
Reading image from file
Creating image from file is also simple. If the image file is "MyImage.jpg", in C# it is
Image<Bgr, Byte> img1 = new Image<Bgr, Byte>("MyImage.jpg");
Creating image from Bitmap
It is also possible to create an Image<TColor, TDepth> from a .Net Bitmap object. The code in C# would be
Image<Bgr, Byte> img = new Image<Bgr, Byte>(bmp); //where bmp is a Bitmap
Automatic Garbage Collection
The Image<TColor, TDepth> class automatically take care of the memory management and garbage collection.
Once the garbage collector decided that there is no more reference to the Image<TColor, TDepth> object, it will call the Disposed
method, which release the unmanaged IplImage structure.
The time of when garbage collector decides to dispose the image is not guaranteed. When working with large image, it is recommend to call the Dispose()
method to explicitly release the object. Alternatively, use the using keyword in C# to limit the scope of the image
using (Image<Gray, Single> image = new Image<Gray, Single>(1000, 800))
{
... //do something here in the image
} //The image will be disposed here and memory freed
Getting or Setting Pixels
The safe (slow) way
- Suppose you are working on an Image<Bgr, Byte>. You can obtain the pixel on the y-th row and x-th column by calling
Bgr color = img[y, x];
- Setting the pixel on the y-th row and x-th column is also simple
img[y,x] = color;
The fast way
- The Image pixels values are stored in the Data property, a 3D array. Use this property if you need to iterate through the pixel values of the image.
Methods
Naming Convention
- Method
XYZ
in Image<TColor, TDepth> class corresponds to the OpenCV functioncvXYZ
. For example, Image<TColor, TDepth>.Not() function corresponds tocvNot
function with the resulting image being returned. - Method
_XYZ
is usually the same as MethodXYZ
except that the operation is performed inplace rather than returning a value. For example, Image<TColor, TDepth>._Not() function performs the bit-wise inversion inplace.
Operators Overload
The operators + - * /
has been overloaded (version > 1.2.2.0) such that it is perfectly legal to write codes like:
Image<Gray, Byte> image3 = (image1 + image2 - 2.0) * 0.5;
Generic Operation
One of the advantage of using Emgu CV is the ability to perform generic operations.
It's best if I demonstrate this with an example. Suppose we have an grayscale image of bytes
Image<Gray, Byte> img1 = new Image<Gray, Byte>(400, 300, new Gray(30));
To invert all the pixels in this image we can call the Not function
Image<Gray, Byte> img2 = img1.Not();
As an alternative, we can also use the generic method Convert
available from the Image<TColor, TDepth> class
Image<Gray, Byte> img3 = img1.Convert<Byte>( delegate(Byte b) { return (Byte) (255-b); } );
The resulting image img2
and img3
contains the same value for each pixel.
At first glance it wouldn't seems to be a big gain when using generic operations. In fact, since OpenCV already has an implementation of the Not
function and performance-wise it is better than the generic version of the equivalent Convert
function call. However, there comes to cases when generic functions provide the flexibility with only minor performance penalty.
Let's say you have an Image<Gray, Byte> img1
with pixels set. You wants to create a single channel floating point image of the same size, where each pixel of the new image, correspond to the old image, described with the following delegate
delegate(Byte b) { return (Single) Math.cos( b * b / 255.0); }
This operation can be completed as follows in Emgu CV
Image<Gray, Single> img4 = img1.Convert<Single>( delegate(Byte b) { return (Single) Math.cos( b * b / 255.0); } );
The syntax is simple and meaningful. On the other hand, this operation in OpenCV is hard to perform since equivalent function such as Math.cos
is not available.
Drawing Objects on Image
The Draw( )
method in Image< Color, Depth> can be used to draw different types of objects, including fonts, lines, circles, rectangles, boxes, ellipses as well as contours. Use the documentation and intellisense as a guideline to discover the many functionality of the Draw
function.
Color and Depth Conversion
Converting an Image<TColor, TDepth> between different colors and depths are simple. For example, if you have Image<Bgr, Byte> img1
and you wants to convert it to a grayscale image of Single, all you need to do is
Image<Gray, Single> img2 = img1.Convert<Gray, Single>();
Displaying Image
Using ImageBox
Emgu CV recommends the use of ImageBox control for display purpose, for the following reasons
- ImageBox is a high performance control for displaying image. Whenever possible, it displays a Bitmap that shares memory with the Image object, therefore no memory copy is needed (very fast).
- The user will be able to examine the image pixel values, video frame rates, color types when the image is being displayed.
- It is convenient to perform simple image operations with just a few mouse clicks.
Converting to Bitmap
The Image class has a ToBitmap()
function that return a Bitmap object, which can easily be displayed on a PictureBox control using Windows Form.
XML Serialization
Why do I care?
One of the future of Emgu CV is that Image<TColor, TDepth> can be XML serializated. You might ask why we need to serialization an Image. The answer is simple, we wants to use it in a web service!
Since the Image<TColor, TDepth> class implements ISerializable, when you work in WCF (Windows Communication Fundation), you are free to use Image<TColor, TDepth> type as parameters or return value of a web service.
This will be ideal, for example, if you are building a cluster of computers to recognize different groups of object and have a central computer to coordinate the tasks. I will also be useful if your wants to implement remote monitoring software that constantly query image from a remote server, which use the Capture
class in Emgu CV to capture images from camera.
Conversion to XML
You can use the following code to convert an Image<Bgr, Byte> image
to XmlDocument
:
StringBuilder sb = new StringBuilder();
(new XmlSerializer(typeof(Image<Bgr, Byte>))).Serialize(new StringWriter(sb), o);
XmlDocument xDoc = new XmlDocument();
xDoc.LoadXml(sb.ToString());
Conversion from XML
You can use the following code to convert a XmlDocument xDoc
to Image<Bgr,Byte>
Image<Bgr, Byte> image = (Image<Bgr, Byte>)
(new XmlSerializer(typeof(Image<Bgr, Byte>))).Deserialize(new XmlNodeReader(xDoc));