Detection of unhealthy region of plant leaves and classification of plant leaf diseases using texture features

March, 2013 Agric Eng Int: CIGR Journal Open access at http://www.cigrjournal.org Vol. 15, No.1 211 Detection of unhealthy region of plant leaves ...
Author: Gerard Elliott
2 downloads 1 Views 371KB Size
March, 2013

Agric Eng Int: CIGR Journal

Open access at http://www.cigrjournal.org

Vol. 15, No.1 211

Detection of unhealthy region of plant leaves and classification of plant leaf diseases using texture features S. Arivazhagan, R. Newlin Shebiah*, S. Ananthi, S. Vishnu Varthini (Department of Electronics and Communication Engineering, Mepco Schlenk Engineering College, Sivakasi Tamilnadu, 626 005, India) Abstract: Plant diseases have turned into a dilemma as it can cause significant reduction in both quality and quantity of agricultural products. Automatic detection of plant diseases is an essential research topic as it may prove benefits in monitoring large fields of crops, and thus automatically detect the symptoms of diseases as soon as they appear on plant leaves. The proposed system is a software solution for automatic detection and classification of plant leaf diseases.

The developed

processing scheme consists of four main steps, first a color transformation structure for the input RGB image is created, then the green pixels are masked and removed using specific threshold value followed by segmentation process, the texture statistics are computed for the useful segments, finally the extracted features are passed through the classifier. The proposed algorithm’s efficiency can successfully detect and classify the examined diseases with an accuracy of 94%. Experimental results on a database of about 500 plant leaves confirm the robustness of the proposed approach. Keywords: HSI, color co-occurrence matrix, texture, SVM, plant leaf diseases Citation: S.Arivazhagan, R. Newlin Shebiah, S.Ananthi, S.Vishnu Varthini. 2013. leaves and classification of plant leaf diseases using texture features.

Detection of unhealthy region of plant

Agric Eng Int: CIGR Journal, 15(1): 211-217.

Introduction

plant diseases were devastating, some of the crop

Images form important data and information in

plant disease losses in Georgia (USA) is approximately

biological sciences. Digital image processing and image

$653.06 million (Jean, 2009). In India no estimation has

analysis

in

been made but it is more than USA because the preventive

microelectronics and computers has many applications in

steps taken to protect our crops are not even one-tenth of

biology and it circumvents the problems that are

that in USA.

1

technology

cultivation has been abandoned. It is estimated that 2007

based

on

the

advances

associated with traditional photography. This new tool

The naked eye observation of experts is the main

helps to improve the images from microscopic to

approach

telescopic range and also offers a scope for their analysis.

identification of plant diseases.

It, therefore, has many applications in biology (Rastogi

continuous monitoring of experts which might be

and Chadda, 1989).

prohibitively expensive in large farms. Further, in some

.

developing countries, farmers may have to go long

Plant diseases cause periodic outbreak of diseases

which leads to large scale death and famine.

adopted

in

practice

for

detection

and

But, this requires

It is

distances to contact experts, this makes consulting experts

estimated that the outbreak of helminthosporiose of rice in

too expensive and time consuming (Al-Hiary et al., 2011)

north eastern India in 1943 caused a heavy loss of food

and moreover farmers are unaware of non-native diseases.

grains and death of a million people. Since the effects of

Automatic detection of plant diseases is an important research topic as it may prove benefits in monitoring large

Received date: 2012-11-23 Accepted date: 2013-02-21 * Corresponding author: R. Newlin Shebiah, Email: [email protected].

fields of crops, and thus automatically detect the diseases from the symptoms that appear on the plant leaves. This enables machine vision that is to provide image based

212

March

Agric Eng Int: CIGR Journal

Open access at http://www.cigrjournal.org

Vol. 15, No.1

automatic inspection, process control and robot guidance.

may be broadly classified into three types.

Comparatively, visual identification is labor intensive, less

bacterial, fungal and viral diseases.

accurate and can be done only in small areas. Kim et al. (2009) have classified the grape fruit peel diseases using color texture features analysis.

2

They are

Proposed methodology First, the images of various leaves are acquired using a

The

texture features are calculated from the Spatial Gray-level

digital camera.

Dependence Matrices (SGDM) and the classification is

applied to the acquired images to extract useful features

done using squared distance technique. Grape fruit peel

that are necessary for further analysis. After that, several

might be infected by several diseases like canker, copper

analytical techniques are used to classify the images

burn, greasy spot, melanose and wind scar (Kim et al.,

according to the specific problem at hand.

2009).

depicts the basic procedure of the proposed vision-based

Helly et al. (2003) developed a new method in which

Then image-processing techniques are

Figure 1

detection algorithm in this paper.

Hue Saturation Intensity (HIS) - transformation is applied to the input image, then it is segmented using Fuzzy C-mean algorithm. Feature extraction stage deals with the color, size and shape of the spot and finally classification is done using neural networks (Helly et al.,

Figure 1 Block diagram of proposed approach

2003). Real time specific weed discrimination technique using multilevel wavelet decomposition was proposed by Siddiqil et al. (2009). In this histogram equalization is

In the initial step, the RGB images of all the leaf samples were picked up.

used for preprocessing. Features are extracted from

The step-by-step procedure of the proposed system:

wavelet decomposition and finally classified by Euclidean

1) RGB image acquisition;

distance method (Siddiqil et.al, 2009)

2) Convert the input image from RGB to HSI format;

Al-Bashish et al. (2011) developed a fast and accurate

3) Masking the green-pixels;

method in which the leaf diseases are detected and

4) Removal of masked green pixels;

classified using k-means based segmentation and neural

5) Segment the components;

networks based classification.

6) Obtain the useful segments;

Automatic classification

of leaf diseases is done based on high resolution multispectral and stereo images (Bauer et al., 2011).

7) Computing the texture features using Color-CoOccurrence methodology;

Sugar beet leaves are used in this approach. Segmentation is the process that is carried out to extract the diseased region and the plant diseases are

8) Configuring the Neural Networks for Recognition. 2.1

Color transformation structure First, the RGB images of leaves are converted into HSI

graded by calculating the quotient of disease spot and leaf

color space representation.

areas. An optimal threshold value for segmentation can

space is to facilitate the specification of colors in some

be obtained using weighted Parzen-window (Jun and

standard, generally accepted way. HSI (hue, saturation,

Wang, 2008).

This reduces the computational burden

intensity) color model is a popular color model because it

and storage requirements without degrading the final

is based on human perception (Gonzalez and Woods,

segmentation results.

2008). Hue is a color attribute that refers to the dominant

The purpose of the color

In this paper, detection and classification of leaf

color as perceived by an observer. Saturation refers to

diseases has been proposed, this method is based on

the relative purity or the amount of white light added to

masking and removing of green pixels, applying a specific

hue and intensity refers to the amplitude of the light.

threshold to extract the infected region and computing the

Color spaces can be converted from one space to another

texture statistics to evaluate the diseases. Plant diseases

easily.

After the transformation process, the H

March, 2013 Detection of unhealthy region of plant leaves and classification of plant leaf diseases using texture features Vol. 15, No.1 213

component is taken into account for further analysis. S

information.

and I components are dropped since it does not give extra

components.

a. Input image infected by bacterial brown spot

b. Hue component

Figure 2 shows the H, S and I

c. Saturation component

d. Intensity component

Figure 2 HSV components of a image infected by brown spots

2.1.1

gray-levels occur in relation to other gray levels (Argenti

Masking green pixels

In this step we identify the mostly green colored

et al., 2008).

These matrices measure the probability

After that, based on specified threshold value

that a pixel at one particular gray level will occur at a

that is computed for these pixels, the mostly green pixels

distinct distance and orientation from any pixel given that

are masked as follows: if the green component of the

pixel has a second particular gray level.

pixel intensity is less than the pre-computed threshold

are represented by the function P(i, j, d, θ) where i

value, the red, green and blue components of the this

represent the gray level of the location (x, y), and j

pixel is assigned to a value of zero.

represents the gray level of the pixel at a distance d from

pixels.

This is done in

sense that the green colored pixels mostly represent the

location (x, y) at an orientation angle of θ.

healthy areas of the leaf and they do not add any valuable

generated for H image.

weight to disease identification.

2.4

Furthermore this

In this step, the pixels with zeros red, green, blue gives

more

accurate

disease

This is helpful as it classification

and

significantly reduces the processing time. 2.2

From the above steps, the infected portion of the leaf The infected region is then segmented into

a number of patches of equal size.

The size of the patch

is chosen in such a way that the significant information is not lost. taken. Not

computed for the H image as given in Equations (1) to (5).

Contrast   i , j  0 (i, j ) 2 C (i, j )

(1)

Energy   i , j  0 C (i, j ) 2

(2)

N 1

N 1

Segmentation:

is extracted.

Texture features

homogeneity, Cluster shade and cluster prominence are

Removing the masked cells

values were completely removed.

In this approach patch size of 32×32pixels is

Local Homogeneity   i , j  0 C (i, j ) / (1  (i  j )2 )

(3)

Cluster Shade   i , j  0 (i  M x  j  M y )3 C (i, j )

(4)

N 1

N 1

Cluster Prominence   i , j  0 (i  M x  j  M y )4 C (i, j ) N 1

(5)

The next step is to extract the useful segments. all

segments

contain

significant

amount

From the texture features, the plant diseases are

of

information. So the patches which are having more than

classified into various types.

fifty percent of the information are taken into account for

2.5

the further analysis.

2.5.1 Minimum distance criterion

2.3

SGDM’s are

Texture features like Contrast, Energy, Local

significantly reduces the processing time. 2.1.2

The SGDM’s

Classifier

In the classification phase, the co-occurrence features

Color co-occurrence method The color co-occurrence texture analysis method is

for the leaves are extracted and compared with the

The gray level

corresponding feature values stored in the feature library.

co-occurrence methodology is a statistical way to

The classification is first done using the Minimum

describe shape by statistically sampling the way certain

Distance Criterion - (Arivazhagan et al., 2010).

developed through the SGDM.

The

214

March

Agric Eng Int: CIGR Journal

Open access at http://www.cigrjournal.org

Vol. 15, No.1

success of classification is measured using the classification

Classification of new instances for the one-versus-all case

gain (G) and is calculated using Equation (6).

is done by a winner-takes-all strategy, in which the

G (%) 

Ccorr  100 M

(6)

where, Ccorr is the number of images correctly classified and M is the total number of images belonging to the

class (Chamasemani and Singh, 2011).

3

Results and discussion About 500 plant leaves of 30 different native plant

particular texture group. 2.5.2

classifier with the highest output function assigns the

species of Tamil Nadu have been collected for our

SVM classifier

Support vector machines (SVMs) are a set of related

approach.

The acquired leaf images are converted into

supervised learning methods used for classification and

HSI format.

regression.

Supervised learning involves analyzing a

energy, local homogeneity, shade and prominence are

given set of labeled observations (the training set) so as to

derived from the co-occurrence matrix. With these set

predict the labels of unlabelled future data (the test set).

of co-occurrence features the plant diseases are detected.

Specifically, the goal is to learn some function that

Samples of leaves with various diseases like early scorch,

describes the relationship between observations and their

yellow spots, brown spots, late scorch, bacterial and

labels (Chi & Lin, 2002).

fungal diseases are shown in Figure 3.

More formally, a support

The co-occurrence features like contrast,

vector machine constructs a hyper plane or set of hyper planes in a high- or infinite-dimensional space, which can be used for classification, regression, or other tasks. Intuitively, a good separation is achieved by the hyper plane that has the largest distance to the nearest training data point of any class (so-called functional margin), in

a. Bacterial disease in rose and beans leaf

general the larger the functional margin the lower the generalization error of the classifier. In the case of support vector machines, a data point is viewed as a p-dimensional vector (a list of p numbers), and we want to know whether we can separate such points with a (p − 1)-dimensional hyper plane.

This is

b. Sun burn disease in lemon leaf

c. Early scorch disease in banana leaf

d. Late scorch disease in beans leaf

e. Fungal disease in beans leaf

called a linear classifier. There are many hyper planes that might classify the data.

One reasonable choice as

the best hyper plane is the one that represents the largest separation, or margin, between the two classes.

So we

choose the hyper plane so that the distance from it to the nearest data point on each side is maximized.

Figure 3

Sample images of infected leaves

Multiclass SVM aims to assign labels to instances by using support vector machines, where the labels are drawn from a finite set of several elements.

The

As a sample, a rose leaf that is infected by bacterial disease is given as input to the algorithm.

Color

dominant approach for doing so is to reduce the single

transformation structure on the input image is performed.

multiclass problem into multiple binary classification

Then the green pixels are masked and removed using a

problems.

Common methods for such reduction include:

specific threshold value. Then the R, G, B components

building binary classifiers which distinguish between (i)

are mapped to the thresholded image. These steps are

one of the labels and the rest (one-versus-all) or (ii)

shown in Figure 4.

between

affected by various diseases.

every

pair

of

classes

(one-versus-one).

Table 1 lists the set of leaves that are

March, 2013 Detection of unhealthy region of plant leaves and classification of plant leaf diseases using texture features Vol. 15, No.1 215

a. Input image

b. Hue content

Figure 4

c. Threshold image

d. R component mapped output

Detection of infected region for a rose leaf

Table 1 Detected diseased region of various leaves Plant species

Input image

Hue content

Thresholded image

R component mapped output

Beans

Lemon

Banana

Guava

After mapping the R, G, B components of the input image to the thresholded image, the co-occurrence features are calculated.

used for training, testing and classification gain for each type of leaves is shown in Table 2.

The co-occurrence features for

The classification gain obtained by Minimum

the leaves are extracted and compared with the

Distance Criterion is 86.77%. The detection accuracy is

corresponding feature values stored in the feature library.

improved to 94.74% by SVM classifier.

The classification is first done using the Minimum

and the testing sets for each type of leaf along with their

Distance Criterion.

detection accuracy is shown in Table 2.

The leaf images are divided into

training and testing set, where 5% of the leaf images from each group are used to train the system and the remaining images serves as the testing set.

The number of images

The training

From the results it can be seen that the detection accuracy is enhanced with SVM classifier. The two class problem is then extended to multiclass

216

March

Agric Eng Int: CIGR Journal

Open access at http://www.cigrjournal.org

Vol. 15, No.1

problem where the detected leaf diseases are then

testing sets for each type of leaf disease along with their

classified into various categories.

detection accuracy is shown in Table 3.

Table 2

Training and the

Comparison of results by minimum distance classifier and support vector machine Detection accuracy/%

No. of images used for training

No. of images used for testing

Banana

10

10

82.25

90

Beans

10

12

96.43

91.66 92.86

Plant species

MDC

SVM

Guava

10

14

78.95

Jackfruit

10

10

82.35

100

Lemon

10

30

94.4

96.66

Mango

10

17

80

94.12

Potato

10

10

78.57

100

Tomato

10

27

95.24

92.59

86.77

94.74

Overall accuracy

Table 3 Plant Species Banana

Beans

Guava

Jackfruit

Lemon

Mango

Potato

Sapota

Tomato

Results of leaf disease recognition system

Category

No. of Images used for Training

No. of Images used for Testing

Good

5

6

Late scorch

5

7

Good

4

9

Bacterial spot

2

4

Fungal spot

4

11

Good

5

7

Chocolate spot

5

7

Good

4

5

Bacterial disease

4

3

Fungal disease

2

2

Good

4

5

Bacterial disease

4

20

Sun burn

2

3 6

Good

3

Bacterial disease

3

4

Sooty mold

4

8

Good

4

7

Early blight

3

9

Late blight

3

12

Good

4

5

Scorch

3

3

Ashen mold

3

2

Good

4

5

Bacterial disease

4

19

Leaf lesion

2

4

Detection Accuracy 84.60%

87.50%

92.86%

90%

82.14%

83.33%

96.43%

80%

82.15%

Overall Accuracy

4 Conclusion

87.66%

lemon, mango, potato, tomato, and sapota.

The diseases

specific to those plants were taken for our approach.

An application of texture analysis in detecting and

The experimental results indicate the proposed approach

classifying the plant leaf diseases has been explained in

can recognize and classify the leaf diseases with a little

this paper. Thus the proposed algorithm was tested on

computational effort.

ten species of plants namely banana, beans, jackfruit,

can be identified at the initial stage itself and the pest

By this method, the plant diseases

March, 2013 Detection of unhealthy region of plant leaves and classification of plant leaf diseases using texture features Vol. 15, No.1 217

control tools can be used to solve pest problems while

deletion), also the taken feature identification vectors

minimizing risks to people and the environment.

need to further optimized.

The

In order to improve disease

reasons for misclassification are as follows: the

identification rate at various stages, the training samples

symptoms of the diseased plant leaves vary (at the

can be increased and shape feature and color feature

beginning, tiny, dark brown to black spots, at later time, it

along with the optimal features can be given as input

has the phenomena of withered leaf, black or part leaf

condition of disease identification.

References Al-Bashish, D., M. Braik, and S. Bani-Ahmad. 2011. Detection and classification of leaf diseases using K-means-based segmentation and neural networks based classification. Information Technology Journal, 10(2): 267-275. 2011.

Fast and accurate detection and

classification of plant diseases.

International Journal of

Computer Applications, 17(1): 31-38. Fast algorithms

for texture analysis using co-occurrence matrices.

IEEE

proceedings, 137, (6): 443-448. 2010.

Classification of citrus and non-citrus fruits Computing Communication and

Networking Technologies, ICCCNT-2010. Bauer, S. D., F. Korc, W. Forstner.

An

and diagnosis. in Proc. IICAI,

pp.1182-1195.

Jean, W. 2009. Extension plant pathologist, georgia plant disease loss estimates, www.caes.uga.edu/publications. parzen window estimation.

Journal of Applied Sciences,

8(5):772-779. Kim, D. G., T. F. Burks, J. Qin, and D. M. Bulanon.

Arivazhagan, S., R. N. Shebiah, S. S. Nidhyanandhan, and L. using texture features.

Neural Networks, 13(2): 415-425. Helly, M. E., A. Rafea, and Salwa-El-Gammal. 2003.

Jun, W., and S. Wang. 2008. Image thresholding using weighted

Argenti, F., L. Alparone, and G. Benelli. 1990.

Ganesan.

IEEE Transactions on

integrated image processing system for leaf disease detection

Al-Hiary, H., S. Bani-Ahmad, M. Reyalat, M. Braik, and Z. AlRahamneh.

multiclass support vector machines.

2011.

Classification of grapefruit peel diseases using color texture feature analysis.

International Journal on Agriculture and

Biological Engineering, 2(3): 41-50. Rastogi, R., and V. K. Chadda.

The potential of

automatic methods of classification to identify leaf diseases from multispectral images. Precision Agriculture, 12: 361-377.

2009.

Applications of image

processing in biology and agriculture J. K. Sainis, Molecular Biology and Agriculture Division, BARC newsletter. Siddiqil, M. H., S. Sulaiman, I. Faye, and I. Ahmad.

2009. A

Chamasemani, F. F., and Y. P. Singh. Malaysia, Bio-Inspired

real time specific weed discrimination system using multi-level

Computing: Theories and Applications (BIC-TA), 2011 Sixth

wavelet decomposition. International Journal of Agriculture

International Conference.

Faculty of Information Technology,

MultiMedia University. Cyberjaya, Malaysia. Chih-Wei, H., and C. Lin.

2002. A comparison of methods for

and Biology, 11(5): 559-565. Gonzalez, R., R. E. Woods. 2008 Digital image processing. Third edition, Pearson Education, Prentice-Hall, Inc.

Suggest Documents