Data
wilt

wilt

active ARFF Publicly available Visibility: public Uploaded 01-06-2015 by Rafael G. Mantovani
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • study_14 study_1 study_1396 study_1782 study_2981 study_3013 study_4542 study_5657 study_8366 study_12660 study_12756 study_947 study_3858 study_6622 study_8366 study_1081 study_1544 study_4677 study_4948 study_8366 study_10410 study_11425 study_1981 study_2751 study_3484 study_6835 study_8366 study_10740 study_11684 study_402 study_1256 study_1876 study_4630 study_6997 study_7135 study_8366 study_422 study_705 study_8366 study_754 study_1212 study_1256 study_2225 study_7415 study_10340 study_11425 study_360 study_2637 study_4562 study_7329 study_10930 study_11684 study_237 study_504 study_638 study_658 study_1212 study_2079 study_4890 study_6861 study_11237 study_3504 study_6835 study_202 study_266 study_2751 study_3940 study_4613 study_11425 study_137 study_1061 study_1505
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Author: Brian Johnson Source: [UCI] (https://archive.ics.uci.edu/ml/datasets/Wilt) Please cite: Johnson, B., Tateishi, R., Hoan, N., 2013. A hybrid pansharpening approach and multiscale object-based image analysis for mapping diseased pine and oak trees. International Journal of Remote Sensing, 34 (20), 6969-6982. * Dataset: Wilt Data Set * Abstract: High-resolution Remote Sensing data set (Quickbird). Small number of training samples of diseased trees, large number for other land cover. Testing data set from stratified random sample of image. * Source: Brian Johnson; Institute for Global Environmental Strategies; 2108-11 Kamiyamaguchi, Hayama, Kanagawa,240-0115 Japan; Email: Johnson '@' iges.or.jp * Data Set Information: This data set contains some training and testing data from a remote sensing study by Johnson et al. (2013) that involved detecting diseased trees in Quickbird imagery. There are few training samples for the 'diseased trees' class (74) and many for 'other land cover' class (4265). The data set consists of image segments, generated by segmenting the pansharpened image. The segments contain spectral information from the Quickbird multispectral image bands and texture information from the panchromatic (Pan) image band. The testing data set is for the row with “Segmentation scale 15” segments and “original multi-spectral image” Spectral information in Table 2 of the reference (i.e. row 5). Please see the reference below for more information on the data set, and please cite the reference if you use this data set. Enjoy! * Attribute Information: class: 'w' (diseased trees), 'n' (all other land cover) GLCM_Pan: GLCM mean texture (Pan band) Mean_G: Mean green value Mean_R: Mean red value Mean_NIR: Mean NIR value SD_Pan: Standard deviation (Pan band) * Relevant Papers: Johnson, B., Tateishi, R., Hoan, N., 2013. A hybrid pansharpening approach and multiscale object-based image analysis for mapping diseased pine and oak trees. International Journal of Remote Sensing, 34 (20), 6969-6982.

6 features

Class (target)nominal2 unique values
0 missing
V1numeric4777 unique values
0 missing
V2numeric4234 unique values
0 missing
V3numeric4145 unique values
0 missing
V4numeric4646 unique values
0 missing
V5numeric4802 unique values
0 missing

107 properties

4839
Number of instances (rows) of the dataset.
6
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
5
Number of numeric attributes.
1
Number of nominal attributes.
Average mutual information between the nominal attributes and the target attribute.
0.3
Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes
70.39
First quartile of means among attributes of the numeric type.
0.95
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1
0.02
Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0.03
Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3
0.79
Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001
An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.
1
Number of binary attributes.
First quartile of mutual information between the nominal attributes and the target attribute.
0.02
Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1
0.8
Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0.75
Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3
0.92
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001
2
Average number of distinct values among the attributes of the nominal type.
-0.28
First quartile of skewness among attributes of the numeric type.
0.75
Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1
0.93
Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0
Standard deviation of the number of distinct values among attributes of the nominal type.
0.02
Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001
3.37
Mean skewness among attributes of the numeric type.
12.09
First quartile of standard deviation of attributes of the numeric type.
0.95
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2
0.02
Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0.65
Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk
0.79
Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001
61.21
Mean standard deviation of attributes of the numeric type.
Second quartile (Median) of entropy among attributes.
0.02
Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2
0.8
Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0.05
Error rate achieved by the landmarker weka.classifiers.lazy.IBk
94.61
Percentage of instances belonging to the most frequent class.
Minimal entropy among attributes.
17.19
Second quartile (Median) of kurtosis among attributes of the numeric type.
0.75
Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2
0.3
Entropy of the target attribute values.
0.4
Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk
4578
Number of instances belonging to the most frequent class.
-0.02
Minimum kurtosis among attributes of the numeric type.
126.86
Second quartile (Median) of means among attributes of the numeric type.
0.95
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3
0.67
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump
Maximum entropy among attributes.
24.48
Minimum of means among attributes of the numeric type.
Second quartile (Median) of mutual information between the nominal attributes and the target attribute.
0.02
Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3
0.05
Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump
119.74
Maximum kurtosis among attributes of the numeric type.
Minimal mutual information between the nominal attributes and the target attribute.
2.5
Second quartile (Median) of skewness among attributes of the numeric type.
0.75
Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3
0
Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump
525.8
Maximum of means among attributes of the numeric type.
2
The minimal number of distinct values among attributes of the nominal type.
16.67
Percentage of binary attributes.
62.01
Second quartile (Median) of standard deviation of attributes of the numeric type.
0.87
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1
0
Number of attributes divided by the number of instances.
Maximum mutual information between the nominal attributes and the target attribute.
-0.75
Minimum skewness among attributes of the numeric type.
0
Percentage of instances having missing values.
Third quartile of entropy among attributes.
0.03
Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1
Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.
2
The maximum number of distinct values among attributes of the nominal type.
10.73
Minimum standard deviation of attributes of the numeric type.
0
Percentage of missing values.
107.68
Third quartile of kurtosis among attributes of the numeric type.
0.97
Average class difference between consecutive instances.
0.75
Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1
0.92
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001
7.73
Maximum skewness among attributes of the numeric type.
156.58
Maximum standard deviation of attributes of the numeric type.
5.39
Percentage of instances belonging to the least frequent class.
83.33
Percentage of numeric attributes.
378.61
Third quartile of means among attributes of the numeric type.
0.93
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0.87
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2
0.02
Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001
Average entropy of the attributes.
261
Number of instances belonging to the least frequent class.
16.67
Percentage of nominal attributes.
Third quartile of mutual information between the nominal attributes and the target attribute.
0.02
Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0.03
Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2
0.79
Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001
48.01
Mean kurtosis among attributes of the numeric type.
0.84
Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes
First quartile of entropy among attributes.
7.46
Third quartile of skewness among attributes of the numeric type.
0.8
Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0.75
Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2
0.92
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001
204.97
Mean of means among attributes of the numeric type.
0.11
Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes
3.75
First quartile of kurtosis among attributes of the numeric type.
109.92
Third quartile of standard deviation of attributes of the numeric type.
0.93
Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W
0.87
Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3
0.02
Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

11 tasks

3 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: Class
0 runs - estimation_procedure: Test on Training Data - target_feature: Class
0 runs - estimation_procedure: 5 times 2-fold Crossvalidation - target_feature: Class
0 runs - estimation_procedure: 10% Holdout set - target_feature: Class
0 runs - estimation_procedure: 33% Holdout set - target_feature: Class
0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - target_feature: Class
0 runs - estimation_procedure: 20% Holdout (Ordered) - target_feature: Class
0 runs - estimation_procedure: Leave one out - target_feature: Class
0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: Class
0 runs - estimation_procedure: 10 times 10-fold Learning Curve - target_feature: Class
0 runs - estimation_procedure: Interleaved Test then Train - target_feature: Class
Define a new task