OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

segment

active ARFF Publicly available Visibility: public Uploaded 06-04-2014 by Jan van Rijn
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

study_14 study_1 study_15 study_17 study_29 study_32 study_48 study_69 study_90 study_92 study_94 study_96 study_98 study_127 study_15 study_17 study_29 study_32 study_48 study_69 study_90 study_92 study_94 study_96 study_98 study_127 study_15 study_17 study_29 study_32 study_48 study_69 study_90 study_92 study_94 study_96 study_98 study_127 study_141 study_169 study_195 study_429 study_15 study_17 study_29 study_32 study_48 study_69 study_90 study_92 study_94 study_96 study_98 study_105 study_127 study_15 study_17 study_29 study_32 study_48 study_69 study_90 study_92 study_94 study_96 study_98 study_127 study_441 study_2 study_15 study_17 study_29 study_32 study_48 study_69 study_90 study_92 study_94 study_96 study_98 study_127 study_310 study_110 study_212 study_66 Add tag

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Author: Source: Unknown - Please cite: 1. Title: Image Segmentation data 2. Source Information -- Creators: Vision Group, University of Massachusetts -- Donor: Vision Group (Carla Brodley, brodley@cs.umass.edu) -- Date: November, 1990 3. Past Usage: None yet published 4. Relevant Information: The instances were drawn randomly from a database of 7 outdoor images. The images were handsegmented to create a classification for every pixel. Each instance is a 3x3 region. 5. Number of Instances: Training data: 210 Test data: 2100 6. Number of Attributes: 19 continuous attributes 7. Attribute Information: 1. region-centroid-col: the column of the center pixel of the region. 2. region-centroid-row: the row of the center pixel of the region. 3. region-pixel-count: the number of pixels in a region = 9. 4. short-line-density-5: the results of a line extractoin algorithm that counts how many lines of length 5 (any orientation) with low contrast, less than or equal to 5, go through the region. 5. short-line-density-2: same as short-line-density-5 but counts lines of high contrast, greater than 5. 6. vedge-mean: measure the contrast of horizontally adjacent pixels in the region. There are 6, the mean and standard deviation are given. This attribute is used as a vertical edge detector. 7. vegde-sd: (see 6) 8. hedge-mean: measures the contrast of vertically adjacent pixels. Used for horizontal line detection. 9. hedge-sd: (see 8). 10. intensity-mean: the average over the region of (R + G + B)/3 11. rawred-mean: the average over the region of the R value. 12. rawblue-mean: the average over the region of the B value. 13. rawgreen-mean: the average over the region of the G value. 14. exred-mean: measure the excess red: (2R - (G + B)) 15. exblue-mean: measure the excess blue: (2B - (G + R)) 16. exgreen-mean: measure the excess green: (2G - (R + B)) 17. value-mean: 3-d nonlinear transformation of RGB. (Algorithm can be found in Foley and VanDam, Fundamentals of Interactive Computer Graphics) 18. saturatoin-mean: (see 17) 19. hue-mean: (see 17) 8. Missing Attribute Values: None 9. Class Distribution: Classes: brickface, sky, foliage, cement, window, path, grass. 30 instances per class for training data. 300 instances per class for test data. Relabeled values in attribute class From: 1 To: brickface From: 2 To: sky From: 3 To: foliage From: 4 To: cement From: 5 To: window From: 6 To: path From: 7 To: grass

20 features

class (target)	nominal	7 unique values 0 missing
rawred-mean	numeric	681 unique values 0 missing
hue-mean	numeric	1922 unique values 0 missing
saturation-mean	numeric	1899 unique values 0 missing
value-mean	numeric	785 unique values 0 missing
exgreen-mean	numeric	377 unique values 0 missing
exblue-mean	numeric	636 unique values 0 missing
exred-mean	numeric	430 unique values 0 missing
rawgreen-mean	numeric	691 unique values 0 missing
rawblue-mean	numeric	781 unique values 0 missing
region-centroid-col	numeric	253 unique values 0 missing
intensity-mean	numeric	1271 unique values 0 missing
hedge-sd	numeric	1180 unique values 0 missing
hedge-mean	numeric	262 unique values 0 missing
vegde-sd	numeric	1082 unique values 0 missing
vedge-mean	numeric	234 unique values 0 missing
short-line-density-2	numeric	3 unique values 0 missing
short-line-density-5	numeric	4 unique values 0 missing
region-pixel-count	numeric	1 unique values 0 missing
region-centroid-row	numeric	238 unique values 0 missing

Show all 20 features

107 properties

NumberOfInstances

2310

Number of instances (rows) of the dataset.

NumberOfFeatures

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

Number of instances with at least one value missing.

NumberOfNumericFeatures

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

AutoCorrelation

0.15

Average class difference between consecutive instances.

CfsSubsetEval_DecisionStumpAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_DecisionStumpErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_DecisionStumpKappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesKappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NKappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

ClassEntropy

2.81

Entropy of the target attribute values.

DecisionStumpAUC

0.77

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump

DecisionStumpErrRate

0.72

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump

DecisionStumpKappa

0.17

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump

Dimensionality

0.01

Number of attributes divided by the number of instances.

EquivalentNumberOfAtts

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

J48.00001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.00001.ErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.00001.Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.0001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.0001.ErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.0001.Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001

J48.001.ErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001

J48.001.Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001

MajorityClassPercentage

14.29

Percentage of instances belonging to the most frequent class.

MajorityClassSize

330

Number of instances belonging to the most frequent class.

MaxAttributeEntropy

Maximum entropy among attributes.

MaxKurtosisOfNumericAtts

339.22

Maximum kurtosis among attributes of the numeric type.

MaxMeansOfNumericAtts

124.91

Maximum of means among attributes of the numeric type.

MaxMutualInformation

Maximum mutual information between the nominal attributes and the target attribute.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MaxSkewnessOfNumericAtts

16.9

Maximum skewness among attributes of the numeric type.

MaxStdDevOfNumericAtts

72.96

Maximum standard deviation of attributes of the numeric type.

MeanAttributeEntropy

Average entropy of the attributes.

MeanKurtosisOfNumericAtts

38.52

Mean kurtosis among attributes of the numeric type.

MeanMeansOfNumericAtts

24.63

Mean of means among attributes of the numeric type.

MeanMutualInformation

Average mutual information between the nominal attributes and the target attribute.

MeanNoiseToSignalRatio

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

MeanNominalAttDistinctValues

Average number of distinct values among the attributes of the nominal type.

MeanSkewnessOfNumericAtts

3.32

Mean skewness among attributes of the numeric type.

MeanStdDevOfNumericAtts

25.31

Mean standard deviation of attributes of the numeric type.

MinAttributeEntropy

Minimal entropy among attributes.

MinKurtosisOfNumericAtts

-1.22

Minimum kurtosis among attributes of the numeric type.

MinMeansOfNumericAtts

-12.69

Minimum of means among attributes of the numeric type.

MinMutualInformation

Minimal mutual information between the nominal attributes and the target attribute.

MinNominalAttDistinctValues

The minimal number of distinct values among attributes of the nominal type.

MinSkewnessOfNumericAtts

-0.89

Minimum skewness among attributes of the numeric type.

MinStdDevOfNumericAtts

Minimum standard deviation of attributes of the numeric type.

MinorityClassPercentage

14.29

Percentage of instances belonging to the least frequent class.

MinorityClassSize

330

Number of instances belonging to the least frequent class.

NaiveBayesAUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NaiveBayesErrRate

0.2

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NaiveBayesKappa

0.77

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NumberOfBinaryFeatures

Number of binary attributes.

PercentageOfBinaryFeatures

Percentage of binary attributes.

PercentageOfInstancesWithMissingValues

Percentage of instances having missing values.

PercentageOfMissingValues

Percentage of missing values.

PercentageOfNumericFeatures

Percentage of numeric attributes.

PercentageOfSymbolicFeatures

Percentage of nominal attributes.

Quartile1AttributeEntropy

First quartile of entropy among attributes.

Quartile1KurtosisOfNumericAtts

0.09

First quartile of kurtosis among attributes of the numeric type.

Quartile1MeansOfNumericAtts

0.01

First quartile of means among attributes of the numeric type.

Quartile1MutualInformation

First quartile of mutual information between the nominal attributes and the target attribute.

Quartile1SkewnessOfNumericAtts

0.69

First quartile of skewness among attributes of the numeric type.

Quartile1StdDevOfNumericAtts

1.55

First quartile of standard deviation of attributes of the numeric type.

Quartile2AttributeEntropy

Second quartile (Median) of entropy among attributes.

Quartile2KurtosisOfNumericAtts

0.69

Second quartile (Median) of kurtosis among attributes of the numeric type.

Quartile2MeansOfNumericAtts

8.24

Second quartile (Median) of means among attributes of the numeric type.

Quartile2MutualInformation

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

Quartile2SkewnessOfNumericAtts

1.3

Second quartile (Median) of skewness among attributes of the numeric type.

Quartile2StdDevOfNumericAtts

19.57

Second quartile (Median) of standard deviation of attributes of the numeric type.

Quartile3AttributeEntropy

Third quartile of entropy among attributes.

Quartile3KurtosisOfNumericAtts

34.34

Third quartile of kurtosis among attributes of the numeric type.

Quartile3MeansOfNumericAtts

37.05

Third quartile of means among attributes of the numeric type.

Quartile3MutualInformation

Third quartile of mutual information between the nominal attributes and the target attribute.

Quartile3SkewnessOfNumericAtts

5.37

Third quartile of skewness among attributes of the numeric type.

Quartile3StdDevOfNumericAtts

43.53

Third quartile of standard deviation of attributes of the numeric type.

REPTreeDepth1AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth1ErrRate

0.07

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth1Kappa

0.92

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth2AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth2ErrRate

0.07

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth2Kappa

0.92

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth3AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

REPTreeDepth3ErrRate

0.07

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

REPTreeDepth3Kappa

0.92

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

RandomTreeDepth1AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth1ErrRate

0.06

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth1Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth2AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth2ErrRate

0.06

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth2Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth3AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3ErrRate

0.06

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

StdvNominalAttDistinctValues

Standard deviation of the number of distinct values among attributes of the nominal type.

kNN1NAUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

kNN1NErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

kNN1NKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

Show all 107 properties

11 tasks

Supervised Classification on segment

0 runs - estimation_procedure: 20% Holdout (Ordered) - target_feature: class

Supervised Classification on segment

0 runs - estimation_procedure: Test on Training Data - target_feature: class

Supervised Classification on segment

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: class

Supervised Classification on segment

0 runs - estimation_procedure: 5 times 2-fold Crossvalidation - target_feature: class

Supervised Classification on segment

0 runs - estimation_procedure: Leave one out - target_feature: class

Supervised Classification on segment

0 runs - estimation_procedure: 33% Holdout set - target_feature: class

Supervised Classification on segment

0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - target_feature: class

Supervised Classification on segment

0 runs - estimation_procedure: 10% Holdout set - target_feature: class

Learning Curve on segment

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class

Learning Curve on segment

0 runs - estimation_procedure: 10 times 10-fold Learning Curve - target_feature: class

Supervised Data Stream Classification on segment

0 runs - estimation_procedure: Interleaved Test then Train - target_feature: class

Define a new task

Sign in

segment

20 features

107 properties

11 tasks