OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

anneal

active ARFF Publicly available Visibility: public Uploaded 06-04-2014 by Jan van Rijn
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

ab and new_tag of study_14 the study_1 study_491 study_40 study_298 study_491 study_358 study_491 study_7 study_9 study_16 study_18 study_25 study_28 study_30 study_31 study_36 study_44 study_45 study_55 study_56 study_59 study_62 study_63 study_67 study_74 study_77 study_81 study_85 study_88 study_89 study_91 study_93 study_95 study_97 study_101 study_106 study_107 study_112 study_117 study_122 study_123 study_126 study_128 study_135 study_136 study_148 study_153 study_155 study_160 study_182 study_184 study_188 study_193 study_198 study_204 study_207 study_213 study_219 study_227 study_232 study_239 study_241 study_246 study_251 study_258 study_263 study_268 study_275 study_278 study_283 study_288 study_297 study_303 study_308 study_316 study_320 study_324 study_332 study_338 study_341 study_348 study_353 study_362 study_369 study_375 study_377 study_387 study_391 study_394 study_402 study_404 study_410 study_414 study_422 study_424 study_432 study_439 study_443 study_452 study_453 study_458 study_465 study_474 study_475 study_482 study_7 study_9 study_16 study_18 study_25 study_28 study_30 study_31 study_36 study_44 study_45 study_55 study_56 study_59 study_62 study_63 study_67 study_74 study_77 study_81 study_85 study_88 study_89 study_91 study_93 study_95 study_97 study_101 study_106 study_107 study_112 study_117 study_122 study_123 study_126 study_128 study_135 study_136 study_148 study_153 study_155 study_160 study_182 study_184 study_188 study_193 study_198 study_204 study_207 study_213 study_219 study_227 study_232 study_239 study_241 study_246 study_251 study_258 study_263 study_268 study_275 study_278 study_283 study_288 study_297 study_303 study_308 study_316 study_320 study_324 study_332 study_338 study_341 study_348 study_353 study_362 study_369 study_375 study_377 study_387 study_391 study_394 study_402 study_404 study_410 study_414 study_422 study_424 study_432 study_439 study_443 study_452 study_453 study_458 study_465 study_474 study_475 study_482 study_7 study_9 study_16 study_18 study_25 study_28 study_30 study_31 study_36 study_44 study_45 study_55 study_56 study_59 study_62 study_63 study_67 study_74 study_77 study_81 study_85 study_88 study_89 study_91 study_93 study_95 study_97 study_101 study_106 study_107 study_110 study_112 study_117 study_122 study_123 study_126 study_128 study_135 study_136 study_148 study_153 study_155 study_160 study_182 study_184 study_188 study_193 study_198 study_204 study_207 study_213 study_219 study_227 study_232 study_239 study_241 study_246 study_251 study_258 study_263 study_268 study_275 study_278 study_283 study_288 study_297 study_303 study_308 study_316 study_320 study_324 study_332 study_338 study_341 study_348 study_353 study_362 study_369 study_375 study_377 study_387 study_391 study_394 study_402 study_404 study_410 study_414 study_422 study_424 study_432 study_439 study_443 study_452 study_453 study_458 study_465 study_474 study_475 study_482 study_256 Add tag

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Author: Source: Unknown - Please cite: 1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross Quinlan in 1987 at the 4th Machine Learning Workshop. I'd have to check with Jeff Schlimmer to double check this. 5. Number of Instances: 798 6. Number of Attributes: 38 -- 6 continuously-valued -- 3 integer-valued -- 29 nominal-valued 7. Attribute Information: 1. family: --,GB,GK,GS,TN,ZA,ZF,ZH,ZM,ZS 2. product-type: C, H, G 3. steel: -,R,A,U,K,M,S,W,V 4. carbon: continuous 5. hardness: continuous 6. temper_rolling: -,T 7. condition: -,S,A,X 8. formability: -,1,2,3,4,5 9. strength: continuous 10. non-ageing: -,N 11. surface-finish: P,M,- 12. surface-quality: -,D,E,F,G 13. enamelability: -,1,2,3,4,5 14. bc: Y,- 15. bf: Y,- 16. bt: Y,- 17. bw/me: B,M,- 18. bl: Y,- 19. m: Y,- 20. chrom: C,- 21. phos: P,- 22. cbond: Y,- 23. marvi: Y,- 24. exptl: Y,- 25. ferro: Y,- 26. corr: Y,- 27. blue/bright/varn/clean: B,R,V,C,- 28. lustre: Y,- 29. jurofm: Y,- 30. s: Y,- 31. p: Y,- 32. shape: COIL, SHEET 33. thick: continuous 34. width: continuous 35. len: continuous 36. oil: -,Y,N 37. bore: 0000,0500,0600,0760 38. packing: -,1,2,3 classes: 1,2,3,4,5,U -- The '-' values are actually 'not_applicable' values rather than 'missing_values' (and so can be treated as legal discrete values rather than as showing the absence of a discrete value). 8. Missing Attribute Values: Signified with "?" Attribute: Number of instances missing its value: 1 0 2 0 3 70 4 0 5 0 6 675 7 271 8 283 9 0 10 703 11 790 12 217 13 785 14 797 15 680 16 736 17 609 18 662 19 798 20 775 21 791 22 730 23 798 24 796 25 772 26 798 27 793 28 753 29 798 30 798 31 798 32 0 33 0 34 0 35 0 36 740 37 0 38 789 39 0 9. Distribution of Classes Class Name: Number of Instances: 1 8 2 88 3 608 4 0 5 60 U 34 --- 798

39 features

class (target)	nominal	5 unique values 0 missing
phos	nominal	1 unique values 891 missing
chrom	nominal	1 unique values 872 missing
cbond	nominal	1 unique values 824 missing
marvi	nominal	0 unique values 898 missing
exptl	nominal	1 unique values 896 missing
ferro	nominal	1 unique values 868 missing
corr	nominal	0 unique values 898 missing
blue%2Fbright%2Fvarn%2Fclean	nominal	3 unique values 892 missing
lustre	nominal	1 unique values 847 missing
jurofm	nominal	0 unique values 898 missing
s	nominal	0 unique values 898 missing
p	nominal	0 unique values 898 missing
shape	nominal	2 unique values 0 missing
thick	numeric	50 unique values 0 missing
width	numeric	68 unique values 0 missing
len	numeric	24 unique values 0 missing
oil	nominal	2 unique values 834 missing
bore	nominal	3 unique values 0 missing
packing	nominal	2 unique values 889 missing
surface-finish	nominal	1 unique values 889 missing
product-type	nominal	1 unique values 0 missing
steel	nominal	7 unique values 86 missing
carbon	numeric	10 unique values 0 missing
hardness	numeric	7 unique values 0 missing
temper_rolling	nominal	1 unique values 761 missing
condition	nominal	2 unique values 303 missing
formability	nominal	4 unique values 318 missing
strength	numeric	8 unique values 0 missing
non-ageing	nominal	1 unique values 793 missing
family	nominal	2 unique values 772 missing
surface-quality	nominal	4 unique values 244 missing
enamelability	nominal	2 unique values 882 missing
bc	nominal	1 unique values 897 missing
bf	nominal	1 unique values 769 missing
bt	nominal	1 unique values 824 missing
bw%2Fme	nominal	2 unique values 687 missing
bl	nominal	1 unique values 749 missing
m	nominal	0 unique values 898 missing

Show all 39 features

107 properties

NumberOfInstances

898

Number of instances (rows) of the dataset.

NumberOfFeatures

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

22175

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

898

Number of instances with at least one value missing.

NumberOfNumericFeatures

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

AutoCorrelation

0.61

Average class difference between consecutive instances.

CfsSubsetEval_DecisionStumpAUC

0.91

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_DecisionStumpErrRate

0.13

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_DecisionStumpKappa

0.62

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesAUC

0.91

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesErrRate

0.13

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesKappa

0.62

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NAUC

0.91

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NErrRate

0.13

Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NKappa

0.62

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

ClassEntropy

1.19

Entropy of the target attribute values.

DecisionStumpAUC

0.87

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump

DecisionStumpErrRate

0.23

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump

DecisionStumpKappa

0.45

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump

Dimensionality

0.04

Number of attributes divided by the number of instances.

EquivalentNumberOfAtts

26.84

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

J48.00001.AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.00001.ErrRate

0.1

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.00001.Kappa

0.7

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.0001.AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.0001.ErrRate

0.1

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.0001.Kappa

0.7

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.001.AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001

J48.001.ErrRate

0.1

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001

J48.001.Kappa

0.7

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001

MajorityClassPercentage

76.17

Percentage of instances belonging to the most frequent class.

MajorityClassSize

684

Number of instances belonging to the most frequent class.

MaxAttributeEntropy

1.82

Maximum entropy among attributes.

MaxKurtosisOfNumericAtts

13.22

Maximum kurtosis among attributes of the numeric type.

MaxMeansOfNumericAtts

1263.09

Maximum of means among attributes of the numeric type.

MaxMutualInformation

0.41

Maximum mutual information between the nominal attributes and the target attribute.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MaxSkewnessOfNumericAtts

3.76

Maximum skewness among attributes of the numeric type.

MaxStdDevOfNumericAtts

1871.4

Maximum standard deviation of attributes of the numeric type.

MeanAttributeEntropy

0.25

Average entropy of the attributes.

MeanKurtosisOfNumericAtts

4.65

Mean kurtosis among attributes of the numeric type.

MeanMeansOfNumericAtts

348.5

Mean of means among attributes of the numeric type.

MeanMutualInformation

0.04

Average mutual information between the nominal attributes and the target attribute.

MeanNoiseToSignalRatio

4.67

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

MeanNominalAttDistinctValues

1.64

Average number of distinct values among the attributes of the nominal type.

MeanSkewnessOfNumericAtts

2.03

Mean skewness among attributes of the numeric type.

MeanStdDevOfNumericAtts

405.17

Mean standard deviation of attributes of the numeric type.

MinAttributeEntropy

-0

Minimal entropy among attributes.

MinKurtosisOfNumericAtts

-0.97

Minimum kurtosis among attributes of the numeric type.

MinMeansOfNumericAtts

1.2

Minimum of means among attributes of the numeric type.

MinMutualInformation

Minimal mutual information between the nominal attributes and the target attribute.

MinNominalAttDistinctValues

The minimal number of distinct values among attributes of the nominal type.

MinSkewnessOfNumericAtts

0.07

Minimum skewness among attributes of the numeric type.

MinStdDevOfNumericAtts

0.87

Minimum standard deviation of attributes of the numeric type.

MinorityClassPercentage

0.89

Percentage of instances belonging to the least frequent class.

MinorityClassSize

Number of instances belonging to the least frequent class.

NaiveBayesAUC

0.93

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NaiveBayesErrRate

0.25

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NaiveBayesKappa

0.56

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NumberOfBinaryFeatures

Number of binary attributes.

PercentageOfBinaryFeatures

10.26

Percentage of binary attributes.

PercentageOfInstancesWithMissingValues

100

Percentage of instances having missing values.

PercentageOfMissingValues

63.32

Percentage of missing values.

PercentageOfNumericFeatures

15.38

Percentage of numeric attributes.

PercentageOfSymbolicFeatures

84.62

Percentage of nominal attributes.

Quartile1AttributeEntropy

First quartile of entropy among attributes.

Quartile1KurtosisOfNumericAtts

-0.4

First quartile of kurtosis among attributes of the numeric type.

Quartile1MeansOfNumericAtts

3.03

First quartile of means among attributes of the numeric type.

Quartile1MutualInformation

First quartile of mutual information between the nominal attributes and the target attribute.

Quartile1SkewnessOfNumericAtts

0.97

First quartile of skewness among attributes of the numeric type.

Quartile1StdDevOfNumericAtts

10.51

First quartile of standard deviation of attributes of the numeric type.

Quartile2AttributeEntropy

Second quartile (Median) of entropy among attributes.

Quartile2KurtosisOfNumericAtts

1.64

Second quartile (Median) of kurtosis among attributes of the numeric type.

Quartile2MeansOfNumericAtts

21.22

Second quartile (Median) of means among attributes of the numeric type.

Quartile2MutualInformation

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

Quartile2SkewnessOfNumericAtts

1.65

Second quartile (Median) of skewness among attributes of the numeric type.

Quartile2StdDevOfNumericAtts

69.85

Second quartile (Median) of standard deviation of attributes of the numeric type.

Quartile3AttributeEntropy

0.24

Third quartile of entropy among attributes.

Quartile3KurtosisOfNumericAtts

12.74

Third quartile of kurtosis among attributes of the numeric type.

Quartile3MeansOfNumericAtts

901.26

Third quartile of means among attributes of the numeric type.

Quartile3MutualInformation

0.02

Third quartile of mutual information between the nominal attributes and the target attribute.

Quartile3SkewnessOfNumericAtts

3.75

Third quartile of skewness among attributes of the numeric type.

Quartile3StdDevOfNumericAtts

771.86

Third quartile of standard deviation of attributes of the numeric type.

REPTreeDepth1AUC

0.96

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth1ErrRate

0.08

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth1Kappa

0.77

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth2AUC

0.96

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth2ErrRate

0.08

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth2Kappa

0.77

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth3AUC

0.96

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

REPTreeDepth3ErrRate

0.08

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

REPTreeDepth3Kappa

0.77

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

RandomTreeDepth1AUC

0.93

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth1ErrRate

0.08

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth1Kappa

0.8

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth2AUC

0.93

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth2ErrRate

0.08

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth2Kappa

0.8

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth3AUC

0.93

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3ErrRate

0.08

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3Kappa

0.8

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

StdvNominalAttDistinctValues

1.56

Standard deviation of the number of distinct values among attributes of the nominal type.

kNN1NAUC

0.87

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

kNN1NErrRate

0.06

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

kNN1NKappa

0.83

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

Show all 107 properties

11 tasks

Supervised Classification on anneal

33 runs - estimation_procedure: 33% Holdout set - target_feature: class

Supervised Classification on anneal

14 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: class

Supervised Classification on anneal

0 runs - estimation_procedure: Leave one out - target_feature: class

Supervised Classification on anneal

0 runs - estimation_procedure: 10% Holdout set - target_feature: class

Supervised Classification on anneal

0 runs - estimation_procedure: Test on Training Data - target_feature: class

Supervised Classification on anneal

0 runs - estimation_procedure: 20% Holdout (Ordered) - target_feature: class

Supervised Classification on anneal

0 runs - estimation_procedure: 5 times 2-fold Crossvalidation - target_feature: class

Supervised Classification on anneal

0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - target_feature: class

Learning Curve on anneal

108 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class

Learning Curve on anneal

0 runs - estimation_procedure: 10 times 10-fold Learning Curve - target_feature: class

Supervised Data Stream Classification on anneal

0 runs - estimation_procedure: Interleaved Test then Train - target_feature: class

Define a new task

Sign in

anneal

39 features

107 properties

11 tasks