OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

gas-drift

active ARFF Publicly available Visibility: public Uploaded 22-05-2015 by Rafael Gomes Mantovani
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Author: Alexander Vergara Source: UCI Please cite: Alexander Vergara and Shankar Vembu and Tuba Ayhan and Margaret A. Ryan and Margie L. Homer and Ramón Huerta, Chemical gas sensor drift compensation using classifier ensembles, Sensors and Actuators B: Chemical (2012) doi: 10.1016/j.snb.2012.01.074. Title: Gas Sensor Array Drift Dataset Data Set Source: Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego, California, USA Donors of the Dataset: Alexander Vergara (vergara '@' ucsd.edu) Ramon Huerta (rhuerta '@' ucsd.edu) Data Set Information: This archive contains 13910 measurements from 16 chemical sensors utilized in simulations for drift compensation in a discrimination task of 6 gases at various levels of concentrations. The goal is to achieve good performance (or as low degradation as possible) over time, as reported in the paper mentioned below in Section 2: Data collection. The primary purpose of providing this dataset is to make it freely accessible on-line to the chemo-sensor research community and artificial intelligence to develop strategies to cope with sensor/concept drift. The dataset can be used exclusively for research purposes. Commercial purposes are fully excluded. The dataset was gathered within January 2007 to February 2011 (36 months) in a gas delivery platform facility situated at the ChemoSignals Laboratory in the BioCircuits Institute, University of California San Diego. Being completely operated by a fully computerized environment â€”controlled by a LabVIEWâ€“National Instruments software on a PC fitted with the appropriate serial data acquisition boards. The measurement system platform provides versatility for obtaining the desired concentrations of the chemical substances of interest with high accuracy and in a highly reproducible manner, minimizing thereby the common mistakes caused by human intervention and making it possible to exclusively concentrate on the chemical sensors for compensating real drift. The resulting dataset comprises recordings from six distinct pure gaseous substances, namely Ammonia, Acetaldehyde, Acetone, Ethylene, Ethanol, and Toluene, each dosed at a wide variety of concentration values ranging from 5 to 1000 ppmv. See Tables 1 and 2 of the below cited manuscript for details on the gas identity name, concentration values, and time distribution sequence of the measurement recordings considered in this dataset. Batch10.dat was updated on 10/14/2013 to correct some corrupted values in the last 120 lines of the file. An extension of this dataset with the concentration values is available at Gas Sensor Array Drift Dataset at Different Concentrations Data Set [Web Link] Attribute Information: The response of the said sensors is read-out in the form of the resistance across the active layer of each sensor; hence each measurement produced a 16-channel time series, each of which represented by an aggregate of features reflecting all the dynamic processes occurring at the sensor surface in reaction to the chemical substance being evaluated. In particular, two distinct types of features were considered in the creation of this dataset: (i) The so-called steady-state feature (Î”R), defined as the difference of the maximal resistance change and the baseline and its normalized version expressed by the ratio of the maximal resistance and the baseline values when the chemical vapor is present in the test chamber. And (ii), an aggregate of features reflecting the sensor dynamics of the increasing/decaying transient portion of the sensor response during the entire measurement procedure under controlled conditions, namely the exponential moving average (emaÎ±). These aggregate of features is a transform, borrowed from the field of econometrics originally introduced to the chemo-sensing community by Muezzinoglu et al. (2009), that converts the said transient portion into a real scalar, by estimating the maximum value â€”minimum for the decaying portion of the sensor responseâ€” of its exponential moving average (emaÎ±), with an initial condition set to zero and a scalar smoothing parameter of the operator, Î±, that defines both the quality of the feature and the time of its occurrence along the time series the scalar, set to range between 0 and 1. In particular, three different values for Î± were set to obtain three different feature values from the pre-recorded rising portion of the sensor response and three additional features with the same Î± values but for the decaying portion of the sensor response, covering thus the entire sensor response dynamics. For a more detailed analysis and discussion on these features as well as a graphical illustration of them please refer to Section 2.3 and Figure 2, respectively of the annotated manuscript. Once the abovementioned features are calculated, one is to form a feature vector containing the 8 features extracted from each particular sensor multiplied by the 16 sensors considered here. In the end, the resulting 128-dimensional feature vector containing all the features indicated above (8 features Ã— 16 sensors) is organized as follows: Î”R_1, |Î”R|_1, EMAi0.001_1, EMAi0.01_1, EMAi0.1_1, EMAd0.001_1, EMAd0.01_1, EMAd0.1_1, Î”R_2, |Î”R|_2, EMAi0.001_2, EMAi0.01_2, EMAi0.1_2, EMAd0.001_2, EMAd0.01_2, EMAd0.1_2,..., Î”R_16, |Î”R|_16, EMAi0.001_16, EMAi0.01_16, EMAi0.1_16, EMAd0.001_16, EMAd0.01_16, EMAd0.1_16, where: â€œÎ”R_1â€ and â€œ|Î”R|_1â€ is the Î”R and the normalized Î”R feature, respectively, â€œEMAi0.001_1â€, â€œEMAi0.01_1â€, and â€œEMAi0.1_1â€, the emaÎ± of the rising transient portion of the sensor response for Î± equals to 0.001, 0.01, and 0.1, respectively, and â€œEMAd0.001_1â€, â€œEMAd0.01_1â€, and â€œEMAd0.1_1â€, the emaÎ± of the decaying transient portion of the sensor response for Î± equals to 0.001, 0.01, and 0.1, respectively, all corresponding to sensor # 1; â€œÎ”R_2â€ and â€œ|Î”R|_2â€ is the Î”R and the normalized Î”R feature, respectively, â€œEMAi0.001_2â€, â€œEMAi0.01_2â€, and â€œEMAi0.1_2â€, the emaÎ± of the rising transient portion of the sensor response for Î± equals to 0.001, 0.01, and 0.1, respectively, and â€œEMAd0.001_2â€, â€œEMAd0.01_2â€, and â€œEMAd0.1_2â€, the emaÎ± of the decaying transient portion of the sensor response for Î± equals to 0.001, 0.01, and 0.1, respectively, all corresponding to sensor # 2; and so forth up until sensor # 16, forming thus the 128-dimensional feature vector that is to be fetched to the classifiers for training. For processing purposes, the data is organized into ten batches, each containing the number of measurements per class and month indicated in the table below. This reorganization of data was done to ensure having a sufficient and as uniformly distributed as possible number of experiments in each class and month when training the classifier. Dataset organization details. Each row corresponds to months that were combined to form a batch: Batch ID Month IDs Batch 1 Months 1 and 2 Batch 2 Months 3, 4, 8, 9 and 10 Batch 3 Months 11, 12, and 13 Batch 4 Months 14 and 15 Batch 5 Month 16 Batch 6 Months 17, 18, 19, and 20 Batch 7 Month 21 Batch 8 Months 22 and 23 Batch 9 Months 24 and 30 Batch 10 Month 36 The data format follows the same coding style as in libsvm, in which one indicates the class each data point belongs to (1: Ethanol; 2: Ethylene; 3:Ammonia; 4: Acetaldehyde; 5: Acetone; 6: Toluene), and, then, the collection of features in a format x:v, where x stands for the feature number and v for the actual value of the feature. For example, in 1 1:15596.162100 2:1.868245 3:2.371604 4:2.803678 5:7.512213 â€¦ 128:-2.654529 The number â€œ1â€ stands for the class number (in this case Ethanol), whereas the remaining 128 columns list the actual feature values for each measurement recording organized as described above. Finally, to make the results presented in the associated article reproducible for the reader, please use the following parameter values in the training task: â€¢ folds: 10 â€¢ log2c = -5, 10, 1 â€¢ log2g = -10, 5, 1 â€¢ Scale the features in the training set appropriately to lie between -1 and +1. â€¢ And use the following cross validation parameters: Batch C Gamma (É¤) Rate 1 256.0 0.03125 98.8764 2 64.0 0.00390625 99.7588 3 128.0 0.03125 100.0 4 1.0 0.25 100.0 5 2.0 0.015625 99.4924 6 256.0 0.0009765625 99.5217 7 64.0 0.0625 99.9723 8 1024.0 0.0078125 99.6599 9 2.0 0.00390625 100.0

129 features

Class (target)	nominal	6 unique values 0 missing
V1	numeric	13904 unique values 0 missing
V2	numeric	13890 unique values 0 missing
V3	numeric	13904 unique values 0 missing
V4	numeric	13905 unique values 0 missing
V5	numeric	13904 unique values 0 missing
V6	numeric	13897 unique values 0 missing
V7	numeric	13895 unique values 0 missing
V8	numeric	13907 unique values 0 missing
V9	numeric	13897 unique values 0 missing
V10	numeric	13888 unique values 0 missing
V11	numeric	13905 unique values 0 missing
V12	numeric	13909 unique values 0 missing
V13	numeric	13906 unique values 0 missing
V14	numeric	13906 unique values 0 missing
V15	numeric	13902 unique values 0 missing
V16	numeric	13908 unique values 0 missing
V17	numeric	13910 unique values 0 missing
V18	numeric	13892 unique values 0 missing
V19	numeric	13896 unique values 0 missing
V20	numeric	13903 unique values 0 missing
V21	numeric	13909 unique values 0 missing
V22	numeric	13883 unique values 0 missing
V23	numeric	13903 unique values 0 missing
V24	numeric	13899 unique values 0 missing
V25	numeric	13896 unique values 0 missing
V26	numeric	13885 unique values 0 missing
V27	numeric	13891 unique values 0 missing
V28	numeric	13892 unique values 0 missing
V29	numeric	13893 unique values 0 missing
V30	numeric	13872 unique values 0 missing
V31	numeric	13886 unique values 0 missing
V32	numeric	13891 unique values 0 missing
V33	numeric	13904 unique values 0 missing
V34	numeric	13874 unique values 0 missing
V35	numeric	13855 unique values 0 missing
V36	numeric	13894 unique values 0 missing
V37	numeric	13886 unique values 0 missing
V38	numeric	13835 unique values 0 missing
V39	numeric	13869 unique values 0 missing
V40	numeric	13891 unique values 0 missing
V41	numeric	13908 unique values 0 missing
V42	numeric	13877 unique values 0 missing
V43	numeric	13864 unique values 0 missing
V44	numeric	13891 unique values 0 missing
V45	numeric	13894 unique values 0 missing
V46	numeric	13820 unique values 0 missing
V47	numeric	13859 unique values 0 missing
V48	numeric	13882 unique values 0 missing
V49	numeric	13908 unique values 0 missing
V50	numeric	13898 unique values 0 missing
V51	numeric	13906 unique values 0 missing
V52	numeric	13908 unique values 0 missing
V53	numeric	13907 unique values 0 missing
V54	numeric	13893 unique values 0 missing
V55	numeric	13903 unique values 0 missing
V56	numeric	13903 unique values 0 missing
V57	numeric	13909 unique values 0 missing
V58	numeric	13897 unique values 0 missing
V59	numeric	13900 unique values 0 missing
V60	numeric	13905 unique values 0 missing
V61	numeric	13906 unique values 0 missing
V62	numeric	13902 unique values 0 missing
V63	numeric	13901 unique values 0 missing
V64	numeric	13904 unique values 0 missing
V65	numeric	13899 unique values 0 missing
V66	numeric	13889 unique values 0 missing
V67	numeric	13902 unique values 0 missing
V68	numeric	13906 unique values 0 missing
V69	numeric	13907 unique values 0 missing
V70	numeric	13891 unique values 0 missing
V71	numeric	13907 unique values 0 missing
V72	numeric	13906 unique values 0 missing
V73	numeric	13904 unique values 0 missing
V74	numeric	13887 unique values 0 missing
V75	numeric	13904 unique values 0 missing
V76	numeric	13903 unique values 0 missing
V77	numeric	13905 unique values 0 missing
V78	numeric	13897 unique values 0 missing
V79	numeric	13898 unique values 0 missing
V80	numeric	13900 unique values 0 missing
V81	numeric	13908 unique values 0 missing
V82	numeric	13888 unique values 0 missing
V83	numeric	13906 unique values 0 missing
V84	numeric	13906 unique values 0 missing
V85	numeric	13905 unique values 0 missing
V86	numeric	13892 unique values 0 missing
V87	numeric	13899 unique values 0 missing
V88	numeric	13903 unique values 0 missing
V89	numeric	13908 unique values 0 missing
V90	numeric	13900 unique values 0 missing
V91	numeric	13903 unique values 0 missing
V92	numeric	13905 unique values 0 missing
V93	numeric	13903 unique values 0 missing
V94	numeric	13886 unique values 0 missing
V95	numeric	13896 unique values 0 missing
V96	numeric	13902 unique values 0 missing
V97	numeric	13902 unique values 0 missing
V98	numeric	13882 unique values 0 missing
V99	numeric	13872 unique values 0 missing
V100	numeric	13905 unique values 0 missing
V101	numeric	13902 unique values 0 missing
V102	numeric	13854 unique values 0 missing
V103	numeric	13882 unique values 0 missing
V104	numeric	13895 unique values 0 missing
V105	numeric	13910 unique values 0 missing
V106	numeric	13885 unique values 0 missing
V107	numeric	13876 unique values 0 missing
V108	numeric	13894 unique values 0 missing
V109	numeric	13895 unique values 0 missing
V110	numeric	13850 unique values 0 missing
V111	numeric	13875 unique values 0 missing
V112	numeric	13875 unique values 0 missing
V113	numeric	13905 unique values 0 missing
V114	numeric	13898 unique values 0 missing
V115	numeric	13903 unique values 0 missing
V116	numeric	13908 unique values 0 missing
V117	numeric	13906 unique values 0 missing
V118	numeric	13898 unique values 0 missing
V119	numeric	13903 unique values 0 missing
V120	numeric	13907 unique values 0 missing
V121	numeric	13909 unique values 0 missing
V122	numeric	13898 unique values 0 missing
V123	numeric	13903 unique values 0 missing
V124	numeric	13907 unique values 0 missing
V125	numeric	13903 unique values 0 missing
V126	numeric	13898 unique values 0 missing
V127	numeric	13905 unique values 0 missing
V128	numeric	13907 unique values 0 missing

Show first 100 features

107 properties

NumberOfInstances

13910

Number of instances (rows) of the dataset.

NumberOfFeatures

129

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

Number of instances with at least one value missing.

NumberOfNumericFeatures

128

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

AutoCorrelation

0.59

Average class difference between consecutive instances.

CfsSubsetEval_DecisionStumpAUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_DecisionStumpErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_DecisionStumpKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesAUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NAUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

ClassEntropy

2.55

Entropy of the target attribute values.

DecisionStumpAUC

0.71

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump

DecisionStumpErrRate

0.61

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump

DecisionStumpKappa

0.22

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump

Dimensionality

0.01

Number of attributes divided by the number of instances.

EquivalentNumberOfAtts

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

J48.00001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.00001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.00001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.0001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.0001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.0001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001

J48.001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001

J48.001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001

MajorityClassPercentage

21.63

Percentage of instances belonging to the most frequent class.

MajorityClassSize

3009

Number of instances belonging to the most frequent class.

MaxAttributeEntropy

Maximum entropy among attributes.

MaxKurtosisOfNumericAtts

13909.09

Maximum kurtosis among attributes of the numeric type.

MaxMeansOfNumericAtts

57340.1

Maximum of means among attributes of the numeric type.

MaxMutualInformation

Maximum mutual information between the nominal attributes and the target attribute.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MaxSkewnessOfNumericAtts

117.93

Maximum skewness among attributes of the numeric type.

MaxStdDevOfNumericAtts

69844.79

Maximum standard deviation of attributes of the numeric type.

MeanAttributeEntropy

Average entropy of the attributes.

MeanKurtosisOfNumericAtts

1037.15

Mean kurtosis among attributes of the numeric type.

MeanMeansOfNumericAtts

2791.46

Mean of means among attributes of the numeric type.

MeanMutualInformation

Average mutual information between the nominal attributes and the target attribute.

MeanNoiseToSignalRatio

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

MeanNominalAttDistinctValues

Average number of distinct values among the attributes of the nominal type.

MeanSkewnessOfNumericAtts

4.62

Mean skewness among attributes of the numeric type.

MeanStdDevOfNumericAtts

2729.31

Mean standard deviation of attributes of the numeric type.

MinAttributeEntropy

Minimal entropy among attributes.

MinKurtosisOfNumericAtts

-0.07

Minimum kurtosis among attributes of the numeric type.

MinMeansOfNumericAtts

-72.75

Minimum of means among attributes of the numeric type.

MinMutualInformation

Minimal mutual information between the nominal attributes and the target attribute.

MinNominalAttDistinctValues

The minimal number of distinct values among attributes of the nominal type.

MinSkewnessOfNumericAtts

-87.65

Minimum skewness among attributes of the numeric type.

MinStdDevOfNumericAtts

0.53

Minimum standard deviation of attributes of the numeric type.

MinorityClassPercentage

11.8

Percentage of instances belonging to the least frequent class.

MinorityClassSize

1641

Number of instances belonging to the least frequent class.

NaiveBayesAUC

0.84

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NaiveBayesErrRate

0.43

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NaiveBayesKappa

0.49

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NumberOfBinaryFeatures

Number of binary attributes.

PercentageOfBinaryFeatures

Percentage of binary attributes.

PercentageOfInstancesWithMissingValues

Percentage of instances having missing values.

PercentageOfMissingValues

Percentage of missing values.

PercentageOfNumericFeatures

99.22

Percentage of numeric attributes.

PercentageOfSymbolicFeatures

0.78

Percentage of nominal attributes.

Quartile1AttributeEntropy

First quartile of entropy among attributes.

Quartile1KurtosisOfNumericAtts

4.23

First quartile of kurtosis among attributes of the numeric type.

Quartile1MeansOfNumericAtts

-4.77

First quartile of means among attributes of the numeric type.

Quartile1MutualInformation

First quartile of mutual information between the nominal attributes and the target attribute.

Quartile1SkewnessOfNumericAtts

-2.29

First quartile of skewness among attributes of the numeric type.

Quartile1StdDevOfNumericAtts

4.36

First quartile of standard deviation of attributes of the numeric type.

Quartile2AttributeEntropy

Second quartile (Median) of entropy among attributes.

Quartile2KurtosisOfNumericAtts

10.32

Second quartile (Median) of kurtosis among attributes of the numeric type.

Quartile2MeansOfNumericAtts

5.37

Second quartile (Median) of means among attributes of the numeric type.

Quartile2MutualInformation

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

Quartile2SkewnessOfNumericAtts

1.3

Second quartile (Median) of skewness among attributes of the numeric type.

Quartile2StdDevOfNumericAtts

9.63

Second quartile (Median) of standard deviation of attributes of the numeric type.

Quartile3AttributeEntropy

Third quartile of entropy among attributes.

Quartile3KurtosisOfNumericAtts

80.89

Third quartile of kurtosis among attributes of the numeric type.

Quartile3MeansOfNumericAtts

15.19

Third quartile of means among attributes of the numeric type.

Quartile3MutualInformation

Third quartile of mutual information between the nominal attributes and the target attribute.

Quartile3SkewnessOfNumericAtts

2.54

Third quartile of skewness among attributes of the numeric type.

Quartile3StdDevOfNumericAtts

24.79

Third quartile of standard deviation of attributes of the numeric type.

REPTreeDepth1AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth1ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth1Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth2AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth2ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth2Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth3AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

REPTreeDepth3ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

REPTreeDepth3Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

RandomTreeDepth1AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth1ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth1Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth2AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth2ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth2Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth3AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

StdvNominalAttDistinctValues

Standard deviation of the number of distinct values among attributes of the nominal type.

kNN1NAUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

kNN1NErrRate

0.01

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

kNN1NKappa

0.99

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

Show all 107 properties

11 tasks

Supervised Classification on gas-drift

0 runs - estimation_procedure: 20% Holdout (Ordered) - target_feature: Class

Supervised Classification on gas-drift

0 runs - estimation_procedure: 5 times 2-fold Crossvalidation - target_feature: Class

Supervised Classification on gas-drift

0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - target_feature: Class

Supervised Classification on gas-drift

0 runs - estimation_procedure: Leave one out - target_feature: Class

Supervised Classification on gas-drift

0 runs - estimation_procedure: 10% Holdout set - target_feature: Class

Supervised Classification on gas-drift

0 runs - estimation_procedure: 33% Holdout set - target_feature: Class

Supervised Classification on gas-drift

0 runs - estimation_procedure: Test on Training Data - target_feature: Class

Supervised Classification on gas-drift

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: Class

Learning Curve on gas-drift

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: Class

Learning Curve on gas-drift

0 runs - estimation_procedure: 10 times 10-fold Learning Curve - target_feature: Class

Supervised Data Stream Classification on gas-drift

0 runs - estimation_procedure: Interleaved Test then Train - target_feature: Class

Define a new task

Sign in

gas-drift

129 features

107 properties

11 tasks