47
ada_agnostic
1
**Author**:
**Source**: Unknown - Date unknown
**Please cite**:
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch)
Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php
Modified by TunedIT (converted to ARFF format)
ADA is the marketing database
The task of ADA is to discover high revenue people from census data. This is a two-class classification problem. The raw data from the census bureau is known as the Adult database in the UCI machine-learning repository. The 14 original attributes (features) include age, workclass, education,
marital status, occupation, native country, etc. It contains continuous, binary and categorical features. This dataset is from the "agnostic learning track", i.e. has access to a preprocessed numeric representation eliminating categorical variables, but the identity of the features is not revealed.
Data type: non-sparse
Number of features: 48
Number of examples and check-sums:
Pos_ex Neg_ex Tot_ex Check_sum
Train 1029 3118 4147 6798109.00
Valid 103 312 415 681151.00
This dataset contains samples from both training and validation datasets.
1
ARFF
2014-10-06T23:56:15
Public https://test.openml.org/data/v1/download/47/ada_agnostic.arff
47 label study_14 public active
2024-01-10 13:51:02 bdff0c6d2cfd9ce53a823464e136b3bd