47 ada_agnostic 1 **Author**: **Source**: Unknown - Date unknown **Please cite**: Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF format) ADA is the marketing database The task of ADA is to discover high revenue people from census data. This is a two-class classification problem. The raw data from the census bureau is known as the Adult database in the UCI machine-learning repository. The 14 original attributes (features) include age, workclass, education, marital status, occupation, native country, etc. It contains continuous, binary and categorical features. This dataset is from the "agnostic learning track", i.e. has access to a preprocessed numeric representation eliminating categorical variables, but the identity of the features is not revealed. Data type: non-sparse Number of features: 48 Number of examples and check-sums: Pos_ex Neg_ex Tot_ex Check_sum Train 1029 3118 4147 6798109.00 Valid 103 312 415 681151.00 This dataset contains samples from both training and validation datasets. 1 ARFF 2014-10-06T23:56:15 Public https://test.openml.org/data/v1/download/47/ada_agnostic.arff 47 label study_14 public active 2024-01-10 13:51:02 bdff0c6d2cfd9ce53a823464e136b3bd