Study
Test-Suite

Test-Suite

Created 11-01-2024 by Test Test Visibility: public
Search these data sets in more detail
Data from the Kaggle Amazon Employee Access Challenge: https://www.kaggle.com/c/amazon-employee-access-challenge When an employee at any company starts work, they first need to obtain the computer…
0 runs0 likes0 downloads0 reach11 impact
32769 instances - 10 features - 2 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
0 runs0 likes0 downloads0 reach11 impact
48842 instances - 15 features - 2 classes - 6465 missing values
* Title: Breast Cancer Wisconsin (Diagnostic) Data Set (WDBC) * Abstract: Diagnostic Wisconsin Breast Cancer Database * Source: Creators: 1. Dr. William H. Wolberg, General Surgery Dept. University of…
0 runs0 likes0 downloads0 reach11 impact
569 instances - 31 features - 2 classes - 0 missing values
1. One-hundred plant species leaves data set (class = shape). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images are…
0 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
1 . Abstract: Two ground ozone level data sets are included in this collection. One is the eight hour peak set (eighthr.data), the other is the one hour peak set (onehr.data). Those data were…
0 runs0 likes0 downloads0 reach11 impact
2534 instances - 73 features - 2 classes - 0 missing values
Abstract: MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty is…
0 runs0 likes0 downloads0 reach11 impact
2600 instances - 501 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
0 runs0 likes0 downloads0 reach11 impact
15545 instances - 6 features - 2 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
0 runs0 likes0 downloads0 reach11 impact
3468 instances - 971 features - 2 classes - 0 missing values
The following are data used in an analysis of the Brown and Frown corpora for my doctoral dissertation titled ``Variations in Written English: Characterizing Authors' Rhetorical Language Choices…
65 runs0 likes0 downloads0 reach11 impact
500 instances - 22 features - 15 classes - 0 missing values
The Monk's Problems: Problem 2 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
601 instances - 7 features - 2 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identification code deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes0 downloads0 reach0 impact
189 instances - 10 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
0 runs0 likes0 downloads0 reach0 impact
9517 instances - 7 features - 0 classes - 0 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes0 downloads0 reach0 impact
61 instances - 3 features - 0 classes - 0 missing values
1. Title: Tic-Tac-Toe Endgame database 2. Source Information -- Creator: David W. Aha (aha@cs.jhu.edu) -- Donor: David W. Aha (aha@cs.jhu.edu) -- Date: 19 August 1991 3. Known Past Usage: 1.…
0 runs0 likes0 downloads0 reach0 impact
958 instances - 10 features - 2 classes - 0 missing values
1. Title of Database: Optical Recognition of Handwritten Digits 2. Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
0 runs0 likes0 downloads0 reach0 impact
5620 instances - 65 features - 10 classes - 0 missing values
1. Title: Car Evaluation Database 2. Sources: (a) Creator: Marko Bohanec (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past Usage: The…
0 runs0 likes0 downloads0 reach3 impact
1728 instances - 7 features - 4 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values