Study
Test-Suite

Test-Suite

Created 17-10-2024 by Test Test Visibility: public
Search these data sets in more detail
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
0 runs0 likes0 downloads0 reach11 impact
1055 instances - 42 features - 2 classes - 0 missing values
1. One-hundred plant species leaves data set (class = margin). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images…
0 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
Source: D. Lucas (ddlucas .at. alum.mit.edu), Lawrence Livermore National Laboratory; R. Klein (rklein .at. astron.berkeley.edu), Lawrence Livermore National Laboratory & U.C. Berkeley; J. Tannahill…
0 runs0 likes0 downloads0 reach11 impact
540 instances - 21 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
0 runs0 likes0 downloads0 reach11 impact
6598 instances - 168 features - 2 classes - 0 missing values
This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable, and/or improvable predictive models of software engineering. If you publish material based…
0 runs0 likes0 downloads0 reach11 impact
10885 instances - 22 features - 2 classes - 25 missing values
The Monk's Problems: Problem 3 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach12 impact
554 instances - 7 features - 2 classes - 0 missing values
The Monk's Problems: Problem 1 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
556 instances - 7 features - 2 classes - 0 missing values
This data set was generated as follows. 150 subjects spoke the name of each letter of the alphabet twice. Hence, we have 52 training examples from each speaker. The speakers are grouped into sets of…
0 runs0 likes0 downloads0 reach11 impact
7797 instances - 618 features - 26 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) These data are those collected in a cloud-seeding experiment in Tasmania between mid-1964 and January 1971. Their analysis, using regression techniques…
0 runs0 likes0 downloads0 reach0 impact
108 instances - 6 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) This is the data set called `DETROIT' in the book `Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series of monographs…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! All nominal attributes and instances with missing values are deleted. Price treated as the class attribute. As used by…
0 runs0 likes0 downloads0 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Cholesterol treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes0 downloads0 reach0 impact
52 instances - 3 features - 0 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
0 runs0 likes0 downloads0 reach0 impact
6430 instances - 37 features - 6 classes - 0 missing values
1. Title: SPAM E-mail Database 2. Sources: (a) Creators: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt Hewlett-Packard Labs, 1501 Page Mill Rd., Palo Alto, CA 94304 (b) Donor: George Forman…
0 runs0 likes0 downloads0 reach0 impact
4601 instances - 58 features - 2 classes - 0 missing values
1. Title: Image Segmentation data 2. Source Information -- Creators: Vision Group, University of Massachusetts -- Donor: Vision Group (Carla Brodley, brodley@cs.umass.edu) -- Date: November, 1990 3.…
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 20 features - 7 classes - 0 missing values
1. Title of Database: Pen-Based Recognition of Handwritten Digits 2. Source: E. Alpaydin, F. Alimoglu Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 17 features - 10 classes - 0 missing values
1. Title: Credit Approval 2. Sources: (confidential) Submitted by quinlan@cs.su.oz.au 3. Past Usage: See Quinlan, * "Simplifying decision trees", Int J Man-Machine Studies 27, Dec 1987, pp. 221-234. *…
38 runs0 likes0 downloads0 reach0 impact
690 instances - 16 features - 2 classes - 67 missing values