OpenML
Filter results by:
The KDD Cup 2009 offers the opportunity to work on large marketing databases from the French Telecom company Orange to predict the propensity of customers to switch provider (churn). Churn (wikipedia…
0 runs0 likes0 downloads0 reach0 impact
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
0 runs0 likes0 downloads0 reach0 impact
This dataset represents a set of possible advertisements on Internet pages. The features encode the geometry of the image (if available) as well as phrases occurring in the URL, the image's URL and…
0 runs0 likes0 downloads0 reach0 impact
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Horsepower treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes0 downloads0 reach0 impact
61 instances - 3 features - 0 classes - 0 missing values
* Title: Breast Cancer Wisconsin (Diagnostic) Data Set (WDBC) * Abstract: Diagnostic Wisconsin Breast Cancer Database * Source: Creators: 1. Dr. William H. Wolberg, General Surgery Dept. University of…
0 runs0 likes0 downloads0 reach11 impact
569 instances - 31 features - 2 classes - 0 missing values
NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may be viewed from one of many…
0 runs0 likes0 downloads0 reach0 impact
846 instances - 19 features - 4 classes - 0 missing values
This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable, and/or improvable predictive models of software engineering. If you publish material based…
0 runs0 likes0 downloads0 reach11 impact
10885 instances - 22 features - 2 classes - 25 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
0 runs0 likes0 downloads0 reach11 impact
4562 instances - 49 features - 2 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Cholesterol treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
Abstract: MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty is…
0 runs0 likes0 downloads0 reach11 impact
2600 instances - 501 features - 2 classes - 0 missing values
The data directory contains the binary images (masks) of the leaf samples. The colour images are not included. There are three features: Shape, Margin and Texture. As discussed in the paper(s) above.…
0 runs0 likes0 downloads0 reach11 impact
1599 instances - 65 features - 100 classes - 0 missing values
* Dataset Title: MicroMass - Pure (pure spectra version) * Abstract: A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. * Source:…
0 runs0 likes0 downloads0 reach11 impact
571 instances - 1301 features - 20 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 48 features - 10 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
0 runs0 likes0 downloads0 reach11 impact
19020 instances - 11 features - 2 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 65 features - 10 classes - 0 missing values
1. Title: Credit Approval 2. Sources: (confidential) Submitted by quinlan@cs.su.oz.au 3. Past Usage: See Quinlan, * "Simplifying decision trees", Int J Man-Machine Studies 27, Dec 1987, pp. 221-234. *…
0 runs0 likes0 downloads0 reach0 impact
690 instances - 16 features - 2 classes - 67 missing values
1. Title: Mushroom Database 2. Sources: (a) Mushroom records drawn from The Audubon Society Field Guide to North American Mushrooms (1981). G. H. Lincoff (Pres.), New York: Alfred A. Knopf (b) Donor:…
0 runs0 likes0 downloads0 reach0 impact
8124 instances - 23 features - 2 classes - 2480 missing values
* Title: Tamilnadu Electricity Board Hourly Readings Data Set * Abstract: This data can be effectively produced the result to fewer parameter of the Load profile can be reduced in the Database *…
0 runs0 likes0 downloads0 reach11 impact
45781 instances - 4 features - 20 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
0 runs0 likes0 downloads0 reach11 impact
39948 instances - 10 features - 2 classes - 0 missing values
This data set was generated as follows. 150 subjects spoke the name of each letter of the alphabet twice. Hence, we have 52 training examples from each speaker. The speakers are grouped into sets of…
0 runs0 likes0 downloads0 reach11 impact
7797 instances - 618 features - 26 classes - 0 missing values
Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction. Attribute Information: > 1. timestamp:…
0 runs0 likes0 downloads0 reach11 impact
540 instances - 38 features - 2 classes - 999 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes0 downloads0 reach0 impact
398 instances - 8 features - 0 classes - 6 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 7 features - 10 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
0 runs0 likes0 downloads0 reach11 impact
1109 instances - 22 features - 2 classes - 0 missing values
Description of the German credit dataset. 1. Title: German Credit data 2. Source Information Professor Dr. Hans Hofmann Institut f"ur Statistik und "Okonometrie Universit"at Hamburg FB…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - 2 classes - 0 missing values
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
0 runs0 likes0 downloads0 reach11 impact
1055 instances - 42 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 4 classes - 31 missing values
1. Title: SPAM E-mail Database 2. Sources: (a) Creators: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt Hewlett-Packard Labs, 1501 Page Mill Rd., Palo Alto, CA 94304 (b) Donor: George Forman…
0 runs0 likes0 downloads0 reach0 impact
4601 instances - 58 features - 2 classes - 0 missing values
Donor: G. Towell, M. Noordewier, and J. Shavlik Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. All examples taken from Genbank 64.1. Categories "ei" and "ie"…
0 runs0 likes0 downloads0 reach0 impact
3190 instances - 61 features - 3 classes - 0 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
In my work on context-sensitive learning, I used the "Deterding Vowel Recognition Data", but I found it necessary to reformulate the data. Implicit in the original data is contextual information on…
0 runs0 likes0 downloads0 reach11 impact
990 instances - 13 features - 11 classes - 0 missing values
Source: 1. Bendi Venkata Ramana, ramana.bendi '@' gmail.com Associate Professor, Department of Information Technology, Aditya Instutute of Technology and Management, Tekkali - 532201, Andhra Pradesh,…
0 runs0 likes0 downloads0 reach11 impact
583 instances - 11 features - 2 classes - 0 missing values
Title: Blood Transfusion Service Center Data Set Abstract: Data taken from the Blood Transfusion Service Center in Hsin-Chu City in Taiwan -- this is a classification problem.…
0 runs0 likes0 downloads0 reach11 impact
748 instances - 5 features - 2 classes - 0 missing values
Dataset artificially generated by using first order theory which describes structure of ten capital letters of English alphabet
0 runs0 likes0 downloads0 reach11 impact
10218 instances - 8 features - 10 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
0 runs0 likes0 downloads0 reach11 impact
15545 instances - 6 features - 2 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
0 runs0 likes0 downloads0 reach11 impact
2109 instances - 22 features - 2 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: transform to two-class
0 runs0 likes0 downloads0 reach10 impact
862 instances - 3 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
797 instances - 5 features - 6 classes - 0 missing values
1. Title of Database: Optical Recognition of Handwritten Digits 2. Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
0 runs0 likes0 downloads0 reach0 impact
5620 instances - 65 features - 10 classes - 0 missing values
Additionally, the authors require a citation to one or more publications from those cited as relevant papers. Source: Creators: Renata Cristina Barros Madeo (Madeo, R. C. B.) Priscilla Koch Wagner…
0 runs0 likes0 downloads0 reach11 impact
9873 instances - 33 features - 5 classes - 0 missing values
All data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was detected via a camera during the EEG measurement…
0 runs0 likes0 downloads0 reach11 impact
14980 instances - 15 features - 2 classes - 0 missing values
* Dataset: Wilt Data Set * Abstract: High-resolution Remote Sensing data set (Quickbird). Small number of training samples of diseased trees, large number for other land cover. Testing data set from…
0 runs0 likes0 downloads0 reach11 impact
4839 instances - 6 features - 2 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning…
0 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 0 classes - 2 missing values
This is data set is concerned with the forward kinematics of an 8 link robot arm. Among the existing variants of this data set we have used the variant 8nm, which is known to be highly non-linear and…
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 9 features - 0 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
0 runs0 likes0 downloads0 reach11 impact
3468 instances - 971 features - 2 classes - 0 missing values
Data from the Kaggle Amazon Employee Access Challenge: https://www.kaggle.com/c/amazon-employee-access-challenge When an employee at any company starts work, they first need to obtain the computer…
0 runs0 likes0 downloads0 reach11 impact
32769 instances - 10 features - 2 classes - 0 missing values
; ; Thyroid disease records supplied by the Garavan Institute and J. Ross ; Quinlan, New South Wales Institute, Syndney, Australia. ; ; 1987. ; sick, negative. | classes age: continuous. sex: M, F. on…
0 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 2 classes - 6064 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
0 runs0 likes0 downloads0 reach11 impact
672 instances - 10 features - 2 classes - 1200 missing values
* Dataset Title: Wall-Following Robot Navigation Data Data Set * Abstract: The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4…
0 runs0 likes0 downloads0 reach11 impact
5456 instances - 25 features - 4 classes - 0 missing values
The Monk's Problems: Problem 2 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
601 instances - 7 features - 2 classes - 0 missing values
1. Title: Wisconsin Prognostic Breast Cancer (WPBC) 2. Source Information a) Creators: Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Sciences Center, Madison, WI…
0 runs0 likes0 downloads0 reach0 impact
194 instances - 33 features - 0 classes - 0 missing values
The problem is to learn a regression equation/rule/tree to predict the activity from the descriptive structural attributes. The data and methodology is described in detail in: - King, Ross .D., Hurst,…
0 runs0 likes0 downloads0 reach0 impact
186 instances - 61 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) SUMMARY: Data from an experiment on the affects of machine adjustments on the time to count bolts. Data appear as the STATS (Issue 10) Challenge. DATA:…
0 runs0 likes0 downloads0 reach0 impact
40 instances - 7 features - 0 classes - 0 missing values
Title: Human Activity Recognition Using Smartphones Abstract: Human Activity Recognition database built from the recordings of 30 subjects performing activities of daily living (ADL) while carrying a…
0 runs0 likes0 downloads0 reach11 impact
10299 instances - 562 features - 6 classes - 0 missing values
Title: Gas Sensor Array Drift Dataset Data Set Source: Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego, California, USA Donors of…
0 runs0 likes0 downloads0 reach11 impact
13910 instances - 129 features - 6 classes - 0 missing values
Scene recognition dataset Source: Matthew R. Boutell, Jiebo Luo, Xipeng Shen, and Christopher M. Brown. Learning multi-label scene classification. Pattern Recognition, 37(9):1757-1771, 2004. 1:…
0 runs0 likes0 downloads0 reach11 impact
2407 instances - 300 features - 2 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 77 features - 10 classes - 0 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
Tattile Via Gaetano Donizetti, 1-3-5,25030 Mairano (Brescia), Italy. * Title: Semeion Handwritten Digit Data Set * Abstract: 1593 handwritten digits from around 80 persons were scanned, stretched in a…
0 runs0 likes0 downloads0 reach11 impact
1593 instances - 257 features - 10 classes - 0 missing values
1. One-hundred plant species leaves data set (class = margin). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images…
0 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
Source: Owner of database: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe, volker.lohweg '@' hs-owl.de) Donor of database: Helene Doerksen (University of Applied Sciences,…
0 runs0 likes0 downloads0 reach11 impact
1372 instances - 5 features - 2 classes - 0 missing values
* Source: Marques de Sá, J.P., jpmdesa '@' gmail.com, Biomedical Engineering Institute, Porto, Portugal. Bernardes, J., joaobern '@' med.up.pt, Faculty of Medicine, University of Porto, Portugal.…
0 runs0 likes0 downloads0 reach11 impact
2126 instances - 36 features - 10 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) These data are those collected in a cloud-seeding experiment in Tasmania between mid-1964 and January 1971. Their analysis, using regression techniques…
0 runs0 likes0 downloads0 reach0 impact
108 instances - 6 features - 0 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
0 runs0 likes0 downloads0 reach11 impact
48842 instances - 15 features - 2 classes - 6465 missing values
Source: D. Lucas (ddlucas .at. alum.mit.edu), Lawrence Livermore National Laboratory; R. Klein (rklein .at. astron.berkeley.edu), Lawrence Livermore National Laboratory & U.C. Berkeley; J. Tannahill…
0 runs0 likes0 downloads0 reach11 impact
540 instances - 21 features - 2 classes - 0 missing values
b
0 runs0 likes0 downloads0 reach0 impact
150 instances - 4 features - 3 classes - 0 missing values
1. TITLE: Letter Image Recognition Data The objective is to identify each of a large number of black-and-white rectangular pixel displays as one of the 26 capital letters in the English alphabet. The…
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
This is the famous Australian dataset, retrieved 2014-11-14 from the libSVM site. It was normalized. The original version is from…
0 runs0 likes0 downloads0 reach11 impact
690 instances - 15 features - 2 classes - 0 missing values
1. Title: Tic-Tac-Toe Endgame database 2. Source Information -- Creator: David W. Aha (aha@cs.jhu.edu) -- Donor: David W. Aha (aha@cs.jhu.edu) -- Date: 19 August 1991 3. Known Past Usage: 1.…
0 runs0 likes0 downloads0 reach0 impact
958 instances - 10 features - 2 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 241 features - 10 classes - 0 missing values
Abstract: Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative…
0 runs0 likes0 downloads0 reach11 impact
1080 instances - 82 features - 8 classes - 1396 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! NAME: Sexual activity and the lifespan of male fruitflies TYPE: Designed (almost factorial)…
0 runs0 likes0 downloads0 reach0 impact
125 instances - 5 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
841 instances - 71 features - 4 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
0 runs0 likes0 downloads0 reach11 impact
1563 instances - 38 features - 2 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! All nominal attributes and instances with missing values are deleted. Price treated as the class attribute. As used by…
0 runs0 likes0 downloads0 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics; (b) its assigned insurance risk rating,; (c) its normalized losses in use as…
0 runs0 likes0 downloads0 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
0 runs0 likes0 downloads0 reach11 impact
6598 instances - 168 features - 2 classes - 0 missing values
1. Source: Lee Graham (lee '@' stellaralchemy.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer Science Intelligent Systems Research Unit 1125 Colonel By…
0 runs0 likes0 downloads0 reach11 impact
1212 instances - 101 features - 2 classes - 0 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
1 . Abstract: Two ground ozone level data sets are included in this collection. One is the eight hour peak set (eighthr.data), the other is the one hour peak set (onehr.data). Those data were…
0 runs0 likes0 downloads0 reach11 impact
2534 instances - 73 features - 2 classes - 0 missing values
Irish Educational Transitions Data Below are shown data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984),…
0 runs0 likes0 downloads0 reach11 impact
500 instances - 6 features - 2 classes - 32 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
45312 instances - 9 features - 2 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
0 runs0 likes0 downloads0 reach0 impact
6430 instances - 37 features - 6 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identification code deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes0 downloads0 reach0 impact
189 instances - 10 features - 0 classes - 0 missing values
Relevant Papers: Laurent Candillier and Vincent Lemaire. Design and Analysis of the Nomao Challenge - Active Learning in the Real-World. In: Proceedings of the ALRA : Active Learning in Real-world…
0 runs0 likes0 downloads0 reach11 impact
34465 instances - 119 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
699 instances - 10 features - 2 classes - 16 missing values
The Monk's Problems: Problem 3 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
554 instances - 7 features - 2 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. X treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
0 runs0 likes0 downloads0 reach0 impact
418 instances - 19 features - 0 classes - 1239 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
0 runs0 likes0 downloads0 reach0 impact
9517 instances - 7 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes0 downloads0 reach0 impact
52 instances - 3 features - 0 classes - 0 missing values
####1. Summary This database was generated by the Laboratory of Image Processing and Pattern Recognition (INPG-LTIRF) in the development of the Esprit project ELENA No. 6891 and the Esprit working…
0 runs0 likes0 downloads0 reach11 impact
5500 instances - 41 features - 11 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
0 runs0 likes0 downloads0 reach0 impact
736 instances - 20 features - 5 classes - 448 missing values
1. Title of Database: Pen-Based Recognition of Handwritten Digits 2. Source: E. Alpaydin, F. Alimoglu Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 17 features - 10 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
0 runs0 likes0 downloads0 reach11 impact
1458 instances - 38 features - 2 classes - 0 missing values
This file contains 9 sets of sanitized user data drawn from the command histories of 8 UNIX computer users at Purdue over the course of up to 2 years (USER0 and USER1 were generated by the same…
0 runs0 likes0 downloads0 reach10 impact
9100 instances - 3 features - 9 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) This is the data set called `DETROIT' in the book `Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series of monographs…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values
This data consists of synthetically generated control charts. This dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are…
0 runs0 likes0 downloads0 reach11 impact
600 instances - 61 features - 6 classes - 0 missing values