OpenML
Filter results by:
This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers. The data was collected for examining our newly developed classifier for multidimensional curves…
0 runs0 likes0 downloads0 reach11 impact
9961 instances - 15 features - 9 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
841 instances - 71 features - 4 classes - 0 missing values
This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable, and/or improvable predictive models of software engineering. If you publish material based…
0 runs0 likes0 downloads0 reach11 impact
10885 instances - 22 features - 2 classes - 25 missing values
This data consists of synthetically generated control charts. This dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are…
0 runs0 likes0 downloads0 reach11 impact
600 instances - 61 features - 6 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
0 runs0 likes0 downloads0 reach11 impact
1109 instances - 22 features - 2 classes - 0 missing values
Source: Owner of database: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe, volker.lohweg '@' hs-owl.de) Donor of database: Helene Doerksen (University of Applied Sciences,…
0 runs0 likes0 downloads0 reach11 impact
1372 instances - 5 features - 2 classes - 0 missing values
1. One-hundred plant species leaves data set (class = margin). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images…
0 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
* Dataset Title: MicroMass - Pure (pure spectra version) * Abstract: A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. * Source:…
0 runs0 likes0 downloads0 reach11 impact
571 instances - 1301 features - 20 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
0 runs0 likes0 downloads0 reach11 impact
3468 instances - 971 features - 2 classes - 0 missing values
* Dataset: Wilt Data Set * Abstract: High-resolution Remote Sensing data set (Quickbird). Small number of training samples of diseased trees, large number for other land cover. Testing data set from…
0 runs0 likes0 downloads0 reach11 impact
4839 instances - 6 features - 2 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
0 runs0 likes0 downloads0 reach11 impact
48842 instances - 15 features - 2 classes - 6465 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
0 runs0 likes0 downloads0 reach11 impact
2796 instances - 34 features - 6 classes - 68100 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
0 runs0 likes0 downloads0 reach11 impact
14395 instances - 217 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
0 runs0 likes0 downloads0 reach11 impact
15545 instances - 6 features - 2 classes - 0 missing values
Title: Gas Sensor Array Drift Dataset Data Set Source: Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego, California, USA Donors of…
0 runs0 likes0 downloads0 reach11 impact
13910 instances - 129 features - 6 classes - 0 missing values
* Dataset Title: Wall-Following Robot Navigation Data Data Set * Abstract: The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4…
0 runs0 likes0 downloads0 reach11 impact
5456 instances - 25 features - 4 classes - 0 missing values
####1. Summary This database was generated by the Laboratory of Image Processing and Pattern Recognition (INPG-LTIRF) in the development of the Esprit project ELENA No. 6891 and the Esprit working…
0 runs0 likes0 downloads0 reach11 impact
5500 instances - 41 features - 11 classes - 0 missing values
The following are data used in an analysis of the Brown and Frown corpora for my doctoral dissertation titled ``Variations in Written English: Characterizing Authors' Rhetorical Language Choices…
65 runs0 likes0 downloads0 reach11 impact
500 instances - 22 features - 15 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
65 runs0 likes0 downloads0 reach11 impact
522 instances - 22 features - 2 classes - 0 missing values
(www.semeion.it) * Title: Steel Plates Faults Data Set * Abstract: A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern…
0 runs0 likes0 downloads0 reach12 impact
1941 instances - 34 features - 2 classes - 0 missing values
The Monk's Problems: Problem 1 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach12 impact
556 instances - 7 features - 2 classes - 0 missing values
Source: Rami Mustafa A Mohammad ( University of Huddersfield, rami.mohammad '@' hud.ac.uk, rami.mustafa.a '@' gmail.com) Lee McCluskey (University of Huddersfield,t.l.mccluskey '@' hud.ac.uk ) Fadi…
0 runs0 likes0 downloads0 reach12 impact
11055 instances - 31 features - 2 classes - 0 missing values
Scene recognition dataset Source: Matthew R. Boutell, Jiebo Luo, Xipeng Shen, and Christopher M. Brown. Learning multi-label scene classification. Pattern Recognition, 37(9):1757-1771, 2004. 1:…
0 runs0 likes0 downloads0 reach12 impact
2407 instances - 300 features - 2 classes - 0 missing values