Data
Filter results by:
* Source: Marques de Sá, J.P., jpmdesa '@' gmail.com, Biomedical Engineering Institute, Porto, Portugal. Bernardes, J., joaobern '@' med.up.pt, Faculty of Medicine, University of Porto, Portugal.…
0 runs0 likes0 downloads0 reach11 impact
2126 instances - 36 features - 10 classes - 0 missing values
Title: Gas Sensor Array Drift Dataset Data Set Source: Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego, California, USA Donors of…
0 runs0 likes0 downloads0 reach11 impact
13910 instances - 129 features - 6 classes - 0 missing values
Source: 1. Bendi Venkata Ramana, ramana.bendi '@' gmail.com Associate Professor, Department of Information Technology, Aditya Instutute of Technology and Management, Tekkali - 532201, Andhra Pradesh,…
0 runs0 likes0 downloads0 reach11 impact
583 instances - 11 features - 2 classes - 0 missing values
* Title: Tamilnadu Electricity Board Hourly Readings Data Set * Abstract: This data can be effectively produced the result to fewer parameter of the Load profile can be reduced in the Database *…
0 runs0 likes0 downloads0 reach11 impact
45781 instances - 4 features - 20 classes - 0 missing values
In my work on context-sensitive learning, I used the "Deterding Vowel Recognition Data", but I found it necessary to reformulate the data. Implicit in the original data is contextual information on…
0 runs0 likes0 downloads0 reach11 impact
990 instances - 13 features - 11 classes - 0 missing values
* Dataset Title: Wall-Following Robot Navigation Data Data Set * Abstract: The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4…
0 runs0 likes0 downloads0 reach11 impact
5456 instances - 25 features - 4 classes - 0 missing values
Data from the Kaggle Amazon Employee Access Challenge: https://www.kaggle.com/c/amazon-employee-access-challenge When an employee at any company starts work, they first need to obtain the computer…
0 runs0 likes0 downloads0 reach11 impact
32769 instances - 10 features - 2 classes - 0 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
0 runs0 likes0 downloads0 reach11 impact
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
0 runs0 likes0 downloads0 reach11 impact
39948 instances - 10 features - 2 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
0 runs0 likes0 downloads0 reach11 impact
1458 instances - 38 features - 2 classes - 0 missing values
Source: James P Bridge, Sean B Holden and Lawrence C Paulson University of Cambridge Computer Laboratory William Gates Building 15 JJ Thomson Avenue Cambridge CB3 0FD UK +44 (0)1223 763500…
0 runs0 likes0 downloads0 reach11 impact
6118 instances - 52 features - 6 classes - 0 missing values
The data directory contains the binary images (masks) of the leaf samples. The colour images are not included. There are three features: Shape, Margin and Texture. As discussed in the paper(s) above.…
0 runs0 likes0 downloads0 reach11 impact
1599 instances - 65 features - 100 classes - 0 missing values
This is the famous Australian dataset, retrieved 2014-11-14 from the libSVM site. It was normalized. The original version is from…
0 runs0 likes0 downloads0 reach11 impact
690 instances - 15 features - 2 classes - 0 missing values
The Monk's Problems: Problem 2 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
601 instances - 7 features - 2 classes - 0 missing values
This dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers. The data was collected for examining our newly developed classifier for multidimensional curves…
0 runs0 likes0 downloads0 reach11 impact
9961 instances - 15 features - 9 classes - 0 missing values
This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable, and/or improvable predictive models of software engineering. If you publish material based…
0 runs0 likes0 downloads0 reach11 impact
10885 instances - 22 features - 2 classes - 25 missing values
Source: Patrick Marques Ciarelli, pciarelli '@' lcad.inf.ufes.br, Department of Electrical Engineering, Federal University of Espirito Santo Elias Oliveira, elias '@' lcad.inf.ufes.br, Department of…
0 runs0 likes0 downloads0 reach11 impact
1080 instances - 857 features - 9 classes - 0 missing values
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
0 runs0 likes0 downloads0 reach11 impact
1055 instances - 42 features - 2 classes - 0 missing values
Additionally, the authors require a citation to one or more publications from those cited as relevant papers. Source: Creators: Renata Cristina Barros Madeo (Madeo, R. C. B.) Priscilla Koch Wagner…
0 runs0 likes0 downloads0 reach11 impact
9873 instances - 33 features - 5 classes - 0 missing values
1. One-hundred plant species leaves data set (class = margin). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images…
0 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
Tattile Via Gaetano Donizetti, 1-3-5,25030 Mairano (Brescia), Italy. * Title: Semeion Handwritten Digit Data Set * Abstract: 1593 handwritten digits from around 80 persons were scanned, stretched in a…
0 runs0 likes0 downloads0 reach11 impact
1593 instances - 257 features - 10 classes - 0 missing values
* Title: Breast Cancer Wisconsin (Diagnostic) Data Set (WDBC) * Abstract: Diagnostic Wisconsin Breast Cancer Database * Source: Creators: 1. Dr. William H. Wolberg, General Surgery Dept. University of…
0 runs0 likes0 downloads0 reach11 impact
569 instances - 31 features - 2 classes - 0 missing values
Abstract: Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative…
0 runs0 likes0 downloads0 reach11 impact
1080 instances - 82 features - 8 classes - 1396 missing values
1. Title of Database: LED display domain 2. Sources: (a) Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group: Belmont,…
0 runs0 likes0 downloads0 reach11 impact
500 instances - 8 features - 10 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
0 runs0 likes0 downloads0 reach11 impact
4562 instances - 49 features - 2 classes - 0 missing values
1. Source: Lee Graham (lee '@' stellaralchemy.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer Science Intelligent Systems Research Unit 1125 Colonel By…
0 runs0 likes0 downloads0 reach11 impact
1212 instances - 101 features - 2 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
0 runs0 likes0 downloads0 reach11 impact
48842 instances - 15 features - 2 classes - 6465 missing values
Scene recognition dataset Source: Matthew R. Boutell, Jiebo Luo, Xipeng Shen, and Christopher M. Brown. Learning multi-label scene classification. Pattern Recognition, 37(9):1757-1771, 2004. 1:…
0 runs0 likes0 downloads0 reach11 impact
2407 instances - 300 features - 2 classes - 0 missing values
This data set was generated as follows. 150 subjects spoke the name of each letter of the alphabet twice. Hence, we have 52 training examples from each speaker. The speakers are grouped into sets of…
0 runs0 likes0 downloads0 reach11 impact
7797 instances - 618 features - 26 classes - 0 missing values
1 . Abstract: Two ground ozone level data sets are included in this collection. One is the eight hour peak set (eighthr.data), the other is the one hour peak set (onehr.data). Those data were…
0 runs0 likes0 downloads0 reach11 impact
2534 instances - 73 features - 2 classes - 0 missing values
1. One-hundred plant species leaves data set (class = shape). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images are…
0 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
Data from the Kaggle Bioresponse challenge: https://www.kaggle.com/c/bioresponse The objective of the competition is to help us build as good a model as possible so that we can, as optimally as this…
0 runs0 likes0 downloads0 reach11 impact
3751 instances - 1777 features - 2 classes - 0 missing values
(www.semeion.it) * Title: Steel Plates Faults Data Set * Abstract: A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern…
0 runs0 likes0 downloads0 reach11 impact
1941 instances - 34 features - 2 classes - 0 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales.Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
0 runs0 likes0 downloads0 reach11 impact
500 instances - 13 features - 2 classes - 835 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
797 instances - 5 features - 6 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
841 instances - 71 features - 4 classes - 0 missing values
Available at: [pdf] http://hdl.handle.net/1822/14838 [bib] http://www3.dsi.uminho.pt/pcortez/bib/2011-esm-1.txt 1. Title: Bank Marketing 2. Sources Created by: Paulo Cortez (Univ. Minho) and Sérgio…
0 runs0 likes0 downloads0 reach11 impact
45211 instances - 17 features - 2 classes - 0 missing values
This data consists of synthetically generated control charts. This dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are…
0 runs0 likes0 downloads0 reach11 impact
600 instances - 61 features - 6 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
0 runs0 likes0 downloads0 reach11 impact
672 instances - 10 features - 2 classes - 1200 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
0 runs0 likes0 downloads0 reach11 impact
2109 instances - 22 features - 2 classes - 0 missing values
Relevant Papers: Laurent Candillier and Vincent Lemaire. Design and Analysis of the Nomao Challenge - Active Learning in the Real-World. In: Proceedings of the ALRA : Active Learning in Real-world…
0 runs0 likes0 downloads0 reach11 impact
34465 instances - 119 features - 2 classes - 0 missing values
* Title: Phoneme dataset * Abstract: The aim of this dataset is to distinguish between nasal (class 0) and oral sounds (class 1). The class distribution is 3,818 samples in class 0 and 1,586 samples…
0 runs0 likes0 downloads0 reach11 impact
5404 instances - 6 features - 2 classes - 0 missing values
* Dataset: Wilt Data Set * Abstract: High-resolution Remote Sensing data set (Quickbird). Small number of training samples of diseased trees, large number for other land cover. Testing data set from…
0 runs0 likes0 downloads0 reach11 impact
4839 instances - 6 features - 2 classes - 0 missing values
All data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was detected via a camera during the EEG measurement…
64 runs0 likes0 downloads0 reach11 impact
14980 instances - 15 features - 2 classes - 0 missing values
The following are data used in an analysis of the Brown and Frown corpora for my doctoral dissertation titled ``Variations in Written English: Characterizing Authors' Rhetorical Language Choices…
31 runs0 likes0 downloads0 reach11 impact
500 instances - 22 features - 15 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
31 runs0 likes0 downloads0 reach11 impact
522 instances - 22 features - 2 classes - 0 missing values
The Monk's Problems: Problem 3 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach12 impact
554 instances - 7 features - 2 classes - 0 missing values