Data
Filter results by:
Data from the Kaggle Bioresponse challenge: https://www.kaggle.com/c/bioresponse The objective of the competition is to help us build as good a model as possible so that we can, as optimally as this…
8 runs0 likes0 downloads0 reach11 impact
3751 instances - 1777 features - 2 classes - 0 missing values
* Dataset Title: MicroMass - Pure (pure spectra version) * Abstract: A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. * Source:…
9 runs0 likes0 downloads0 reach11 impact
571 instances - 1301 features - 20 classes - 0 missing values
testing upload criteria
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
8 runs0 likes0 downloads0 reach11 impact
3468 instances - 971 features - 2 classes - 0 missing values
Source: Patrick Marques Ciarelli, pciarelli '@' lcad.inf.ufes.br, Department of Electrical Engineering, Federal University of Espirito Santo Elias Oliveira, elias '@' lcad.inf.ufes.br, Department of…
11 runs0 likes0 downloads0 reach11 impact
1080 instances - 857 features - 9 classes - 0 missing values
This data set was generated as follows. 150 subjects spoke the name of each letter of the alphabet twice. Hence, we have 52 training examples from each speaker. The speakers are grouped into sets of…
4 runs0 likes0 downloads0 reach11 impact
7797 instances - 618 features - 26 classes - 0 missing values
Title: Human Activity Recognition Using Smartphones Abstract: Human Activity Recognition database built from the recordings of 30 subjects performing activities of daily living (ADL) while carrying a…
5 runs0 likes0 downloads0 reach11 impact
10299 instances - 562 features - 6 classes - 0 missing values
Abstract: MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty is…
8 runs0 likes0 downloads0 reach11 impact
2600 instances - 501 features - 2 classes - 0 missing values
XXX
0 runs0 likes0 downloads0 reach0 impact
4209 instances - 377 features - 0 classes - 0 missing values
Scene recognition dataset Source: Matthew R. Boutell, Jiebo Luo, Xipeng Shen, and Christopher M. Brown. Learning multi-label scene classification. Pattern Recognition, 37(9):1757-1771, 2004. 1:…
17 runs0 likes0 downloads0 reach11 impact
2407 instances - 300 features - 2 classes - 0 missing values
Tattile Via Gaetano Donizetti, 1-3-5,25030 Mairano (Brescia), Italy. * Title: Semeion Handwritten Digit Data Set * Abstract: 1593 handwritten digits from around 80 persons were scanned, stretched in a…
7 runs0 likes0 downloads0 reach11 impact
1593 instances - 257 features - 10 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
5 runs0 likes0 downloads0 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
12 runs0 likes0 downloads0 reach11 impact
14395 instances - 217 features - 2 classes - 0 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
8 runs0 likes0 downloads0 reach11 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
The KDD Cup 2009 offers the opportunity to work on large marketing databases from the French Telecom company Orange to predict the propensity of customers to switch provider (churn). Churn (wikipedia…
11 runs0 likes0 downloads0 reach10 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
10 runs0 likes0 downloads0 reach11 impact
6598 instances - 168 features - 2 classes - 0 missing values
Title: Gas Sensor Array Drift Dataset Data Set Source: Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego, California, USA Donors of…
13 runs0 likes0 downloads0 reach11 impact
13910 instances - 129 features - 6 classes - 0 missing values
1. Source: Lee Graham (lee '@' stellaralchemy.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer Science Intelligent Systems Research Unit 1125 Colonel By…
7 runs0 likes0 downloads0 reach11 impact
1212 instances - 101 features - 2 classes - 0 missing values
Relevant Papers: Laurent Candillier and Vincent Lemaire. Design and Analysis of the Nomao Challenge - Active Learning in the Real-World. In: Proceedings of the ALRA : Active Learning in Real-world…
5 runs0 likes0 downloads0 reach11 impact
34465 instances - 119 features - 2 classes - 0 missing values
Abstract: Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative…
17 runs0 likes0 downloads0 reach11 impact
1080 instances - 82 features - 8 classes - 1396 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
28 runs0 likes0 downloads0 reach0 impact
2000 instances - 77 features - 10 classes - 0 missing values
1 . Abstract: Two ground ozone level data sets are included in this collection. One is the eight hour peak set (eighthr.data), the other is the one hour peak set (onehr.data). Those data were…
7 runs0 likes0 downloads0 reach11 impact
2534 instances - 73 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
14 runs0 likes0 downloads0 reach11 impact
841 instances - 71 features - 4 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
7 runs0 likes0 downloads0 reach0 impact
2000 instances - 65 features - 10 classes - 0 missing values
1. Title of Database: Optical Recognition of Handwritten Digits 2. Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
6 runs0 likes0 downloads0 reach0 impact
5620 instances - 65 features - 10 classes - 0 missing values
The data directory contains the binary images (masks) of the leaf samples. The colour images are not included. There are three features: Shape, Margin and Texture. As discussed in the paper(s) above.…
8 runs0 likes0 downloads0 reach11 impact
1599 instances - 65 features - 100 classes - 0 missing values
1. One-hundred plant species leaves data set (class = shape). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images are…
11 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
1. One-hundred plant species leaves data set (class = margin). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images…
17 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
The problem is to learn a regression equation/rule/tree to predict the activity from the descriptive structural attributes. The data and methodology is described in detail in: - King, Ross .D., Hurst,…
0 runs0 likes0 downloads0 reach0 impact
186 instances - 61 features - 0 classes - 0 missing values
This data consists of synthetically generated control charts. This dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are…
12 runs0 likes0 downloads0 reach11 impact
600 instances - 61 features - 6 classes - 0 missing values
1. Title: SPAM E-mail Database 2. Sources: (a) Creators: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt Hewlett-Packard Labs, 1501 Page Mill Rd., Palo Alto, CA 94304 (b) Donor: George Forman…
9 runs0 likes0 downloads0 reach0 impact
4601 instances - 58 features - 2 classes - 0 missing values
Source: James P Bridge, Sean B Holden and Lawrence C Paulson University of Cambridge Computer Laboratory William Gates Building 15 JJ Thomson Avenue Cambridge CB3 0FD UK +44 (0)1223 763500…
4 runs0 likes0 downloads0 reach11 impact
6118 instances - 52 features - 6 classes - 0 missing values
This is a commercial application described in Weiss & Indurkhya (1995). The data describes a telecommunication problem. No further information is available. Characteristics: (10000+5000) cases, 49…
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 49 features - 0 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
15 runs0 likes0 downloads0 reach11 impact
4562 instances - 49 features - 2 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
7 runs0 likes0 downloads0 reach0 impact
2000 instances - 48 features - 10 classes - 0 missing values
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
10 runs0 likes0 downloads0 reach11 impact
1055 instances - 42 features - 2 classes - 0 missing values
1. Title: Waveform Database Generator (written in C) 2. Source: (a) Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group:…
9 runs0 likes0 downloads0 reach0 impact
5000 instances - 41 features - 3 classes - 0 missing values
####1. Summary This database was generated by the Laboratory of Image Processing and Pattern Recognition (INPG-LTIRF) in the development of the Esprit project ELENA No. 6891 and the Esprit working…
11 runs0 likes0 downloads0 reach11 impact
5500 instances - 41 features - 11 classes - 0 missing values
Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. But this playground competition's…
0 runs0 likes0 downloads0 reach0 impact
1460 instances - 80 features - 0 classes - 6965 missing values
XXX
0 runs0 likes0 downloads0 reach0 impact
1460 instances - 80 features - 0 classes - 6965 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
11 runs0 likes0 downloads0 reach11 impact
1563 instances - 38 features - 2 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
8 runs0 likes0 downloads0 reach11 impact
1458 instances - 38 features - 2 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
7 runs0 likes0 downloads0 reach0 impact
6430 instances - 37 features - 6 classes - 0 missing values
* Source: Marques de Sá, J.P., jpmdesa '@' gmail.com, Biomedical Engineering Institute, Porto, Portugal. Bernardes, J., joaobern '@' med.up.pt, Faculty of Medicine, University of Porto, Portugal.…
10 runs0 likes0 downloads0 reach11 impact
2126 instances - 36 features - 10 classes - 0 missing values
1. Title: Wisconsin Prognostic Breast Cancer (WPBC) 2. Source Information a) Creators: Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Sciences Center, Madison, WI…
0 runs0 likes0 downloads0 reach0 impact
194 instances - 33 features - 0 classes - 0 missing values
(www.semeion.it) * Title: Steel Plates Faults Data Set * Abstract: A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern…
11 runs0 likes0 downloads0 reach11 impact
1941 instances - 34 features - 2 classes - 0 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Additionally, the authors require a citation to one or more publications from those cited as relevant papers. Source: Creators: Renata Cristina Barros Madeo (Madeo, R. C. B.) Priscilla Koch Wagner…
6 runs0 likes0 downloads0 reach11 impact
9873 instances - 33 features - 5 classes - 0 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
4 runs0 likes0 downloads0 reach11 impact
2796 instances - 34 features - 6 classes - 68100 missing values
* Title: Breast Cancer Wisconsin (Diagnostic) Data Set (WDBC) * Abstract: Diagnostic Wisconsin Breast Cancer Database * Source: Creators: 1. Dr. William H. Wolberg, General Surgery Dept. University of…
321 runs0 likes0 downloads0 reach11 impact
569 instances - 31 features - 2 classes - 0 missing values
* Dataset Title: Wall-Following Robot Navigation Data Data Set * Abstract: The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4…
10 runs0 likes0 downloads0 reach11 impact
5456 instances - 25 features - 4 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 22 features - 0 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
12 runs0 likes0 downloads0 reach11 impact
1109 instances - 22 features - 2 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
6 runs0 likes0 downloads0 reach11 impact
2109 instances - 22 features - 2 classes - 0 missing values
This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable, and/or improvable predictive models of software engineering. If you publish material based…
5 runs0 likes0 downloads0 reach11 impact
10885 instances - 22 features - 2 classes - 25 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
7 runs0 likes0 downloads0 reach11 impact
522 instances - 22 features - 2 classes - 0 missing values
Source: D. Lucas (ddlucas .at. alum.mit.edu), Lawrence Livermore National Laboratory; R. Klein (rklein .at. astron.berkeley.edu), Lawrence Livermore National Laboratory & U.C. Berkeley; J. Tannahill…
11 runs0 likes0 downloads0 reach11 impact
540 instances - 21 features - 2 classes - 0 missing values
1. Title: Image Segmentation data 2. Source Information -- Creators: Vision Group, University of Massachusetts -- Donor: Vision Group (Carla Brodley, brodley@cs.umass.edu) -- Date: November, 1990 3.…
9 runs0 likes0 downloads0 reach0 impact
2310 instances - 20 features - 7 classes - 0 missing values
The following are data used in an analysis of the Brown and Frown corpora for my doctoral dissertation titled ``Variations in Written English: Characterizing Authors' Rhetorical Language Choices…
11 runs0 likes0 downloads0 reach11 impact
500 instances - 22 features - 15 classes - 0 missing values
Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction. Attribute Information: > 1. timestamp:…
10 runs0 likes0 downloads0 reach11 impact
540 instances - 38 features - 2 classes - 999 missing values
NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may be viewed from one of many…
8 runs0 likes0 downloads0 reach0 impact
846 instances - 19 features - 4 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values