OpenML
Filter results by:
The KDD Cup 2009 offers the opportunity to work on large marketing databases from the French Telecom company Orange to predict the propensity of customers to switch provider (churn). Churn (wikipedia…
0 runs0 likes0 downloads0 reach10 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales.Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
0 runs0 likes0 downloads0 reach11 impact
500 instances - 13 features - 2 classes - 835 missing values
####1. Summary This database was generated by the Laboratory of Image Processing and Pattern Recognition (INPG-LTIRF) in the development of the Esprit project ELENA No. 6891 and the Esprit working…
0 runs0 likes0 downloads0 reach11 impact
5500 instances - 41 features - 11 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
0 runs0 likes0 downloads0 reach11 impact
3468 instances - 971 features - 2 classes - 0 missing values
1 . Abstract: Two ground ozone level data sets are included in this collection. One is the eight hour peak set (eighthr.data), the other is the one hour peak set (onehr.data). Those data were…
0 runs0 likes0 downloads0 reach11 impact
2534 instances - 73 features - 2 classes - 0 missing values
1. Title: Image Segmentation data 2. Source Information -- Creators: Vision Group, University of Massachusetts -- Donor: Vision Group (Carla Brodley, brodley@cs.umass.edu) -- Date: November, 1990 3.…
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 20 features - 7 classes - 0 missing values
Source: Rami Mustafa A Mohammad ( University of Huddersfield, rami.mohammad '@' hud.ac.uk, rami.mustafa.a '@' gmail.com) Lee McCluskey (University of Huddersfield,t.l.mccluskey '@' hud.ac.uk ) Fadi…
0 runs0 likes0 downloads0 reach11 impact
11055 instances - 31 features - 2 classes - 0 missing values
1. One-hundred plant species leaves data set (class = margin). 2. Sources: (a) Original owners of colour Leaves Samples: James Cope, Thibaut Beghin, Paolo Remagnino, Sarah Barman. The colour images…
0 runs0 likes0 downloads0 reach11 impact
1600 instances - 65 features - 100 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable,…
0 runs0 likes0 downloads0 reach11 impact
1458 instances - 38 features - 2 classes - 0 missing values
1. Title: Mushroom Database 2. Sources: (a) Mushroom records drawn from The Audubon Society Field Guide to North American Mushrooms (1981). G. H. Lincoff (Pres.), New York: Alfred A. Knopf (b) Donor:…
0 runs0 likes0 downloads0 reach0 impact
8124 instances - 23 features - 2 classes - 2480 missing values
This data set was generated as follows. 150 subjects spoke the name of each letter of the alphabet twice. Hence, we have 52 training examples from each speaker. The speakers are grouped into sets of…
0 runs0 likes0 downloads0 reach11 impact
7797 instances - 618 features - 26 classes - 0 missing values
The Monk's Problems: Problem 2 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
601 instances - 7 features - 2 classes - 0 missing values
1. Title: SPAM E-mail Database 2. Sources: (a) Creators: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt Hewlett-Packard Labs, 1501 Page Mill Rd., Palo Alto, CA 94304 (b) Donor: George Forman…
0 runs0 likes0 downloads0 reach0 impact
4601 instances - 58 features - 2 classes - 0 missing values
1. Title: Waveform Database Generator (written in C) 2. Source: (a) Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group:…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 41 features - 3 classes - 0 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
0 runs0 likes0 downloads0 reach0 impact
736 instances - 20 features - 5 classes - 448 missing values
Irish Educational Transitions Data Below are shown data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984),…
0 runs0 likes0 downloads0 reach11 impact
500 instances - 6 features - 2 classes - 32 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
0 runs0 likes0 downloads0 reach11 impact
2109 instances - 22 features - 2 classes - 0 missing values
(www.semeion.it) * Title: Steel Plates Faults Data Set * Abstract: A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern…
0 runs0 likes0 downloads0 reach11 impact
1941 instances - 34 features - 2 classes - 0 missing values
* Title: Tamilnadu Electricity Board Hourly Readings Data Set * Abstract: This data can be effectively produced the result to fewer parameter of the Load profile can be reduced in the Database *…
0 runs0 likes0 downloads0 reach11 impact
45781 instances - 4 features - 20 classes - 0 missing values
1. Title of Database: LED display domain 2. Sources: (a) Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group: Belmont,…
0 runs0 likes0 downloads0 reach11 impact
500 instances - 8 features - 10 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
45312 instances - 9 features - 2 classes - 0 missing values
Available at: [pdf] http://hdl.handle.net/1822/14838 [bib] http://www3.dsi.uminho.pt/pcortez/bib/2011-esm-1.txt 1. Title: Bank Marketing 2. Sources Created by: Paulo Cortez (Univ. Minho) and Sérgio…
0 runs0 likes0 downloads0 reach11 impact
45211 instances - 17 features - 2 classes - 0 missing values
This is the famous Australian dataset, retrieved 2014-11-14 from the libSVM site. It was normalized. The original version is from…
0 runs0 likes0 downloads0 reach11 impact
690 instances - 15 features - 2 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
0 runs0 likes0 downloads0 reach11 impact
672 instances - 10 features - 2 classes - 1200 missing values
The Monk's Problems: Problem 1 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
556 instances - 7 features - 2 classes - 0 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
0 runs0 likes0 downloads0 reach11 impact
2796 instances - 34 features - 6 classes - 68100 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
0 runs0 likes0 downloads0 reach11 impact
1109 instances - 22 features - 2 classes - 0 missing values
Relevant Papers: Laurent Candillier and Vincent Lemaire. Design and Analysis of the Nomao Challenge - Active Learning in the Real-World. In: Proceedings of the ALRA : Active Learning in Real-world…
0 runs0 likes0 downloads0 reach11 impact
34465 instances - 119 features - 2 classes - 0 missing values
* Title: Breast Cancer Wisconsin (Diagnostic) Data Set (WDBC) * Abstract: Diagnostic Wisconsin Breast Cancer Database * Source: Creators: 1. Dr. William H. Wolberg, General Surgery Dept. University of…
0 runs0 likes0 downloads0 reach11 impact
569 instances - 31 features - 2 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) The infamous Longley data, "An appraisal of least-squares programs from the point of view of the user", JASA, 62(1967) p819-841. Variables are: Number of…
0 runs0 likes0 downloads0 reach0 impact
16 instances - 7 features - 0 classes - 0 missing values