OpenML
Filter results by:
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach11 impact
797 instances - 5 features - 6 classes - 0 missing values
Source: Owner of database: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe, volker.lohweg '@' hs-owl.de) Donor of database: Helene Doerksen (University of Applied Sciences,…
0 runs0 likes0 downloads0 reach11 impact
1372 instances - 5 features - 2 classes - 0 missing values
Title: Blood Transfusion Service Center Data Set Abstract: Data taken from the Blood Transfusion Service Center in Hsin-Chu City in Taiwan -- this is a classification problem.…
0 runs0 likes0 downloads0 reach11 impact
748 instances - 5 features - 2 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! NAME: Sexual activity and the lifespan of male fruitflies TYPE: Designed (almost factorial)…
0 runs0 likes0 downloads0 reach0 impact
125 instances - 5 features - 0 classes - 0 missing values
1. Title: Balance Scale Weight & Distance Database 2. Source Information: (a) Source: Generated to model psychological experiments reported by Siegler, R. S. (1976). Three Aspects of Cognitive…
28 runs0 likes0 downloads0 reach0 impact
625 instances - 5 features - 3 classes - 0 missing values
Unit test should be deleted
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
Unit test should be deleted
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
Unit test should be deleted
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
0 runs0 likes0 downloads0 reach11 impact
15545 instances - 6 features - 2 classes - 0 missing values
Irish Educational Transitions Data Below are shown data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984),…
10 runs0 likes0 downloads0 reach11 impact
500 instances - 6 features - 2 classes - 32 missing values
* Title: Phoneme dataset * Abstract: The aim of this dataset is to distinguish between nasal (class 0) and oral sounds (class 1). The class distribution is 3,818 samples in class 0 and 1,586 samples…
0 runs0 likes0 downloads0 reach11 impact
5404 instances - 6 features - 2 classes - 0 missing values
* Dataset: Wilt Data Set * Abstract: High-resolution Remote Sensing data set (Quickbird). Small number of training samples of diseased trees, large number for other land cover. Testing data set from…
0 runs0 likes0 downloads0 reach11 impact
4839 instances - 6 features - 2 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) These data are those collected in a cloud-seeding experiment in Tasmania between mid-1964 and January 1971. Their analysis, using regression techniques…
0 runs0 likes0 downloads0 reach0 impact
108 instances - 6 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) SUMMARY: Data from an experiment on the affects of machine adjustments on the time to count bolts. Data appear as the STATS (Issue 10) Challenge. DATA:…
0 runs0 likes0 downloads0 reach0 impact
40 instances - 7 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) The infamous Longley data, "An appraisal of least-squares programs from the point of view of the user", JASA, 62(1967) p819-841. Variables are: Number of…
0 runs0 likes0 downloads0 reach0 impact
16 instances - 7 features - 0 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 7 features - 10 classes - 0 missing values
1. Title: Car Evaluation Database 2. Sources: (a) Creator: Marko Bohanec (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past Usage: The…
0 runs0 likes0 downloads0 reach0 impact
1728 instances - 7 features - 4 classes - 0 missing values
The Monk's Problems: Problem 1 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach12 impact
556 instances - 7 features - 2 classes - 0 missing values
The Monk's Problems: Problem 2 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
601 instances - 7 features - 2 classes - 0 missing values
The Monk's Problems: Problem 3 This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks.…
0 runs0 likes0 downloads0 reach11 impact
554 instances - 7 features - 2 classes - 0 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
0 runs0 likes0 downloads0 reach0 impact
9517 instances - 7 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by Allison, T. and Cicchetti, D.…
0 runs0 likes0 downloads0 reach0 impact
62 instances - 8 features - 0 classes - 12 missing values
Dataset artificially generated by using first order theory which describes structure of ten capital letters of English alphabet
0 runs0 likes0 downloads0 reach11 impact
10218 instances - 8 features - 10 classes - 0 missing values
1. Title of Database: LED display domain 2. Sources: (a) Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group: Belmont,…
0 runs0 likes0 downloads0 reach11 impact
500 instances - 8 features - 10 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes0 downloads0 reach0 impact
398 instances - 8 features - 0 classes - 6 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
45312 instances - 9 features - 2 classes - 0 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
441 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 2 classes - 0 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
0 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 28 classes - 0 missing values
This is data set is concerned with the forward kinematics of an 8 link robot arm. Among the existing variants of this data set we have used the variant 8nm, which is known to be highly non-linear and…
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 9 features - 0 classes - 0 missing values
Data from the Kaggle Amazon Employee Access Challenge: https://www.kaggle.com/c/amazon-employee-access-challenge When an employee at any company starts work, they first need to obtain the computer…
0 runs0 likes0 downloads0 reach11 impact
32769 instances - 10 features - 2 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
0 runs0 likes0 downloads0 reach11 impact
39948 instances - 10 features - 2 classes - 0 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - 3 classes - 0 missing values
1. Title: Tic-Tac-Toe Endgame database 2. Source Information -- Creator: David W. Aha (aha@cs.jhu.edu) -- Donor: David W. Aha (aha@cs.jhu.edu) -- Date: 19 August 1991 3. Known Past Usage: 1.…
0 runs0 likes0 downloads0 reach0 impact
958 instances - 10 features - 2 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
0 runs0 likes0 downloads0 reach11 impact
672 instances - 10 features - 2 classes - 1200 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identification code deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes0 downloads0 reach0 impact
189 instances - 10 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
699 instances - 10 features - 2 classes - 16 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
0 runs0 likes0 downloads0 reach11 impact
19020 instances - 11 features - 2 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
Source: 1. Bendi Venkata Ramana, ramana.bendi '@' gmail.com Associate Professor, Department of Information Technology, Aditya Instutute of Technology and Management, Tekkali - 532201, Andhra Pradesh,…
0 runs0 likes0 downloads0 reach11 impact
583 instances - 11 features - 2 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning…
0 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 0 classes - 2 missing values
In my work on context-sensitive learning, I used the "Deterding Vowel Recognition Data", but I found it necessary to reformulate the data. Implicit in the original data is contextual information on…
0 runs0 likes0 downloads0 reach11 impact
990 instances - 13 features - 11 classes - 0 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales.Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
0 runs0 likes0 downloads0 reach11 impact
500 instances - 13 features - 2 classes - 835 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) This is the data set called `DETROIT' in the book `Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series of monographs…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values