Data
Filter results by:
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
8 runs0 likes0 downloads0 reach11 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
The KDD Cup 2009 offers the opportunity to work on large marketing databases from the French Telecom company Orange to predict the propensity of customers to switch provider (churn). Churn (wikipedia…
11 runs0 likes0 downloads0 reach10 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
4 runs0 likes0 downloads0 reach11 impact
2796 instances - 34 features - 6 classes - 68100 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
899 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. But this playground competition's…
0 runs0 likes0 downloads0 reach0 impact
1460 instances - 80 features - 0 classes - 6965 missing values
XXX
0 runs0 likes0 downloads0 reach0 impact
1460 instances - 80 features - 0 classes - 6965 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
10 runs0 likes0 downloads0 reach12 impact
48842 instances - 15 features - 2 classes - 6465 missing values
; ; Thyroid disease records supplied by the Garavan Institute and J. Ross ; Quinlan, New South Wales Institute, Syndney, Australia. ; ; 1987. ; sick, negative. | classes age: continuous. sex: M, F. on…
5 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 2 classes - 6064 missing values
1. Title: Mushroom Database 2. Sources: (a) Mushroom records drawn from The Audubon Society Field Guide to North American Mushrooms (1981). G. H. Lincoff (Pres.), New York: Alfred A. Knopf (b) Donor:…
13 runs0 likes0 downloads0 reach1 impact
8124 instances - 23 features - 2 classes - 2480 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
3 runs0 likes0 downloads0 reach2 impact
683 instances - 36 features - 19 classes - 2337 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
Abstract: Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative…
17 runs0 likes0 downloads0 reach11 impact
1080 instances - 82 features - 8 classes - 1396 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. X treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
0 runs0 likes0 downloads0 reach0 impact
418 instances - 19 features - 0 classes - 1239 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
10 runs0 likes0 downloads0 reach11 impact
672 instances - 10 features - 2 classes - 1200 missing values
Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction. Attribute Information: > 1. timestamp:…
10 runs0 likes0 downloads0 reach11 impact
540 instances - 38 features - 2 classes - 999 missing values
This is a test dataset
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - 0 classes - 866 missing values
This is a test dataset
0 runs0 likes0 downloads0 reach0 impact
891 instances - 12 features - 0 classes - 866 missing values
####1. Summary This dataset contain attributes of dresses and their recommendations according to their sales.Sales are monitor on the basis of alternate days. The attributes present analyzed are:…
9 runs0 likes0 downloads0 reach11 impact
500 instances - 13 features - 2 classes - 835 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
10 runs0 likes0 downloads0 reach0 impact
736 instances - 20 features - 5 classes - 448 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
Asteroid Dataset
0 runs0 likes0 downloads0 reach0 impact
126131 instances - 34 features - 2 classes - 99 missing values
1. Title: Credit Approval 2. Sources: (confidential) Submitted by quinlan@cs.su.oz.au 3. Past Usage: See Quinlan, * "Simplifying decision trees", Int J Man-Machine Studies 27, Dec 1987, pp. 221-234. *…
39 runs0 likes0 downloads0 reach0 impact
690 instances - 16 features - 2 classes - 67 missing values
Irish Educational Transitions Data Below are shown data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984),…
15 runs0 likes0 downloads0 reach11 impact
500 instances - 6 features - 2 classes - 32 missing values
No data.
0 runs0 likes0 downloads0 reach1 impact
150 instances - 5 features - 4 classes - 31 missing values
This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable, and/or improvable predictive models of software engineering. If you publish material based…
5 runs0 likes0 downloads0 reach11 impact
10885 instances - 22 features - 2 classes - 25 missing values
No data.
10 runs0 likes0 downloads0 reach0 impact
699 instances - 10 features - 2 classes - 16 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by Allison, T. and Cicchetti, D.…
0 runs0 likes0 downloads0 reach0 impact
62 instances - 8 features - 0 classes - 12 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Cholesterol treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes0 downloads0 reach0 impact
398 instances - 8 features - 0 classes - 6 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning…
0 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 0 classes - 2 missing values
Synthetic dataset created from a NumPy array
0 runs0 likes0 downloads0 reach0 impact
3 instances - 4 features - 0 classes - 0 missing values
Synthetic dataset created from a NumPy array
0 runs0 likes0 downloads0 reach0 impact
3 instances - 4 features - 0 classes - 0 missing values
Synthetic dataset created from a NumPy array
0 runs0 likes0 downloads0 reach0 impact
3 instances - 4 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 2 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - 0 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 4 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 4 features - 2 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 2 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - 0 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 4 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - 0 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 2 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
Testing dataset upload when the data is a list of lists
0 runs0 likes0 downloads0 reach0 impact
14 instances - 6 features - 2 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 6 features - 2 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 4 features - 2 classes - 0 missing values
.. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442…
0 runs0 likes0 downloads0 reach0 impact
442 instances - 11 features - 0 classes - 0 missing values
Dataset representing the XOR operation
0 runs0 likes0 downloads0 reach0 impact
4 instances - 3 features - 0 classes - 0 missing values
Synthetic dataset created from a Pandas DataFrame
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - 0 classes - 0 missing values