People
Jan van Rijn
Search these datasets in more detail

Jan's datasets

No data.
0 runs0 likes0 downloads0 reach0 impact
5810 instances - 3 features - 3894 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 4 classes - 31 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning…
0 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 0 classes - 2 missing values
This data set concerns the study of the factors affecting patterns of insulin-dependent diabetes mellitus in children. The objective is to investigate the dependence of the level of serum C-peptide on…
0 runs0 likes0 downloads0 reach0 impact
43 instances - 3 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) The infamous Longley data, "An appraisal of least-squares programs from the point of view of the user", JASA, 62(1967) p819-841. Variables are: Number of…
0 runs0 likes0 downloads0 reach0 impact
16 instances - 7 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) These data are those collected in a cloud-seeding experiment in Tasmania between mid-1964 and January 1971. Their analysis, using regression techniques…
0 runs0 likes0 downloads0 reach0 impact
108 instances - 6 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
22 runs0 likes0 downloads0 reach0 impact
2178 instances - 4 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) This is the data set called `DETROIT' in the book `Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series of monographs…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! All nominal attributes and instances with missing values are deleted. Price treated as the class attribute. As used by…
0 runs0 likes0 downloads0 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
The problem is to learn a regression equation/rule/tree to predict the activity from the descriptive structural attributes. The data and methodology is described in detail in: - King, Ross .D., Hurst,…
0 runs0 likes0 downloads0 reach0 impact
186 instances - 61 features - 0 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by Allison, T. and Cicchetti, D.…
0 runs0 likes0 downloads0 reach0 impact
62 instances - 8 features - 0 classes - 12 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Cholesterol treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identification code deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes0 downloads0 reach0 impact
189 instances - 10 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Horsepower treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
This is a commercial application described in Weiss & Indurkhya (1995). The data describes a telecommunication problem. No further information is available. Characteristics: (10000+5000) cases, 49…
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 49 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. X treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric…
0 runs0 likes0 downloads0 reach0 impact
418 instances - 19 features - 0 classes - 1239 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! NAME: Sexual activity and the lifespan of male fruitflies TYPE: Designed (almost factorial)…
0 runs0 likes0 downloads0 reach0 impact
125 instances - 5 features - 0 classes - 0 missing values
This data set is also obtained from the task of controlling the ailerons of a F16 aircraft, although the target variable and attributes are different from the ailerons domain. The target variable here…
0 runs0 likes0 downloads0 reach0 impact
9517 instances - 7 features - 0 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 22 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
0 runs0 likes0 downloads0 reach0 impact
398 instances - 8 features - 0 classes - 6 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics; (b) its assigned insurance risk rating,; (c) its normalized losses in use as…
0 runs0 likes0 downloads0 reach0 impact
159 instances - 16 features - 0 classes - 0 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) SUMMARY: Data from an experiment on the affects of machine adjustments on the time to count bolts. Data appear as the STATS (Issue 10) Challenge. DATA:…
0 runs0 likes0 downloads0 reach0 impact
40 instances - 7 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes0 downloads0 reach0 impact
52 instances - 3 features - 0 classes - 0 missing values
1. Title: Wisconsin Prognostic Breast Cancer (WPBC) 2. Source Information a) Creators: Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin, Clinical Sciences Center, Madison, WI…
0 runs0 likes0 downloads0 reach0 impact
194 instances - 33 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
0 runs0 likes0 downloads0 reach0 impact
61 instances - 3 features - 0 classes - 0 missing values
This is data set is concerned with the forward kinematics of an 8 link robot arm. Among the existing variants of this data set we have used the variant 8nm, which is known to be highly non-linear and…
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 9 features - 0 classes - 0 missing values
The objective was to determine which seedlots in a species are best for soil conservation in seasonally dry hill country. Determination is found by measurement of height, diameter by height, survival,…
0 runs0 likes0 downloads0 reach0 impact
736 instances - 20 features - 5 classes - 448 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
0 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 28 classes - 0 missing values
The database consists of the multi-spectral values of pixels in 3x3 neighbourhoods in a satellite image, and the classification associated with the central pixel in each neighbourhood. The aim is to…
0 runs0 likes0 downloads0 reach0 impact
6430 instances - 37 features - 6 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
45312 instances - 9 features - 2 classes - 0 missing values
This data sets consists of 3 different types of irises' (Setosa, Versicolour, and Virginica) petal and sepal length, stored in a 150x4 numpy.ndarray
10 runs0 likes0 downloads0 reach2 impact
150 instances - 5 features - 3 classes - 0 missing values
1. Title: Waveform Database Generator (written in C) 2. Source: (a) Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group:…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 41 features - 3 classes - 0 missing values
NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may be viewed from one of many…
0 runs0 likes0 downloads0 reach0 impact
846 instances - 19 features - 4 classes - 0 missing values
1. Title: Tic-Tac-Toe Endgame database 2. Source Information -- Creator: David W. Aha (aha@cs.jhu.edu) -- Donor: David W. Aha (aha@cs.jhu.edu) -- Date: 19 August 1991 3. Known Past Usage: 1.…
0 runs0 likes0 downloads0 reach0 impact
958 instances - 10 features - 2 classes - 0 missing values
Donor: G. Towell, M. Noordewier, and J. Shavlik Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. All examples taken from Genbank 64.1. Categories "ei" and "ie"…
0 runs0 likes0 downloads0 reach0 impact
3190 instances - 61 features - 3 classes - 0 missing values
1. Title: SPAM E-mail Database 2. Sources: (a) Creators: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt Hewlett-Packard Labs, 1501 Page Mill Rd., Palo Alto, CA 94304 (b) Donor: George Forman…
0 runs0 likes0 downloads0 reach0 impact
4601 instances - 58 features - 2 classes - 0 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
0 runs0 likes0 downloads0 reach0 impact
683 instances - 36 features - 19 classes - 2337 missing values
; ; Thyroid disease records supplied by the Garavan Institute and J. Ross ; Quinlan, New South Wales Institute, Syndney, Australia. ; ; 1987. ; sick, negative. | classes age: continuous. sex: M, F. on…
0 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 2 classes - 6064 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
337 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 2 classes - 0 missing values
1. Title: Image Segmentation data 2. Source Information -- Creators: Vision Group, University of Massachusetts -- Donor: Vision Group (Carla Brodley, brodley@cs.umass.edu) -- Date: November, 1990 3.…
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 20 features - 7 classes - 0 missing values
1. Title of Database: Pen-Based Recognition of Handwritten Digits 2. Source: E. Alpaydin, F. Alimoglu Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 17 features - 10 classes - 0 missing values
Description of the German credit dataset. 1. Title: German Credit data 2. Source Information Professor Dr. Hans Hofmann Institut f"ur Statistik und "Okonometrie Universit"at Hamburg FB…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - 2 classes - 0 missing values
1. Title: Credit Approval 2. Sources: (confidential) Submitted by quinlan@cs.su.oz.au 3. Past Usage: See Quinlan, * "Simplifying decision trees", Int J Man-Machine Studies 27, Dec 1987, pp. 221-234. *…
38 runs0 likes0 downloads0 reach0 impact
690 instances - 16 features - 2 classes - 67 missing values
1. Title of Database: Optical Recognition of Handwritten Digits 2. Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
0 runs0 likes0 downloads0 reach0 impact
5620 instances - 65 features - 10 classes - 0 missing values
1. Title: Mushroom Database 2. Sources: (a) Mushroom records drawn from The Audubon Society Field Guide to North American Mushrooms (1981). G. H. Lincoff (Pres.), New York: Alfred A. Knopf (b) Donor:…
0 runs0 likes0 downloads0 reach0 impact
8124 instances - 23 features - 2 classes - 2480 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - 3 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 48 features - 10 classes - 0 missing values
1. Title: Car Evaluation Database 2. Sources: (a) Creator: Marko Bohanec (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past Usage: The…
0 runs0 likes0 downloads0 reach0 impact
1728 instances - 7 features - 4 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 241 features - 10 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 7 features - 10 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 65 features - 10 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
699 instances - 10 features - 2 classes - 16 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 77 features - 10 classes - 0 missing values
The multi-feature digit dataset ------------------------------- Oowned and donated by: ---------------------- Robert P.W. Duin Department of Applied Physics Delft University of Technology P.O. Box…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 217 features - 10 classes - 0 missing values
1. Title: Balance Scale Weight & Distance Database 2. Source Information: (a) Source: Generated to model psychological experiments reported by Siegler, R. S. (1976). Three Aspects of Cognitive…
24 runs0 likes0 downloads0 reach0 impact
625 instances - 5 features - 3 classes - 0 missing values
1. TITLE: Letter Image Recognition Data The objective is to identify each of a large number of black-and-white rectangular pixel displays as one of the 26 capital letters in the English alphabet. The…
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 26 classes - 0 missing values
1. Title: Chess End-Game -- King+Rook versus King+Pawn on a7 (usually abbreviated KRKPA7). The pawn on a7 means it is one square away from queening. It is the King+Rook's side (white) to move. 2.…
26 runs0 likes0 downloads0 reach0 impact
3196 instances - 37 features - 2 classes - 0 missing values
1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross…
148 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values