We believe that machine learning should be as open and accessible as possible, and the OpenML team and the active community of contributors have invested countless hours of time and resources in OpenML to make this dream come true.
You are free to use OpenML and all empirical data and metadata under the CC-BY licence, requesting appropriate credit if you do.
The code of the OpenML platform and libraries is BSD licenced. Please check the corresponding GitHub repo's.
If you have used OpenML in a scientific publication, we would appreciate citations to the following paper:
Joaquin Vanschoren, Jan N. van Rijn, Bernd Bischl, and Luis Torgo. OpenML: networked science in machine learning. SIGKDD Explorations 15(2), pp 49-60, 2013.
Show BibTeX - Read on arXiv
If you have used the OpenML Python package, please also cite:
Matthias Feurer, Jan N. van Rijn, Arlind Kadra, Pieter Gijsbers, Neeratyoy Mallik, Sahithya Ravi, Andreas Mueller, Joaquin Vanschoren, Frank Hutter.
OpenML-Python: an extensible Python API for OpenML. arXiv:1911.02490 [cs.LG], 2019
Show BibTeX - Read on arXiv
If you have used the OpenML R package, please also cite:
Giuseppe Casalicchio, Jakob Bossek, Michel Lang, Dominik Kirchhoff, Pascal Kerschke, Benjamin Hofner, Heidi Seibold, Joaquin Vanschoren, Bernd Bischl.
OpenML: An R package to connect to the machine learning platform OpenML. Computational Statistics 32 (3), pp 1-15, 2017
Show BibTeX - Read on arXiv
Thank You!
Sharing data and code is crucial for reproducibility and scientific progress, and should be rewarded. If you are reusing any of the shared data sets, flows or runs/studies, please honor their respective licences and citation requests. OpenML cleary shows these requests when they apply.