PythonDataScienceHandbook/README.md

59 lines
3.6 KiB
Markdown
Raw Permalink Normal View History

2016-11-18 14:12:38 +08:00
# Python Data Science Handbook
2016-08-12 06:49:18 +08:00
2017-11-14 05:31:26 +08:00
[![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jakevdp/PythonDataScienceHandbook/master?filepath=notebooks%2FIndex.ipynb)
2018-08-29 04:05:27 +08:00
[![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb)
2017-11-14 03:08:47 +08:00
This repository contains the entire [Python Data Science Handbook](http://shop.oreilly.com/product/0636920034919.do), in the form of (free!) Jupyter notebooks.
2016-08-12 06:49:18 +08:00
2017-11-14 03:08:47 +08:00
![cover image](notebooks/figures/PDSH-cover.png)
2017-11-14 03:14:55 +08:00
## How to Use this Book
2017-08-15 05:54:22 +08:00
2017-11-14 03:14:55 +08:00
- Read the book in its entirety online at https://jakevdp.github.io/PythonDataScienceHandbook/
2017-11-14 03:08:47 +08:00
2017-11-14 03:14:55 +08:00
- Run the code using the Jupyter notebooks available in this repository's [notebooks](notebooks) directory.
2017-11-14 03:11:35 +08:00
2018-08-29 04:13:35 +08:00
- Launch executable versions of these notebooks using [Google Colab](http://colab.research.google.com): [![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb)
2018-08-29 04:05:27 +08:00
2017-11-14 05:31:26 +08:00
- Launch a live notebook server with these notebooks using [binder](https://beta.mybinder.org/): [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jakevdp/PythonDataScienceHandbook/master?filepath=notebooks%2FIndex.ipynb)
2017-11-14 03:08:47 +08:00
2017-11-14 03:14:55 +08:00
- Buy the printed book through [O'Reilly Media](http://shop.oreilly.com/product/0636920034919.do)
2017-11-14 03:11:35 +08:00
## About
2016-08-12 06:49:18 +08:00
2017-11-14 03:08:47 +08:00
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
2016-11-22 03:31:25 +08:00
2017-01-10 23:20:04 +08:00
The book introduces the core libraries essential for working with data in Python: particularly [IPython](http://ipython.org), [NumPy](http://numpy.org), [Pandas](http://pandas.pydata.org), [Matplotlib](http://matplotlib.org), [Scikit-Learn](http://scikit-learn.org), and related packages.
Familiarity with Python as a language is assumed; if you need a quick introduction to the language itself, see the free companion project,
[A Whirlwind Tour of Python](https://github.com/jakevdp/WhirlwindTourOfPython): it's a fast-paced introduction to the Python language aimed at researchers and scientists.
2017-08-15 05:54:22 +08:00
See [Index.ipynb](http://nbviewer.jupyter.org/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb) for an index of the notebooks available to accompany the text.
2016-08-12 06:44:38 +08:00
2017-11-14 03:11:35 +08:00
## Software
2016-11-24 01:37:18 +08:00
The code in the book was tested with Python 3.5, though most (but not all) will also work correctly with Python 2.7 and other older Python versions.
2017-01-28 04:11:38 +08:00
The packages I used to run the code in the book are listed in [requirements.txt](requirements.txt) (Note that some of these exact version numbers may not be available on your platform: you may have to tweak them for your own use).
To install the requirements using [conda](http://conda.pydata.org), run the following at the command-line:
2016-11-24 01:37:18 +08:00
```
$ conda install --file requirements.txt
```
To create a stand-alone environment named ``PDSH`` with Python 3.5 and all the required package versions, run the following:
```
$ conda create -n PDSH python=3.5 --file requirements.txt
```
You can read more about using conda environments in the [Managing Environments](http://conda.pydata.org/docs/using/envs.html) section of the conda documentation.
2016-08-12 06:44:38 +08:00
## License
2016-11-24 01:42:16 +08:00
### Code
The code in this repository, including all code samples in the notebooks listed above, is released under the [MIT license](LICENSE-CODE). Read more at the [Open Source Initiative](https://opensource.org/licenses/MIT).
### Text
2016-11-18 14:12:38 +08:00
The text content of the book is released under the [CC-BY-NC-ND license](LICENSE-TEXT). Read more at [Creative Commons](https://creativecommons.org/licenses/by-nc-nd/3.0/us/legalcode).