This repository contains the entire Python Data Science Handbook, in the form of (free!) Jupyter notebooks. https://jakevdp.github.io/PythonDataScienceHandbook/
 
Go to file
Jake VanderPlas cecbc43f8f add temporary files to gitignore 2016-11-30 14:46:11 -08:00
code_listings add data files 2016-08-19 09:12:27 -07:00
notebooks Add edited 05.10 2016-11-30 14:41:22 -08:00
tools Update headers, contents, and navigation links to add Chapter 3 2016-11-21 20:39:11 -08:00
.gitignore add temporary files to gitignore 2016-11-30 14:46:11 -08:00
LICENSE-CODE add Preface notebook 2016-11-17 14:45:39 -08:00
LICENSE-TEXT add Preface notebook 2016-11-17 14:45:39 -08:00
README.md Modify license discussion in README 2016-11-23 09:42:16 -08:00
requirements.txt Add updated 04.13 and requirements 2016-11-23 07:22:46 -08:00

README.md

Python Data Science Handbook

Jupyter notebook content for my OReilly book, the Python Data Science Handbook

cover image
cover image

This repository will contain the full listing of IPython notebooks used to create the book, including all text and code. The code was tested with Python 3.5, though most (but not all) snippets will work correctly in Python 2.7.

See also the free companion project, A Whirlwind Tour of Python: a fast-paced introduction to the Python language aimed at researchers and scientists.

I am currently doing the final edits of these notebooks, and will post them as I make my way through. See the content here:


Preface

1. IPython: Beyond Normal Python

2. Introduction to NumPy

3. Data Manipulation with Pandas

4. Visualization with Matplotlib

5. Machine Learning (coming soon)

Appendix: Figure Code


Required Packages

The code in the book was tested with Python 3.5, though most (but not all) will also work correctly with Python 2.7 and other older Python versions.

The packages required to run the code in the book are listed in requirements.txt. To install these using conda, run the following at the command-line:

$ conda install --file requirements.txt

To create a stand-alone environment named PDSH with Python 3.5 and all the required package versions, run the following:

$ conda create -n PDSH python=3.5 --file requirements.txt

You can read more about using conda environments in the Managing Environments section of the conda documentation.

Code Listings

The notebooks above are still being edited. In the meantime, you can see the code from an earlier draft of the book in code_listings. This will be removed once the full notebooks are posted.

License

Code

The code in this repository, including all code samples in the notebooks listed above, is released under the MIT license. Read more at the Open Source Initiative.

Text

The text content of the book is released under the CC-BY-NC-ND license. Read more at Creative Commons.