awesome-pandas/README.md

160 lines
15 KiB
Markdown
Raw Normal View History

2017-12-30 04:24:30 +08:00
# awesome-pandas
A collection of resources for pandas (Python) and related subjects.
2017-12-30 06:39:09 +08:00
2017-12-30 17:33:55 +08:00
**Table of contents**
* pandas resources
* Videos
* Cheat-sheets
* Tutorials
* Books
* Data analysis with Python resources
* Videos
* Cheat-sheets
* Tutorials
* Books
* Miscellaneous related resources
* Videos
* Cheat-sheets
* Tutorials
* Books
2017-12-30 06:39:09 +08:00
--------------------------------------------------------------------------------
## pandas resources
### Videos
2017-12-30 15:47:56 +08:00
:snake: - Little programming
:snake: :snake: Some programming
:snake: :snake: :snake: -Advanced
2017-12-30 15:36:46 +08:00
| Title | Speaker | Uploader | Duration | Views | Keywords | Year | Level |
| ----- | ------- | -------- | -------- | ----- | -------- | ---- | ----- |
2017-12-30 15:40:56 +08:00
| [How to become a Data Scientist in 6 months](https://www.youtube.com/watch?v=rIofV14c0tc) | Tetiana Ivanova | PyData | 0:56 | 148000 | misc | 2016 | :snake: |
2017-12-30 15:47:56 +08:00
| [Introduction Into Pandas](https://www.youtube.com/watch?v=-NR-ynQg0YM) | Daniel Chen | Python Tutorial | 1:28 | 46000 | pandas | 2017 | :snake: |
| [So you want to be a Python expert?](https://www.youtube.com/watch?v=cKPlPJyQrt4) | James Powell | PyData | 1:54 | 28000 | python | 2017 | :snake::snake::snake: |
| [NumPy Beginner](https://www.youtube.com/watch?v=gtejJ3RCddE) [[repo](https://github.com/enthought/Numpy-Tutorial-SciPyConf-2016)] | Alexandre Chabot LeClerc | Enthought | 2:47 | 56000 | NumPy | 2016 | :snake: :snake: |
2017-12-30 16:00:26 +08:00
| [Introduction To Data Analytics With Pandas](https://www.youtube.com/watch?v=5XGycFIe8qE) | Quentin Caudron | Python Tutorial | 1:51 | 25000 | pandas | 2017 | :snake: |
| [Transforming Code into Beautiful, Idiomatic Python](https://www.youtube.com/watch?v=OSGv2VnC0go) | Raymond Hettinger | Next Day Video | 0:48 | 340000 | python | 2013 | :snake: |
| [Machine Learning](https://www.youtube.com/watch?v=OB1reY6IX-o) | Andreas Mueller & Sebastian Raschka | Enthought | 3:03 | 47000 | sklearn | 2016 | :snake: :snake: |
| [Pandas From The Ground Up](https://www.youtube.com/watch?v=5JnMutdy6Fw) [[repo](https://github.com/brandon-rhodes/pycon-pandas-tutorial)] | Brandon Rhodes | PyCon 2015 | 2:24 | 91000 | pandas | 2015 | :snake: :snake: |
| [Modern Dictionaries](https://www.youtube.com/watch?v=p33CVV29OG8) | Raymond Hettinger | SF Python | 1:07 | 44000 | python | 2016 | :snake: :snake: |
| [The Python Visualization Landscape](https://www.youtube.com/watch?v=FytuB8nFHPQ) | Jake VanderPlas | PyCon 2017 | 0:33 | 21000 | python | 2017 | :snake: |
| [Keynote on Concurrency](https://www.youtube.com/watch?v=9zinZmE3Ogk) | Raymond Hettinger | SF Python | 1:13 | 15000 | python | 2017 | :snake::snake: |
| [Pandas for Data Analysis](https://www.youtube.com/watch?v=oGzU688xCUs) [[repo](https://github.com/chendaniely/scipy-2017-tutorial-pandas)] | Daniel Chen | Enthought | 3:45 | 13000 | pandas | 2017 | :snake::snake: |
2017-12-30 16:21:30 +08:00
| [JupyterLab: Building Blocks for Interactive Computing](https://www.youtube.com/watch?v=Ejh0ftSjk6g) | Brian Granger | Enthought | 0:29 | 28000 | jupyter | 2016 | :snake: |
| [Optimizing Pandas Code for Speed and Efficiency](https://www.youtube.com/watch?v=HN5d490_KKk) | Sofia Heisler | PyCon 2017 | 0:29 | 12000 | pandas | 2017 | :snake: :snake: |
| [A Visual Guide To Pandas](https://www.youtube.com/watch?v=9d5-Ti6onew) | Jason Wirth | Next Day Video | 0:26 | 49000 | pandas | 2015 | :snake: |
| [Machine Learning with Scikit Learn](https://www.youtube.com/watch?v=80fZrVMurPM) [[repo](https://github.com/amueller/scipy_2015_sklearn_tutorial)] | Andreas Mueller & Kyle Kastner | Enthought | 3:22 | 48000 | sklearn | 2015 | :snake: :snake: |
| [Machine Learning for Time Series Data in Python](https://www.youtube.com/watch?v=ZgHGCfwExw0) | Brett Naul | Enthought | 0:24 | 24000 | cesium | 2016 | :snake: |
| [The Fun of Reinvention](https://www.youtube.com/watch?v=js_0wjzuMfc) | David Beazley | David Beazley | 0:52 | 11000 | python | 2017 | :snake::snake::snake: |
2017-12-30 16:45:15 +08:00
| [Analyzing and Manipulating Data with Pandas](https://www.youtube.com/watch?v=6ohWS7J1hVA) [[repo](https://github.com/jonathanrocher/pandas_tutorial)] | Jonathan Rocher | Enthought | 3:33 | 22000 | pandas | 2016 | :snake: |
| [Computational Statistics](https://www.youtube.com/watch?v=He9MCbs1wgE) [[repo](https://github.com/AllenDowney/CompStats)] | Allen Downey | Enthought | 2:05 | 10000 | scipy | 2017 | :snake: :snake: |
| [Being a Core Developer in Python](https://www.youtube.com/watch?v=voXVTjwnn-U) | Raymond Hettinger | SF Python | 1:02 | 19000 | python | 2016 | :snake: |
| [Time Series Analysis](https://www.youtube.com/watch?v=zmfe2RaX-14) [[repo](https://github.com/ikding/pycon_time_series)] | Aileen Nielsen | PyCon 2017 | 3:11 | 9000 | pandas | 2017 | :snake: :snake: |
| [Learning TensorFlow](https://www.youtube.com/watch?v=bvHgESVuS6Q) | Robert Layton | PyCon Australia | 0:40 | 18000 | tensorflow | 2016 | :snake: :snake: |
| [JupyterHub: Deploying Jupyter Notebooks](https://www.youtube.com/watch?v=gSVvxOchT8Y) | Min Ragan Kelley & Thomas Kluyver | PyData | 1:36 | 17000 | jupyter | 2016 | :snake: |
| [Applied Time Series Econometrics](https://www.youtube.com/watch?v=tJ-O3hk1vRw) | Jeffrey Yau | PyData | 1:39 | 17000 | statsmodels | 2016 | :snake: :snake: |
| [Machine Learning with scikit learn](https://www.youtube.com/watch?v=2kT6QOVSgSg) [[repo](https://github.com/amueller/scipy-2017-sklearn)] | Andreas Mueller & Alexandre Gram | Enthought | 3:10 | 8000 | sklearn | 2017 | :snake: :snake: |
| [Introduction to Numerical Computing with NumPy](https://www.youtube.com/watch?v=lKcwuPnSHIQ) | Dillon Niederhut | Enthought | 2:27 | 8000 | NumPy | 2017 | :snake: |
2017-12-30 17:33:55 +08:00
| [Dask - A Pythonic Distributed Data Science Framework](https://www.youtube.com/watch?v=RA_2qdipVng) | Matthew Rocklin | PyCon 2017 | 0:46 | 7000 | dask | 2017 | :snake: :snake: |
| [Introduction to Statistical Modeling with Python](https://www.youtube.com/watch?v=TMmSESkhRtI) [[repo](https://github.com/fonnesbeck/intro_stat_modeling_2017)] | Christopher Fonnesbeck | PyCon 2017 | 3:19 | 7000 | scipy | 2017 | :snake: :snake: |
| [Fully Convolutional Networks for Image Segmentation](https://www.youtube.com/watch?v=-lXfsWP7DJ8) | Daniil Pakhomov | Enthought | 0:20 | 7000 | scipy | 2017 | :snake: |
| [Exploratory data analysis in python](https://www.youtube.com/watch?v=W5WE9Db2RLU) [[repo](https://github.com/cmawer/pycon-2017-eda-tutorial)] | Chloe Mawer & Jonathan Whitmore | PyCon 2017 | 2:54 | 7000 | scipy | 2017 | :snake: |
| [Visualizing Geographic Data](https://www.youtube.com/watch?v=ZIEyHdvF474) | Christopher Roach | PyData | 0:31 | 14000 | python | 2016 | :snake: |
| [Builtin Superheroes](https://www.youtube.com/watch?v=j6VSAsKAj98) | David Beazley | David Beazley | 0:44 | 12000 | python | 2016 | :snake: :snake: |
| [Python's Class Development Toolkit](https://www.youtube.com/watch?v=HTLu2DFOdTg) | Raymond Hettinger | Next Day Video | 0:45 | 80000 | python | 2013 | :snake: :snake: |
| [Libraries for Deep Learning with Sequences](https://www.youtube.com/watch?v=E92jDCmJNek) | Alex Rubinsteyn | PyData | 0:44 | 23000 | scipy | 2015 | :snake: :snake: |
| [The Other Async (Threads + Async = ❤️) - YouTube](https://www.youtube.com/watch?v=x1ndXuw7S0s) | David Beazley | David Beazley | 0:47 | 5000 | python | 2017 | :snake: :snake: :snake:|
| [Numba - Tell Those C++ Bullies to Get Lost](https://www.youtube.com/watch?v=1AwG0T4gaO0) [[repo](https://github.com/gforsyth/numba_tutorial_scipy2017)] | Gil Forsyth & Lorena Barba | Enthought | 2:25 | 5000 | numba | 2017 | :snake: :snake: |
| [Deploying Interactive Jupyter Dashboards](https://www.youtube.com/watch?v=8Jktm-Imt-I) | Philipp Rudiger | Enthought | 0:18 | 5000 | jupyter | 2017 | :snake: :snake: |
| [Data Science Using Functional Python](https://www.youtube.com/watch?v=ThS4juptJjQ) | Joel Grus | PyData | 0:44 | 18000 | python | 2015 | :snake: :snake: |
| [Pandas from the Inside](https://www.youtube.com/watch?v=CowlcrtSyME) | Stephen Simmons | PyData | 1:20 | 9000 | pandas | 2016 | :snake: :snake: :snake: |
| [Anatomy of matplotlib](https://www.youtube.com/watch?v=MKucn8NtVeI) [[repo](https://github.com/matplotlib/AnatomyOfMatplotlib)] | Benjamin Root & Joe Kington | Enthought | 3:18 | 18000 | matplotlib | 2015 | :snake: :snake: |
| [Anatomy of matplotlib](https://www.youtube.com/watch?v=rARMKS8jE9g) [[repo](https://github.com/matplotlib/AnatomyOfMatplotlib)] | Benjamin Root | Enthought | 3:02 | 4000 | matplotlib | 2017 | :snake: :snake: |
| [Data Science is Software](https://www.youtube.com/watch?v=EKUy0TSLg04) [[repo](https://github.com/drivendata/data-science-is-software)] | Peter Bull & Isaac Slavitt | Enthought | 2:12 | 9000 | jupyter | 2016 | :snake: |
2017-12-30 15:36:46 +08:00
| [Jake VanderPlas: Machine Learning with Scikit Learn - YouTube](https://www.youtube.com/watch?v=HC0J_SPm9co) | NAME | PyData | DURATION | 16000 | None | 2015 | Novice |
| [Using Jupyter notebooks to develop and share interactive data displays - YouTube](https://www.youtube.com/watch?v=aXR2d9k9-h4) | NAME | PyCon Australia | DURATION | 8000 | None | 2016 | Novice |
| [Parallel Python: Analyzing Large Datasets Intermediate SciPy 2016 Tutorial Matthew Rocklin & Mi - YouTube](https://www.youtube.com/watch?v=5Md_sSsN51k) | NAME | Enthought | DURATION | 7000 | None | 2016 | Novice |
| [Functional Programming with Python - YouTube](https://www.youtube.com/watch?v=Ta1bAMOMFOI) | NAME | Next Day Video | DURATION | 44000 | None | 2013 | Novice |
| [Predicting sports winners using data analytics with pandas and scikit-learn by Robert Layton - YouTube](https://www.youtube.com/watch?v=k7hSD_-gWMw) | NAME | PyCon Australia | DURATION | 13000 | None | 2015 | Novice |
| [Keynote: Project Jupyter SciPy 2016 Brian Granger - YouTube](https://www.youtube.com/watch?v=v5mrwq7yJc4) | NAME | Enthought | DURATION | 7000 | None | 2016 | Novice |
| [matplotlib (Python Plotting Library) Beginner SciPy 2016 Tutorial Nicolas Rougier - YouTube](https://www.youtube.com/watch?v=p7Mj-4kASmI) | NAME | Enthought | DURATION | 6000 | None | 2016 | Novice |
| [Awesome Big Data Algorithms - YouTube](https://www.youtube.com/watch?v=jKBwGlYb13w) | NAME | Next Day Video | DURATION | 41000 | None | 2013 | Novice |
| [Stephen Simmons - Pandas from the Inside / "Big Pandas" - YouTube](https://www.youtube.com/watch?v=YGk09nK_xnM) | NAME | PyData | DURATION | 3000 | None | 2017 | Novice |
| [Fear and Awaiting in Async (Screencast) - YouTube](https://www.youtube.com/watch?v=Bm96RqNGbGo) | NAME | David Beazley | DURATION | 5000 | None | 2016 | Novice |
| [Brian Granger: All About Jupyter - YouTube](https://www.youtube.com/watch?v=GMKZD1Ohlzk) | NAME | PyData | DURATION | 11000 | None | 2015 | Novice |
| [Anusua Trivedi: An example of Predictive Analytics: Building a Recommendation Engine using Python - YouTube](https://www.youtube.com/watch?v=E9XTOnEgqRY) | NAME | PyData | DURATION | 11000 | None | 2015 | Novice |
| [Sarah Guido The Wild West of Data Wrangling PyCon 2017 - YouTube](https://www.youtube.com/watch?v=xn9sTXR3Cp8) | NAME | PyCon 2017 | DURATION | 3000 | None | 2017 | Novice |
| [Adventures in scikit-learn's Random Forest by Gregory Saunders - YouTube](https://www.youtube.com/watch?v=YkVscKsV_qk) | NAME | PyCon Australia | DURATION | 9000 | None | 2015 | Novice |
| [Doing Math with Python - YouTube](https://www.youtube.com/watch?v=XJOt4QQgx0A) | NAME | PyCon Australia | DURATION | 5000 | None | 2016 | Novice |
| [Iterations of Evolution: The Unauthorized Biography of the For-Loop - YouTube](https://www.youtube.com/watch?v=2AXuhgid7E4) | NAME | David Beazley | DURATION | 2000 | None | 2017 | Novice |
| [Alex Martelli, ""Good Enough" IS Good Enough!", PyBay2016 - YouTube](https://www.youtube.com/watch?v=_Ek3A2b-nHU) | NAME | SF Python | DURATION | 4000 | None | 2016 | Novice |
| [Renee Chu - Python for Social Scientists: Cleaning and Prepping Data - PyCon 2016 - YouTube](https://www.youtube.com/watch?v=u682UpVrMVM) | NAME | PyCon 2016 | DURATION | 3000 | None | 2016 | Novice |
| [PyMC: Markov Chain Monte Carlo in Python SciPy 2014 Chris Fonnesbeck - YouTube](https://www.youtube.com/watch?v=XbxIo7ScVzc) | NAME | Enthought | DURATION | 9000 | None | 2014 | Novice |
| [Alex Martelli, ""The Tower of Abstraction", PyBay2016 - YouTube](https://www.youtube.com/watch?v=zhpWhkW8kcc) | NAME | SF Python | DURATION | 3000 | None | 2016 | Novice |
| [Jupyter Advanced Topics Tutorial SciPy 2015 Tutorial Jonathan Frederic, Matthias Bussonier & Tho - YouTube](https://www.youtube.com/watch?v=38R7jiCspkw) | NAME | Enthought | DURATION | 4000 | None | 2015 | Novice |
| [Rachel Thomas, "Using randomness to make code much faster", PyBay2017 - YouTube](https://www.youtube.com/watch?v=7i6kBz1kZ-A) | NAME | SF Python | DURATION | 1000 | None | 2017 | Novice |
| [Mahmoud Hashemi, Python Profiling & Performance: elementary to enterprise, PyBay2016 - YouTube](https://www.youtube.com/watch?v=Dgnp28Ijm_M) | NAME | SF Python | DURATION | 1000 | None | 2016 | Novice |
| [Dillon Niederhut, "What to do when your data is large, but not big", PyBay2016 - YouTube](https://www.youtube.com/watch?v=g-YCaX3ml2Q) | NAME | SF Python | DURATION | 0 | None | 2016 | Novice |
| [Cynthia Lin, "Opening Up to Open Source", PyBay2017 - YouTube](https://www.youtube.com/watch?v=rfdzOZkDqYk) | NAME | SF Python | DURATION | 0 | None | 2017 | Novice |
2017-12-30 06:49:10 +08:00
2017-12-30 06:39:09 +08:00
* (Brandon Rhodes - Pandas From The Ground Up - PyCon 2015) https://www.youtube.com/watch?v=5JnMutdy6Fw
* (A Visual Guide To Pandas) https://www.youtube.com/watch?v=9d5-Ti6onew
* (Stephen Simmons | Pandas from the Inside) https://www.youtube.com/watch?v=CowlcrtSyME
* (Stephen Simmons - Pandas from the Inside / "Big Pandas") https://www.youtube.com/watch?v=YGk09nK_xnM
* (Pandas for Data Analysis | SciPy 2017 Tutorial | Daniel Chen) https://www.youtube.com/watch?v=oGzU688xCUs
* (Introduction To Data Analytics With Pandas) https://www.youtube.com/watch?v=5XGycFIe8qE
* (Introduction Into Pandas: Python Tutorial) https://www.youtube.com/watch?v=-NR-ynQg0YM
### Cheat-sheets
* https://github.com/pandas-dev/pandas/blob/master/doc/cheatsheet/Pandas_Cheat_Sheet.pdf
* https://s3.amazonaws.com/quandl-static-content/Documents/Quandl+-+Pandas,+SciPy,+NumPy+Cheat+Sheet.pdf
* http://www.webpages.uidaho.edu/~stevel/504/Pandas%20DataFrame%20Notes.pdf
* https://assets.datacamp.com/blog_assets/PandasPythonForDataScience.pdf
* https://s3.amazonaws.com/assets.datacamp.com/blog_assets/Python_Pandas_Cheat_Sheet_2.pdf
### Tutorials
* https://github.com/jorisvandenbossche/pandas-tutorial
* https://github.com/guipsamora/pandas_exercises
* https://github.com/brandon-rhodes/pycon-pandas-tutorial
* https://github.com/jadianes/winerama-recommender-tutorial
* https://github.com/jonathanrocher/pandas_tutorial
* https://github.com/chendaniely/scipy-2017-tutorial-pandas
* https://github.com/tdpetrou/Learn-Pandas
* https://github.com/adeshpande3/Pandas-Tutorial
* https://github.com/GaelVaroquaux/sklearn_pandas_tutorial
* https://github.com/vi3k6i5/pandas_basics
* https://github.com/california-civic-data-coalition/first-python-notebook
### Books
## Data analysis with Python resources
### Videos
* (Introduction to NumPy | SciPy 2015 Tutorial | Eric Jones) https://www.youtube.com/watch?v=1zmV8lZsHF4
### Cheat-sheets
* http://datasciencefree.com/numpy.pdf
* https://s3.amazonaws.com/assets.datacamp.com/blog_assets/Scikit_Learn_Cheat_Sheet_Python.pdf
### Tutorials
### Books
## Data analysis resources
### Books
### Papers
###