🤓 General: Within this repository is a directory of jupyter notebooks, each of which contain examples of many different concepts that one might use when working with Python in the field of Data Science or Analytics, as well as many fundamentals of Python programming. *
- 📓 Notebooks for each section of the
Table of Contentsbelow are stored in thenotebooks/directory see here and are named to reflect the general information contained within (i.e. the notebook atnotebooks/sampling.ipynbcontains information and examples pertaining to statistical sampling in Python). - Any data sets necessary for particular notebooks are stored in the
data/directory within a folder whose name is the same as the notebook to which it belongs (i.e. the data for the notebook atnotebooks/sampling.ipynbcan be found atdata/sampling/).- NOTE: Not all data has been uploaded to the repository to conserve space. For notebooks where the necessary dataset(s) has not been uploaded, the corresponding folder within the
data/directory contains a Markdown (.md) file where a link to the dataset(s) can be found for downloading; simply download the dataset(s) to the directory where the Markdown file is located, and everything will work for you.
- NOTE: Not all data has been uploaded to the repository to conserve space. For notebooks where the necessary dataset(s) has not been uploaded, the corresponding folder within the
- The notebooks in this repository use many GPU accelerated libraries and tools, such as RAPIDS (specifically
cudfinstead of usingpandas) andjax.numpy(imported asjnp) - I have included, for the most part, examples of both non-GPU accelerated code and GPU accelerated code
- If you are going to use GPU acceleration you will need to run this to get jax to work:
pip install --upgrade "jax[cuda13-local]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html- NOTE: the install script above is for Cuda version 13, if you have another version of Cuda, just change the number in
cuda13-localto your specific version
- NOTE: the install script above is for Cuda version 13, if you have another version of Cuda, just change the number in
💻 PURPOSE: This repository is mostly for my own reference and memorization, but it is also available for anyone else to either learn Python and the Data Science tools of Python or just as a reference/cookbook to look back to when something isn't coming to mind immediately--or whatever else you want to use it for: I don't care 
- 1. PYTHON & PANDAS FUNDAMENTALS
- 1.a The Basics
- 1.b List/Dictionary Comprehensions and Generators
- 1.c
- 1.d
- 1.e
- 1.f
- 1.g
- 2.
- 2.a
- 2.b
- 2.c
- 2.d
- 2.e
- 3.
