Skip to content

A repository containing a folder of jupyter notebooks that go through many different Python Data Science concepts and techniques

Notifications You must be signed in to change notification settings

wanderduck/Python_DataScience_Examples

Repository files navigation

🦆 Python Data Science Examples 🦆



Wanderduck at Desk


🏗️THIS IS CURRENTLY A WORK IN PROGRESS AND WILL BECOME MORE COMPLETE OVER TIME👷‍♂️


   🤓 General: Within this repository is a directory of jupyter notebooks, each of which contain examples of many different concepts that one might use when working with Python in the field of Data Science or Analytics, as well as many fundamentals of Python programming. *


  • 📓 Notebooks for each section of the Table of Contents below are stored in the notebooks/ directory see here and are named to reflect the general information contained within (i.e. the notebook at notebooks/sampling.ipynb contains information and examples pertaining to statistical sampling in Python).

  • Any data sets necessary for particular notebooks are stored in the data/ directory within a folder whose name is the same as the notebook to which it belongs (i.e. the data for the notebook at notebooks/sampling.ipynb can be found at data/sampling/).
    • NOTE: Not all data has been uploaded to the repository to conserve space. For notebooks where the necessary dataset(s) has not been uploaded, the corresponding folder within the data/ directory contains a Markdown (.md) file where a link to the dataset(s) can be found for downloading; simply download the dataset(s) to the directory where the Markdown file is located, and everything will work for you.

  • The notebooks in this repository use many GPU accelerated libraries and tools, such as RAPIDS (specifically cudf instead of using pandas) and jax.numpy (imported as jnp)
  • I have included, for the most part, examples of both non-GPU accelerated code and GPU accelerated code
  • If you are going to use GPU acceleration you will need to run this to get jax to work:
    • pip install --upgrade "jax[cuda13-local]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
      • NOTE: the install script above is for Cuda version 13, if you have another version of Cuda, just change the number in cuda13-local to your specific version

  💻 PURPOSE: This repository is mostly for my own reference and memorization, but it is also available for anyone else to either learn Python and the Data Science tools of Python or just as a reference/cookbook to look back to when something isn't coming to mind immediately--or whatever else you want to use it for: I don't care :trollface:



📖 Table of Contents 📖

  • 1. PYTHON & PANDAS FUNDAMENTALS
    • 1.a The Basics
    • 1.b List/Dictionary Comprehensions and Generators
    • 1.c
    • 1.d
    • 1.e
    • 1.f
    • 1.g
  • 2.
    • 2.a
    • 2.b
    • 2.c
    • 2.d
    • 2.e
  • 3.

About

A repository containing a folder of jupyter notebooks that go through many different Python Data Science concepts and techniques

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages