This repository contains machine learning projects that focus on regression and classification tasks using the NumPy, scikit-learn and Tensorflow libraries. Various datasets are used for these projects. Datasets used are mentioned in the project description.
-
Wine quality classification and regression
- Datasets: Red Wine Quality, White Wine Quality
- Description: In this project, the goal is to classify wine quality as 'bad' or 'good' using classification model. It demonstrates classification algorithms with scikit-learn. a regression model is also used to determine sample quility.
-
- high energy gamma particles in a ground-based atmospheric Cherenkov gamma telescope
- Datasets: MAGIC Gamma Telescope
-
- This Jupyter Notebook demonstrates a multi-task learning setup using TensorFlow and Keras. Multi-task learning involves training a model to perform multiple related tasks simultaneously. In this example, the model is trained to predict both the digit label and the color of a handwritten digit image.
- The MNIST dataset is used in this example. The dataset consists of grayscale images of handwritten digits (0 to 9). Each digit is associated with a label indicating the digit itself, and in this case, a color label (0 for red, 1 for green) is also introduced.
- The model architecture involves shared convolutional layers followed by separate branches for digit and color predictions. The shared layers capture general features from the input image, and the separate branches learn task-specific predictions.
-
- Trade-off between Overfitting and Underfitting is investigated
$$\ t = \sin(4\pi x + \frac{\pi}{2}) + \epsilon $$
-
binary classification logistic regression
- breast cancer wisconsin dataset (classification)
- ROC and PR curve are graphed
-
- K Nearest Neighbor is used on boston housing dataset
- Best K value is determined
-
Neural Network Classifier with Two Hidden Layers
- Early stop, SGD
- Number of hidden layer units is changed and compared to find the optimal n1, n2.
-
- Image processing and compression
- Effect of random initialization is observed
- Python 3.x
- NumPy (install using
pip install numpy) - Pandas (install using
pip install pandas) - scikit-learn (install using
pip install scikit-learn) - TensorFlow (install using
pip install tensorflow) - mine is version 2.12.0 - Other libraries when necessary, see project import.
This project is licensed under the MIT License
