Skip to content
View JustaKris's full-sized avatar

Block or report JustaKris

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JustaKris/README.md

Hey there! πŸ‘‹ I'm Krystiyan

Data Scientist | AI/ML Engineer

Building smart tools, untangling messy data, automating the boring bits and turning it all into production-ready AI/ML solutions

LinkedIn Email


πŸ‘¨β€πŸ’» About Me

I'm a curious and driven Data Scientist | AI/ML Engineer who enjoys figuring out the "why" behind anything and everything, with a Data Science background, focused on building practical, production-ready ML/AI systems and modern cloud-native applications.

I enjoy turning messy data, legacy workflows, and vague business problems into clean Python pipelines, deployable ML services, and useful AI-powered tools. I'm especially interested in the intersection of Machine Learning, Generative AI, and software engineering β€” where models actually ship and create real impact. My philosophy: understand the why behind a model, not just the how.

πŸ”­ What I'm Currently Working On

My recent work focuses on modernizing legacy analytical systems and building end-to-end AI/ML solutions. While most of this work lives in private company repositories, it includes:

  • πŸ—οΈ Designing and maintaining scalable Python pipelines for data processing and ML workflows
  • πŸ”„ Re-engineering legacy statistical projects (SPSS/R) into modern, maintainable Python systems
  • πŸ€– Building and deploying ML models for real business use cases β€” forecasting, imputation, TV viewing prediction, and classification
  • πŸš€ Deploying containerized applications on AWS (Lambda, ECS/Fargate, S3, ALB) with CI/CD pipelines
  • πŸ’¬ Prototyping RAG-based chatbot systems using modern LLM stacks (embeddings, vector search, prompt engineering)
  • 🧠 Creating internal AI tools for database exploration, automation, and analytics support

🌱 Recent Projects & AI Work

  • RAG-based Chatbot Systems: End-to-end chatbot using LangChain, embeddings, RAG and vector search for contextual document Q&A and workflow automation
  • Production ML Models: Profitability forecasting using Scikit-learn and PyTorch, TV viewing prediction systems, and data imputation pipelines
  • Python Automation: Automated data workflows using Python scripting and job scheduling for routine tasks
  • Box Office Analytics: Movie and TV show analysis incorporating API integration and web scraping
  • Medical AI: CNN-based Chest X-ray image classification with deployment focus
  • Continuous Learning: Expanding knowledge in Cloud Technologies, Machine Learning, MLOps, and Generative AI

πŸ’‘ What Drives Me

  • πŸš€ Building ML systems that actually ship and create impact
  • πŸ“Š Understanding the "why" behind it all β€” defining problems clearly and turning complex data into actionable insights
  • ⚑ Making AI tools usable in real workflows, not just notebooks
  • βš™οΈ Automating repetitive tasks to focus on what matters
  • 🎸 Balancing tech with music, fitness, and too many movies, series, and games
  • 🀝 Always open to collaboration, fun side projects, or chatting tech!

πŸš€ Tech Stack

🧠 Core AI/ML Stack

Python Pandas NumPy scikit-learn PyTorch TensorFlow Keras FastAPI PyTest

πŸ€– LLMs & Generative AI

OpenAI Anthropic Google Gemini LangChain Hugging Face RAG

πŸ’Ύ Data & Cloud

PostgreSQL MySQL SQL Server SQLAlchemy Snowflake AWS Azure Docker Linux

πŸ”§ CI/CD & Version Control

Git GitHub GitLab GitHub Actions GitLab CI

πŸ’» Development Tools

VSCode PyCharm Jupyter


🧭 Current Focus

  • πŸ”¨ Building production-ready ML pipelines and deploying them on AWS/Azure
  • πŸ’¬ Developing RAG-based chatbot systems with LangChain and vector databases
  • πŸ“¦ Modernizing legacy statistical projects (SPSS/R) into scalable Python systems
  • πŸ€– Creating ML models for forecasting, imputation, and classification tasks
  • πŸ§ͺ Experimenting with semantic search, embeddings, and prompt engineering
  • πŸ“š Learning MLOps best practices and advanced RAG architectures
  • ☁️ Exploring cloud-native ML deployments and agentic AI systems
  • πŸ”§ Just enough DevOps to be dangerous πŸ˜„

πŸ“« Let's Connect

If you're working on something exciting or have a challenge that needs a curious mind β€” feel free to reach out!
Always happy to chat about data, tech, or interesting side quests.


Thanks for stopping by! ✨

Pinned Loading

  1. CNN-Classification-of-Chest-X-Ray-Images CNN-Classification-of-Chest-X-Ray-Images Public

    A Deep Learning project leveraging MobileNet for multi-class Chest X-Ray image classification. Features a scalable Flask web application for user input and predictions with GRAD-CAM heatmaps, fully…

    Jupyter Notebook

  2. Trump-Rally-Speeches-NLP-Chatbot Trump-Rally-Speeches-NLP-Chatbot Public

    Production-ready RAG-powered NLP API with Retrieval-Augmented Generation, sentiment analysis, and entity analytics. FastAPI + ChromaDB + Google Gemini.

    Jupyter Notebook

  3. Titanic-Machine-Learning-from-Disaster Titanic-Machine-Learning-from-Disaster Public

    A portfolio ML project: Titanic survival prediction system with 85%+ accuracy, ensemble methods, SHAP explainability, Flask REST API, Docker deployment, and comprehensive testing. Production-grade …

    Jupyter Notebook

  4. Are-You-Not-Entertained Are-You-Not-Entertained Public

    Machine learning pipeline for predicting movie box office success. Includes web scraping, data pipelines, feature engineering, ML models, MLflow tracking, and a web interface for predictions.

    Jupyter Notebook

  5. XYZ-Marketing-Campaign XYZ-Marketing-Campaign Public

    Data science analysis of advertising campaign effectiveness. EDA, feature engineering, predictive modeling, and actionable recommendations for marketing optimization.

    Jupyter Notebook

  6. Deep-Learning-SoftUni Deep-Learning-SoftUni Public

    A repository to keep track of my Deep Learning course in SoftUni

    Jupyter Notebook