[π§π· PortuguΓͺs] [π¬π§ English]
Dictionary-Based Feature Grouping for LLM/AI Pipelines
Institution: Pontifical Catholic University of SΓ£o Paulo (PUC-SP)
School: Faculty of Interdisciplinary Studies
Program: Humanistic AI and Data Science
Semester: 2nd Semester 2025
Professor: Professor Doctor in Mathematics Daniel Rodrigues da Silva
Important
- Projects and deliverables may be made publicly available whenever possible.
- The course emphasizes practical, hands-on experience with real datasets to simulate professional consulting scenarios in the fields of Data Analysis and Data Mining for partner organizations and institutions affiliated with the university.
- All activities comply with the academic and ethical guidelines of PUC-SP.
- Any content not authorized for public disclosure will remain confidential and securely stored in private repositories.
πΆ Prelude Suite no.1 (J. S. Bach) - Sound Design Remix
Statistical.Measures.and.Banking.Sector.Analysis.at.Bovespa.mp4
πΊ For better resolution, watch the video on YouTube.
Tip
This repository is a review of the Statistics course from the undergraduate program Humanities, AI and Data Science at PUC-SP.
β Access Data Mining Main Repository
<br
-
Chen, X., et al. (2024). LLM-based feature generation from text for interpretable machine learning. arXiv preprint. Retrieved from arxiv.org/html/2409.07132v2
-
DataCamp. (2024). Pandas GroupBy Explained: Syntax, Examples, and Tips. Retrieved from datacamp.com/tutorial/pandas-groupby
-
GeeksforGeeks. (2024). Pandas dataframe.groupby() Method. Retrieved from geeksforgeeks.org/pandas/python-pandas-dataframe-groupby
-
Machine Learning Mastery. (2024). Feature Engineering with LLM Embeddings: Enhancing Scikit-learn Models. Retrieved from machinelearningmastery.com
-
McKinney, W. (2017). Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython (2nd ed.). OβReilly Media.
-
Pandas Documentation. (2024). Group by: split-apply-combine. Retrieved from pandas.pydata.org/docs/user_guide/groupby.html
-
VanderPlas, J. (2016). Python Data Science Handbook: Essential Tools for Working with Data. OβReilly Media.
πΈΰΉ My Contacts Hub
ββββββββββββββ πβ ββββββββββββββ
β£β’β€ Back to Top
Copyright 2026 Quantum Software Development. Code released under the MIT License license.