The Final Project of Introduction to Data Science Course - HCMUS - Oct 2024.
- Class: 22_21.
- Theory Class Lecturer: Mr. Le Ngoc Thanh.
- Practice Class Lecturer: Mr. Le Nhut Nam.
- Group ID: 9.
- Member's information: ID - Name
- 22120412 - Nguyễn Anh Tường.
- 22120384 - Nguyễn Đình Trí.
- 22120398 - Vũ Hoàng Nhật Trường.
- 22120424 - Phạm Ngọc Bảo Uyên.
- 22120449 - Lê Nguyễn Huyền Vy.
- Time: Oct 01, 2024 - Dec 24, 2024.
- To run: DataCollection.ipynb.
- Datasets: Dataset_child/
- Note: Here I will divide the data collection process into several parts with different data sets. One of the main reasons is that this kitchenart website will block us if we do data scraping too many times ( Unusual access to the website ).
- To run: Data Cleaning and Normalizing.ipynb
- Datasets:cleaned_recipes_translated.txt, cleaned_recipes_2_translated.txt
- Note:This section will demonstrate how to clean and normalize data, from a plain text file converted into a binary dataframe, with rows being the names of dishes, and columns being the names of ingredients present in the original data.
- To run: DataExploration.ipynb
- Datasets:ingredients.csv
- Note: This section will present the information you need to know about the cooking field.
- Step 01: Install LMStudtio.
- Step 02: Download a model.
- Step 03: Start LMStudio Server.
- To run: Context_Machine.ipynb
- Datasets:
- Context: ingredients_analysis.csv
- Context Cleaned: cleaned_ingredients_data.csv
- Context Encoded: numerical_context_data.csv
- Note: You must check clearly what port number LMStudio provides, what model you are using and edit the code accordingly.
- To run: Datamodeling_01.ipynb
- How to use: Replace the below code with the ingredients you currently have and are missing.
current_ingredients = ['active yeast', 'agave nectar', 'all-purpose flour']
missing_ingredients = ['brown sugar', 'bacon']
recommendations = recommend_replacements(current_ingredients, missing_ingredients)
recommendations- Note: Here I have used additional context for the ingredients, so you must complete the above step (00).
- To run: Datamodeling_02.ipynb
- How to use: Please click "Run All," and a GUI will appear shortly afterward. Insert one recipe at a time in the text box and click ENTER to view the suggested substitutes.
- To run: DataModeling_03.ipynb
- How to use: Please click "Run All," and a GUI will appear shortly afterward. Insert your desired ingredients (separated by comma) in the text box and click the ENTER button to see a list of recommended recipes.
Link here: Notion