B9BA103: A dessert parlour wants to use the above dataset to train a model that could automatically label: Data Mining Report, DBS, Ireland

Text Mining Assignment
Dataset
The dataset Yelp Restaurant Reviews.csv contains customer reviews of different restaurants. In addition to customer reviews, customer ratings (scale 15) are also provided. It is assumed that a customer would recommend a given restaurant to others if the rating given by that customer is 4 or 5. Otherwise, it is assumed that the customer would not recommend the restaurant to others.
This dataset is taken from https://www.kaggle.com/datasets/farukalam/yelprestaurantreviews
Task
A dessert parlour wants to use the above dataset to train a model that could automatically label its customer reviews to indicate whether a given customer would recommend it to others or not. Your task is to help dessert parlour create such a model by trying any two classification algorithms (that you deem appropriate) in Python.
Stuck in Completing this Assignment and feeling stressed ? Take our Private Writing Services
Get A Free Quote

Deliverables & Assessment Rubric
Put the following in a zipped folder and upload to Moodle:

1.
Python Code (.py) (40% Weighting)

Key Assessment Areas: Does the Python code demonstrate the functionality required to implement text classification effectively on a given dataset (i.e., data preparation; model training, evaluation)?
Is the code running without errors? Does the code come with copious explanatory comments for the various steps involved?

2.
Report (.pdf) (60% Weighting)

Key assessment areas: Evidence of critical analysis. A max 2000word report should providecritical analysis of the following points in context of the given task:

a)
Text cleaning Has the learner discussed the approach used to clean text before its conversion to numerical format? Has the learner discussed limitations of the used approach?

b)
Creation of structured data Has the learner discussed the approach used to convert clean text to a structured numerical format? Has the learner discussed limitations of the used approach?

c)
Model performance evaluation Has the learner discussed the choice of model evaluation metric? Has the learner interpreted the performance of each model and discussed why they performed the way they did? Has the learner recommended any model for use by the dessert parlour?
Are You Searching Answer of this Question? Request Ireland Writers to Write a plagiarism Free Copy for You.
Get A Free Quote

Reference no: EM132069492

WhatsApp
Hello! Need help with your assignments? We are here

GRAB 25% OFF YOUR ORDERS TODAY

X