Assignment Task
1. (a) Describe the dataset using appropriate plots/curves/charts
(b) Consider one of continuous attributes, and compute central and variational measures
(c) For a particular variable of the dataset, use Chebyshev’s rule, and propose one-sigma interval. Based on your proposed interval, specify the outliers if any.
(d) Explain how the box-plot technique can be used to detect outliers. Apply this technique for one attribute of the dataset
2. a) Select four variables of the dataset, and propose an appropriate probability model to quantify uncertainty of each variable.
b) For each model in part (a), estimate the parameters of model.
c) Express the way in which each model can be used for the predictive analytics, then find the prediction for each attribute.
3. (a) Consider two categorical variables of the dataset, develop a binary decision making strategy to check whether two variables are independent at the significant level alpha=0.01. To do so,
i. State the hypotheses.
ii. Find the statistic and critical values.
iii. Explain your decision and Interpret results.
(b) Consider one categorical variable, apply goodness of fit test to evaluate whether a candidate set of probabilities can be appropriate to quantify the uncertainty of class frequency at the significant level alpha=0.05.
(c) Consider one continuous variable in the dataset, and apply test of mean for a proposed candidate of μ at the significant level alpha=0.05.