Ata Warehousing and Business Intelligence

1.1. Describe your data (give an overview summary of your data set)
1.2. Identify your input and class variables (which variable are you going to use as your class variable)
1.3. Analyse your variables (for each variable, you need to discuss the variable type, calculate relevant summary statistics and visually display the data)
1.4. Discuss any anomalies in the data (for each variable you need to discuss missing values, outliers etc.)

2. Pre-process the Data (Group) (15 marks)

2.1. Discuss and carry out the appropriate handling of any anomalies identified in section 1.4
2.2. Do you need to discard any of your input variables, justify the reasons
2.3. Carry out appropriate Correlation Coefficient Analysis of the variables
2.4. Carry out appropriate pre-processing of the data set
2.5. Carry out any appropriate transformation of any of the input variable

3. Data Mining (Individual)

3.1. Appropriate use of data-mining algorithms and software (you need to use at least 2 different techniques)
3.2. Appropriate selection and presentation of results

4. Analysis of Results (Individual)
4.1. Discussion and interpretation of the data-mining results (you need to compare the results you get from the data mining with the results of other members of your group)
4.2. Discussion of the business intelligence that can be obtained from the results.

5. Presentation (Group with each group member contributing individually)

