Ata Model and Data Warehouse Design Architecture & XML

Exercise 1 (50%). Data Model and Data Warehouse Design Architecture
You are expected to propose a design architecture for a large data warehouse, i.e. data model, database schema, database integration framework, of your chosen business environment. In order to carry out the tasks, you may wish to visit a company, or an industry or any service provider that you know in order to find out how they operate. Alternatively, you may wish to research data warehousing implementations and case studies on the Internet.
Once you have identified a business or company for which you propose to design a data warehouse, gather requirements and formulate a business scenario prior to the design work. Based on the business scenario that you have developed, start designing your data model for the department that you are responsible for. Select an appropriate model for the representation of your data model. It is desired that you apply Extended ER or UML modelling technique to represent the data model and should be underpinned with appropriate theory and practices. The data model should represent all the relevant business rules, integrities and validation procedures. Now convert the diagram into appropriate schema. Provide appropriate rationale on the database normalisation process and show any discrepancies between top down and bottom up approaches. The report can be 2-5,000 words (typically, 5-15 A4 pages) excluding the appendices. You can use any drawing tool of your choice, except the ER or UML generator. You should also include dimensional modelling and representation of dimensional modelling using visualisation approach, namely star schema.
Your work should be documented in the form of a brief report. The report should contain the following:
1. Business scenario
2. Detail EER/Class diagram of the proposed system, plus normalisation
3. Data warehouse model (including star schema) and process
4. How the company/industry can benefit from the proposed implementation. What are the possible threats of implementing such a data warehouse.
5. Possible recommendations on strategies to overcome future challenges, for example, data mining approaches. Plus a bibliography/references section.
Each section above is worth 10 marks, giving a total of 5x10 50 marks
You must include all relevant citations through out your report. You are required to list books, sources, articles etc. in the bibliography section of your report.


Exercise 2 (50%): XML
You are required to represent your previous term s weekly university timetable in XML and display it using XSL/t.
” Submit all files saved on a compact/floppy disc.
” Also submit printouts (hardcopies) of all files, plus the XSL output and the original printed timetable.
” Correct format; increasingly there seems to be many problems with CD formats. Make sure that your CD works on a standard CD/DVD reader, there are many recording formats that cannot be read.
Step 1: XML representation. Complete the following objectives:
” Print a copy of your university timetable (if no timetable exists in the system, or even a system, then please create a copy of your timetable for a typical week).
” Make a copy of your student timetable. This copy is to be submitted with the coursework and therefore it is an important part of the assessment.
” Think about how to represent that information. Look for hierarchical relationships.
” Create an XML instance that will represent the knowledge in the student timetable. The XML file should include descriptive elements, appropriate attributes and accurate data.
” It is unacceptable to copy data from other students: this will be considered as plagiarism.
Step 2: Validation
” Create a DTD fille that validates the file created in the previous step.
” Marks are awarded for appropriate use of DTD element types (empty, container and data).
” Marks are awarded for appropriate use of Attributes.
” Marks are awarded for the appropriate use of Namespaces and entities.
” Marks are awarded for the flexibility of the design.
Step 3: Transformation
” Create an XSL/t file that transforms the data in the XML file to an XHTML file.
” Marks are awarded for use of XSL/t commands.
” Marks are awarded for the similarity between the structure of the XHTML file and the original timetable.
Step 4: Preparing for Submitting Coursework
” Tidy up your code, using indentation and comments.
” Complete the frontsheet. Include a copy of the original timetable; part of the assessment criteria is assessing data representation and similarity between transformation of the data and original timetable.
” Make two copies of the coursework: The first copy on a CD to be submitted to the student office. The second copy should be on any format preferred by yourself and should be kept in a safe place should the first copy go missing.
Ensure that you have the following files on your disc:
” XML files
” DTD for above files
” XSL file to transform to Listing


Assessment Criteria
Data Representation (8%)
1. Flexibility Is the design flexible enough for other users e.g.
use of postal code and zip codes (2)
2. Generic ensuring that other instances can be represented
and there are no specific tags. Multiple levels of data (2)
3. Appropriate names of elements (2)
4. Use of idref and id (2)
DTD & Validation (24%)
1. Malformed (-5) (this is a tough penalty, but, it has been emphasised throughout the module that malformed code is not XML).
2. Doctype and validation (4)
3. Attributes: required, implied and fixed (3)
4. Empty tags (1)
5. Use of entity and entities (2)
6. Use of namespaces (5)
7. Use of enumerator types (3)
8. Use of relationships ?,*,+, (4)
9. Data and Container Elements (2)
Level of complexity chosen is a coefficient from:
1. easy0.5,
2. medium0.75,
3. difficult1.0 and multiplied by the sum above.
XSL/t (16%)
1. Correct use of XPaths (2)
2. Iteration (2)
3. Selection (2)
4. xpath predicates (2)
5. Comments and indentation (2)
6. Appropriate Template use and transform independency i.e.
reliant on data in xml instance and not static(2)
7. Appropriate Attribute extraction (2)
8. Appropriate Element extraction (2)
Similarity of transformation (2%)
1. Compare XHTML output file to media format chosen and
compare similarity (2)
TOTAL FOR EXERCISE 2: 50%