logo for information-management-architect.com
Home
Strategy
Framework
Business Case Analysis
Project Planning
Requirements Analysis
Architecture & Design
Build Phase
Quality Assurance
Transition to Production
Management Information
Business Intelligence
Data Warehouse
Tools
Jobs
leftimage for information-management-architect.com

Data Requirements

Need to evolve data requirements for an information management requirements specification and want practical timesaving suggestions?


Task Objectives
Data Requirements
The objective of this task is to define requirements for:
  • Organizing, storing, moving and protecting data;
  • Ensuring data quality; and
  • Using data for business intelligence, analytical and reporting purposes.
Why is this important?

Information management, or data warehousing projects,  are data centric. This means that most of the requirements will be data related and require highly skilled IT professional specialists early in the project, to define requirements, and thru-out the project, to ensure requirements traceability and satisfaction of requirements.

Clearly defined requirements will go a long way towards ensuring project success

Structure and design

An entity relationship diagram documents the logical data model and metadata e.g. a definition for each entity and attribute. This model represents logical business rules e.g. the model might show a business rule which states “a person can own many automobiles, and an automobile must be owned by one person at one time”  The users can look at the model and agree that the logic is correct. The technical team can use the model as the basis to ensure that the data base design will not permit an automobile to be owned by more than one person at one time, or, in this case, not permit storage of an automobile without an  owner.

A dimensional model documents the facts and dimensions that will be used for analytic and reporting purposes. Think of a fact is something that has happened, e.g. an invoice.

Think of a dimension as all of the things that relate to the fact e.g. a product dimension, a customer dimension, a sales territory, and a date dimension. These dimensions are used to analyze details about the fact e.g. it helps answer questions like which products were sold, by which sales territory, to what customers and when.

Data storage

Data storage requirements are concerned with data volumes and expected growth rates e.g. if we know that we have 100,000 customers today and are expecting a growth of 10% by month, then these data requirements need to be documented to provide direction to the architecture and design team for data base sizing and capacity planning.

Data movement

Data movement requirements confirm the source data interface information, which will provide guidance to the architecture and design team. If the solution builds a data warehouse, which feeds other systems, then the data movement requirements will also include interface information for the “down-stream” systems.

It is very important to get user input on the data movement requirements e.g. sales data might be derived from a sales application or from a billing system. The user team needs to confirm which system is the “system of record” for the sales data i.e. the system that is  “trusted” to provide the most accurate data.

Data security

Data security sounds like a non-functional requirement but needs to involve users.

Suppose you have a human resource application that contains sensitive  executive salary and bonus information. This application is used by the HR department but will now be loaded into a data warehouse which can be used by a variety of other departments—It is important to identify the security data requirements and consider who can read the data, who can extract it for “downstream” systems, and how to prevent unauthorized access to test data.

Data quality

Data quality requirements are critical to a successful information management project—it is important to identify how data quality issues will be handled. Things to consider include should we load bad data? Should we reject it? Should we store it until a data steward corrects it? Who should be notified when data quality issues occur?

Information usage

Business intelligence and analytical reporting requirements should specify which data metric requirements will be used and provide a definition for each metric e.g. if invoice count is defined as a count of all invoices sent, then it is clear that it does not include invoices that are still in progress.  This definition helps the architecture and design team build a solution that only extracts invoices with a status of “sent”.

Information usage should also consider:
  • What types of query profiles will be required? This identifies possibly data requirements, which should be reflected in the dimensional model;
  • Is there a requirement to convert current reports to the new system?
  • Are there are any audit reporting requirements or reconciliation reporting requirements e.g. if we obtain data from the sales application and the billing system, and the amount billed does not match the order amount, how do we report this?
Summary…

Information management projects are data centric. Most of the requirements are data related and require highly skilled IT professional specialists early in the project, to define requirements, and thru-out the project, to ensure requirements traceability and satisfaction of requirements.



footer for Information management page