Pages

Wednesday, November 17, 2010

OLAP

OLAP SYSTEM


OLAP (online analytical processing) is computer processing that enables a userto easily and selectively extract and view data from different points of view.For example, a user can request that data be analyzed to display a spreadsheet showing all of a company's products sold in Chennai in the month of July, compare revenue with those
for the same products in September, and then see a comparison of other product sales in Chennai in thesame time period. (On Line Analytical Processing) Decision support software that allows the user to quickly analyze information that has been summarized into multidimensional views and hierarchies.
Online: Although most OLAP tools and applications enable development of reports that can be savedand printed when not connected to live data, OLAP emphasizes live access to data rather than static reporting. Analytic queries are submitted against the database in real time, and the results are returned in real time.
Analytical processing: This is the key concept with OLAP. End users can:
• Easily navigate multidimensional data to perform unpredictable ad hoc queries and to display
the results
• Manage business rules across dimensions and cubes
• Drill through levels of detail to uncover significant aspects of data
• Rapidly and efficiently obtain the results of sophisticated data calculation and selection across
multiple dimensions of data
A standard transactional report or query might ask, “When did order 84305 ship?” This query reflects
the basic mechanics of doing business. It involves simple data selection and little or no calculation
processing. It can be answered directly from the transactional system, probably without impacting other operations. Every organisation needs this basic level of information.
In contrast, OLAP systems are typically deployed to extend and enhance an organization’s ability to answer a much broader range of business questions about the data they are collecting in their transactional systems:
• How do sales for our top 10 most profitable products across Europe for this quarter compare
with sales a year ago?
• What are the differences in the product-sales mix between the regions, relative to the global
sales mix?
• What are our forecast units, unit price per service, unit cost per product, sales, cost trends, and
profit for the next 12 months?
• In what ways does the mix vary by salesperson, and what is the relative performance of our
salespeople?
• What are the products making up 40% of our profit for each region over time?
These questions are more analytical and complex, and the answer to one question often leads
immediately to another question as the user follows a train of thought in researching a business
problem or opportunity.
Although OLAP applications are found in widely divergent functional areas, they all require the
following key features:
· Multidimensional views of data
· Calculation-intensive capabilities
· Time intelligence
OLAP System can handle:
· Reports over time
· Summarized data
· Aggregated data
· Data used for trend analysis
· Non time sensitive information
OLAP systems can’t handle
· Transactional forms and reports used in day-to-day business
· Detailed transactions, unless there is a specific drill down reporting need.
OLAP Advantages:
• On-Line Analytical Processing (OLAP) provides customers with tools that can be used to perform
"multidimensional analysis" on data to discover hidden information within the database.
• BI Technology: A special technologies are far better tuned to dealing with summarization, slice
‘n’ dicing and decision support solutions than a stock standard relational database.
• Demoralized data:Data in the OLAP database can be demoralized to include submersed data.
This should drastically cut down computational time for many reports. The number of joins is
reduced, thus reducing look ups, hashes and merging.
• Index optimization:Many indexes can be placed on the data which are geared for report processing
performance.As data is not being added all the time, there will be no conflict with OLTP
systems.
• Lock reduction:the databaseis brought to an absolute minimum on the database as it is SELECT
only store, the tables gladly run with multiple “read locks”.
Types of OLAP Systems
• MOLAP - Multidimensional Online Analytical Processing or MOLAP is considered to be main
form of OLAP and thus sometimes just referred as OLAP. It is known for its fast query performance
which is because of optimized storage, multidimensional indexing . The main disadvantage
is that it introduces data redundancy.
Advantages of MOLAP
• Fast query performance due to optimized storage, multidimensional indexing .
• Automated computation of higher level aggregates of the data.
• It is very compact for low dimension data sets.
Disadvantages of MOLAP
• Within some MOLAP Solutions the processing step (data load) can be quite lengthy, especially
on large data volumes.
• MOLAP tools traditionally have difficulty querying models with dimensions with very high
cardinality(i.e., millions of members).
• Some MOLAP products have difficulty updating and querying models.
• Some MOLAP methodologies introduce data redundancy.
• ROLAP - Relational Online Analytical Processing is an alternative to the MOLAP and works
directly with relational databases. The main difference between MOLAP and ROLAP is that it
doesn't require storage of information. One advantage of ROLAP is that data stored in database
can be accessed through SQL tool. But it is considered to be slower than MOLAP tools by
industry people.
Advantages of ROLAP
• ROLAP is considered to be more scalable in handling large data volumes, especially models
with dimensions with very high cardinality (i.e. millions of members).
• The data is stored in a standard relational database and can be accessed by any sql tool. .
Disadvantages of ROLAP
• There is a consensus in the industry that ROLAP tools have slower performance than MOLAP
tools.
• When the step of creating aggregate tables is skipped, the query performance then suffers
because the larger detailed tables must be queried. This can be partially remedied by adding
additional aggregate tables, however it is still not practical to create aggregate tables for all
combinations of dimensions/attributes.
• HOLAP -This is a combination of the above two. This architecture has the MOLAP storage of
summary data and this summary data has links to the detailed transaction level data in Data
Warehouse RDBMS to enable user to drill down to the lowest level of detail. Apart from that it
also allows direct access to the data in Data Warehouse, which is not considered worthwhile to
be translated into OLAP cubes.

Tuesday, November 16, 2010

OLTP

OLTP SYSTEM

Databases must often allow the real-time processing of SQL transactions to support real time time-critical applications. This type of processing is known as online transaction processing (OLTP). Enterprise-level relational database management software, such as SQL Server, was designed originally to centrally store the data generated by the daily transactions of large companies or government organizations. Over the decades, these databases have grown to be highly efficient systems for recording the data required to perform the daily operations of the enterprise. Because the system is based on computers and records the business transactions of the enterprise, these systems are known as online transaction processing (OLTP) systems.


The data in OLTP systems is organized primarily to support transactions, such as:

  • Recording an order from a point-of-sale(store) terminal or entered through a Web site.
  • Placing an order for more supplies when inventory levels drop to a defined level.
  • Recording employee data.
Individual transactions are completed quickly and access relatively small amounts of data. OLTP systems are designed and tuned to process hundreds or thousands of transactions being entered at the same time.Although OLTP systems excel at recording the data required to support daily operations,OLTP data is not organized in a manner that easily provides the information required by managers to plan the work of their organizations. Managers need summary information from which they can analyze trends that affect their organization or team. They need to find the critical factors affecting the success of their organization, and how best to adjust those factors to improve the success of the enterprise. They need to find how the workload of their enterprise is affected by seasonal and yearlytrends so that they can predict how many employees and resources will be required to perform future work.

OLTP systems can handle:
  • Manual transaction capturing
  • Batch transaction processing of current data
  • Transactional information
  • Current business activities
  • · Current workflow including stage and state
  • · Search and find functionality of recent business transactional activity
  • · Document creation for the purpose of carrying out a transaction, e.g. Invoices, Delivery notes,
  • Job cards.
  • Historical detailed transactions
  • Time sensitive information
OLTP systems can’t handle:
  • Reporting
  • Management information
  • Decision support systems
  • Aggregated and summarized information
Advantages:
  • It provides faster and more accurate forecast for revenues and expenses.
  • It provides a concrete foundation for a stable organization because of timely modification of all transactions.
  • It makes the transactions much easier on behalf of the customers by allowing them to make the payments according to their choice.
  • It broadens the customer base for an organization by simplifying and speeding up
individual processes.

  • Small current footprint of data: The OLTP database should be optimized with a small footprint of information that allows the business to continue trading today. This will allow DBA'sto manage backups more efficiently where only current data is backed up. Historicaltransactions could be scheduled over quite times.
  • Normalized:The database focuses on highly normalized tables with strong database referential backing. This will assist in the drive to ensure the current data is accurate and notcompromised.
  • Performance tuned:The tables will have a minimal set of indexes on it as to support fast transactional processing
  • Historical transaction splitting: In very large databases, historical data is normally partitionedinto separate tables, thus allowing for smaller current tables with current and active data. In the event that historical transactions need to be queried, it can be used as the front-end to access the history table.
  • Data locks: As the active data set will be smaller, lock durations may bedecreased.
  • Lock contention: As the OLTP solution will be in a separate database compared to OLAP data,long running reports will not affect the current transactional processing tasks by holding “readlocks” on database for a long period of time.
  • Deadlock reduction: Although deadlocks are created by inconsistent data access, many OLTP versus OLAP deadlocks could be avoided. Deadlocks are caused by Insert, Update and Delete actions conflicting with other Insert, Update, Delete and Select statements on the same database.

Disadvantages:
Online Transaction Processing (OLTP) has the following disadvantages:
• The data is not properly organized.
• The data is highly volatile.
• The OLTP system does not maintain history.
• Data is in highly normalized form
• This data is maintained at different platforms with different data structures