Essbase

From Wikipedia, the free encyclopedia

Jump to: navigation, search
Essbase
Latest release 11.1.1.1.0 / Nov 2008
Operating system Microsoft Windows, Linux, AIX, HP-UX, Solaris
Type Multidimensional database
License Proprietary
Website www.oracle.com/bi

Essbase is a multidimensional database management system (MDBMS) that provides a multidimensional database platform upon which to build analytic applications. Essbase, whose name derives from "Extended Spread Sheet dataBASE", was originally developed by Arbor Software, which merged with Hyperion Software in 1998. Hyperion Solutions Corporation was acquired by Oracle Corporation in 2007, so Essbase is now available as Oracle Essbase. Until late 2005, it was also marketed by IBM as DB2 OLAP Server. [1]

The term "on-line analytical processing" (OLAP) was coined by database researcher E. F. Codd in a whitepaper that set out twelve rules for analytic systems, an allusion to his earlier famous set of twelve rules defining the relational model. This whitepaper, published by Computerworld, was somewhat explicit in its reference to Essbase features, and when it was later discovered that Codd had been sponsored by Arbor Software, Computerworld withdrew the paper.[2]

By comparison with "on-line transaction processing" (OLTP), OLAP defines a database technology that is optimized for processing human queries rather than transactions. The results of this orientation was that MDBMS oriented their performance requirements around a different set of benchmarks (Analytic Performance Benchmark, APB-1) than that of RDBMS (Transaction Processing Performance Council (TPC)).

Many Hyperion products were renamed in 2005, giving Essbase an official name of Hyperion System 9 BI+ Analytic Services, but the new name was largely ignored by practitioners. The Essbase brand was later returned to the official product name for marketing purposes, but the server software still carried the "Analytic Services" title until it was incorporated into Oracle's Business Intelligence product suite. [3]

Essbase was named as one of the 10 most influential technology innovations of the last 10 years by Information Age magazine in its 10th anniversary issue, along with Netscape, Blackberry, Google, virtualization, Voice Over IP (VOIP), Linux, XML, the Pentium processor and ADSL. Editor Kenny MacIver said: "Hyperion Essbase was the multi-dimensional database technology that put online analytical processing on the business intelligence map. It has spurred the creation of scores of rival OLAP products – and billions of OLAP cubes".

Contents

[edit] History and motivation

Although Essbase has been categorised as a general-purpose multidimensional database, it was originally developed to address the scalability issues associated with spreadsheets such as Lotus 1-2-3 and Microsoft Excel. Indeed, the patent covering Essbase uses spreadsheets as a motivating example to illustrate the need for such a system. [1]

In this context, "multi-dimensional" refers to the representation of financial data in spreadsheet format. A typical spreadsheet may display time intervals along column headings, and account names on row headings. For example:

Jan Feb Mar Total
Quantity 1000 2000 3000 6000
Sales $100 $200 $300 $600
Expenses $80 $160 $240 $480
Profit $20 $40 $60 $120

If a user wants to break down these values by Region, for example, this typically involves the duplication of this table on multiple spreadsheets:

North
Jan Feb Mar Total
Quantity 240 1890 50 2180
Sales $24 $189 $5 $218
Expenses $20 $150 $3 $173
Profit $4 $39 $2 $45
South
Jan Feb Mar Total
Quantity 760 110 2950 3820
Sales $76 $11 $295 $382
Expenses $60 $10 $237 $307
Profit $16 $1 $58 $75
Total Region
Jan Feb Mar Total
Quantity 1000 2000 3000 6000
Sales $100 $200 $300 $600
Expenses $80 $160 $240 $480
Profit $20 $40 $60 $120

An alternative representation of this structure would be a three-dimensional spreadsheet grid, giving rise to the idea that "Time", "Account", and "Region" are dimensions. As further dimensions are added to the system, it becomes very difficult to maintain spreadsheets that correctly represent the multi-dimensional values. Multidimensional databases such as Essbase provide a data store for values that exist, at least conceptually, in a multi-dimensional "hypercube".

[edit] Sparsity

A technical problem faced by multidimensional databases is the physical representation of data as the number and size of dimensions increases. Say the above example was extended to add a "Customer" and "Product" dimension:

Dimension Number of dimension values
Accounts 4
Time 4
Region 3
Customer 10,000
Product 5,000

If the multidimensional database reserved storage space for every possible value, it would need to store 2,400,000,000 (4 × 4 × 3 × 10,000 × 5,000) cells. If each cell is represented as a 64-bit floating point value, this equates to a memory requirement of at least 17 gigabytes. In practice, of course, the number of combinations of "Customer" and "Product" that contain meaningful values will be a tiny subset of the total space. This property of multi-dimensional spaces is referred to as sparsity.

[edit] Aggregation

OLAP systems generally provide for multiple levels of detail within each dimension by arranging the members of each dimension into one or more hierarchies. A Time dimension, for example, may be represented as a hierarchy starting with "Total Time", and breaking down into multiple years, then quarters, then months. An Accounts dimension may start with "Profit", which breaks down into "Sales" and "Expenses", and so on.

In the example above, if "Product" represents individual product SKUs, analysts may want to also be able to report using aggregations such as "Product Group", "Product Family", "Product Line", etc. Similarly, for "Customer", natural aggregations may arrange customers according to geographic location or industry.

The number of aggregate values implied by a set of input data can be surprisingly large. If the Customer and Product dimensions are each in fact six "generations" deep, then 36 (6 × 6) aggregate values are affected by a single data point. It follows that if all these aggregate values are to be stored, the amount of space required is proportional to the product of the depth of all aggregating dimensions. For large databases, this can cause the effective storage requirements to be many hundred times the size of the data being aggregated.

[edit] Block storage (Essbase Analytics)

Since version 7, Essbase has supported two "storage options" which take advantage of sparsity to minimize the amount of physical memory and disk space required to represent large multidimensional spaces. The Essbase patent[1] describes the original method, which aimed to reduce the amount of physical memory required without increasing the time required to look up closely-related values. With the introduction of alternative storage options, this was named Block Storage Option (Essbase BSO), and later referred to as Essbase Analytics in marketing material.

Put briefly, Essbase requires the developer to tag dimensions as "dense" or "sparse". The system then arranges data to represent the hypercube into "blocks", where each block is multi-dimensional array made up of "dense" dimensions, and space is allocated for every potential cell in that block. Sparsity is exploited because the system only creates blocks when required. In the example above, say the developer has tagged "Accounts" and "Time" as "dense", and "Region", "Customer", and "Product" as "sparse". If there are, say, 12,000 combinations of Region, Customer and Product that contain data, then only 12,000 blocks will be created, each block large enough to store every possible combination of Accounts and Time. The number of cells stored is therefore 192000 (4 × 4 × 12000), requiring under 2 megabytes of memory, plus the size of the index used to look up the appropriate blocks.

Because this implementation is hidden from front-end tools (i.e., a report that attempts to retrieve data from non-existent cells merely sees "null" values), the full hypercube can be navigated naturally, and it is possible to load values into any cell interactively.

[edit] Calculation engine

Calculations in Essbase BSO can be specified as:

  • the aggregation of values through dimensional hierarchies;
  • stored calculations on dimension members;
  • "dynamically calculated" dimension members; or
  • procedural "calculation scripts" that act on values stored in the database.

The first method (dimension aggregation) is implicitly performed through addition, or by selectively tagging branches of the hierarchy to be subtracted, multiplied, divided or ignored. Also, the result of this aggregation can be stored in the database, or calculated dynamically on demand—members must be tagged as "Stored" or "Dynamic Calc." to specify which method is to be used.

The second method (stored calculations) uses a formula against each calculated dimension member—when Essbase calculates that member, the result is stored against that member just like a data value.

The third method (dynamic calculation) is specified in exactly the same format as stored calculations, but are calculated when a value addressed by that member is accessed by a user, and are not stored.

The fourth method (calculation scripts) uses a procedural programming language specific to the Essbase calculation engine. This type of calculation may act upon any data value in the hypercube, and can therefore be used to perform calculations that cannot be expressed as a simple formula.

A calculation script must also be executed to trigger the calculation of aggregated values or stored calculations as described above—a built-in calculation script (called the "default calculation") can be used to execute this type of calculation.

[edit] Aggregate storage (Enterprise Analytics)

Although block storage effectively minimizes storage requirements without impacting retrieval time, it is limited by its treatment of aggregate data in large applications, motivating the introduction of a second storage engine, named Aggregate Storage Option (Essbase ASO) or more recently, Enterprise Analytics. This storage option makes the database behave much more similarly to OLAP databases like SQL Server Analysis Services.

Following a data load, Essbase ASO does not store any aggregate values, but instead calculates them on demand. For large databases, where the time required to generate these values is inconvenient, the database can materialize one or more aggregate "views", made up of one aggregate level from each dimension (for example, the database may calculate all combinations of the fifth generation of Product with the third generation of Customer), and these views are then used to generate other aggregate values where possible. This process can be partially automated, where the administrator specifies the amount of disk space that may be used, and the database generates views according to actual usage.

The major drawback of this approach is that the cube cannot be treated for calculation purposes as a single large hypercube, because aggregate values cannot be directly controlled, so write-back from front-end tools is limited, and complex calculations that cannot be expressed as MDX expressions are not possible.

[edit] Calculation engine

Calculations in Essbase ASO can be specified as:

  • the aggregation of values through dimensional hierarchies; or
  • dynamically calculated dimension members.

The first method (dimension aggregation) is basically the same as for Essbase BSO.

The second method (dynamic calculations) evaluates MDX expressions against dimension members.

[edit] User interface

The most widely known user interface to Essbase is an add-in for Microsoft Excel (previously also Lotus 1-2-3). The add-in adds a menu to the spreadsheet application that can be used to connect to Essbase databases, retrieve data, and navigate the cube's dimensions ("Zoom in", "Pivot", etc).[2]

With the release of System 9, Hyperion provided a new user interface add-in for Essbase called "SmartView for Microsoft Office". SmartView provides access to Essbase and other System 9 content for Microsoft Powerpoint, Microsoft Word, Microsoft Outlook as well as supplanting the previous add-in for Microsoft Excel.

In 2005, Hyperion began to offer a visualization tool called Tableau under the name "Hyperion Visual Explorer".[3] Tableau was originally developed at Stanford University as a government-sponsored research project to investigate new ways for users to interact with relational and OLAP databases.

Other user-facing applications with support for Essbase databases are:

The previous offerings from Hyperion are offered with new names as given below:

Hyperion's previous offerings Hyperion System 9 BI+ offerings
Hyperion Essbase ASO Enterprise Analytics
Hyperion Essbase BSO Essbase Analytics
Hyperion Analyzer Web Analysis
Hyperion Reports Financial Reporting
Hyperion Intelligence Interactive Reporting
Hyperion SQR Production Reporting
Hyperion Metrics Builder Enterprise Metrics

An API is available for C, Visual Basic and Java, and embedded scripting support is available for Perl. The standardised XML for Analysis protocol can be used to query Essbase data sources using the MDX language.

In 2007, Oracle began bundling Hyperion BI tools into Oracle Business Intelligence Enterprise Edition Plus.

[edit] Administrative interface

A number of standard interfaces are provided for the administration of Essbase applications:

  • ESSCMD, the original command line interface for administration commands;
  • MaxL, a "multi-dimensional database access language" which provides both a superset of ESSCMD commands, but with a syntax more akin to SQL, as well as support for MDX queries;
  • Essbase Application Manager, the original Microsoft Windows GUI administration client, compatible with versions of Essbase before 7.0; and
  • Essbase Administration Services, later renamed Analytic Administration Services, and then back to 'Essbase Administration Services' in v. 9.3.1, the currently-supported GUI administration client.
  • Essbase Integration Server for maintaining the structure and content of Essbase databases based on data models derived from relational or file-based data sources.

[edit] Competitors

There are several significant competitors among the OLAP, analytics products to that of Essbase (HOLAP/MOLAP) on the market, among them Microsoft SQL Server Analysis Services (MOLAP, HOLAP, ROLAP), IBM Cognos BI (MOLAP/ROLAP/HOLAP), IBM Cognos Planning/Applix TM1 (MOLAP),Microstrategy (ROLAP). With the acqusition of Hyperion, Oracle OLAP (ROLAP/MOLAP) is now an internal alternative/competitor product to Essbase.

Also note that of the above competitors, including Essbase, all use heterogenous relational (SQL Server, Oracle, IBM DB/2, TeraData, Access, etc.) or non-relational data sourcing (Excel, text Files, CSV Files, etc.) to feed the cubes (facts and dimensional data), except for Oracle OLAP which may only use Oracle relational sourcing.

[edit] Export and/or Product Migration of Essbase

There are currently two options available for exportation of Essbase cubes into other formats.

CubePort is a commercial conversion application that converts Essbase cubes to the Microsoft SQL Server Analysis Services product. This product performs an object-to-object translation that make up an Essbase cube, including: outline, member formulas, calc scripts, data loading (load rules), report scripts to MDX queries, substitution variables, and security model. It can extract from any platform version of Essbase, including Oracle/Hyperion Essbase on Windows, Unix, AIX, HP UX, Solaris, IBM DB/2 OLAP, or AS/400 Showcase Essbase.

Outline Extractor is another option that performs a pure, rudimentary, export of the outline, though is does not directly create any new objects. The output is a simple text file that can be pulled indirectly into other OLAP products, among other uses, such as sychronizing outlines.

[edit] See also

[edit] References

  1. ^ a b Earle, Robert J. (1992) "Method and apparatus for storing and retrieving multi-dimensional data in computer memory". United States Patent 5,359,724 assigned to Arbor Software Corporation.
  2. ^ Hyperion Solutions Corporation (2006). Essbase Database Administrator's Guide.
  3. ^ Tableau Software (2005). Tableau Software Lands Global OEM Deal with Hyperion. Press release.

[edit] External links

Personal tools
Languages