Go Back   CORTEX Forums > Vendors and Service Provders > Open Source Analytics > Open Source News and Opinion
Register Blogs FAQ Members List Calendar Search Today's Posts Mark Forums Read

Removing Mondrian's 'high cardinality dimension' feature

This is a discussion on Removing Mondrian's 'high cardinality dimension' feature within the Open Source News and Opinion forums, part of the Open Source Analytics category; I would like to remove the 'high cardinality dimension' feature in mondrian 4.0. To specify that a dimension is high-cardinality, you set the highCardinality attribute of the Dimension element to ...


Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old 2nd June 2011, 10:43 AM   #1
News Bot
 
Join Date: Nov 2007
Posts: 15,085
Latest News Headlines is on a distinguished road
Post Removing Mondrian's 'high cardinality dimension' feature

I would like to remove the 'high cardinality dimension' feature in mondrian 4.0.

To specify that a dimension is high-cardinality, you set the highCardinality attribute of the Dimension element to true. This will cause mondrian to scan over the dimension, rather than trying to load all of the children of a given parent member into memory.

The goal is a worthy one, but the implementation — making iterators look like lists — has a number of architectural problems: it duplicates code; because it allows backtracking for a fixed amount, it works with small dimensions but unpredictably fails with larger ones; and because lists are based on iterators, re-starting an iteration multiple times (e.g. from within a crossjoin) can re-execute complex SQL statements.

There are other architectural features designed to help with large dimensions. Many functions can operate in an 'iterable' mode (except that here the iterators are explicit). And for many of the most data-intensive operators, such as crossjoin, filter, semijoin (non-empty), and topcount, we can push down the operator to SQL, and thereby reduce the number of records coming out of the RDBMS.

It's always hard to remove a feature. But over the years we have seen numerous inconsistencies, and if we removed this feature in mondrian 4.0, we could better focus our resources.

If you are using this feature and getting significant performance benefit, I would like to hear from you. I would like to understand about your use case, and either direct you to another feature that solves the problem, or try to develop an alternative solution in mondrian 4.0. The best place to make comments about these use cases is on the Jira case MONDRIAN-949.


More from Julian Hyde on Open Source OLAP Blog ...
Latest News Headlines is offline  
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiTweet this Post!
Reply With Quote
Reply

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is On
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Removing The Mist Surrounding Cloud Computing Latest News Headlines Other International Vendors 0 17th March 2011 06:12 AM
InTrade are removing all trading and expiry fees. admin Prediction Markets News Feeds 0 12th December 2010 01:07 PM
Architectural shuffling in mondrian's XMLA and olap4j servers Latest News Headlines Open Source News and Opinion 0 23rd November 2010 07:41 AM
Improved collections classes for Mondrian's query execution process Latest News Headlines Open Source News and Opinion 0 17th February 2010 01:17 PM
Highly sensitive feature detection for high resolution LC/MS. admin Analytic News Feeds 0 18th November 2009 11:38 AM


All times are GMT +11. The time now is 07:55 AM.

© The Business Intelligence Group

Search Engine Optimization by vBSEO