<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Star Schema Bechmark: InfoBright, InfiniDB and LucidDB</title>
	<atom:link href="http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/</link>
	<description>Percona&#039;s MySQL &#38; InnoDB performance and scalability blog</description>
	<lastBuildDate>Sat, 11 Feb 2012 16:45:54 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Jaimie Sirovich</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-868363</link>
		<dc:creator>Jaimie Sirovich</dc:creator>
		<pubDate>Mon, 02 Jan 2012 20:05:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-868363</guid>
		<description>@Michael Fuery

MongoDB is even less like a column store than a row store is because there are no joins and all data must be stored in one place (without said joins).

Column Store = Each column is a file that can be scanned (oversimplification). Joins can be used to put data together.
Row Store = Each column is part of a file containing all columns (again, oversimplification). Joins can be used to put data together.
MongoDB = There is only one table period, and all columns and joins must be in one file unless you do a fake-join in your application.

For most purposes, column stores behave like row stores.  Joins work, etc.  Some of the performance characteristics will be different, though.  Clearly if you&#039;re using/selecting most columns, they won&#039;t help much.  They might even hurt!  Which columns you filter by or select in a column store will make a big difference.  

What you get out of MongoDB is map-reduce, and a natural ability to use a non-declarative (i.e. SQL) language that can be trivially paralellized without any planner magic.  In other words, even if the physical layout sucks, you can throw hardware at it and expect that to help.</description>
		<content:encoded><![CDATA[<p>@Michael Fuery</p>
<p>MongoDB is even less like a column store than a row store is because there are no joins and all data must be stored in one place (without said joins).</p>
<p>Column Store = Each column is a file that can be scanned (oversimplification). Joins can be used to put data together.<br />
Row Store = Each column is part of a file containing all columns (again, oversimplification). Joins can be used to put data together.<br />
MongoDB = There is only one table period, and all columns and joins must be in one file unless you do a fake-join in your application.</p>
<p>For most purposes, column stores behave like row stores.  Joins work, etc.  Some of the performance characteristics will be different, though.  Clearly if you&#8217;re using/selecting most columns, they won&#8217;t help much.  They might even hurt!  Which columns you filter by or select in a column store will make a big difference.  </p>
<p>What you get out of MongoDB is map-reduce, and a natural ability to use a non-declarative (i.e. SQL) language that can be trivially paralellized without any planner magic.  In other words, even if the physical layout sucks, you can throw hardware at it and expect that to help.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Fuery</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-821907</link>
		<dc:creator>Michael Fuery</dc:creator>
		<pubDate>Mon, 12 Sep 2011 18:21:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-821907</guid>
		<description>Thanks for the real-world case study. It is very insightful! I am wondering where MongoDB fits into this picture?</description>
		<content:encoded><![CDATA[<p>Thanks for the real-world case study. It is very insightful! I am wondering where MongoDB fits into this picture?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Justin Swanhart</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-746528</link>
		<dc:creator>Justin Swanhart</dc:creator>
		<pubDate>Sun, 11 Apr 2010 19:55:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-746528</guid>
		<description>Pedro,

Also, the SSB schema is based on the TPC-H schema, which is denormalized.</description>
		<content:encoded><![CDATA[<p>Pedro,</p>
<p>Also, the SSB schema is based on the TPC-H schema, which is denormalized.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Justin Swanhart</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-746527</link>
		<dc:creator>Justin Swanhart</dc:creator>
		<pubDate>Sun, 11 Apr 2010 19:49:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-746527</guid>
		<description>SSB does not prohibit using materialized views with query re-write to achieve performance, but you have to account for this in the loading time according to the rules.

So for example, on Oracle you could build materialized views to answer the queries, and allow Oracle to rewrite the SSB queries transparently to use your views, but this wouldn&#039;t gain you much since each SSB query is executed only once.

A database engine might build a materialized view (or a specialized index) automatically behind the scenes on foreign key constraint definitions in order to improve the performance of the join.  I believe that LucidDB builds such a structure on FK constraints, as does ScaleDB, Kickfire, and other database engines.  Other engines focus on distributed hash-joins instead of materialization.  Ideally, such an engine will have a high-performance way of building such a structure.

As I recall, the SSB benchmark was designed with a specialized type of DB2 materialized cube in mind, but I&#039;d have to read the paper again to verify that.</description>
		<content:encoded><![CDATA[<p>SSB does not prohibit using materialized views with query re-write to achieve performance, but you have to account for this in the loading time according to the rules.</p>
<p>So for example, on Oracle you could build materialized views to answer the queries, and allow Oracle to rewrite the SSB queries transparently to use your views, but this wouldn&#8217;t gain you much since each SSB query is executed only once.</p>
<p>A database engine might build a materialized view (or a specialized index) automatically behind the scenes on foreign key constraint definitions in order to improve the performance of the join.  I believe that LucidDB builds such a structure on FK constraints, as does ScaleDB, Kickfire, and other database engines.  Other engines focus on distributed hash-joins instead of materialization.  Ideally, such an engine will have a high-performance way of building such a structure.</p>
<p>As I recall, the SSB benchmark was designed with a specialized type of DB2 materialized cube in mind, but I&#8217;d have to read the paper again to verify that.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Perfczar</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-745642</link>
		<dc:creator>Perfczar</dc:creator>
		<pubDate>Fri, 09 Apr 2010 14:04:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-745642</guid>
		<description>In many products, it is possible to change the physical schema without changing the logical schema.  This can improve the performance of queries with dimension table predicates, especially when data comes from disk, because the matching rows are easy to identify and are collocated.  Full denormalization doesn&#039;t help as much; the joins to retrieve dimension values are fast because the dimension tables are relatively small.</description>
		<content:encoded><![CDATA[<p>In many products, it is possible to change the physical schema without changing the logical schema.  This can improve the performance of queries with dimension table predicates, especially when data comes from disk, because the matching rows are easy to identify and are collocated.  Full denormalization doesn&#8217;t help as much; the joins to retrieve dimension values are fast because the dimension tables are relatively small.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pedro Magalhaes</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-745631</link>
		<dc:creator>Pedro Magalhaes</dc:creator>
		<pubDate>Fri, 09 Apr 2010 13:00:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-745631</guid>
		<description>Nicholas/all

I saw that LucidDB and InfoBright have a high compression rate. So if we make the SSB denormalization then probabilly the queries perfomance will grow up. Is that right?

Does anyone already made this kind of stuff?

Thanks</description>
		<content:encoded><![CDATA[<p>Nicholas/all</p>
<p>I saw that LucidDB and InfoBright have a high compression rate. So if we make the SSB denormalization then probabilly the queries perfomance will grow up. Is that right?</p>
<p>Does anyone already made this kind of stuff?</p>
<p>Thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nicholas Goodman</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-745630</link>
		<dc:creator>Nicholas Goodman</dc:creator>
		<pubDate>Fri, 09 Apr 2010 12:43:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-745630</guid>
		<description>Pedro,

Speaking for LucidDB it&#039;s unlikely you&#039;d see a large increase if you denormalized to a flat table.  Since we store column by column you&#039;d see some change in size but it wouldn&#039;t be substantial.</description>
		<content:encoded><![CDATA[<p>Pedro,</p>
<p>Speaking for LucidDB it&#8217;s unlikely you&#8217;d see a large increase if you denormalized to a flat table.  Since we store column by column you&#8217;d see some change in size but it wouldn&#8217;t be substantial.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pedro Magalhaes</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-745529</link>
		<dc:creator>Pedro Magalhaes</dc:creator>
		<pubDate>Fri, 09 Apr 2010 06:36:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-745529</guid>
		<description>Vadim/all,

I was taking a look at SSB and then i was thinking if is possible to denormalize all tables from the SSB?

So database size will increase a lot but potencially the performance will grow up. Is that right? Is It possible?

Thanks and sorry for my poor englise
Pedro</description>
		<content:encoded><![CDATA[<p>Vadim/all,</p>
<p>I was taking a look at SSB and then i was thinking if is possible to denormalize all tables from the SSB?</p>
<p>So database size will increase a lot but potencially the performance will grow up. Is that right? Is It possible?</p>
<p>Thanks and sorry for my poor englise<br />
Pedro</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jim Tommaney</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-727283</link>
		<dc:creator>Jim Tommaney</dc:creator>
		<pubDate>Fri, 19 Feb 2010 21:08:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-727283</guid>
		<description>Vadim/all,

We have some metrics from running a SF-10,000 to share for those interested. Notes on the paramters tweaked from the defaults are included as well.  The dbgen issue with keys being re-used referenced by Marcin Z does show up at this scale. 

The basic scenario; load a SF-10,000 and a SF-1,000 (60 billion and 6 billion facts respectively) into one InfiniDB instance on a 6 sever (8 cores, 16GB) stack attached to 4 FC attached LSI trays.  Then run the queries in a stream against each and compare the relative time against each other.

Overall, the results were as expected; the overall processing behavior didn&#039;t change as we scaled the fact table.  Anyway, further details are available at:   http://www.infinidb.org/myblog-admin/mysql-parallel-query-processing-of-60-billion-rows-via-infinidb.html . 

Thanks - Jim Tommaney</description>
		<content:encoded><![CDATA[<p>Vadim/all,</p>
<p>We have some metrics from running a SF-10,000 to share for those interested. Notes on the paramters tweaked from the defaults are included as well.  The dbgen issue with keys being re-used referenced by Marcin Z does show up at this scale. </p>
<p>The basic scenario; load a SF-10,000 and a SF-1,000 (60 billion and 6 billion facts respectively) into one InfiniDB instance on a 6 sever (8 cores, 16GB) stack attached to 4 FC attached LSI trays.  Then run the queries in a stream against each and compare the relative time against each other.</p>
<p>Overall, the results were as expected; the overall processing behavior didn&#8217;t change as we scaled the fact table.  Anyway, further details are available at:   <a href="http://www.infinidb.org/myblog-admin/mysql-parallel-query-processing-of-60-billion-rows-via-infinidb.html" rel="nofollow">http://www.infinidb.org/myblog-admin/mysql-parallel-query-processing-of-60-billion-rows-via-infinidb.html</a> . </p>
<p>Thanks &#8211; Jim Tommaney</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Vadim</title>
		<link>http://www.mysqlperformanceblog.com/2010/01/07/star-schema-bechmark-infobright-infinidb-and-luciddb/comment-page-1/#comment-723783</link>
		<dc:creator>Vadim</dc:creator>
		<pubDate>Fri, 12 Feb 2010 18:20:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=1955#comment-723783</guid>
		<description>Matthieu,

I just use &quot;du -sh *&quot; on datadir, that&#039;s all.</description>
		<content:encoded><![CDATA[<p>Matthieu,</p>
<p>I just use &#8220;du -sh *&#8221; on datadir, that&#8217;s all.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

