<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: MySQL: Data Storage or Data Processing</title>
	<atom:link href="http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/</link>
	<description>Everything about MySQL Performance</description>
	<lastBuildDate>Sat, 21 Nov 2009 05:23:57 -0800</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Gaurav Singh</title>
		<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/comment-page-1/#comment-281705</link>
		<dc:creator>Gaurav Singh</dc:creator>
		<pubDate>Fri, 18 Apr 2008 19:31:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/#comment-281705</guid>
		<description>how i can switch mysql to use other partition of my hard disk to store data.
i am a new kid for mysql. assist me for the same.</description>
		<content:encoded><![CDATA[<p>how i can switch mysql to use other partition of my hard disk to store data.<br />
i am a new kid for mysql. assist me for the same.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: peter</title>
		<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/comment-page-1/#comment-203609</link>
		<dc:creator>peter</dc:creator>
		<pubDate>Sat, 24 Nov 2007 19:54:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/#comment-203609</guid>
		<description>Well Google is just an example. It could be Amazon, FaceBook or Yahoo... a lot of smaller second tier web sites would qualify as well.

Web system which have to deal with user generated content have to deal with massive amount of such. 

Speaking about Google - this is deliberate choice of search application. There are many services using same underlying infrastructure such as bigtable which do not have this problem.

You do not really need to have shared storage to give consistent results, you can have consistent shared systems either.  There are of course various implications - like if you get to deal with multiple data centers you have to choice either synchronous propagation (meaning you can run into the trouble if connection between them has problems) or asynchronous propagation, meaning you will get a stale copy.  Of course you can structure the application so stale copy is never used but it normally comes with other performance challenges.</description>
		<content:encoded><![CDATA[<p>Well Google is just an example. It could be Amazon, FaceBook or Yahoo&#8230; a lot of smaller second tier web sites would qualify as well.</p>
<p>Web system which have to deal with user generated content have to deal with massive amount of such. </p>
<p>Speaking about Google &#8211; this is deliberate choice of search application. There are many services using same underlying infrastructure such as bigtable which do not have this problem.</p>
<p>You do not really need to have shared storage to give consistent results, you can have consistent shared systems either.  There are of course various implications &#8211; like if you get to deal with multiple data centers you have to choice either synchronous propagation (meaning you can run into the trouble if connection between them has problems) or asynchronous propagation, meaning you will get a stale copy.  Of course you can structure the application so stale copy is never used but it normally comes with other performance challenges.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gints Plivna</title>
		<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/comment-page-1/#comment-201873</link>
		<dc:creator>Gints Plivna</dc:creator>
		<pubDate>Thu, 22 Nov 2007 16:07:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/#comment-201873</guid>
		<description>It is not quite correct to compare &quot;normal&quot; databases with google. Google is very special with special needs. For example a while ago I was searching google using some keywords to test my homepage ranking in search results. I was quite surprised for a phenomenon that simultaneous searches in my home computer and work computer (which physically are away ~4 km from each other) returned different results. Obviously they used different google sites and one of them had not updated it&#039;s index. So can you imagine a relational database that will return different results for SUM of your company salaries depending on which node you&#039;ll execute your query? :)
I have Oracle background and it&#039;s RAC (real application clusters) option is a step towards your proposed solution, but it uses the same storage and it always gives back consistent results, not depending on which node you run your statement. It is both it&#039;s strength and weakness. Strength because it gives back on all nodes the same result, weakness because that means all nodes have to get most recent versions of database blocks and in case of frequent changes it puts a quite big load on interconnect and waits are introduced.</description>
		<content:encoded><![CDATA[<p>It is not quite correct to compare &#8220;normal&#8221; databases with google. Google is very special with special needs. For example a while ago I was searching google using some keywords to test my homepage ranking in search results. I was quite surprised for a phenomenon that simultaneous searches in my home computer and work computer (which physically are away ~4 km from each other) returned different results. Obviously they used different google sites and one of them had not updated it&#8217;s index. So can you imagine a relational database that will return different results for SUM of your company salaries depending on which node you&#8217;ll execute your query? <img src='http://www.mysqlperformanceblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
I have Oracle background and it&#8217;s RAC (real application clusters) option is a step towards your proposed solution, but it uses the same storage and it always gives back consistent results, not depending on which node you run your statement. It is both it&#8217;s strength and weakness. Strength because it gives back on all nodes the same result, weakness because that means all nodes have to get most recent versions of database blocks and in case of frequent changes it puts a quite big load on interconnect and waits are introduced.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bluecells &#187; Blog Archive &#187; MySQL: Data Storage or Data Processing</title>
		<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/comment-page-1/#comment-198715</link>
		<dc:creator>bluecells &#187; Blog Archive &#187; MySQL: Data Storage or Data Processing</dc:creator>
		<pubDate>Tue, 20 Nov 2007 00:35:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/#comment-198715</guid>
		<description>[...] can read the full story here   Filed under: [...]</description>
		<content:encoded><![CDATA[<p>[...] can read the full story here   Filed under: [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: peter</title>
		<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/comment-page-1/#comment-197751</link>
		<dc:creator>peter</dc:creator>
		<pubDate>Mon, 19 Nov 2007 09:27:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/#comment-197751</guid>
		<description>AlexN,

This is not the thing I&#039;m talking about. The proper cursors allow you to stream date easily but they do not store the main problem.  They are still not parallel.</description>
		<content:encoded><![CDATA[<p>AlexN,</p>
<p>This is not the thing I&#8217;m talking about. The proper cursors allow you to stream date easily but they do not store the main problem.  They are still not parallel.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: AlexN</title>
		<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/comment-page-1/#comment-197703</link>
		<dc:creator>AlexN</dc:creator>
		<pubDate>Mon, 19 Nov 2007 08:04:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/#comment-197703</guid>
		<description>There is, for example, Mondrian (http://mondrian.pentaho.org/), the OLAP tool that uses MySQL
as a DB engine for ROLAP. About 50% of all processing job is done on server. It is not very fast,
however. From my experience, the main advantage in data processing on server using MSSQL (not MySQL)
is gained form its cursors, that use row level locks and do not interfere with data load and retrieval.
Ideal for background processes. In MySQL cursor syntax is obscure, and I think that many people
still are not using them at all.</description>
		<content:encoded><![CDATA[<p>There is, for example, Mondrian (<a href="http://mondrian.pentaho.org/)" rel="nofollow">http://mondrian.pentaho.org/)</a>, the OLAP tool that uses MySQL<br />
as a DB engine for ROLAP. About 50% of all processing job is done on server. It is not very fast,<br />
however. From my experience, the main advantage in data processing on server using MSSQL (not MySQL)<br />
is gained form its cursors, that use row level locks and do not interfere with data load and retrieval.<br />
Ideal for background processes. In MySQL cursor syntax is obscure, and I think that many people<br />
still are not using them at all.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: peter</title>
		<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/comment-page-1/#comment-197091</link>
		<dc:creator>peter</dc:creator>
		<pubDate>Sun, 18 Nov 2007 18:15:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/#comment-197091</guid>
		<description>Yeah. Good quote though I do not think so.  I think the current technologies are not mature enough and may be they will develop to lead the next wave but it is also possible some other leaders pop up.</description>
		<content:encoded><![CDATA[<p>Yeah. Good quote though I do not think so.  I think the current technologies are not mature enough and may be they will develop to lead the next wave but it is also possible some other leaders pop up.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian Aker</title>
		<link>http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/comment-page-1/#comment-197067</link>
		<dc:creator>Brian Aker</dc:creator>
		<pubDate>Sun, 18 Nov 2007 17:53:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/11/18/mysql-data-storage-or-data-processing/#comment-197067</guid>
		<description>Hi!

You are probably not surprised to know that some of us are advocating this. I&#039;ll probably blog what my MySQL internal thoughts sometime soon :)

A quote I heard recently &quot;Hadoop will replace the database vendors&quot;. Good quote aye?

Cheers,
   -Brian</description>
		<content:encoded><![CDATA[<p>Hi!</p>
<p>You are probably not surprised to know that some of us are advocating this. I&#8217;ll probably blog what my MySQL internal thoughts sometime soon <img src='http://www.mysqlperformanceblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>A quote I heard recently &#8220;Hadoop will replace the database vendors&#8221;. Good quote aye?</p>
<p>Cheers,<br />
   -Brian</p>
]]></content:encoded>
	</item>
</channel>
</rss>
