<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Computing 95 percentile in MySQL</title>
	<atom:link href="http://www.mysqlperformanceblog.com/2008/11/28/computing-95-percentile-in-mysql/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mysqlperformanceblog.com/2008/11/28/computing-95-percentile-in-mysql/</link>
	<description>Everything about MySQL Performance</description>
	<lastBuildDate>Sat, 21 Nov 2009 05:23:57 -0800</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Baron Schwartz</title>
		<link>http://www.mysqlperformanceblog.com/2008/11/28/computing-95-percentile-in-mysql/comment-page-1/#comment-404838</link>
		<dc:creator>Baron Schwartz</dc:creator>
		<pubDate>Mon, 08 Dec 2008 16:20:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=549#comment-404838</guid>
		<description>Histograms are nice, I agree.  We&#039;ve built histograms into Maatkit&#039;s new log analysis tool, mk-log-parser.  We are also going to add an indication of time-clustering, e.g. &quot;this query always runs at 4AM on Tuesday&quot;.

I might write up a post on histograms -- with LOG() it is actually not too hard to do, though it&#039;s not terribly efficient (filesort/temp table).</description>
		<content:encoded><![CDATA[<p>Histograms are nice, I agree.  We&#8217;ve built histograms into Maatkit&#8217;s new log analysis tool, mk-log-parser.  We are also going to add an indication of time-clustering, e.g. &#8220;this query always runs at 4AM on Tuesday&#8221;.</p>
<p>I might write up a post on histograms &#8212; with LOG() it is actually not too hard to do, though it&#8217;s not terribly efficient (filesort/temp table).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: peter</title>
		<link>http://www.mysqlperformanceblog.com/2008/11/28/computing-95-percentile-in-mysql/comment-page-1/#comment-396601</link>
		<dc:creator>peter</dc:creator>
		<pubDate>Sun, 30 Nov 2008 02:20:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=549#comment-396601</guid>
		<description>Dale,

Histogram is good though takes more work to get in SQL plus it is harder to define the goals.  It is much easier to say  95 percentile should be within a second and 99 percentile within 3 seconds  than define goals in terms of histogram.</description>
		<content:encoded><![CDATA[<p>Dale,</p>
<p>Histogram is good though takes more work to get in SQL plus it is harder to define the goals.  It is much easier to say  95 percentile should be within a second and 99 percentile within 3 seconds  than define goals in terms of histogram.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dale Lancaster</title>
		<link>http://www.mysqlperformanceblog.com/2008/11/28/computing-95-percentile-in-mysql/comment-page-1/#comment-396452</link>
		<dc:creator>Dale Lancaster</dc:creator>
		<pubDate>Sat, 29 Nov 2008 23:42:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=549#comment-396452</guid>
		<description>Thanks for the article.  For me having a histogram of access/load times would be more meaningful.  I am not sure what a 99 or 95 percentile ranking would mean to me.  If anything it would lead me to generate more statistics and the one I would found most useful is a histogram of the counts of the load times.  This would require normalizing the data a bit such that only the first couple decimal places of precision are used, otherwise you get a lot of noise.

To generate a histogram with SQL I usually just use the group by of a count on the load times and you are good to go.  So in your example, if you did the group by with order by on the load time, I would be most interested in how often the really large load times occur.

Thanks.</description>
		<content:encoded><![CDATA[<p>Thanks for the article.  For me having a histogram of access/load times would be more meaningful.  I am not sure what a 99 or 95 percentile ranking would mean to me.  If anything it would lead me to generate more statistics and the one I would found most useful is a histogram of the counts of the load times.  This would require normalizing the data a bit such that only the first couple decimal places of precision are used, otherwise you get a lot of noise.</p>
<p>To generate a histogram with SQL I usually just use the group by of a count on the load times and you are good to go.  So in your example, if you did the group by with order by on the load time, I would be most interested in how often the really large load times occur.</p>
<p>Thanks.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
