<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: EuroOSCON 2006 &#8211; High Performance FullText Search</title>
	<atom:link href="http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/</link>
	<description>Everything about MySQL Performance</description>
	<lastBuildDate>Sat, 07 Nov 2009 18:35:44 -0800</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Sphinx and Lucene search engines, first impressions &#124; PageTracer &#124; Webmasters mashup</title>
		<link>http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/comment-page-1/#comment-242695</link>
		<dc:creator>Sphinx and Lucene search engines, first impressions &#124; PageTracer &#124; Webmasters mashup</dc:creator>
		<pubDate>Thu, 14 Feb 2008 23:42:33 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/#comment-242695</guid>
		<description>[...] MySQL full text search. I will show you only one slide from this presentation which you can find on MySQL Performance Blog to demonstrate the speed difference between MySQL, Lucene and Sphinx on a 5GB test database (of [...]</description>
		<content:encoded><![CDATA[<p>[...] MySQL full text search. I will show you only one slide from this presentation which you can find on MySQL Performance Blog to demonstrate the speed difference between MySQL, Lucene and Sphinx on a 5GB test database (of [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Apachez</title>
		<link>http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/comment-page-1/#comment-2730</link>
		<dc:creator>Apachez</dc:creator>
		<pubDate>Sat, 23 Sep 2006 13:09:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/#comment-2730</guid>
		<description>I agree to that Peter.

The reason for why I created TBGsearch was not to build a complete standalone search engine solution but to find a replacement of mysql fulltext for the forum at http://www.tbg.nu

In my case the problem with mysql fulltext were that my searches performed:

* Using boolean search.
* No stopword-list.
* Sorting hits by date and not &quot;relevance&quot; since relevance were irrelevant in this case :-)

All the three reasons above made mysql fulltext to virtually puke when a client performed a search which contained several commonly existing word.

I assume the reason for that is that the mysql fulltext internally will first find all hits for each word (since it was boolean search), then sort the result on date and finally return top 200 matches (instead of having some optimization to loop one hit at a time until 200 hits is collected or similar to that). Ending up taking sometimes 10 minutes or more to complete a search...

So I started to look for a fix or a replacement and quickly found sphinx which I created a perl api for (since I&#039;m using perl on my site). However sphinx had some limitations such as no support for wildcards, no liveupdates and when a new sphinx version were released the old perl api was no longer compatible.

So instead of putting time into updating the perl api I tried to see how large the difference would be if I created a search engine written only in perl with mysql as backend for the storage and this ended up in TBGseach which I have released as open-source at http://www.tbg.nu/tbgsearch (for download http://www.tbg.nu/tbgsearch/tbgsearch.zip).

In case someone has some tips or other questions regarding TBGsearch you can email me at the address which is written in http://www.tbg.nu/tbgsearch/readme.txt</description>
		<content:encoded><![CDATA[<p>I agree to that Peter.</p>
<p>The reason for why I created TBGsearch was not to build a complete standalone search engine solution but to find a replacement of mysql fulltext for the forum at <a href="http://www.tbg.nu" rel="nofollow">http://www.tbg.nu</a></p>
<p>In my case the problem with mysql fulltext were that my searches performed:</p>
<p>* Using boolean search.<br />
* No stopword-list.<br />
* Sorting hits by date and not &#8220;relevance&#8221; since relevance were irrelevant in this case <img src='http://www.mysqlperformanceblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>All the three reasons above made mysql fulltext to virtually puke when a client performed a search which contained several commonly existing word.</p>
<p>I assume the reason for that is that the mysql fulltext internally will first find all hits for each word (since it was boolean search), then sort the result on date and finally return top 200 matches (instead of having some optimization to loop one hit at a time until 200 hits is collected or similar to that). Ending up taking sometimes 10 minutes or more to complete a search&#8230;</p>
<p>So I started to look for a fix or a replacement and quickly found sphinx which I created a perl api for (since I&#8217;m using perl on my site). However sphinx had some limitations such as no support for wildcards, no liveupdates and when a new sphinx version were released the old perl api was no longer compatible.</p>
<p>So instead of putting time into updating the perl api I tried to see how large the difference would be if I created a search engine written only in perl with mysql as backend for the storage and this ended up in TBGseach which I have released as open-source at <a href="http://www.tbg.nu/tbgsearch" rel="nofollow">http://www.tbg.nu/tbgsearch</a> (for download <a href="http://www.tbg.nu/tbgsearch/tbgsearch.zip)" rel="nofollow">http://www.tbg.nu/tbgsearch/tbgsearch.zip)</a>.</p>
<p>In case someone has some tips or other questions regarding TBGsearch you can email me at the address which is written in <a href="http://www.tbg.nu/tbgsearch/readme.txt" rel="nofollow">http://www.tbg.nu/tbgsearch/readme.txt</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: peter</title>
		<link>http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/comment-page-1/#comment-2725</link>
		<dc:creator>peter</dc:creator>
		<pubDate>Sat, 23 Sep 2006 08:02:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/#comment-2725</guid>
		<description>Kevin, 

I can&#039;t agree with you on final conclusion.  First you have to identify your performance goals and than check solutions.  For many data sizes or search requirements MySQL FullText search performance is unfortunately showstopper.  

There are some examples and benchmarks in presentation :)

For tiny data size, ie search for your personal DVD collection MySQL FullText search should be used of course as it is simplest to use.    

You also can check sphinx which goes between Lucene and MySQL FullText Search in features and ease of use but which is the fastest. 

Note: It might sounds as I&#039;m too negative to MySQL Full Text Search but just have seen too many people being stuck trying to scale it when their data size grew.</description>
		<content:encoded><![CDATA[<p>Kevin, </p>
<p>I can&#8217;t agree with you on final conclusion.  First you have to identify your performance goals and than check solutions.  For many data sizes or search requirements MySQL FullText search performance is unfortunately showstopper.  </p>
<p>There are some examples and benchmarks in presentation <img src='http://www.mysqlperformanceblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>For tiny data size, ie search for your personal DVD collection MySQL FullText search should be used of course as it is simplest to use.    </p>
<p>You also can check sphinx which goes between Lucene and MySQL FullText Search in features and ease of use but which is the fastest. </p>
<p>Note: It might sounds as I&#8217;m too negative to MySQL Full Text Search but just have seen too many people being stuck trying to scale it when their data size grew.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin Burton</title>
		<link>http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/comment-page-1/#comment-2723</link>
		<dc:creator>Kevin Burton</dc:creator>
		<pubDate>Sat, 23 Sep 2006 04:43:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2006/09/22/eurooscon-2006-high-performance-fulltext-search/#comment-2723</guid>
		<description>yeah.... turn it into an article here.......

I&#039;ve worked with both Lucene and MySQL full-text search and both are fairly decent.  MySQL really doesn&#039;t scale as much as Lucene but it&#039;s really easy to implement and support.  Much better to just use MySQL if you&#039;re already a MySQL shop then to have to deal with Lucene.

Kevin</description>
		<content:encoded><![CDATA[<p>yeah&#8230;. turn it into an article here&#8230;&#8230;.</p>
<p>I&#8217;ve worked with both Lucene and MySQL full-text search and both are fairly decent.  MySQL really doesn&#8217;t scale as much as Lucene but it&#8217;s really easy to implement and support.  Much better to just use MySQL if you&#8217;re already a MySQL shop then to have to deal with Lucene.</p>
<p>Kevin</p>
]]></content:encoded>
	</item>
</channel>
</rss>
