<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: How to load large files safely into InnoDB with LOAD DATA INFILE</title>
	<atom:link href="http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/</link>
	<description>Percona&#039;s MySQL &#38; InnoDB performance and scalability blog</description>
	<lastBuildDate>Sat, 11 Feb 2012 16:45:54 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Will</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-848635</link>
		<dc:creator>Will</dc:creator>
		<pubDate>Wed, 30 Nov 2011 01:33:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-848635</guid>
		<description>Old post, but very helpful.  We were doing an ignore into load which caused a lot of issues on our production transactions.  By splitting up the import into chunks, it eliminated the impact on our production load.</description>
		<content:encoded><![CDATA[<p>Old post, but very helpful.  We were doing an ignore into load which caused a lot of issues on our production transactions.  By splitting up the import into chunks, it eliminated the impact on our production load.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Baron Schwartz</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-798270</link>
		<dc:creator>Baron Schwartz</dc:creator>
		<pubDate>Tue, 15 Feb 2011 18:01:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-798270</guid>
		<description>That&#039;s better to ask on the forum, so here&#039;s a pointer to the forum :) http://forum.percona.com</description>
		<content:encoded><![CDATA[<p>That&#8217;s better to ask on the forum, so here&#8217;s a pointer to the forum <img src='http://www.mysqlperformanceblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  <a href="http://forum.percona.com" rel="nofollow">http://forum.percona.com</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-798261</link>
		<dc:creator>John</dc:creator>
		<pubDate>Tue, 15 Feb 2011 16:59:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-798261</guid>
		<description>Hi,

I have run into a little problem with this

i have create a bash script to allow me to pass in table name a file to load in data with, this works fine, but if i use the replace option on the load data infile, i get errors of duplicates

ERROR 1062 (23000) at line 1: Duplicate entry &#039;85694e353d34b4ab284970f22e3bcd66&#039; for key &#039;idx_code&#039;

any pointers would be really helpful


John</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>I have run into a little problem with this</p>
<p>i have create a bash script to allow me to pass in table name a file to load in data with, this works fine, but if i use the replace option on the load data infile, i get errors of duplicates</p>
<p>ERROR 1062 (23000) at line 1: Duplicate entry &#8217;85694e353d34b4ab284970f22e3bcd66&#8242; for key &#8216;idx_code&#8217;</p>
<p>any pointers would be really helpful</p>
<p>John</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Baron Schwartz</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-660233</link>
		<dc:creator>Baron Schwartz</dc:creator>
		<pubDate>Wed, 30 Sep 2009 13:35:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-660233</guid>
		<description>Nishant, I would suspect that you&#039;re seeing the tablespace grow because of the insert buffer.  This can be controlled in Percona-patched versions of InnoDB, and in XtraDB.</description>
		<content:encoded><![CDATA[<p>Nishant, I would suspect that you&#8217;re seeing the tablespace grow because of the insert buffer.  This can be controlled in Percona-patched versions of InnoDB, and in XtraDB.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nishant Deshpande</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-660167</link>
		<dc:creator>Nishant Deshpande</dc:creator>
		<pubDate>Wed, 30 Sep 2009 10:45:24 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-660167</guid>
		<description>Baron,

Thanks for the blog as always. I was wondering if this suggests a solution to my problem, namely the shared tablespace (ibdata) file growing even when i have file_per_table and indeed all my tables are created correctly as separate files.

when i occasionally do &#039;insert into new_bigtable select * from bigtable&#039;... i notice that the ibdata file grows huge (unfortunately i haven&#039;t run controlled experiements given i only notice this for really large tables 100GB+). i think this also happens when i do a &#039;load data infile&#039; again we&#039;re talking 100GB+ files.

Can I make sure I understand your two points above, namely:

&gt;&gt; lots of undo log entries collect in the tablespace...
from here (http://dev.mysql.com/doc/refman/5.1/en/multiple-tablespaces.html) i see that the undo log entries are kept in the shared tablespace (i&#039;m not sure if you meant that in (1) it wasn&#039;t clear to me)

so basically if i conduct a transaction on 100GB of data, the ibdata file will necessarily grow to be approximately this size just because i&#039;m doing this as a transaction. once the transaction commits, the undo logs will be &#039;discarded&#039; but the ibdata file will remain at 100GB. and now i have no way of shrinking this back (unless i do a mysqldump and load which for a large db is prohibitively expensive). as i understand it i can&#039;t just copy my .ibd / .frm files and then put them on a new mysql instance.

is there any way of avoiding the ibdata file from growing to be as large as the largest transaction effectively? for me the largest transactions would be a data load which would be huge and that means ibdata would be swallowing 20% or more of my disk.

Nishant</description>
		<content:encoded><![CDATA[<p>Baron,</p>
<p>Thanks for the blog as always. I was wondering if this suggests a solution to my problem, namely the shared tablespace (ibdata) file growing even when i have file_per_table and indeed all my tables are created correctly as separate files.</p>
<p>when i occasionally do &#8216;insert into new_bigtable select * from bigtable&#8217;&#8230; i notice that the ibdata file grows huge (unfortunately i haven&#8217;t run controlled experiements given i only notice this for really large tables 100GB+). i think this also happens when i do a &#8216;load data infile&#8217; again we&#8217;re talking 100GB+ files.</p>
<p>Can I make sure I understand your two points above, namely:</p>
<p>&gt;&gt; lots of undo log entries collect in the tablespace&#8230;<br />
from here (<a href="http://dev.mysql.com/doc/refman/5.1/en/multiple-tablespaces.html" rel="nofollow">http://dev.mysql.com/doc/refman/5.1/en/multiple-tablespaces.html</a>) i see that the undo log entries are kept in the shared tablespace (i&#8217;m not sure if you meant that in (1) it wasn&#8217;t clear to me)</p>
<p>so basically if i conduct a transaction on 100GB of data, the ibdata file will necessarily grow to be approximately this size just because i&#8217;m doing this as a transaction. once the transaction commits, the undo logs will be &#8216;discarded&#8217; but the ibdata file will remain at 100GB. and now i have no way of shrinking this back (unless i do a mysqldump and load which for a large db is prohibitively expensive). as i understand it i can&#8217;t just copy my .ibd / .frm files and then put them on a new mysql instance.</p>
<p>is there any way of avoiding the ibdata file from growing to be as large as the largest transaction effectively? for me the largest transactions would be a data load which would be huge and that means ibdata would be swallowing 20% or more of my disk.</p>
<p>Nishant</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Baron Schwartz</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-655040</link>
		<dc:creator>Baron Schwartz</dc:creator>
		<pubDate>Mon, 21 Sep 2009 01:37:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-655040</guid>
		<description>Hi Gadi, thanks for your comment.  I&#039;ve updated the incorrect code listing and added a bold comment below it.</description>
		<content:encoded><![CDATA[<p>Hi Gadi, thanks for your comment.  I&#8217;ve updated the incorrect code listing and added a bold comment below it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gadi Naveh</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-654873</link>
		<dc:creator>Gadi Naveh</dc:creator>
		<pubDate>Sun, 20 Sep 2009 07:26:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-654873</guid>
		<description>Some comments - while the fifo as facility works, it is not obvious from the page that in the loop, the load command must reference the fifo file and NOT the original. it actually says - mysql -e &quot;..... same as above.... &quot; which is misleading.

I suggest putting together a step-by-step directions for this page, including a bold comment about which file to use in the load.

chrz</description>
		<content:encoded><![CDATA[<p>Some comments &#8211; while the fifo as facility works, it is not obvious from the page that in the loop, the load command must reference the fifo file and NOT the original. it actually says &#8211; mysql -e &#8220;&#8230;.. same as above&#8230;. &#8221; which is misleading.</p>
<p>I suggest putting together a step-by-step directions for this page, including a bold comment about which file to use in the load.</p>
<p>chrz</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Baron Schwartz</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-359384</link>
		<dc:creator>Baron Schwartz</dc:creator>
		<pubDate>Fri, 03 Oct 2008 13:28:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-359384</guid>
		<description>Only MyISAM supports full-text indexing in MySQL.  If you have a lot of content to index (bigger than your available memory) and you need high performance, you probably need an external solution such as Sphinx or Lucene.  Sphinx has a storage engine for easy integration with MySQL.</description>
		<content:encoded><![CDATA[<p>Only MyISAM supports full-text indexing in MySQL.  If you have a lot of content to index (bigger than your available memory) and you need high performance, you probably need an external solution such as Sphinx or Lucene.  Sphinx has a storage engine for easy integration with MySQL.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: LeRoy Grubbs</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-359223</link>
		<dc:creator>LeRoy Grubbs</dc:creator>
		<pubDate>Thu, 02 Oct 2008 21:24:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-359223</guid>
		<description>What&#039;s the best way to load lots of large and small files for full text indexing?   Which database engine is best suited for FTI? of large files?</description>
		<content:encoded><![CDATA[<p>What&#8217;s the best way to load lots of large and small files for full text indexing?   Which database engine is best suited for FTI? of large files?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Baron Schwartz</title>
		<link>http://www.mysqlperformanceblog.com/2008/07/03/how-to-load-large-files-safely-into-innodb-with-load-data-infile/comment-page-1/#comment-336167</link>
		<dc:creator>Baron Schwartz</dc:creator>
		<pubDate>Mon, 28 Jul 2008 13:13:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/?p=424#comment-336167</guid>
		<description>Hi Kye,

Please use the Contact Us form on our website http://www.percona.com, as this goes into our ticketing system.</description>
		<content:encoded><![CDATA[<p>Hi Kye,</p>
<p>Please use the Contact Us form on our website <a href="http://www.percona.com" rel="nofollow">http://www.percona.com</a>, as this goes into our ticketing system.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

