<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Wishes for mysqldump</title>
	<atom:link href="http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/</link>
	<description>Everything about MySQL Performance</description>
	<pubDate>Sun, 07 Sep 2008 08:30:14 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: Thomas</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-306613</link>
		<dc:creator>Thomas</dc:creator>
		<pubDate>Thu, 29 May 2008 19:41:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-306613</guid>
		<description>3. I wrote a simple perl script that does just that... I only tested with 16 threads so far (on a 16-proc server) and IO seek seemed to me a big problem (the restore was probably between 50 and 200 times slower! - I just aborted it). It's not a production RAID (only for local C drive, "value" kind of hardware...) and I may do more tests on a real RAID.

My script is a bit crappy; it buffers tables in memory then send them in one block to child processes, and block all threads on db changes. I know how I can make it better by using queues and non-blocking I/O but I'm unlikely to to this anytime soon due to time constraints. It's fairly simple to do with Event::Lib though.

Also I'm not sure if the specs for dumps are published... I rely solely on comments in the dump and on the fact that global parameters are only present at the start of it. Also I based my work on a specific database dump; If there's a spec somewhere it could help.</description>
		<content:encoded><![CDATA[<p>3. I wrote a simple perl script that does just that&#8230; I only tested with 16 threads so far (on a 16-proc server) and IO seek seemed to me a big problem (the restore was probably between 50 and 200 times slower! - I just aborted it). It&#8217;s not a production RAID (only for local C drive, &#8220;value&#8221; kind of hardware&#8230;) and I may do more tests on a real RAID.</p>
<p>My script is a bit crappy; it buffers tables in memory then send them in one block to child processes, and block all threads on db changes. I know how I can make it better by using queues and non-blocking I/O but I&#8217;m unlikely to to this anytime soon due to time constraints. It&#8217;s fairly simple to do with Event::Lib though.</p>
<p>Also I&#8217;m not sure if the specs for dumps are published&#8230; I rely solely on comments in the dump and on the fact that global parameters are only present at the start of it. Also I based my work on a specific database dump; If there&#8217;s a spec somewhere it could help.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Introducing MySQL Parallel Dump at Xaprb</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-173985</link>
		<dc:creator>Introducing MySQL Parallel Dump at Xaprb</dc:creator>
		<pubDate>Mon, 01 Oct 2007 02:18:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-173985</guid>
		<description>[...] while ago Peter Zaitsev wrote about his wishes for mysqldump. These included multi-threaded dumps and &#8220;safe&#8221; dumps that would wait for a server to [...]</description>
		<content:encoded><![CDATA[<p>[...] while ago Peter Zaitsev wrote about his wishes for mysqldump. These included multi-threaded dumps and &#8220;safe&#8221; dumps that would wait for a server to [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Callaghan</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130656</link>
		<dc:creator>Mark Callaghan</dc:creator>
		<pubDate>Thu, 24 May 2007 17:15:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130656</guid>
		<description>Chip released compact_innodb.py many months ago. Despite the _compact_innodb_, it should work with more storage engines. It does parallel load and restore by forking mysqldump and mysql processes. The project page for the code is http://code.google.com/p/google-mysql-tools.</description>
		<content:encoded><![CDATA[<p>Chip released compact_innodb.py many months ago. Despite the _compact_innodb_, it should work with more storage engines. It does parallel load and restore by forking mysqldump and mysql processes. The project page for the code is <a href="http://code.google.com/p/google-mysql-tools" rel="nofollow">http://code.google.com/p/google-mysql-tools</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: fr3nd</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130645</link>
		<dc:creator>fr3nd</dc:creator>
		<pubDate>Thu, 24 May 2007 16:05:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130645</guid>
		<description>Hey,

I've released a new version of mysqlpdump whith some important improvements:

    * Store dumps to files directly instead to stdout
    * Can compress files
    * Dump each table in its own file
    * Can pass parameters directly to mysqldump

http://www.fr3nd.net/projects/mysqlpdump

Next thing I want to implement is to store all the replication data to a separate .sql file.

Any suggestion?</description>
		<content:encoded><![CDATA[<p>Hey,</p>
<p>I&#8217;ve released a new version of mysqlpdump whith some important improvements:</p>
<p>    * Store dumps to files directly instead to stdout<br />
    * Can compress files<br />
    * Dump each table in its own file<br />
    * Can pass parameters directly to mysqldump</p>
<p><a href="http://www.fr3nd.net/projects/mysqlpdump" rel="nofollow">http://www.fr3nd.net/projects/mysqlpdump</a></p>
<p>Next thing I want to implement is to store all the replication data to a separate .sql file.</p>
<p>Any suggestion?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: flupps</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130570</link>
		<dc:creator>flupps</dc:creator>
		<pubDate>Thu, 24 May 2007 03:56:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130570</guid>
		<description>Brian, Larry:

There's been a patch around for this since 2005, unfortunately never got pushed into the mysqldump...
http://bugs.mysql.com/file.php?id=959</description>
		<content:encoded><![CDATA[<p>Brian, Larry:</p>
<p>There&#8217;s been a patch around for this since 2005, unfortunately never got pushed into the mysqldump&#8230;<br />
<a href="http://bugs.mysql.com/file.php?id=959" rel="nofollow">http://bugs.mysql.com/file.php?id=959</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: peter</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130497</link>
		<dc:creator>peter</dc:creator>
		<pubDate>Wed, 23 May 2007 17:36:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130497</guid>
		<description>Thank Brian, 

Really I like tab format a lot.  LOAD DATA INFILE should be easier to optimize for storage engines internally if they like to.  Right now I think it is underused because you need to process each database separately.</description>
		<content:encoded><![CDATA[<p>Thank Brian, </p>
<p>Really I like tab format a lot.  LOAD DATA INFILE should be easier to optimize for storage engines internally if they like to.  Right now I think it is underused because you need to process each database separately.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian Aker</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130464</link>
		<dc:creator>Brian Aker</dc:creator>
		<pubDate>Wed, 23 May 2007 16:51:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130464</guid>
		<description>Hi!

I've been thinking about a --clone option for a bit now... allow you to get the slave options plus take the name of a machine to feed the data to. Bypass the need to pump data via:
   mysqldump [args]  &#124; mysql --host --port  

Which is something I do all of the time to make new slaves. If I get a spare minute I'll look at your idea on parallel dump with locks. If I do it though, I would be interested in it via a tab dump.  

Cheers,
   -Brian</description>
		<content:encoded><![CDATA[<p>Hi!</p>
<p>I&#8217;ve been thinking about a &#8211;clone option for a bit now&#8230; allow you to get the slave options plus take the name of a machine to feed the data to. Bypass the need to pump data via:<br />
   mysqldump [args]  | mysql &#8211;host &#8211;port  </p>
<p>Which is something I do all of the time to make new slaves. If I get a spare minute I&#8217;ll look at your idea on parallel dump with locks. If I do it though, I would be interested in it via a tab dump.  </p>
<p>Cheers,<br />
   -Brian</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: fr3nd</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130442</link>
		<dc:creator>fr3nd</dc:creator>
		<pubDate>Wed, 23 May 2007 16:21:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130442</guid>
		<description>I wrote a quick and dirty wrapper in python that launches many threads with mysqldump. Each thread stores the result in memory and waits for the preceding one. That's the reason why it could be so memory consuming with databases with large tables.

You can find it here: http://www.fr3nd.net/projects/mysqlpdump

It's pretty raw at the moment, but it can be used as skeleton to build something more complex.

Any comments will be appreciated :)</description>
		<content:encoded><![CDATA[<p>I wrote a quick and dirty wrapper in python that launches many threads with mysqldump. Each thread stores the result in memory and waits for the preceding one. That&#8217;s the reason why it could be so memory consuming with databases with large tables.</p>
<p>You can find it here: <a href="http://www.fr3nd.net/projects/mysqlpdump" rel="nofollow">http://www.fr3nd.net/projects/mysqlpdump</a></p>
<p>It&#8217;s pretty raw at the moment, but it can be used as skeleton to build something more complex.</p>
<p>Any comments will be appreciated <img src='http://www.mysqlperformanceblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: peter</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130380</link>
		<dc:creator>peter</dc:creator>
		<pubDate>Wed, 23 May 2007 11:56:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130380</guid>
		<description>Larry, 

Absolutely.   This is why in MMM backup tools we store both SHOW SLAVE STATUS and SHOW MASTER STATUS info so you can when use it to restore master and do point in time recovery using binary log or restore slave and get it going.</description>
		<content:encoded><![CDATA[<p>Larry, </p>
<p>Absolutely.   This is why in MMM backup tools we store both SHOW SLAVE STATUS and SHOW MASTER STATUS info so you can when use it to restore master and do point in time recovery using binary log or restore slave and get it going.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Larry Damman</title>
		<link>http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130222</link>
		<dc:creator>Larry Damman</dc:creator>
		<pubDate>Wed, 23 May 2007 08:56:01 +0000</pubDate>
		<guid isPermaLink="false">http://www.mysqlperformanceblog.com/2007/05/22/wishes-for-mysqldump/#comment-130222</guid>
		<description>While we're requesting features, I have one that I'd love to see implemented.  It would be similar to --master-data=1 except it would get the Exec_Master_Log_Pos from the "SHOW SLAVE STATUS" command.  This way you could (easily) create a sibling of the dumped db instead of a child.  This would come in handy when you take your backups/snapshots off of a slave in order to keep the master running smoothly.</description>
		<content:encoded><![CDATA[<p>While we&#8217;re requesting features, I have one that I&#8217;d love to see implemented.  It would be similar to &#8211;master-data=1 except it would get the Exec_Master_Log_Pos from the &#8220;SHOW SLAVE STATUS&#8221; command.  This way you could (easily) create a sibling of the dumped db instead of a child.  This would come in handy when you take your backups/snapshots off of a slave in order to keep the master running smoothly.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
