<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: On SSDs, rotations and I/O</title>
	<atom:link href="http://dom.as/2008/11/09/on-ssd-io/feed/" rel="self" type="application/rss+xml" />
	<link>http://dom.as/2008/11/09/on-ssd-io/</link>
	<description></description>
	<lastBuildDate>Sat, 17 Dec 2011 20:14:59 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Log Buffer #123: a Carnival of the Vanities for DBAs</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1376</link>
		<dc:creator><![CDATA[Log Buffer #123: a Carnival of the Vanities for DBAs]]></dc:creator>
		<pubDate>Fri, 14 Nov 2008 17:54:16 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1376</guid>
		<description><![CDATA[[...] Mituzas has some thoughts of his own on SSDs, rotations and I/O, which begin, &#8220;Every time anyone mentions SSDs, I have a feeling of futility and being [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Mituzas has some thoughts of his own on SSDs, rotations and I/O, which begin, &#8220;Every time anyone mentions SSDs, I have a feeling of futility and being [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin Burton</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1375</link>
		<dc:creator><![CDATA[Kevin Burton]]></dc:creator>
		<pubDate>Tue, 11 Nov 2008 18:37:17 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1375</guid>
		<description><![CDATA[I&#039;ve been working on a long blog post about the pros/cons of in-kernel caching vs in-process caching.

It&#039;s a very complicated issue that I&#039;ve been thinking about for a long time.

If you can get your data on SSD, and avoid caching altogether, these issues totally vanish.

The problem is that InnoDB/MySQL is not very good when you put it on an ultra-fast IO subsystem (as Peter mentioned).

All of our SSD tests were CPU bound in InnoDB.  In a number of them MyISAM crushed SSD because it wasn&#039;t using any CPU.

Hopefully some of the new performance patches will fix things but I think the data/IO path will need to be optimized.

We just simply didn&#039;t have access to devices with these IO capabilities in the past.......

Kevin]]></description>
		<content:encoded><![CDATA[<p>I&#8217;ve been working on a long blog post about the pros/cons of in-kernel caching vs in-process caching.</p>
<p>It&#8217;s a very complicated issue that I&#8217;ve been thinking about for a long time.</p>
<p>If you can get your data on SSD, and avoid caching altogether, these issues totally vanish.</p>
<p>The problem is that InnoDB/MySQL is not very good when you put it on an ultra-fast IO subsystem (as Peter mentioned).</p>
<p>All of our SSD tests were CPU bound in InnoDB.  In a number of them MyISAM crushed SSD because it wasn&#8217;t using any CPU.</p>
<p>Hopefully some of the new performance patches will fix things but I think the data/IO path will need to be optimized.</p>
<p>We just simply didn&#8217;t have access to devices with these IO capabilities in the past&#8230;&#8230;.</p>
<p>Kevin</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Domas Mituzas</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1374</link>
		<dc:creator><![CDATA[Domas Mituzas]]></dc:creator>
		<pubDate>Mon, 10 Nov 2008 16:36:10 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1374</guid>
		<description><![CDATA[Øystein, PBXT per-thread log reduces lock contention (otherwise per-table logs would be appended). It gets at least one part of operation to be lock-less.]]></description>
		<content:encoded><![CDATA[<p>Øystein, PBXT per-thread log reduces lock contention (otherwise per-table logs would be appended). It gets at least one part of operation to be lock-less.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Øystein Grøvlen</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1373</link>
		<dc:creator><![CDATA[Øystein Grøvlen]]></dc:creator>
		<pubDate>Mon, 10 Nov 2008 10:40:02 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1373</guid>
		<description><![CDATA[&quot;Log-based storage designs like PBXT will make much more sense&quot;

I do not understand the reasoning behind this.  With SSD, the advantage of doing sequential writes instead of random writes will be almost eliminated.  Since one of the main motivations of log-based storage is to reduce the amount of random writes, I do not see how the log-based approach will make more sense with SSD.]]></description>
		<content:encoded><![CDATA[<p>&#8220;Log-based storage designs like PBXT will make much more sense&#8221;</p>
<p>I do not understand the reasoning behind this.  With SSD, the advantage of doing sequential writes instead of random writes will be almost eliminated.  Since one of the main motivations of log-based storage is to reduce the amount of random writes, I do not see how the log-based approach will make more sense with SSD.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Domas Mituzas</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1372</link>
		<dc:creator><![CDATA[Domas Mituzas]]></dc:creator>
		<pubDate>Sun, 09 Nov 2008 19:21:05 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1372</guid>
		<description><![CDATA[The most expensive thing in I/O reads cpu-wise is checksumming, everything else is lightweight compared...]]></description>
		<content:encoded><![CDATA[<p>The most expensive thing in I/O reads cpu-wise is checksumming, everything else is lightweight compared&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Callaghan</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1371</link>
		<dc:creator><![CDATA[Mark Callaghan]]></dc:creator>
		<pubDate>Sun, 09 Nov 2008 18:27:33 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1371</guid>
		<description><![CDATA[The rate limiting in InnoDB is braindead. The current limit is 100 page writes per second regardless of the server IO capacity. And when there are too many dirty pages the server switches from writing too few to writing too many. But that is easy to fix and both Percona and Google have published patches for it.

Vote for these bugs:
http://bugs.mysql.com/bug.php?id=40603
http://bugs.mysql.com/bug.php?id=40604]]></description>
		<content:encoded><![CDATA[<p>The rate limiting in InnoDB is braindead. The current limit is 100 page writes per second regardless of the server IO capacity. And when there are too many dirty pages the server switches from writing too few to writing too many. But that is easy to fix and both Percona and Google have published patches for it.</p>
<p>Vote for these bugs:<br />
<a href="http://bugs.mysql.com/bug.php?id=40603" rel="nofollow">http://bugs.mysql.com/bug.php?id=40603</a><br />
<a href="http://bugs.mysql.com/bug.php?id=40604" rel="nofollow">http://bugs.mysql.com/bug.php?id=40604</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Peter Zaitsev</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1370</link>
		<dc:creator><![CDATA[Peter Zaitsev]]></dc:creator>
		<pubDate>Sun, 09 Nov 2008 17:57:00 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1370</guid>
		<description><![CDATA[... At the same time.

Yes SSD and Flash in general will change landscape dramatically.  Though I think of the Flash more than SSD - when you&#039;re looking at Flash based systems it does not really make sense to go through the disk interface.  Solutions connecting to the bus directly (such as FusionIO) are likely to be more interesting.  And as you mention RAID - for the same reason there is less requirement for hot swap too]]></description>
		<content:encoded><![CDATA[<p>&#8230; At the same time.</p>
<p>Yes SSD and Flash in general will change landscape dramatically.  Though I think of the Flash more than SSD &#8211; when you&#8217;re looking at Flash based systems it does not really make sense to go through the disk interface.  Solutions connecting to the bus directly (such as FusionIO) are likely to be more interesting.  And as you mention RAID &#8211; for the same reason there is less requirement for hot swap too</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Peter Zaitsev</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1369</link>
		<dc:creator><![CDATA[Peter Zaitsev]]></dc:creator>
		<pubDate>Sun, 09 Nov 2008 17:51:36 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1369</guid>
		<description><![CDATA[Couple of comments

1) Getting rid of Buffer Pool is not going to happen even with SSD.  Take a look at the code path for date in cache vs data on disk for Innodb (I have done work running MySQL on RAM drive and Violin Memory system)

2) Rate limiting makes sense. If you need to flush 10000 pages and do it as fast as possible you will see less bandwith available for reads and will see the dip in results.   Though Innodb is doing it wrong. You can check DBT2 graph or any other flush intensive benchmarks and you will see dips still... because Innodb either flush not fast enough or starts flushing like a crazy and unable to do anything else.   Adaptive and configurable limit would make much more sense (and this is what we&#039;re doing in Percona patches)]]></description>
		<content:encoded><![CDATA[<p>Couple of comments</p>
<p>1) Getting rid of Buffer Pool is not going to happen even with SSD.  Take a look at the code path for date in cache vs data on disk for Innodb (I have done work running MySQL on RAM drive and Violin Memory system)</p>
<p>2) Rate limiting makes sense. If you need to flush 10000 pages and do it as fast as possible you will see less bandwith available for reads and will see the dip in results.   Though Innodb is doing it wrong. You can check DBT2 graph or any other flush intensive benchmarks and you will see dips still&#8230; because Innodb either flush not fast enough or starts flushing like a crazy and unable to do anything else.   Adaptive and configurable limit would make much more sense (and this is what we&#8217;re doing in Percona patches)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Domas Mituzas</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1368</link>
		<dc:creator><![CDATA[Domas Mituzas]]></dc:creator>
		<pubDate>Sun, 09 Nov 2008 16:56:58 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1368</guid>
		<description><![CDATA[Mark, I think any static rate limiting is not needed - even if rate limiting is done, it could be expressed with some time-based exponential pressure (more you&#039;re lagging, more you&#039;re pressuring the writes to I/O).

Maria also does background LRU flushing (see http://forge.mysql.com/worklog/task.php?id=3261), but doesn&#039;t have adaptive hash/insert buffer steps.

As for memory use - I&#039;d have to show full profile (this one is a bit outdated and 4.0 based: http://noc.wikimedia.org/~midom/mem.pdf ), and indeed it may be possible to change that just by bumping up the page size (does anyone do that in production with nice results?). Last time I looked it was wasteful enough :)]]></description>
		<content:encoded><![CDATA[<p>Mark, I think any static rate limiting is not needed &#8211; even if rate limiting is done, it could be expressed with some time-based exponential pressure (more you&#8217;re lagging, more you&#8217;re pressuring the writes to I/O).</p>
<p>Maria also does background LRU flushing (see <a href="http://forge.mysql.com/worklog/task.php?id=3261" rel="nofollow">http://forge.mysql.com/worklog/task.php?id=3261</a>), but doesn&#8217;t have adaptive hash/insert buffer steps.</p>
<p>As for memory use &#8211; I&#8217;d have to show full profile (this one is a bit outdated and 4.0 based: <a href="http://noc.wikimedia.org/~midom/mem.pdf" rel="nofollow">http://noc.wikimedia.org/~midom/mem.pdf</a> ), and indeed it may be possible to change that just by bumping up the page size (does anyone do that in production with nice results?). Last time I looked it was wasteful enough :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Callaghan</title>
		<link>http://dom.as/2008/11/09/on-ssd-io/#comment-1367</link>
		<dc:creator><![CDATA[Mark Callaghan]]></dc:creator>
		<pubDate>Sun, 09 Nov 2008 16:41:15 +0000</pubDate>
		<guid isPermaLink="false">http://dammit.lt/?p=266#comment-1367</guid>
		<description><![CDATA[What is &#039;braindead&#039; about the page-flush design for InnoDB? What is the page-flush design for Maria?

It is possible to significantly shrink the per buffer cache page overhead from InnoDB, but that will take time. Is the current overhead really 2kb per page? I thought it was a few hundred bytes per page and the ratio is reduced if you boost the Innodb page size from 16kb to 32kb or 64kb.]]></description>
		<content:encoded><![CDATA[<p>What is &#8216;braindead&#8217; about the page-flush design for InnoDB? What is the page-flush design for Maria?</p>
<p>It is possible to significantly shrink the per buffer cache page overhead from InnoDB, but that will take time. Is the current overhead really 2kb per page? I thought it was a few hundred bytes per page and the ratio is reduced if you boost the Innodb page size from 16kb to 32kb or 64kb.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

