Percona will be in Chicago and London the week of April 8th delivering two 2-day MySQL workshops. For our MPB readers, we are offering a 15% discount. Just use MPB15A when purchasing your tickets to one or both MySQL workshops. Scaling and Optimization for MySQL on Monday & Tuesday, April 8 & 9 This MySQL workshop [...]
Webinar: Building a highly scaleable distributed row, document or column store with MySQL and Shard-Query
On Friday, February 15, 2013 10:00am Pacific Standard Time, I will be delivering a webinar entitled “Building a highly scaleable distributed row, document or column store with MySQL and Shard-Query” The first part of this webinar will focus on why distributed databases are needed, and on the techniques employed by Shard-Query to implement a distributed [...]
Thank you for joining us at Percona Live, NYC 2012
Over 200 attendees attended last week’s Percona Live NY event. This year we structured event different than in 2011 with Tutorial Day allowing for in-depth 3 hour presentations for those looking to dive deep into specific topics. We also added an Expo Hall which allowed a lot of MySQL Ecosystem participant to meet their prospective [...]
Meet Percona Team at OSCON
Yes, We will be at OSCON next week. I will be talking about Optimizing MySQL Configuration and host a BOF on MySQL Sharding Replication and Clustering if you’re interested in any of these technologies please come by and share your story. I would love to see both users and technology vendors working in this field. [...]
An update on Percona Live MySQL Conference & Expo 2012
We announced a while back that we were going to continue the traditional MySQL conference in Santa Clara, because O’Reilly wasn’t doing it anymore. But we haven’t given an update in a while. Here’s the current status: We created a conference committee. We created a conference website that allows people to create an account and [...]
Using any general purpose computer as a special purpose SIMD computer
Often times, from a computing perspective, one must run a function on a large amount of input. Often times, the same function must be run on many pieces of input, and this is a very expensive process unless the work can be done in parallel. Shard-Query introduces set based processing, which on the surface appears [...]
Distributed Set Processing with Shard-Query
Can Shard-Query scale to 20 nodes? Peter asked this question in comments to to my previous Shard-Query benchmark. Actually he asked if it could scale to 50, but testing 20 was all I could due to to EC2 and time limits. I think the results at 20 nodes are very useful to understand the performance: [...]
Shard-Query EC2 images available
Infobright and InnoDB AMI images are now available There are now demonstration AMI images for Shard-Query. Each image comes pre-loaded with the data used in the previous Shard-Query blog post. The data in the each image is split into 20 “shards”. This blog post will refer to an EC2 instances as a node from here [...]
MongoDB Approach to Availability
Another thing I find interesting about MongoDB is its approach to Durability, Data Consistency and Availability. It is very relaxed and will not work for some applications but for others it can be usable in current form. Let me explain some concepts and compare it to technologies in MySQL space. First I think MongoDB is [...]
“Shard early, shard often”
I wrote a post a while back that said why you don’t want to shard. In that post that I tried to explain that hardware advances such as 128G of RAM being so cheap is changing the point at which you need to shard, and that the (often omitted) operational issues created by sharding can [...]

