June 18, 2013

Post: Sphinx search performance optimization: multi-threaded search

… or even COUNT with GROUP BY and if data nodes return large amounts of data to post-process, aggregator may well become…, using div operator is not necessarily the best way to split data, you may want these to be ranges by using a… sql_query_range or something else, depending on how your data looks like. Finishing line I always loved how Sphinx scales…

Comment: PHP Sessions - Files vs Database Based

… contents at same time. And when these frontends push their data back to shared memcached, last one overwrites all previous changes… drop using the generic session handler and instead split data on small enough fragments to reduce risk of threads overwriting each others data.

Post: Is your MySQL Application having Busy IO by Oracle Measures ?

… move to multiple instances and you have to “shard” your data in some way anyway there is no difference whenever you… we often start “sharding” or splitting data other way because operationally working with large table and data sizes in MySQL can be painful…

Comment: Is disk Everything for MySQL Performance ?

… standpoint without a good reason. > I tend to keep Innodb data files (and whole datadir) and log files on the same… also true when data/logs are physically separated (artificially “rate limiting” — may be imagined as “protecting” — io by splitting data/logs) and disk…

Comment: "Shard early, shard often"

… proper consideration. Sharding is usually much easier than re-sharding (splitting) data on an overloaded shard. The plan to shard must include…

Post: Percona XtraDB Cluster: Failure Scenarios with only 2 nodes

…_quorum: Completely ignore quorum calculations. E.g. in case master splits from several slaves it still remains operational. Use with extreme… database inconsistency. We can bypass this behaviour by ignoring the split-brain by adding wsrep_provider_options = “pc.ignore_sb = true… nodes won’t accept queries – if you plan to ignore split-brain or quorum, you risk to have inconsistent data very easily

Post: Shard-Query turbo charges Infobright community edition (ICE)

… large amount of data. Shard-Query includes a loader (loader.php) which can be used to either split a data into multiple files… of data (23GB raw data, compressed to about 2GB total data. then spread over six servers). Due to ICE limitations, the data was split before loading. The splitting

Post: MySQL 5.6 vs MySQL 5.5 and the Star Schema Benchmark

…invariably moves the page onto the hot area, defeating the split.  In order to prevent this from happening, another variable …| | INNODB_BUFFER_POOL_READS | 191571 | | INNODB_DATA_PENDING_READS | 0 | | INNODB_DATA_READ | 3140882432 | | INNODB_DATA_READS | 191581 | | INNODB_PAGES_READ | 191570 | | …

Post: Shard-Query EC2 images available

…-loaded with the data used in the previous Shard-Query blog post. The data in the each image is split into 20 “shards… to hash split up to 50GB/hour of input data on my i970 test machine. It is possible to distribute splitting and/or…`) ) ENGINE=InnoDB DEFAULT CHARSET=latin1 COMMENT=’Data from BTS ontime flight data. Data for Origin and Destination airport data.’; CREATE TABLE `dim_date` ( `Year…

Post: Small things are better

… small tables. Not to mention splitting 500 tables to several servers will likely be easy than splitting one big one. There are… data appropriately, also code gets complicated but for many applications it is worth the trouble At NNSEEK for example we have data split at 256 groups of tables. Current data size is small enough so even single…