May 26, 2012

Post: Data mart or data warehouse?

data warehouse, but they also build data marts for OLAP analysis from the warehouse data. A different approach is to build a relational warehouse from multiple data marts, or the so-called bottom-up approach to data

Post: Multiple MySQL instances on Fusion-io ioDrive

warehouses (220GB of data), 120GB buffer pool 2 MySQL instances, 1200 warehouses (110GB of data) each, 60GB buffer pool each 4 MySQL instances, 600 warehouseswarehouses (220GB of data), 64GB buffer pool 2 MySQL instances, 1200 warehouses (110GB of data) each, 32GB buffer pool each 4 MySQL instances, 600 warehouses

Post: Intro to OLAP

… business intelligence. (this post) Identifying the differences between a data warehouse, and a data mart. Introduction to MDX queries and the kind of… difference between data marts and data warehouses. The former are usually used for OLAP analysis, but they can be fundamentally related to a warehouse.

Post: MySQL Limitations Part 4: One thread per connection

… lovely to have those things, if you’re running a data warehouse on MySQL, and in some cases for other uses too… startup cost for the query). But MySQL isn’t a data warehouse DBMS first and foremost. It’s a general-purpose OLTP… can be used for tons of other things such as… data warehousing. But it isn’t a Netezza or Paraccel, and…

Comment: The case for getting rid of duplicate “sets”

Welcome to data warehouse summary/aggregate/fact tables (just to name them properly).. http://en.wikipedia.org/wiki/Aggregate_(Data_Warehouse)http://en.wikipedia.org/wiki/Fact_table

Post: High-Performance Click Analysis with MySQL

… expensive to solve. Start with the granularity of your data.  What data do you need to make your business run?Â… aggregates, you’re exposed to that bad data again. Any inconsistencies in the atomic data tend to get magnified as it gets… You What you’re really doing here is building a data warehouse.  So you may think you should use traditional DW…

Post: Shard-Query turbo charges Infobright community edition (ICE)

… OLAP/DW data model, since it typically represents a data mart. See also: “Data mart or data warehouse?”. As it is the most common data model… containing all of the data. Before loading, there was approximately 23GB of data. After loading, ICE compressed this data to about 2GB. The…, each shard has about 335MB-350MB of data (23GB raw data, compressed to about 2GB total data. then spread over six servers). Due…

Post: Copying InnoDB tables between servers

…: Target instance is assumed as followings. xtrabackup: innodb_data_home_dir = ./ xtrabackup: innodb_data_file_path = ibdata1:10M:autoextend xtrabackup: innodb_log…. Copying ./tpcc/warehouse.ibd is skipped. Copying ./tpcc/new_orders.ibd is skipped. Copying ./tpcc/order_line.ibd to /data/vadim/mysql…

Post: How Percona does a MySQL Performance Audit

… hearing you say “we are a SaaS hosted application for data mining stock market transactions,” or whatever the case may be… compare it against the server’s variables or whatever other data I need, and write something instructional to the client about… operate. For example, if the workload is a star schema data warehouse, it is important to know a lot about the I…

Post: How many partitions can you have ?

… recently. The customer dealing with large MySQL data warehouse had the table which was had data merged into it with INSERT ON DUPLICATE…) using bulk insert statements – 1000 rows in each. Loading the data was taking 9 seconds for 1 partitions, 10 seconds for… is per row overhead. This test gives other rather interesting data point – it looks like updates in Innodb for CPU bound…