May 23, 2012

Post: Data mart or data warehouse?

data warehouse, but they also build data marts for OLAP analysis from the warehouse data. A different approach is to build a relational warehouse from multiple data… at data warehouse volumes. This is in large part because commercial database software supports hash joins, bitmap indexes, table partitioning, parallel query execution…

Post: Shard-Query turbo charges Infobright community edition (ICE)

…OLAP/DW data model, since it typically represents a data mart. See also: “Data mart or data warehouse?”. As it is the most common data model…quantities of data. Shard-Query can be used with existing data sets easily, improving the performance of queries significantly if they use common query features…

Comment: Quick comparison of MyISAM, Infobright, and MonetDB

…). It is another open-source RDBMS that is specialized for data warehouse queries. John spoke about it at the Percona Performance conference. A… form of MVCC so that I can do long-running queries while continuously updating the database from replication. I get that from InnoDB+MySQL minus the support for fast DW queries. I don’t get that from MyISAM+MySQL. IIRC LucidDB…

Post: Intro to OLAP

data warehouse, and a data mart. Introduction to MDX queries and the kind of SQL which a ROLAP tool must generate to answer those queries… difference between data marts and data warehouses. The former are usually used for OLAP analysis, but they can be fundamentally related to a warehouse.

Post: MySQL Limitations Part 4: One thread per connection

… or intra-query parallelism, for example. It would be lovely to have those things, if you’re running a data warehouse on MySQL, and in some cases for other uses too (note that most databases that do have these query plans… things like the lower startup cost for the query). But MySQL isn’t a data warehouse DBMS first and foremost. It’s a…

Post: High-Performance Click Analysis with MySQL

…, you should think a lot about how you need to query the data.  It is a hard question to answer, and… write your queries to use the most aggregated data possible to save work. Avoid operations that update huge chunks of aggregated data at… You What you’re really doing here is building a data warehouse.  So you may think you should use traditional DW…

Post: How Percona does a MySQL Performance Audit

… have died. The ORM should just be running the query, and if the query fails, then you know from the error code… operate. For example, if the workload is a star schema data warehouse, it is important to know a lot about the I… is off the table. Queries can be rewritten. Indexes and data types and table structures can be changed. Queries can be broken into…

Comment: Why MySQL could be slow with large tables ?

Hi We have a small Data Warehouse with a 50 million fact table (around 12GB). It is … 1.0.6) Some of our queries need to access entire table (full table scan), these queries are deadly slow. Out I… me that the limitation is how MYSQL (or InnoDB) reads data, it is not capable to do scatter reads, every disk…

Comment: MySQL Replication vs DRBD Battles

… friendly multi-master setups. More real-time data warehouse applications that log tons of data and frequently summarize and aggregate will run into… on your application/database environment, your requirements for recovery speed, data loss, etc. Finally, I will leave with this note – we… and a big beefy slave for redundancy and read only queries.

Comment: Top 5 Wishes for MySQL

…. I came from Oracle and SQL Server background, where parallel query and partition give you the performance for multiple CPUs and… large Data warehouse with 16 CPU cores with MySQL. Now I realize that MySQL is just not there in terms of parallel queries