Join me Wednesday, September 18 at 10 a.m. PDT for an hour-long webinar where I will introduce the basic concepts behind column store technology. The webinar’s title is: “Introduction to open source column stores.”
What will be discussed?
This webinar will talk about Infobright, LucidDB, MonetDB, Hadoop (Impala) and other column stores
- I will compare features between major column stores (both open and closed source).
- Some benchmarks will be used to demonstrate the basic performance characteristics of the open source column stores.
- There will be a question and answer session to ask me anything you like about column stores (you can also ask in the comments section below if you prefer to submit them in advance).
Who should attend?
- Anyone interested in analytics or OLAP
- Those new to column store technology who want to find out how they will benefit from using a column store
- Database administrators or users that have to query large amounts of data quickly, especially in aggregate
T-Shirts!
A randomly selected participant in the webinar will be chosen to receive a free Percona t-shirt. You must be present at the end of the webinar during the question and answer section to be eligible for the shirt.
You can also win a Percona t-shirt by answering one or more of the following questions (the first to answer correctly wins, one winner only, sorry. ):
- What is one advantage of columnar storage?
- What is one big drawback of columnar storage?
- Describe (or link to) an application or project that uses a columnar storage
>What is one advantage of columnar storage?
Selecting only needed attributes from table, also compression.
>What is one big drawback of columnar storage?
Row reconstruction.
>Describe (or link to) an application or project that uses a columnar storage
Used columnar database on previous job for storing and analyzing user interactions with ad impressions. Great compression numbers, very good performance. Vertica.
Also, what happend to the infobright website?
Will the webinar be available afterwards too?
Hi Broncko, yes the webinar will be recorded – you’ll be able to view it using the same url used for registration above.
Hi Evgeny,
Correct on all three. The webinar will talk about how column stores save on IO for typical analytic queries that access few columns of a wide table, but how accessing entire rows can cause poor performance.
On the other hand, a column store can easily handle hundreds or thousands of columns when few are accessed at the same time.
I’m not sure what you mean about the Infobright website? You can find information about their enterprise edition and appliance at Infobright.com and their open source community version at Infobright.org
Nice job, Evgeny. I’m Percona’s community manager and I’ll send you an email shortly – please respond with your address and t-shirt size and I’ll get that out to you.
One more project for you in case it isn’t already on your list:
https://blog.twitter.com/2013/dremel-made-simple-with-parquet
Hi Dave,
Thanks for the info. I am including some info about Impala + Parquet.
Justin,
Thanks for the great overview of technology and tools. I was planning to use MonetDB for my next project, but your results are worrisome.
Would it be possible for me to get the dataset you were testing with, so I can try to replicate the problem, and possibly get a resolution for it?
Thanks!
–Leonid
Hello Justin:
Thanks for the comparative webinar. I would like to point out a correction regarding InfiniDB – you mention that there is “no compression” in InfiniDB – InfiniDB has supported compression since 2010.
Thanks,
Dipti Joshi