Thursday 27 Aug at the VLDB 2009 conference

August 27, 2009

Thursday 27 Aug at the VLDB 2009 conference

Good day,

I wanted to blog everyday while attending the VLDB (Very Large Databases) conference, but I did not have time and my power adapter did not work at first.

So I will start from today and work back to Monday.

There are 4 time slots during the day. Each contains a few sessions. There are 700 researchers here.

Time slot 1:
10 year award keynote:

Database Architecture Evolution: Mammals Flourished long before Dinosaurs became Extinct
Peter A. Boncz (CWI, Amsterdam), Stefan Manegold (CWI, Amsterdam), Martin L. Kersten (CWI, Amsterdam)

A few notes/keywords:

Very good presentation. Funny
These are the MonetDB guys
The Started this research in 1985/1994?
Wrote a few papers
Databases hit the memory wall paper in .... influenced their research
They are a COLUMN storage db
Partitioned hash joins
multi pass clustering
RISC relational algebra
Materialized vs pipeline
They created Vectorwise that do pipeline
Read paper "One size fits all: A concept whose time has come and gone" m stonebraker
INGRES is also invoved with Vectorise
They said db2, Oracle are dinosaurs. Also in 2005 it was found that there was a mammal that feeds on dinosaurs and that maybe MonetDb is a mammal......
MonetDB has grow and adapted and for that reason they are still going strong. They are like a fox and Vectorise(new product from them) is a fast shark :)
Read paper: Breaking the memory wall in MonetDB

Time slot 2: 11:00-12:30

Data Processing on FPGAs
Rene Mueller (ETH Zurich), Jens Teubner (ETH Zurich), Gustavo Alonso (ETH Zurich)

Interesting to see what they do wiht FPGS.

HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads
Azza Abouzeid (Yale Univ.), Kamil Bajda-Pawlikowski (Yale Univ.), Daniel Abadi (Yale Univ.), Alexander Rasin (Brown Univ.), Avi Silberschatz (Yale Univ.)

I liked this talk on HadoopDB. Azza (she presents very well) and Kamil presented.

They have taken Hadoop and have used/integrated PostgreSQL into it. They are also using Hive to provide SQL like functionality.

They have a few benchmark results as well.

HadoopDB is the new baby elephant of Hadoop and PostgreSQL and it needs nurturing.

Parallel datbases vs MapReduce
Read: MapReduce: A major step backwards By David DeWitt

They have merged a parallel db and a mapReduce db

Time slot 3+4:
Column oriented Database Systems I+II
Daniel J. Abadi (Yale), Peter A. Boncz (CWI, Amsterdam), Stavros Harizopoulos (HP Labs)

This was a great session. They provided a nice history about column oriented databases.

Stavros: the history
Daniel: The compression algorithms used in these databases to compress the colums
Peter: How column databases fit/work with the CPU architecture

My thoughts: What to take a away from the session is that colum databases are a good option, they have grow a lot in the last few years, it is important to think about cpu caches.

Read up more on NSM, DSM and PAX. PAX has been mentioned a few times.

SIMD's are also mentioned at a few talks.

Search This Blog

Database research, software development and stuff

Thursday 27 Aug at the VLDB 2009 conference

Comments

Post a Comment

Popular Posts

Intellij Idea 13.1.6/14 HTML formatting funny: Uncaught syntax error:unexpected token illegal

Monday 24 Aug at the VLDB 2009 conference