Sunday, November 01, 2009

Random links of the weekend

Still exploring what Cassandra could provide as a clustered mail storage.

Easy read on cassandra data model : http://arin.me/code/wtf-is-a-supercolumn-cassandra-data-model

Using this article, I drafted an early data model :
Conversations : { // CF
INBOX/123-1 : { // Row key is the conv id
timeuuid_1 : messageId1
timeuuid_2 : messageId2
}
INBOX/123-2 : { // Row key is the conv id
timeuuid_3 : messageId3
timeuuid_4 : messageId4
}
}

Folders : { // CF
INBOX : {
timeuuid_1 : INBOX/123-1
timeuuid_3 : INBOX/123-2
}
}

Messages : { // CF
messageId1 : { // row key
subject: bla bla
to: xx
cc: yy
bcc: zz
preview : hello world
body: hello world ! qdqsd
hasAttach: true
hasInvitation: false
attachments: attach_123, attach_456
}
}

Attachments : {
attch_123 : {
filename: fileName1
size: 123456
content: xazeqsdqdqsd_base_64
}
}


Article on lucene/solr + cassandra : http://nicklothian.com/blog/2009/10/27/solr-cassandra-solandra/

As I wanted to store my tests on those technologies somewhere, I created the BigG project on googlecode.

No comments: