Cassius Rosenthal on 5 Sep 2007 15:41:56 -0000

[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

Re: [PhillyOnRails] jruby + hadoop?

If CouchDB makes it out of beta, I would love to use an ActiveRecord implementation of this for very simple web sites like blogs. I think the revisions feature should be made optional, otherwise the storage file could blow up faster than the log files, requiring its own maintenance plan.

This CouchDB approach particularly makes sense for the Rails framework, where validation and relationships are specified in the model code, making foreign key specifications and constraints redundant.

In terms of performance, I would love to see how the map/reduce implementation stacks up to optimized psql statistics, for example.

But for serious applications, I want my RDBMS. Nothing beats having the model data and relationships where they should be -- entirely in the database, optimized by procedural functions -- application agnostic.


Mat Schaffer wrote:
So granted, I'm still reading the map/reduce paper, but this sorta drove it home for me I think. Thought I'd share it with those of us not subscribed to Labnotes already.

Granted, it's not hadoop, but it's website-type data without an RDMBS which put it in perspective pretty nicely. I can sorta imagine this becoming the norm and RDBMS becoming more of a niche market.


On Sep 1, 2007, at 5:34 PM, Michael Bevilacqua-Linn wrote:

Have you read the google paper (doesn't cover hadoop obviously, but a great intro to the concept, has a section at the end that has a few blurbs on what/how google is using it.) It's very straightforward and written in plain English, not academe-ese.


On 9/1/07, *Mat Schaffer* < <>> wrote:

    On Aug 31, 2007, at 9:12 PM, Evan Weaver wrote:
    > PS. Thanks Toby for your June mail about Hadoop; it's totally
    on the
    > table now :) .

    To completely ignore your question and make a selfish comment of my
    own:  I'd love to see some presentational material on how to really
    use hadoop (or map reduce in general).

    I've looked at it a couple of times and sorta get it, but I keep
    failing to get a firm grasp on what problem spaces it's applicable to
    and how.
    To unsubscribe or change your settings, visit:

_______________________________________________ To unsubscribe or change your settings, visit:


To unsubscribe or change your settings, visit:

_______________________________________________ To unsubscribe or change your settings, visit: