That's a great article, Michael. And thanks, Morgen and Karl.
I'm more interested in data scrubbing and merging. Gridworks and
Needlebase do clustering, and you can use Freebase (and now other
datasets via an API) to additionally reconcile stuff from a Gridworks
project.
There's a thread here on reconciling data based on an entire record
(my needs are name and address and registration number), not only one
field -
http://lists.freebase.com/pipermail/freebase-discuss/2010-May/001491.htmlAny suggestions / thoughts for ways forward? I'm happy to go off-list
on this if anyone has expertise on this?
I hope this isn't too off-topic for CA. I figured we could do with
some technical discussion. :) The data I'm working on is Canadian
federal data on non-profits.
Mike
On Tue, Jan 4, 2011 at 9:05 AM, Morgen Peers <
[hidden email]> wrote: