Here's a great talk on visualizing large plain-text document sets via clustering.
http://curiositycounts.com/post/6455747293/jonathan-stray-of-the-associated-press-on
It uses the wikileak war docs for an example, but can be applied to other data sets.
Cheers.
-owen