I am trying to perform a document clustering on a dataset with 4,000 documents. I have been reading about document clustering lately. Here is a list of a number of resources that I have used. The second item is a pretty good book that I recommend. The first one is a survey of document clustering methods.
1- Recent Developments in Document Clustering
2- Clustering and Information Retrieval (Network Theory and Applications)
3- Scatter/Gather: a cluster-based approach to browsing large document collections
This website has a lot of great documents and literature review for document clustering