About | Contact | Carrot2 @ sf.net | Search Clustering Engine | Carrot Search

Release 3.5.0

......
Carrot2 Project......
Carrot2: open source framework for building search clustering engines
The 3.5.0 release introduces a new physics-inspired tree map visualization called FoamTree, the bisecting k-means clustering algorithm and a number of smaller improvements and bug fixes.
FoamTree visualization
View
FoamTree: physics-inspired tree map visualization in Carrot2 web application

Release 3.5.0 comes with FoamTree: a tree map visualization with innovative physics-inspired layout algorithms and animations.

Carrot2-branded binaries of FoamTree are available in Carrot2 repository, Carrot2 Web Application and Carrot2 Document Clustering Workbench. Fully-brandable binaries are available from Carrot Search.

K-means clustering algorithm Release 3.5.0 adds a new clustering algorithm: bisecting k-means. As opposed to other algorithms in Carrot2, k-means creates disjoint clusters: each document belongs to exactly one cluster.
Resource management improvements Release 3.5.0 introduces an improved mechanism for loading of configuration files such as stop words or algorithm attributes. For more convenient editing, resources are now read from application-specific locations.
Attribute name changes Proper implementation of the k-means clustering required a number of attribute name changes:
  • LingoClusteringAlgorithm.factorizationFactory becomes TermDocumentMatrixReducer.factorizationFactory
  • LingoClusteringAlgorithm.factorizationQuality becomes TermDocumentMatrixReducer.factorizationQuality
Dependency updates A number of dependencies have been updated:
  • High Performance Primitive Collections to version 0.3.3
  • Google Guava to version r08
  • SimpleXML to version 2.4.1
  • Jackson JSON to version 1.7.4
  • Apache Commons Lang to version 2.6
  • SLF4J to version 1.6.1
  • Ehcache to version 1.7.2
  • Lucene to version 3.1.0
  • Apache HTTP client to version 4.1
  • Apache Commons Codec to version 1.4

For a complete list of improvements and bug fixes, see JIRA issues fixed in version 3.5.0. For more new features, please see release 3.4.0 and release 3.3.0 notes.

...