Overview
Carrot2 is a Java-based open-source search results clustering engine for automatically clustering small collections of documents such as search results or document abstracts, into thematic categories. Carrot2 offers ready-to-use components for fetching search results from various sources.
In this instructor-led, live training, participants will learn how to set up and use Carrot2 to automatically organize Search results into thematic categories.
By the end of this training, participants will be able to:
- Install and configure Carrot2
- Post-process (cluster, tune, etc.) data generated by third-part search tools such as Bing, ElasticSearch and Solr (Lucene)
- Integrate Carrot2 into Java and non-Java applications
- Expose Carrot2 clustering as a remote service
Audience
- Developers
- System Administrators
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice
Requirements
- A general understanding of Java
Course Outline
Introduction
Overview of Carrot2 Architecture and Components
Document Sources
Clustering Algorithms
Calling Carrot2 APIs
Running Carrot2 Tools
Calling Carrot2 from C#/.Net Applications
Calling Carrot2 through REST API
Tuning Clustering
Troubleshooting
Real-time text clustering with Carrot Search
Other related projects based on Carrot2
Closing Remarks