InfoCrawler is open source software designed to support knowledge management operations.
One of the major advantages of InfoCrawler is its distributed architecture that makes it a 100% Java-based service that can be executed on one or multiple machines. With its components communicating through XML, the software's administration, spider, and indexing engine can be installed on separate machines for better performance and scalability.
Managing your InfoCrawler-based collections is also very user-friendly thanks to the software's intuitive web-based administration interface. This interface helps you monitor and administer different collections with ease, making it simple and flexible to use, thus reducing total cost ownership.
The software also has other unique features that make it stand out from other knowledge management solutions. For instance, it uses a powerful multi-threaded architecture that enables parallel spidering and multiple threads per collection, making crawling more efficient. InfoCrawler is also equipped with Lucene indexing engine, which allows it to index various file types, including HTML files, Microsoft Office documents, PDFs, XML, and more.
InfoCrawler is an open technology solution that does not rely on proprietary technology. For example, it uses URLs that are maintained using MySql database, and it uses Lucene (Open Source Indexer) as the indexing engine. The software's web administration is done using Apache Tomcat and JSP, while communication between the administration and the spider is done through XML. The spider component, on the other hand, is 100% Java-based.
Finally, InfoCrawler is a flexible software solution that can be integrated easily into larger projects as it works with different standards such as HTML, XML, JSP, Java, and JDBC.
If you're looking for a reliable knowledge management solution, you need to have Java Runtime Environment (JRE) 5 or higher, as well as other requirements as specified by the software vendor. The latest release also comes with major feature enhancements.
Version 1.1: N/A