April 23, 2008

InfoCrawler is open source software designed to support knowledge management operations.

Version 1.1

License GPL v3

Platform Linux

Supported Languages English

Homepage www.infocrawler.org

Developed by Rafik Kaddouri

If you're looking for an efficient and flexible knowledge management solution, you might want to check out InfoCrawler. This open source software enables you to crawl, index, and query different types of documents from various sources, including Intranets, News groups, FTP sites, public WEB sites, local, and remote file systems.

One of the major advantages of InfoCrawler is its distributed architecture that makes it a 100% Java-based service that can be executed on one or multiple machines. With its components communicating through XML, the software's administration, spider, and indexing engine can be installed on separate machines for better performance and scalability.

Managing your InfoCrawler-based collections is also very user-friendly thanks to the software's intuitive web-based administration interface. This interface helps you monitor and administer different collections with ease, making it simple and flexible to use, thus reducing total cost ownership.

The software also has other unique features that make it stand out from other knowledge management solutions. For instance, it uses a powerful multi-threaded architecture that enables parallel spidering and multiple threads per collection, making crawling more efficient. InfoCrawler is also equipped with Lucene indexing engine, which allows it to index various file types, including HTML files, Microsoft Office documents, PDFs, XML, and more.

InfoCrawler is an open technology solution that does not rely on proprietary technology. For example, it uses URLs that are maintained using MySql database, and it uses Lucene (Open Source Indexer) as the indexing engine. The software's web administration is done using Apache Tomcat and JSP, while communication between the administration and the spider is done through XML. The spider component, on the other hand, is 100% Java-based.

Finally, InfoCrawler is a flexible software solution that can be integrated easily into larger projects as it works with different standards such as HTML, XML, JSP, Java, and JDBC.

If you're looking for a reliable knowledge management solution, you need to have Java Runtime Environment (JRE) 5 or higher, as well as other requirements as specified by the software vendor. The latest release also comes with major feature enhancements.

What's New

Version 1.1: N/A

Free Download 128M

Softpile

Free Downloads

InfoCrawler

Most Popular

Related Downloads