• Home
  • Most Popular
  • Submit
  • About Us
  • Contact Us

Softpile

Free Downloads

Categories
  • Home
  • Most Popular
  • Communications
  • Desktop
  • Games & Entertainment
  • Graphic Apps
  • Network & Internet
  • Security & Privacy
  • System Utilities
Alternative to itextsharp 2022.11.10347
IronPDF offers an itextsharp alternative for HTML to PDF conversion with C# code examples, documentation, and ...
VShell Server for Linux and Mac 4.8
VShell is a versatile and secure file transfer server that supports multiple protocols and is compatible ...
PDF Studio PDF Editor for Linux 2022
PDF Studio is a cost-effective PDF editor that delivers full compatibility with the PDF Standard. It's ...
VQ Probe for Linux 1.5
VQ Probe is a comprehensive software tool that enables objective and subjective video quality analysis. The ...
Resilient Server 2.3
This Debian GNU/Linux (Buster) based software has a customized partitioning scheme that enhances robustness against filesystem ...
Valentina Studio for Linux 9.6
Valentina Studio is a cross-platform GUI manager for Mac, Windows, and Linux. It allows users to ...
VPN Lifeguard for Linux 1.0.58
The software monitors VPN connection and automatically terminates apps during connection loss, re-establishes the connection and ...
G_Viewer 0.84
G_Viewer is a Linux software that serves as both a file system and photo/image viewer. It ...
Checksome File Hash Tool for Linux 1.1
This software allows for the generation and verification of file hashes. It is a quick and ...
KeyWrangler Password Manager for Linux 1.2
A password management software that is secure, offline and extensible. It offers military-grade encryption to protect ...
Home Linux Managing Gigabytes for Java Download

Managing Gigabytes for Java

June 9, 2009
This free Java software, called Managing Gigabytes, provides full-text indexing for large document collections. It enables efficient and comprehensive management of gigabytes of data for optimal storage and retrieval.
Version 3.0
License LGPL
Platform Linux
Supported Languages English
Homepage archive4j.dsi.unimi.it
Developed by Sebastiano Vigna
Managing Gigabytes for Java (MG4J) is a free, full-text indexing system that efficiently manages large document collections with Java. This software also offers several optimised classes, such as fast and compact mutable strings, bit-level I/O, fast unsynchronised buffered streams, and minimal perfect hashing for large string collections. With its release of version 1.1, MG4J now provides state-of-the-art features, including BM25 scoring and new research algorithms, which make it a highly customisable and high-performance text-indexing system.

MG4J's powerful indexing capabilities are equipped with support for document collections and factories, enabling the analysis, indexing, and querying of consistently large document collections. The software also provides easy-to-understand snippets, highlighting relevant passages in retrieved documents.

As for efficiency, the software scales to hundreds of millions of documents and can index the TREC GOV2 collection with ease. Rather than displaying meaningless data about indexing speeds, MG4J encourages users to try it themselves.

One distinguishing feature is MG4J's mult-index interval semantics, which produces a list of intervals satisfying the query, providing the base for several high-precision scorers and for very efficient read queries. MG4J also provides expressive operators, making efficient implementation of phrase queries, proximity restrictions, ordered conjunction, and combined multiple-index queries possible. Each operator is represented internally by an abstract object, making it easy to plug in your favourite syntax.

Other features include virtual fields, flexibility to build smaller indices, the openness to present users' data to MG4J through document collection/factory interfaces, distributed processing, multithreading, and clustering capabilities.

In conclusion, MG4J is software that optimally indexes large document collections, providing a customisable and high-performance text-indexing system, which incorporates many advanced features. It supports distributed processing and multithreading, making it an ideal choice for organisations handling large volumes of data that need to execute complex queries.
What's New

Version 3.0: N/A

Free Download 934K
338
  • Share on:

Most Popular

  1. Quicksilver Forums 1.4.2
    155
  2. Dvgrab 3.4
    102
  3. DynVPN 1.0
    92
  4. SlideMap 1.2.2
    82
  5. CherryTV 0.1
    81
  6. porm r2
    79
  7. Swiftfox 3.0b5pre-2
    77
  8. Java Games 1.0
    76
  9. Clewarecontrol 0.8
    75
  10. fuseftp 0.8
    74

Related Downloads

PHPXmail
PhpXmail provides a web-based interface to manage Xmail, making it a convenient ...
Neverwinter Nights Stand-Alone Server Update
NWN is a vast medieval fantasy game, based on Dungeons and Dragons, ...
IXThemes
This software provides high-quality web templates specifically designed for web application platforms. ...
MySQL Proxy
This software allows monitoring, analyzing, and transforming a MySQL server, providing necessary ...
Frisian Dictionary
Frisian Dictionary is a Thunderbird add-on that utilizes Fryske Akademy's word list ...
MOBY::Async
This software offers a toolkit that enables the use of Asynchronous MOBY ...
Integrit
Integrit offers a straightforward solution for validating file integrity, distinguishing itself from ...
BNFinder
Bayesian Network Finder is a software tool that allows users to construct ...
AJAX Image Dump System
AJAX Image Dump System is a PHP and Javascript-based image dumping tool ...
DrumPatterns
DrumPattern is a drumming software designed for users to learn drum patterns ...
Copyright © 1999-2025 Softpile Free Downloads
  • Most Popular
  • Submit
  • About Us
  • Contact Us
  • Privacy Policy
  • Disclaimer
  • Terms of Use

Can we use your data to tailor ads for you?

Our partners will collect data and use cookies for ad personalization and measurement.

By choosing "I agree", closing this pop-up or clicking on any element on the page, you agree to the use of cookies to help us provide you with a better user experience.

Learn how Softpile and our partners collect and use data.

You can change your choice at any time in our privacy center.

Cookie Settings

Our website stores four types of cookies. At any time you can choose which cookies you accept and which you refuse. You can read more about what cookies are and what types of cookies we store in our Cookie Policy.

are necessary for technical reasons. Without them, this website may not function properly.

are necessary for specific functionality on the website. Without them, some features may be disabled.

allow us to analyse website use and to improve the visitor's experience.

allow us to personalise your experience and to send you relevant content and offers, on this website and other websites.