• Home
  • Most Popular
  • Submit
  • About Us
  • Contact Us

Softpile

Free Downloads

Categories
  • Home
  • Most Popular
  • Communications
  • Desktop
  • Games & Entertainment
  • Graphic Apps
  • Network & Internet
  • Security & Privacy
  • System Utilities
Alternative to itextsharp 2022.11.10347
IronPDF offers an itextsharp alternative for HTML to PDF conversion with C# code examples, documentation, and ...
VShell Server for Linux and Mac 4.8
VShell is a versatile and secure file transfer server that supports multiple protocols and is compatible ...
PDF Studio PDF Editor for Linux 2022
PDF Studio is a cost-effective PDF editor that delivers full compatibility with the PDF Standard. It's ...
VQ Probe for Linux 1.5
VQ Probe is a comprehensive software tool that enables objective and subjective video quality analysis. The ...
Resilient Server 2.3
This Debian GNU/Linux (Buster) based software has a customized partitioning scheme that enhances robustness against filesystem ...
Valentina Studio for Linux 9.6
Valentina Studio is a cross-platform GUI manager for Mac, Windows, and Linux. It allows users to ...
VPN Lifeguard for Linux 1.0.58
The software monitors VPN connection and automatically terminates apps during connection loss, re-establishes the connection and ...
G_Viewer 0.84
G_Viewer is a Linux software that serves as both a file system and photo/image viewer. It ...
Checksome File Hash Tool for Linux 1.1
This software allows for the generation and verification of file hashes. It is a quick and ...
KeyWrangler Password Manager for Linux 1.2
A password management software that is secure, offline and extensible. It offers military-grade encryption to protect ...
Home Linux Text::Record::Deduper Download

Text::Record::Deduper

April 2, 2009
This software categorizes text records as complete, partial, or near duplicates.
Version 0.05
License Perl Artistic License
Platform Linux
Supported Languages English
Homepage search.cpan.org
Developed by Kim Ryan
If you're looking for a powerful Perl module to help you streamline your duplicate text records, Text::Record::Deduper could be just what you're looking for. This module comes equipped with complete, partial, and near duplicate records, allowing you to customize your deduplication process to your specific needs.

To get started, simply import the module and create a new instance of Text::Record::Deduper. From there, you can use the dedupe_file() method to remove entire lines that are duplicated, or configure the module to dedupe comma separated records based on specific fields.

One of the most valuable features of Text::Record::Deduper is its ability to identify "near" duplicates by allowing for given name aliases. For example, if you have records containing variations of the same name (such as Bob and Robert), you can configure the module to recognize these aliases and group the records accordingly.

Text::Record::Deduper also makes it easy to generate reports and split your records into unique and duplicate files. And with options to ignore case sensitivity and leading/trailing white space, you can be sure that you're finding all of the true duplicates in your data.

Overall, if you're dealing with large sets of text records and need a way to quickly identify duplicates and streamline your data, Text::Record::Deduper is definitely worth checking out.
What's New

Version 0.05: N/A

Free Download 9.9K
294
  • Share on:

Most Popular

  1. Quicksilver Forums 1.4.2
    154
  2. Dvgrab 3.4
    102
  3. DynVPN 1.0
    89
  4. CherryTV 0.1
    81
  5. SlideMap 1.2.2
    80
  6. porm r2
    73
  7. Clewarecontrol 0.8
    72
  8. Java Games 1.0
    72
  9. Swiftfox 3.0b5pre-2
    71
  10. fuseftp 0.8
    71

Related Downloads

xgnokii
A software suite for managing phones.
Font Redate
FontRedate software modifies internal dates in OpenType, PostScript, and TrueType font files. ...
Save Scummer
"Save Scummer" is a Seven Day Roguelike software developed by Jeff Lait, ...
django-cnote
This software is a user notification system that relies on cookies to ...
ClearLisp
ClearLisp is a feature-rich Common Lisp interpreter built using C# language. It ...
trie
Trie is coded in Python and implemented as a prefix tree.
trytond_account
This software includes a financial and accounting module.
PDC
PDC is a programmer-focused desktop calculator following the 'bc' style, delivering advanced ...
AmAvIs
AMaViS-ng is a modular software that is a reimplementation of amavisd and ...
RABL
RABL is an automated real-time blacklisting server designed for statistical filters. It ...
Copyright © 1999-2025 Softpile Free Downloads
  • Most Popular
  • Submit
  • About Us
  • Contact Us
  • Privacy Policy
  • Disclaimer
  • Terms of Use

Can we use your data to tailor ads for you?

Our partners will collect data and use cookies for ad personalization and measurement.

By choosing "I agree", closing this pop-up or clicking on any element on the page, you agree to the use of cookies to help us provide you with a better user experience.

Learn how Softpile and our partners collect and use data.

You can change your choice at any time in our privacy center.

Cookie Settings

Our website stores four types of cookies. At any time you can choose which cookies you accept and which you refuse. You can read more about what cookies are and what types of cookies we store in our Cookie Policy.

are necessary for technical reasons. Without them, this website may not function properly.

are necessary for specific functionality on the website. Without them, some features may be disabled.

allow us to analyse website use and to improve the visitor's experience.

allow us to personalise your experience and to send you relevant content and offers, on this website and other websites.