• Home
  • Most Popular
  • Submit
  • About Us
  • Contact Us

Softpile

Free Downloads

Categories
  • Home
  • Most Popular
  • Communications
  • Desktop
  • Games & Entertainment
  • Graphic Apps
  • Network & Internet
  • Security & Privacy
  • System Utilities
Alternative to itextsharp 2022.11.10347
IronPDF offers an itextsharp alternative for HTML to PDF conversion with C# code examples, documentation, and ...
VShell Server for Linux and Mac 4.8
VShell is a versatile and secure file transfer server that supports multiple protocols and is compatible ...
PDF Studio PDF Editor for Linux 2022
PDF Studio is a cost-effective PDF editor that delivers full compatibility with the PDF Standard. It's ...
VQ Probe for Linux 1.5
VQ Probe is a comprehensive software tool that enables objective and subjective video quality analysis. The ...
Resilient Server 2.3
This Debian GNU/Linux (Buster) based software has a customized partitioning scheme that enhances robustness against filesystem ...
Valentina Studio for Linux 9.6
Valentina Studio is a cross-platform GUI manager for Mac, Windows, and Linux. It allows users to ...
VPN Lifeguard for Linux 1.0.58
The software monitors VPN connection and automatically terminates apps during connection loss, re-establishes the connection and ...
G_Viewer 0.84
G_Viewer is a Linux software that serves as both a file system and photo/image viewer. It ...
Checksome File Hash Tool for Linux 1.1
This software allows for the generation and verification of file hashes. It is a quick and ...
KeyWrangler Password Manager for Linux 1.2
A password management software that is secure, offline and extensible. It offers military-grade encryption to protect ...
Home Linux dedupe Download

dedupe

June 10, 2009
A Python library designed for deduplication purposes, helping to identify and remove duplicate values from datasets efficient enough to handle large datasets.
Version 2009-06-10
License GPL v3
Platform Linux
Supported Languages English
Homepage launchpad.net
Developed by Graham Poulter
Dedupe is an impressive Python library that can help you detect similar rows within a table of records such as a CSV file or database. It can also be used for linking the similar rows between two different tables. The processing of records with Dedupe is straightforward and involves three primary steps.

First, the records are indexed into blocks. Then, the comparison function compares all the pairs of records within each block. Finally, the pairs of records are clustered such that they either belong to a match or a non-match cluster.

In summary, if you have a database or CSV file with records that require similarity detection or linking, Dedupe is a reliable tool to consider as it provides a clean and efficient process with clear and accurate output.
What's New

Version 2009-06-10: N/A

Free Download
327
  • Share on:

Most Popular

  1. Quicksilver Forums 1.4.2
    155
  2. Dvgrab 3.4
    102
  3. DynVPN 1.0
    92
  4. SlideMap 1.2.2
    82
  5. CherryTV 0.1
    81
  6. porm r2
    79
  7. Swiftfox 3.0b5pre-2
    77
  8. Java Games 1.0
    76
  9. Clewarecontrol 0.8
    75
  10. fuseftp 0.8
    74

Related Downloads

Topal
Topal software integrates GnuPG and Pine/Alpine and provides a secure email environment. ...
Murdoc
Murdoc is a tool for documenting the activities and processes of system ...
Apwal
Apwal is a Linux launcher software with an integrated editor. It's a ...
KCalculator
KCalculator is a user-friendly, compact calculator designed for the KDE desktop environment. ...
Java Binary Enhancement Tool
Java Binary Enhancement Tool is a software tool that allows for Java ...
CPAN::Unpack
This software can unpack CPAN distributions quickly and easily, allowing users to ...
QGRUBEditor
QGRUBEditor is a tool that enables users to view and modify the ...
Linux Kernel Spinlock Metering
Linux Kernel Spinlock Metering patch facilitates building i386, ia64, Alpha, Sparc64, or ...
Babel Router
Babel Router is a protocol which provides loop-free and distance-vector routing, allowing ...
PFStmo
PFStmo project has the latest tone mapping techniques implemented in its software.
Copyright © 1999-2025 Softpile Free Downloads
  • Most Popular
  • Submit
  • About Us
  • Contact Us
  • Privacy Policy
  • Disclaimer
  • Terms of Use

Can we use your data to tailor ads for you?

Our partners will collect data and use cookies for ad personalization and measurement.

By choosing "I agree", closing this pop-up or clicking on any element on the page, you agree to the use of cookies to help us provide you with a better user experience.

Learn how Softpile and our partners collect and use data.

You can change your choice at any time in our privacy center.

Cookie Settings

Our website stores four types of cookies. At any time you can choose which cookies you accept and which you refuse. You can read more about what cookies are and what types of cookies we store in our Cookie Policy.

are necessary for technical reasons. Without them, this website may not function properly.

are necessary for specific functionality on the website. Without them, some features may be disabled.

allow us to analyse website use and to improve the visitor's experience.

allow us to personalise your experience and to send you relevant content and offers, on this website and other websites.