jPDFText is a Java software that can retrieve text from PDF files.

This powerful tool is built upon Qoppas proprietary PDF technology, eliminating the need for third-party software or drivers. As an added bonus, jPDFText is platform-independent, meaning you can run it on any system that supports the Java runtime environment, including Windows, Linux, Unix (Solaris, HP UX, IBM AIX), and Mac OS X.
jPDFText boasts a robust set of features, including the ability to load PDF documents from files, network drives, URLs, or input streams, extract text in the logical reading order, and extract words as a vector of strings. Plus, because it's written in Java, you can be confident that your application will be compatible with a wide range of systems.
Deploying jPDFText is a breeze, with no need to install or configure additional drivers or software. It has been rigorously tested on JDK 1.4.2 and above, ensuring a smooth and seamless experience for users.
If you require additional information or have any questions about the capabilities of jPDFText, don't hesitate to reach out to the experts at Qoppa at [email protected]. And if you're interested in recognizing text in scanned PDFs or PDFs containing images, be sure to check out our Java OCR feature!
Version 2021R1:
Java 9 Support
Rich Text and Non-Latin Unicode Support in Form Fields