December 12, 2008

This Perl extension allows the extraction of terms from large data sets, along with providing a syntactic analysis in a head-modifier format, resulting in a more accessible and organized analysis of text data.

Version 0.5

License Perl Artistic License

Platform Linux

Supported Languages English

Homepage search.cpan.org

Developed by Thierry Hamon

Lingua::YaTeA is a powerful software tool designed to aid in the extraction of relevant terms from large language corpora. As a Perl extension, Lingua::YaTeA makes it easy to retrieve terms and provides a syntactic analysis of these terms in a head-modifier format.

Its SYNOPSIS provides a quick example of how to utilize Lingua::YaTeA and its features through the use of a simple Perl script. The module is the primary component of YaTeA, which targets extraction of noun phrases closely resembling terms from a given corpus.

To utilize this module's capabilities, the software requires a pre-processed corpus that is segmented into words and sentences, lemmatized, and tagged with part-of-speech (POS) information. Data provided with Lingua::YaTeA allows for seamless term extraction from English and French texts. However, more linguistic features can be integrated to extract terms from other languages, even for sub-language or tagset modification.

The analysis strategy for term candidate identification is drawn from parsing patterns and endogenous disambiguation. Additionally, external resources, such as testified term lists, can be used for exogenous disambiguation for term candidate identification and analysis.

Overall, Lingua::YaTeA is a great tool for any linguist, researcher, or developer looking to extract terms and gain a better understanding of the syntactical analysis of the corpus. Its extensive features and flexibility make it an essential software tool for anyone working with large corpora.

What's New

Version 0.5: N/A

Free Download 248K

Softpile

Free Downloads

Lingua::YaTeA

Most Popular

Related Downloads