TLGU is software that converts TLG or PHI input files to Unicode (UTF-8) for easier use.
The software includes several options, including the -b option which inserts a form feed and citation information on every "book" citation change. By default, the program outputs line feeds only (see also the -p option). The -r option forces a change to roman text after each citation block is encountered for primarily Roman text, such as in doccan1.txt and doccan2.txt.
Other options include:
- -v: highest-level reference citation is included before each text line (v-level)
- -w: reference citation is included before each text line (w-level)
- -x: reference citation is included before each text line (x-level)
- -y: reference citation is included before each text line (y-level)
- -z: lowest-level reference citation is included before each text line (z-level)
Additionally, users can create a custom citation format string with the -Z option, e.g. "%A/%B/%x/%y/%zt", which outputs the contents of the A, B citation description levels, followed by x, y, z citation reference levels, followed by a TAB character. The -e option allows users to define a custom blank citation string, such as "-" or "[NONE]".
Other options include the -B option, which inserts blank space (a tab) before each and every line, and the -C, -S, -V, and -W options for outputting citation debug information, special code debug information, block processing information, and outputting each work (book) as a separate file in the form output_file-xxx.txt, respectively.
In the latest release, TLGU now compiles under gcc 4.x without warnings, and the accompanying Hellenic Polytonic HOWTO has been updated. Overall, TLGU offers a comprehensive solution for converting TLG and PHI representation beta code text and citation information into Unicode (UTF-8) with several useful options for customization.
Version 1.4: N/A