Personalchef is a Python tool that analyzes webserver logs and outputs a sorted list of search words that lead to the page.
Version: 0.3Personalchef is a Python tool that analyzes webserver logs and outputs a sorted list of search words that lead to the page the log belongs to. A bit rudimentary, but very fast and Psyco enabled.
Operating System: Linux
Usage: ./personalchef.py < logfile >
Personalchef is a tool that parses logfiles in order to find out what search-words lead to your page. Currently it supports all logfiles where the Referrer is logged in the eleventh field when counting fields as whitespace-seperated strings of non-whitespace. Personalchef does not care about quotes (i.e. "GET /foo.html") counts as two fields.
Commandline should be obvious. Same goes for querytags. If you find new patterns, I'd be glad to hear bout it.
Thanks go out to towo for the nice name and Patrick for his idea on how to increase speed by factor 4...
From version 0.3 onward, personalchef makes use of the Psyco optimizing Python module if it's installed. If you have Psyco installed but don't want personalchef to use it, remove or comment out the line "import psycho" near the top of the program.
Note that Psyco yields about 30% speed increase but nearly doubles RAM usage. On an Athlon-1833, analyzing a 250k-line log file this means 11s and 6M versus 7s and 11M.