I found a function on PHP.net that allows extraction of text in a PDF, however it returns a whole mess of strange characters, copyrights about fonts, verisign, microsoft,Arial fonts, etc.
Here's the function:function pdf2string($sourcefile)
{
$fp = fopen($sourcefile, 'rb');
$content =...