------------------------------------------------------------------------
pdftotext
------------------------------------------------------------------------

I found this useful when dealing with tables.

-raw ... keep the text in content stream order. This is a hack which
         often "undoes" column formatting, etc. Use of raw mode is no longer 
         recommended.