Firstly, I was not involved in preparing this document ( should say have zero knowledge about PageMaker also ) and my aim to load some information from that into relational database. I am reluctant to parse PageMaker binary file directly , so I am trying to convert it to something else.
Two possible formats are HTML and ASCII ( either with layout or tagged text )
Images are separate objects and identified on right mouse click as a link to TIF file.
When converted to HTML this links are referenced but from other hand any layout lost. By losing layout I mean the original table-like structure where positions are driven by tabs is not preserved.
HTML output
<P> 09A <IMG SRC="../../INBOUN~1/html/LITRE13.JPG" WIDTH="3" HEIGHT="7"
ALIGN="BOTTOM"> . . Other
<P>s 2203.00.22 _ Containing more than 2.5 % vol., but not more than <BR>
4.35 % vol. <I>per l
al</I> $21.096<SUP>2</SUP> AU $21.096<SUP>2</SUP>
I can't give a proper example for ASCII because it is going to be wrapped anyway , but I am able to do parsing based on certain positions for the cells when layout is preserved.
Document is about 3MB.
I did exporting from story editor selecting all stories ( in this case I have an additional problem footnotes which are separate stories don't follow the page itself ) and directly from file which preserves layout , but references to external files lost ( <IMG SRC="../../INBOUN~1/html/LITRE13... ) plus footnote link ($21.096<SUP>2</SUP>) .
After installing trial edition, at the first sight, I didn't find in InDesign more options for exporting.