Hello! I have a large number of Word documents (> 100,000) that have to be processed.
1. Read the .doc file
2. Extract account number, name and date
3. convert to RTF format
4. write HL7 messages around the rtf component
5. write the file
I have read the previous notes and tried using Word automation but it is very slow. Is there a faster way to read a DOC file into a string variable, and then read it and convert it from there? Is there a tool kit I can use?
Any help or pointers gratefully accepted.
Thanks!
David
1. Read the .doc file
2. Extract account number, name and date
3. convert to RTF format
4. write HL7 messages around the rtf component
5. write the file
I have read the previous notes and tried using Word automation but it is very slow. Is there a faster way to read a DOC file into a string variable, and then read it and convert it from there? Is there a tool kit I can use?
Any help or pointers gratefully accepted.
Thanks!
David