Hello ppl. I need a way to parse rich text format (rtf) so that I can get plain text from it. As I understand RTFEditorKit is meant for doing this sort of things however I ran into some problems:
1)It sometimes messes up the format (in my case table structure)
2)It throws me an
java.io.IOException: Too many close-groups in RTF text
which as I understand that some } doesnt match due to some new word 2k "standard" thingie
3)It doesnt accept parts of file
I try to parse {
\lang1033\langfe1049\langnp1033 Some }{\b\fs28\lang1033\langfe1049\langnp1033 formated}{\lang1033\langfe1049\langnp1033 text }
and get nothing out of it
4)I have some non-ASCII characters in text and I need to somehow translate them similary as it is done in rtf
the main problems are 3 and 4 but answers to any of them would be appreciated
1)It sometimes messes up the format (in my case table structure)
2)It throws me an
java.io.IOException: Too many close-groups in RTF text
which as I understand that some } doesnt match due to some new word 2k "standard" thingie
3)It doesnt accept parts of file
I try to parse {
\lang1033\langfe1049\langnp1033 Some }{\b\fs28\lang1033\langfe1049\langnp1033 formated}{\lang1033\langfe1049\langnp1033 text }
and get nothing out of it
4)I have some non-ASCII characters in text and I need to somehow translate them similary as it is done in rtf
the main problems are 3 and 4 but answers to any of them would be appreciated