How can i seperate non latin characters (Greek) from a UTF-8 text file, and replace it with a string?
I need to replace ALL the non latin sentence and not every char or word, so the function / routine / algorithm must have the logic to distinguish a full non latin series of characters as a sentence and change it.
<tr valign="top" bgcolor="ffffff">
<td bgcolor="446699">Here is the first non latin chars</td>
</tr>
<tr>
<td>Here are some more non latin chars</td>
</tr>
etc...
i need code to "scan" all the UTF-8 sentences and replace it to whatever i want... for example:
<tr valign="top" bgcolor="ffffff">
<td bgcolor="446699"><REPLACED WITH FIRST CUSTOM TAG></td>
</tr>
<tr>
<td><REPLACED WITH SECOND CUSTOM TAG></td>
</tr>
Thanks in advance,
Dimitris
I need to replace ALL the non latin sentence and not every char or word, so the function / routine / algorithm must have the logic to distinguish a full non latin series of characters as a sentence and change it.
<tr valign="top" bgcolor="ffffff">
<td bgcolor="446699">Here is the first non latin chars</td>
</tr>
<tr>
<td>Here are some more non latin chars</td>
</tr>
etc...
i need code to "scan" all the UTF-8 sentences and replace it to whatever i want... for example:
<tr valign="top" bgcolor="ffffff">
<td bgcolor="446699"><REPLACED WITH FIRST CUSTOM TAG></td>
</tr>
<tr>
<td><REPLACED WITH SECOND CUSTOM TAG></td>
</tr>
Thanks in advance,
Dimitris