I am trying to make a match for a table that contains several tags, but always contains the sentence "Hello World"
I have been trying to match this with:
on the below html:
but it matches too much:
The table i want to match is:
What am I doing wrong with the regexp??
Would really appreciate som help from a regexp guru!
/Masali
I have been trying to match this with:
Code:
<table .*>[\w\s\t\r\n\d\D]*?Hello World[\w\s\t\r\n\d\D]*?(?!table)<\/table>
on the below html:
Code:
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=windows-1252">
<title>Example</title>
</head>
<body topmargin="0" leftmargin="0" bgcolor="#CC3399">
<div align="center">
<table id="table3" style="border: 2px solid #000000" cellSpacing="1" width="800" bgColor="#ffffff" border="1">
<tr>
<td><center><img border="0" src="logo.gif" width="800" height="190">
<table id="table4" style="font-size: 16px; font-family: Verdana, Arial, Helvetica, sans-serif" border="0">
<tr>
<td><hr width="190" color="#000000" SIZE="1"></td>
<td></td>
<td><hr width="190" color="#000000" SIZE="1"></td>
</tr>
</table>
<br>
<font size="+2"><a href="a.html">A</a> ~
<a href="b.html">B</a></font><br>
<br>
<table style="border: 2px dotted #7b68ee" width="760" bgColor="#ffffff" border="0" id="table5">
<tr>
<td align="middle"><br>
<font size="+1">Hello World<br>
<a href="test">
<img src="logo.gif" border="0"></a></font>
<p><br>
<br>
</td>
</tr>
</table>
</center></td>
</tr>
</table>
</div>
</body>
</html>
but it matches too much:
Code:
<table id="table3" style="border: 2px solid #000000" cellSpacing="1" width="800" bgColor="#ffffff" border="1">
<tr>
<td><center><img border="0" src="logo.gif" width="800" height="190">
<table id="table4" style="font-size: 16px; font-family: Verdana, Arial, Helvetica, sans-serif" border="0">
<tr>
<td><hr width="190" color="#000000" SIZE="1"></td>
<td></td>
<td><hr width="190" color="#000000" SIZE="1"></td>
</tr>
</table>
<br>
<font size="+2"><a href="a.html">A</a> ~
<a href="b.html">B</a></font><br>
<br>
<table style="border: 2px dotted #7b68ee" width="760" bgColor="#ffffff" border="0" id="table5">
<tr>
<td align="middle"><br>
<font size="+1">Hello World<br>
<a href="test">
<img src="logo.gif" border="0"></a></font>
<p><br>
<br>
</td>
</tr>
</table>
The table i want to match is:
Code:
<table style="border: 2px dotted #7b68ee" width="760" bgColor="#ffffff" border="0" id="table5">
<tr>
<td align="middle"><br>
<font size="+1">Hello World<br>
<a href="test">
<img src="logo.gif" border="0"></a></font>
<p><br>
<br>
</td>
</tr>
</table>
What am I doing wrong with the regexp??
Would really appreciate som help from a regexp guru!
/Masali