regex image source html file 1

safra · Jan 31, 2004

Hi,

Anyone can help me with the regular expression to find all file locations of images in an html file?

Thanks!

duncdude · Jan 31, 2004

this kind of thing:-

$/ = undef;
open (INFILE, "< html.txt"
$_ = <INFILE>;
close INFILE;

while (m/<img src=([^ ]+)/ig) {
print "$1\n";
}

Kind Regards
Duncan

safra · Feb 1, 2004

Works great, thanks!

ishnid · Feb 2, 2004

You should only use a solution like that if you know that there won't be a line break in the tag and you know the src attribute is always going to be the first one. I.e., neither of the following will work:

Code:

<img
src=&quot;blah.png&quot; />

Code:

<img alt=&quot;A picture&quot; src=&quot;blah.png&quot; />

If you're going to be parsing pages written by others, try using a module like HTML::TokeParser::Simple, which makes it extremely easy to extract the src attribute from all <img> tags:

http://search.cpan.org/~ovid/HTML-TokeParser-Simple-2.2/Simple.pm

duncdude · Feb 2, 2004

thanks safra!

Kind Regards
Duncan

Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

regex image source html file 1

safra

Technical User

duncdude

Programmer

safra

Technical User

ishnid

Programmer

duncdude

Programmer

Similar threads

Part and Inventory Search

Sponsor