Displaying page
of
pages;
Items to
Title |
Test
Details
Pattern Title
|
Expression |
^<a\s+href\s*=\s*"http:\/\/([^"]*)"([^>]*)>(.*?(?=<\/a>))<\/a>$
|
Description |
Regexp to find all external links in a HTML string.
Can easily be modified to handle all/other links/protocols (like file/https/ftp).
Uses lookahead assertions and non-greedy modifier to check for the end </a> but still allow html tags inbetween start and end A tag.
Takes into account that there could be linebreaks and other nasty whitespace chars in the middle of the tag.
I am using it to find all external links in embedded HTML code and change 1.the target of the link 2.insert a "Leaving Site" logo to illustrate you are leaving site.
|
Matches |
<a href="http://www.mysite.com">my external link</a> | <a href="http:/
|
Non-Matches |
<a href="myinternalpage.html">my internal link</a>
|
Author |
Rating:
Not yet rated.
Anders Rask
|
Displaying page
of
pages;
Items to