Displaying page
of
pages;
Items to
Title |
Test
Details
HTML Tag Attribute
|
Expression |
[a-zA-Z_:][a-zA-Z0-9_,\.\-]*? |
Description |
Matches valid html tag attribute name (without value). |
Matches |
value | name | data-:_,.-funky |
Non-Matches |
,non-attr | :!attr | -data-not-funky |
Author |
Rating:
Not yet rated.
Matesz Charytoniuk
|
Title |
Test
Details
Strip HTML tags
|
Expression |
<(\s*/?\s*)\w+?(\s*(([\w-]+="[^"]*?")|([\w-]+='[^']*?')|([\w-]+=[^'"<>\s]+)))*(\s*/?\s*)> |
Description |
This RE strips HTML open and close tags from text with any attributes provided, including not properly formatted tags. |
Matches |
<html>, <meta http-equiv=Content-Type content="text/html; charset=windows-1250">, < / html body="<'something'>" body='"any thing"' b=jj123-qqj / > |
Non-Matches |
<html / d>, <html body='something">, <d /html> |
Author |
Rating:
Not yet rated.
István Dani
|
Title |
Test
Details
find html tags in text
|
Expression |
(([\n, ])*((<+)([^<>]+)(>*))+([\n, ])*)+ |
Description |
finds all HTML tags (<___>) in text and leaves normal text alone. Also matches all extra spaces and empty lines next to html tag, if any.Ragex has extra parentheses for orientation.
Usefull to delete all unwanted html from, lets say, wikipedia souce code. |
Matches |
<a id = al> | <blahblah> | <<div>something> |
Non-Matches |
9>3 | normaltext | <<<<<<<< |
Author |
Rating:
Not yet rated.
Rudolf
|
Title |
Test
Details
Remove all html tags
|
Expression |
<!DOCTYPE[\s\S]*?</head>[\s\S]*?|<form[\s\S]*?>[\s\S]*?|<button[\s\S]*?</button>[\s\S]*?|<[a-zA-Z]+(\s+[a-zA-Z]+\s*=\s*("([^"]*)"|'([^']*)'))*\s*/>|</a>|</li>|</ol>|</div><strong>|</strong>|</dd>|</dt>|<header>|</header>|<dd|</form>|<div>|<center>|</center>|</dl>|</article>|<a>|<a name[\s\S]*?>[\s\S]*?|</aside>|</section>|</div>|</ul>|</tr>|<tr>|</span>|</td>|<td>|</nav>|</table>|<td|<dt|----|<dl|<time class[\s\S]*?>[\s\S]*?|<article[\s\S]*?>[\s\S]*?|<td[\s\S]*?</td>[\s\S]*?|<section[\s\S]*?>[\s\S]*?|<div dir[\s\S]*?>[\s\S]*?|<option[\s\S]*?option>[\s\S]*?|<body[\s\S]*?>[\s\S]*?|bgcolor[\s\S]*?>[\s\S]*?|<blackquote|</blackquote>|<small|<small>|</small>|<textarea id[\s\S]*?</html>[\s\S]*?|<td valign[\s\S]*?>[\s\S]*?|<fb[\s\S]*?>[\s\S]*?|<font [\s\S]*?>[\s\S]*?|<footer[\s\S]*?</footer>[\s\S]*?|<h2[\s\S]*?</h2>[\s\S]*?|<a onMouseOver[\s\S]*?</a>[\s\S]*?|<h3[\s\S]*?</h3>[\s\S]*?|<th[\s\S]*?</th>[\s\S]*?|<tbody>[\s\S]*?</tbody>[\s\S]*?|<div class="fb-like"[\s\S]*?>[\s\S]*?|<input[\s\S]*?>[\s\S]*?|<tfoot>[\s\S]*?</tfoot>[\s\S]*?|<script[\s\S]*?/script>[\s\S]*?|<table[\s\S]*?>[\s\S]*?|<cite[\s\S]*?</cite>[\s\S]*?|<style id[\s\S]*?</style>[\s\S]*?|<li id[\s\S]*?>[\s\S]*?|<p id=[\s\S]*?>[\s\S]*?|<ul[\s\S]*?>[\s\S]*?|<a title[\s\S]*?>[\s\S]*?|<li[\s\S]*?>[\s\S]*?|<style type[\s\S]*?/style>[\s\S]*?|<object[\s\S]*?/object>[\s\S]*?|<label[\s\S]*?/label>[\s\S]*?|<select[\s\S]*?</select>[\s\S]*?|(<a href[\s\S]*?>[\s\S]*?)|(\b(http|https):\/\/.*[^ alt]\b)|href=[\s\S]*?>[\s\S]*?|<a id[\s\S]*?>[\s\S]*?|<!--[\s\S]*?-->[\s\S]*?|<div style[\s\S]*?>[\s\S]*?|<span[\s\S]*?>[\s\S]*?|<span lang[\s\S]*?>[\s\S]*?|<img[\s\S]*?>[\s\S]*?|<div id[\s\S]*?>[\s\S]*?|<header[\s\S]*?>[\s\S]*?|<a target[\s\S]*?>[\s\S]*?|<div class[\s\S]*?>[\s\S]*?|</object>|<hr align[\s\S]*?>[\s\S]*?|<embed[\s\S]*?/>[\s\S]*?|<ol[\s\S]*?>[\s\S]*?|<a rel[\s\S]*?>[\s\S]*?|<param[\s\S]*?/>[\s\S]*?|<noscript>[\s\S]*?</noscript>[\s\S]*?|<aside id[\s\S]*?>[\s\S]*?|<xml>[\s\S]*?</xml>[\s\S]*?|<nav[\s\S]*?>[\s\S]*?|<link rel[\s\S]*?>[\s\S]*?|<p style[\s\S]*?>[\s\S]*?|<iframe[\s\S]*?</iframe>[\s\S]*?|<span rel[\s\S]*?>[\s\S]*?|<span id[\s\S]*?>[\s\S]*?|<span property=[\s\S]*?>[\s\S]*?|<span class[\s\S]*?>[\s\S]*?|<div align[\s\S]*?>[\s\S]*?|<form[\s\S]*?>[\s\S]*?|<p class[\s\S]*?>[\s\S]*?|<h3 property[\s\S]*?>[\s\S]*?|<time[\s\S]*?</time>[\s\S]*?|class=[\s\S]*?>[\s\S]*?|<video src[\s\S]*?>[\s\S]*?|<a class[\s\S]*?>[\s\S]*?|→|style=[\s\S]*?;"[\s\S]*?|<h3 id[\s\S]*?">[\s\S]*?|<center>|</center>|</dl>|<span dir="ltr">|<a name[\s\S]*?>[\s\S]*?|<a name='links'>|<abbr[\s\S]*?</abbr>[\s\S]*?|<span itemprop[\s\S]*?>[\s\S]*?|<amp-iframe[\s\S]*?</amp-iframe>[\s\S]*?|<amp-img[\s\S]*?</amp-img>[\s\S]*?|<amp-img src=">|<stylep-custom>[\s\S]*?</style>[\s\S]*?|<stylep-boilerplate>[\s\S]*?</style>[\s\S]*?|src=">|<amp[\s\S]*?pixel>[\s\S]*?|Leave a Reply|<body>|<body|<h3>|</h3>|<div class[\s\S]*?>[\s\S]*?|<aside[\s\S]*?>[\s\S]*?|<p class=[\s\S]*?>[\s\S]*?| |
Description |
A little long REGEX but remove almost all html tags |
Matches |
links, forms, scripts, |
Non-Matches |
text|txt|aps |
Author |
Rating:
Not yet rated.
Melcu Suzana
|
Displaying page
of
pages;
Items to