szov = RegexReplace("<(img|head|nav|form|footer|style|script|noscript|aside|button|label|i\040|input)[^<>]*>((.|\n|\r\n)*?)</\1>",szov," ")
If szov.Contains("<script") Then
'view-source:http://www.erdekesvilag.hu/a-patkanyok-temploma-indiaban/
szov = RegexReplace("<script[^<>]*>((.|\n|\r\n)*?)</script>",szov," ")
End If
If szov.Contains("<style") Then
szov = RegexReplace("<style[^<>]*>((.|\n|\r\n)*?)</style>",szov," ")
End If
Do I made any mistake in the first expression why they aren't replaced?
thanks in advance
Steven
It is probably a mistake in your pattern. The best way to debug it is to extract the shortest possible text that is not parsed as you expect. We can then check it with your pattern.
Note that it will probably be simpler to use jTidy to parse the page.