Skip to Content

Remove all HTML flags from text field

Hi Expert,

I have a text field, and I need to remove all HTML relevant flags in SAP HANA sql script (such as :<>,<h1>,<h2> etc), as there are a lot of flags , I don't want to remove them one by one, is there any easy way to remove them .

Many Thanks,

Add comment
10|10000 characters needed characters exceeded

  • Get RSS Feed

1 Answer

  • Best Answer
    Nov 01, 2017 at 07:15 AM

    Have you tried it with the REPLACE_REGEXPR function?

    Following a little quick and dirty coding which replaces the HTML tags in a string with the help of that function and a regular expression. Of course you maybe need to invest a little bit more to find a more accurate regular expression to consider "<", "</" and ">" characters which are not part of an HTML tag.

    do
    begin
        declare htmlString nvarchar(255) := '<html><head>Test Head</head><body>Test Body</body></html>';
        
        result = select replace_regexpr('[</?.*>]' in "htmlString" with '' occurrence all) as "cleanedString" 
            from ( select :htmlString as "htmlString" from dummy );
        
        select * from :result;
    end;
    

    Regards,
    Florian

    Add comment
    10|10000 characters needed characters exceeded