Click here to Skip to main content
11,576,063 members (58,032 online)
Click here to Skip to main content

Tagged as

Regular Expression to extract inner text from anchor tags

, 8 Feb 2011 CPOL 16.3K 2
Rate this:
Please Sign up or sign in to vote.
Regular Expression to extract inner text from anchor tags

Several days ago, someone at the forum has asked how to extract the text from a hyperlink and preserve other HTML tags. It sounded interesting, I did some research but could not find a direct solution. So, I decided to put together a simple regular expression to execute the task.

Regular Expression: (<[a|A][^>]*>|</[a|A]>)


<[a|A][^>]*> -- Remove <a ...>

</[a|A]> -- Remove </a> tag

Example 1:

string str1 = "<a href=\"\" class=\"someclass\">Mastering Regular Expressions</a> 
-- <A href=\"\">CNN</a> <div><a href=\"\"></a></div>";
str1 = System.Text.RegularExpressions.Regex.Replace(str1, "(<[a|A][^>]*>|)", "");

Result: Mastering Regular Expressions -- CNN <div> </div>

Example 2:
string str2 = "<div><a href=\"\" class=\"someclass\">ysatech</a></div>";
str2 = System.Text.RegularExpressions.Regex.Replace(str2, "(<[a|A][^>]*>|)", "");

Result: <div>ysatech</div>

Test this regular expression here.


This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


About the Author

Bryian Tan
Software Developer
United States United States
I have over three years of experience working with Microsoft technologies. I have earned my Microsoft Certified Technology Specialist (MCTS) certification. I'm a highly motivated self-starter with an aptitude for learning new skills quickly.

You may also be interested in...

Comments and Discussions

-- There are no messages in this forum --
| Advertise | Privacy | Terms of Use | Mobile
Web03 | 2.8.150603.1 | Last Updated 9 Feb 2011
Article Copyright 2011 by Bryian Tan
Everything else Copyright © CodeProject, 1999-2015
Layout: fixed | fluid