Click here to Skip to main content
11,567,143 members (48,084 online)
Rate this: bad
good
Please Sign up or sign in to vote.
See more: VB.NET
hello guys

I am now working on make a simple web engine.
what i need now extracting only the text word from the web pages using regular expression.
i use a for loop to do that as this:
Public Function ExtractWordsFromHTML(ByVal strHTML As String) As ArrayList
        
        Dim strTemp As String
        Dim arrRet As New System.Collections.ArrayList()
        For Each ch In strHTML
            If Char.IsLetter(ch) Then
                strTemp += ch
            ElseIf strTemp <> "" Then
                arrRet.Add(strTemp)
                strTemp = ""
            End If
        Next
        Return arrRet
    End Function
Posted 17-Feb-13 19:41pm

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

  Print Answers RSS


Advertise | Privacy | Mobile
Web04 | 2.8.150624.2 | Last Updated 18 Feb 2013
Copyright © CodeProject, 1999-2015
All Rights Reserved. Terms of Service
Layout: fixed | fluid

CodeProject, 503-250 Ferrand Drive Toronto Ontario, M3C 3G8 Canada +1 416-849-8900 x 100