Click here to Skip to main content
15,075,638 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
XML
<script language="javascript">
function acceptNumbersOnly(myForm)
{
    if(event.keyCode<48 || event.keyCode>57) return event.returnValue=false;
}
</script>

<html>
<head>
<title>Gautam Buddh Technical University (GBTU)</title>
<style>
a:link{COLOR: #666666; TEXT-DECORATION: none}
a:visited{COLOR: #666666; TEXT-DECORATION: none}
a:active{COLOR: #666666; TEXT-DECORATION: none}
a:hover{COLOR: #666666; TEXT-DECORATION: underline}
.s1{font-family: 'Courier New'; font-size:12px}
</style>
</head>
<body topmargin="0" leftmargin="0" style="font-family: Verdana; font-size: 8pt; background-image: url('../../images/watermark.jpg'); background-repeat: no-repeat; background-attachment: fixed; background-position: center">
<table align='center' border='0' cellpadding='0' cellspacing='0' style='border-collapse: collapse' width='780' height=100%>
<tr><td height=96% align=center><BR><BR>

<html>
<head>



</head>
<body style="background:url(uptu_logo1.gif); background-repeat:no-repeat; background-position:center">
<p align="center" style="font-size:18px"><span style='font-size:20px'>Gautam Buddh Technical University, Lucknow, India</span><br/><span style='font-size:16px'>(Formerly Uttar Pradesh Technical University)</span><br/><u>B.Tech. First Year Result 2009-10</u><br/><u>(Special Carry Over)</u></p>
<br/>
<table align="center" border="1" cellpadding="0" cellspacing="0" bordercolor="#E3DDD5" width="700" style="border-collapse: collapse; font-size: 11px">
<tr>
<td width="50%"><b>Name:</b></td>
<td width="50%">ABHIMANYU GUPTA               </td>
</tr>
<tr>
<td><b>Father's Name:</b></td>
<td>RAMKUMAR GUPTA                </td>
</tr>
<tr>
<td><b>Roll No:</b></td>
<td>0913310002</td>
</tr>
<tr>
<td><b>Status:</b></td>
<td>REGULAR   </td>
</tr>
<tr>
<td><b>Course/Branch:</b></td>
<td>B. Tech. Computer Science and Engineering&nbsp;(10)</td>
</tr>
<tr>
<td><b>Institute Name</b></td>
<td>NOIDA INSTITUTE OF ENGG. & TECHNOLOGY (ENGINEERING INSTITUTE) ,?GREATER NOIDA - 201306</td>
</tr>

<tr>
<td><b>Date of Declaration of Result</b></td>
<td>21-Jul-2010</td>
</tr>
<tr>
<td><b>Date of Special Carry Over</b></td>
<td>13-Oct-2010</td>
</tr>

</table>
<br/>
<table align="center" border="1" cellpadding="0" cellspacing="0" bordercolor="#E3DDD5" width="700" style="border-collapse: collapse; font-size: 11px">
<tr>
<th width="100%" colspan="2">MARKS DETAIL</th>
</tr>
<tr>
<th width="50%">First Semester</th>
<th width="50%">Second Semester</th>
</tr>
<tr>
<td width="50%" valign="top">
<table border="1" style="border-collapse:collapse;font-size: 11px" width="100%">
<tr style="font-size: 11px">
<td width="20%" align="center"><b>Subject Code</b></td>
<td width="20%" align="center"><b>External Marks</b></td>
<td width="20%" align="center"><b>Sessional Marks</b></td>
<td width="20%" align="center"><b>Carry Over Marks</b></td>
<td width="20%" align="center"><b>Credit</b></td>
</tr>

</tr>

<tr><td><b>EAS101    </b></td><td align='center'>010 </td><td align=center>021</td><td align=center>015*</td><td align=center>3 </td><tr><td><b>EME102    </b></td><td align='center'>010 </td><td align=center>035</td><td align=center>013*</td><td align=center>0 </td><tr><td><b>EAS103    </b></td><td align='center'>030 </td><td align=center>036</td><td align=center>    </td><td align=center>4 </td><tr><td><b>ECS101    </b></td><td align='center'>024 </td><td align=center>039</td><td align=center>047*</td><td align=center>4 </td><tr><td><b>EAS104    </b></td><td align='center'>060 </td><td align=center>039</td><td align=center>    </td><td align=center>4 </td><tr><td><b>EAS105    </b></td><td align='center'>019 </td><td align=center>020</td><td align=center>    </td><td align=center>2 </td><tr><td><b>          </b></td><td align='center'>    </td><td align=center>   </td><td align=center>    </td><td align=center>  </td><tr><td><b>EME152    </b></td><td align='center'>024 </td><td align=center>017</td><td align=center>&nbsp;</td><td align=center>1 </td><tr><td><b>ECS151    </b></td><td align='center'>028 </td><td align=center>018</td><td align=center>&nbsp;</td><td align=center>1 </td><tr><td><b>ECE151    </b></td><td align='center'>027 </td><td align=center>018</td><td align=center>&nbsp;</td><td align=center>2 </td><tr><td><b>EAS154    </b></td><td align='center'>--- </td><td align=center>036</td><td align=center>&nbsp;</td><td align=center>1 </td>
<tr>
<td><b>GP101</b></td>
<td align="center" colspan="4">046</td>
</tr>
</table>
</td>
<td width="50%" valign="top">
<table border="1" style="border-collapse:collapse;font-size: 11px" width="100%">
<tr style="font-size: 11px">
<td width="20%" align="center"><b>Subject Code</b></td>
<td width="20%" align="center"><b>External Marks</b></td>
<td width="20%" align="center"><b>Sessional Marks</b></td>
<td width="20%" align="center"><b>Carry Over Marks</b></td>
<td width="20%" align="center"><b>Credit</b></td>
</tr>

</tr>

<tr><td><b>EAS201    </b></td><td align='center'>029 </td><td align=center>021</td><td align=center>    </td><td align=center>3 </td><tr><td><b>EAS202    </b></td><td align='center'>044 </td><td align=center>030</td><td align=center>    </td><td align=center>4 </td><tr><td><b>EAS203    </b></td><td align='center'>017 </td><td align=center>033</td><td align=center>036*</td><td align=center>4 </td><tr><td><b>EEE201    </b></td><td align='center'>030 </td><td align=center>030</td><td align=center>    </td><td align=center>4 </td><tr><td><b>EEC201    </b></td><td align='center'>030 </td><td align=center>031</td><td align=center>    </td><td align=center>4 </td><tr><td><b>EME201    </b></td><td align='center'>016 </td><td align=center>020</td><td align=center>    </td><td align=center>2 </td><tr><td><b>          </b></td><td align='center'>    </td><td align=center>   </td><td align=center>    </td><td align=center>  </td><tr><td><b>EAS252    </b></td><td align='center'>020 </td><td align=center>018</td><td align=center>&nbsp;</td><td align=center>1 </td><tr><td><b>EEE251    </b></td><td align='center'>024 </td><td align=center>019</td><td align=center>&nbsp;</td><td align=center>1 </td><tr><td><b>EWS251    </b></td><td align='center'>024 </td><td align=center>013</td><td align=center>&nbsp;</td><td align=center>2 </td><tr><td><b>EAS251    </b></td><td align='center'>022 </td><td align=center>015</td><td align=center>&nbsp;</td><td align=center>1 </td><tr><td><b>AUC001    </b></td><td align='center'>018 </td><td align='center'>019</td><td align='center'>    </td><td>&nbsp;</td></tr>
<tr>
<td><b>GP201</b></td>
<td align="center" colspan="3">040</td>
</tr>
</table>
</td>
</tr>
</table>
<br/>
<table align="center" border="1" cellpadding="0" cellspacing="0" bordercolor=#E3DDD5 width="700" style="border-collapse: collapse; font-size: 11px">
<tr>
<td width="50%"><b>Total Credits Earn</b></td>
<td width="50%"><b>Practical :</b>&nbsp;10&nbsp;&nbsp;<b>Theory :</b>&nbsp;38</td>
</tr>

<tr>
<td width="50%"><b>Human Values & Professional Ethics(AUC001)</b></td>

<td width="50%">Cleared</td>

</tr>


<tr>
<td width="50%"><b>Carry Over Paper(s)</b></td>
<td width="50%">EME102,                                                                                             </td>
</tr>
<tr>
<td width="50%"><b>Grace Marks</b></td>
<td width="50%">0 </td>
</tr>

</table>
<br/>
<table align="center" border="1" cellpadding="0" cellspacing="0" bordercolor=#E3DDD5 width="700" style="border-collapse: collapse; font-size: 11px">
<tr>
<th>Marks Obtained</th>
<th>Max. Marks</th>
<th>Result Status</th>
</tr>
<tr>
<td align="center">1133</td>
<td align="center">2000</td>
<td align="center">PCP   </td>
</tr>
</table>

<BR>
<table align="center"><tr><td align="center"><small><small><b>Note:</b> University does not own for the errors or omissions, if any, in this statement.<br><B><font size='2'></font></B></small></small><BR><BR><BR>
</td></tr>
</body>
</html>

</body>
</html>






from this html source i want to extract only text data like
Name = abhimanyu gupta
father's name=ramkumar gupta

like this is i want only text need a code help
Posted

I wouldn't use a regex: while they are very effective, they are not good at dealing with so many variables in a input stream.
Instead, load the HTML into a HTMLDocument MSDN[^] and use the All property to get it as processed HTMLElements.
   
Comments
Espen Harlinn 25-Feb-11 11:03am
   
Good advice - my 5
Sergey Alexandrovich Kryukov 25-Feb-11 22:18pm
   
Absolutely, my 5.
--SA
You can create an XmlDocument out of the HTML code. Then iterate each XmlElement recursively and use InnerText to extract the text. Here you may also apply some filter to sort out required elements only.
Hope this helps !
   
Comments
Sergey Alexandrovich Kryukov 25-Feb-11 22:19pm
   
Much better the Regex, anyway.
--SA

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)




CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900