|
|
|||||||||
|
|||||||||
|
|||||||||
| |
|||
| |||||||||
![]() |
|
|
«
Previous Thread
|
Next Thread
»
|
Thread Tools | Search this Thread | Display Modes |
|
#1
|
|||
|
|||
|
Parse contents in HTML files using VB.NET
I hope you guys can help me out with this VB.NET application. I have a HTML file that contains tables in which I want to extract the data out and populate them in an Access database. Please provide code if you can. Thanks! Here is the HTML file:
<CENTER> <H2>List of Linking Pages</H2></CENTER><BR> <TABLE border=1 width="90%"> <TBODY> <TR> <TH>Page</TH> <TH>Links In</TH> <TH>Links Out</TH> <TH>Links Ratio</TH> <TH>Target Links</TH> <TH>Alexa Rank</TH> <TH>Google Rank</TH> <TH>Page Title</TH> <TR> <TD>URL</TD> <TD>42</TD> <TD>14</TD> <TD>300</TD> <TD>0</TD> <TD>56</TD> <TD>8</TD> <TD>Lycos Online Media Kit</TD></TR> <TR> <TD>URL</TD> <TD>0</TD> <TD>23</TD> <TD>0</TD> <TD>1</TD> <TD>2811505</TD> <TD>3</TD> <TD>A. L. Ayers & Co. Printng - Large Format</TD></TR> <TR> <TD>URL</TD> <TD>0</TD> <TD>88</TD> <TD>0</TD> <TD>0</TD> <TD>556114</TD> <TD>0</TD> <TD>ODP: Business:Arts and Entertainment:Photography:Stock:Royalty Free</TD></TR> ... ... |
|
#2
|
|||
|
|||
|
Regular Expression C# codes
//Change Syntax for your VB Code
using System.Text.RegularExpressions; private string RemoveHTMLTags( string strText) { return Regex.Replace(richTextBox1.Text,"<[^>]*>",""); } |
|
#3
|
|||
|
|||
|
Thanks iamed for replying. Do you have the full code for this application in C# or VB.NET? Other languages are also fine too.
Thanks again, ddev |
![]() |
| Viewing: Dev Articles Community Forums > Programming > .NET Development > Parse contents in HTML files using VB.NET |
| Thread Tools | Search this Thread |
| Display Modes | Rate This Thread |
|
|
|
|