.NET Development
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
 
User Name:
Password:
Remember me
 



Go Back   Dev Articles Community ForumsProgramming.NET Development

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Display Modes
 
Unread Dev Articles Community Forums Sponsor:
  #1  
Old September 23rd, 2003, 06:57 PM
ddev ddev is offline
Registered User
Dev Articles Newbie (0 - 499 posts)
 
Join Date: Sep 2003
Posts: 2 ddev User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Send a message via AIM to ddev
Parse contents in HTML files using VB.NET

I hope you guys can help me out with this VB.NET application. I have a HTML file that contains tables in which I want to extract the data out and populate them in an Access database. Please provide code if you can. Thanks! Here is the HTML file:

<CENTER>
<H2>List of Linking Pages</H2></CENTER><BR>
<TABLE border=1 width="90%">
<TBODY>
<TR>
<TH>Page</TH>
<TH>Links In</TH>
<TH>Links Out</TH>
<TH>Links Ratio</TH>
<TH>Target Links</TH>
<TH>Alexa Rank</TH>
<TH>Google Rank</TH>
<TH>Page Title</TH>
<TR>
<TD>URL</TD>
<TD>42</TD>
<TD>14</TD>
<TD>300</TD>
<TD>0</TD>
<TD>56</TD>
<TD>8</TD>
<TD>Lycos Online Media Kit</TD></TR>
<TR>
<TD>URL</TD>
<TD>0</TD>
<TD>23</TD>
<TD>0</TD>
<TD>1</TD>
<TD>2811505</TD>
<TD>3</TD>
<TD>A. L. Ayers & Co. Printng - Large Format</TD></TR>
<TR>
<TD>URL</TD>
<TD>0</TD>
<TD>88</TD>
<TD>0</TD>
<TD>0</TD>
<TD>556114</TD>
<TD>0</TD>
<TD>ODP: Business:Arts and Entertainment:Photography:Stock:Royalty
Free</TD></TR>
...
...

Reply With Quote
  #2  
Old September 23rd, 2003, 09:01 PM
iahmed iahmed is offline
Contributing User
Dev Articles Newbie (0 - 499 posts)
 
Join Date: May 2003
Location: USA
Posts: 171 iahmed User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 42 m 58 sec
Reputation Power: 15
Regular Expression C# codes

//Change Syntax for your VB Code

using System.Text.RegularExpressions;

private string RemoveHTMLTags( string strText)
{
return Regex.Replace(richTextBox1.Text,"<[^>]*>","");

}

Reply With Quote
  #3  
Old September 23rd, 2003, 10:28 PM
ddev ddev is offline
Registered User
Dev Articles Newbie (0 - 499 posts)
 
Join Date: Sep 2003
Posts: 2 ddev User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Send a message via AIM to ddev
Thanks iamed for replying. Do you have the full code for this application in C# or VB.NET? Other languages are also fine too.

Thanks again,
ddev

Reply With Quote
Reply

Viewing: Dev Articles Community ForumsProgramming.NET Development > Parse contents in HTML files using VB.NET


Developer Shed Advertisers and Affiliates


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump

Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 


Powered by: vBulletin Version 3.0.5
Copyright ©2000 - 2018, Jelsoft Enterprises Ltd.

© 2003-2018 by Developer Shed. All rights reserved. DS Cluster - Follow our Sitemap