Can anyone advice which language and which software to use to write a basic web scrapper? Also how involved is this work? Have no prior experience with this. My goal is to copy and paste specific information from some websites every 1/5 minutes to an excel sheet. Thanks.
Thanks never heard of it before Standard edition costs 245$. Will see if excel web query can do the job.
If you use C# .Net it's only 3 lines to download HTML from a URL into a string. Then you can parse it any way you want, and if you know anything about parsing, mostly it will be easy stuff to code. If it's not HTML then it's harder. Another advantage is also you can iterate through many URLs easily, in case not all info is on a single page. Here's code for VB: Code: Dim client As New System.Net.WebClient Dim html As String html = client.DownloadString("http://www.bloomberg.com/quote/SPY:US")
Ive done some work scraping specific data for people in the bond markets. One useful (though quirky) product is djuggler www.djuggler.com After much tinkering around with this and other products - I finally decided its easier (for me) in the long run to simply use the C# APIs and write my own code (fewer things to learn-and relearn)
Thanks, but I haven't used C#. I can do VB though. I also have never done any parsing before. I will see if I can get some resource on net to learn it. But the message I am getting from various posters is that it is not something that will take too much time to learn and implement.