Tutorial 1: Hello World!

Hello World!

This tutorial will walk you step-by-step through the process generally used to scrape information from web pages using screen-scraper. It should take you about 20 to 30 minutes to complete, and will familiarize you with the basic principles you'll need to scrape information from web sites. To get the most from this tutorial you should have at least a basic knowledge of HTML and HTTP (really just the way web browsers interact with web servers). This tutorial also assumes that you've successfully downloaded and installed screen-scraper.

If you don't have a lot of experience working with web technologies, or if you'd just like a refresher, you might find these sites helpful:

This is intended to be a very basic tutorial, and, as such, we'll be extracting the words "Hello World" from a web page and writing them to a file. While this is a simple example of pulling a single snippet of text off of a page, you would use a very similar approach for something like a stock quote or product price.

We'll try to keep the pace of the tutorial such that (hopefully) you won't get bored or frustrated. Along the way if you'd like more information on a topic try the links at the bottom of each screen.

The scraping session you are about to create (choose Interpreted Java or VB Script):

AttachmentSize
Hello World (Scraping Session--Interpreted Java).xml3.78 KB
Hello World (Scraping Session--VBScript).xml4.28 KB