next up previous
Next: Overview of Search Up: Overview of Collector Previous: Overview of Collector

GetIndex

GetIndex is a C program that uses the W3C Reference Library version 2.7 (wwwlib2.7) [10] to get documents from WWW given the URL. The reason for using wwwlib2.7 is because it provides a good way of handling the WWW communication structure known as hypertext transfer protocol (HTTP) [4]. When the HTTP format changes, the wwwlib will try to accommodate these changes, keeping the interface standard between the library and the application. The wwwlib also allows the application to handle error messages according to its needs, thus GetIndex is capable of suppressing errors during loading.

The following file should be included in the Rumour package :

Some of the function used in get_index.c include:

Although GetIndex could be installed at different directories from the Rumour directory, a link is needed from the GetIndex that is executable to the same directory as the Collector.



Tommy Wing Yiu Tsui
Tue Nov 7 10:21:32 EST 1995