Lindex is designed to store all of the hypertext pages that are accessible by the related Web server.
Rather than creating its own format, Lindex follows a format based on an Internet Draft of the Internet
Engineering Task Force (IEFT) [2]. However some components of the format are omitted to keep the
index as small as possible.
For example, the organisation name, and author names of the document are omitted.
Each file is a record in Lindex, and a Lindex can contain as many records as needed.
Each of these records is separated by a CR.
The format of each record is as follows:
Template-Type: Document <CR> URL: <full Local of the document> <CR> Title: <Title of the document, mentioned in <title> tag in HTML><CR> [ keywords: <key>, <key>, . . <CR> ] [ description: <description of the document> <CR> ] <CR>
The following is an example of an entry in Lindex :
Template-Type: Document URL: http://ip4.cs.monash.edu.au:8000/\~twyt/index.html Title: Tommy Tsui - Home page keywords: Monash University, Home Page, Star Wars Links, Star Trek Links, X-Files Links, Hong Kong Starting Point Links, Distributed Indexing WWW System, Internet Radio, Radio Television Hong Kong, RTHK
See Appendix II for an example. Keywords and description are optional, but recommended. Currently Rumour can only extract user-defined keywords and descriptions by using the "meta" tag in HTML. The user can define these keywords and description by including the meta tags like so:
<HTML> <head> : <meta name="keywords" value="<keywords"> <meta name="description" value="<description"> : </head> <body> : : </body> </html>
Lindex can be placed at any location in the directory,
as long as "Jack" and the Search Program
know where this is.