next up previous
Next: Collectible Index (Cindex) Up: Indexing in Rumour Previous: Indexing in Rumour

Local Index (Lindex)

Lindex is designed to store all of the hypertext pages that are accessible by the related Web server. Rather than creating its own format, Lindex follows a format based on an Internet Draft of the Internet Engineering Task Force (IEFT) [2]. However some components of the format are omitted to keep the index as small as possible.

For example, the organisation name, and author names of the document are omitted. Each file is a record in Lindex, and a Lindex can contain as many records as needed. Each of these records is separated by a CR.

The format of each record is as follows:

				Template-Type: Document <CR>
				URL: <full Local of the document> <CR>
				Title: <Title of the document, mentioned in <title> tag
						 in HTML><CR>
				[ keywords: <key>, <key>, .  .  <CR>	]
				[ description: <description of the document> <CR>			]
				<CR>

The following is an example of an entry in Lindex :

				Template-Type: Document
				URL: http://ip4.cs.monash.edu.au:8000/\~twyt/index.html
				Title: Tommy Tsui - Home page
				keywords: Monash University, Home Page, Star Wars Links,
	 			Star Trek Links, X-Files Links, Hong Kong Starting
	 			Point Links, Distributed Indexing WWW System, Internet
	 			Radio, Radio Television Hong Kong, RTHK

See Appendix II for an example. Keywords and description are optional, but recommended. Currently Rumour can only extract user-defined keywords and descriptions by using the "meta" tag in HTML. The user can define these keywords and description by including the meta tags like so:

				<HTML>
				<head>
	   				:
				<meta name="keywords" value="<keywords">
				<meta name="description" value="<description">
	   			:
				</head>
				<body>
	   			:
	   			:
				</body>
				</html>

Lindex can be placed at any location in the directory, as long as "Jack" and the Search Program know where this is.



Tommy Wing Yiu Tsui
Tue Nov 7 10:21:32 EST 1995