Examples of Database services on the Web included Yahoo [11], Lycos [9],
and Infoseek [8]. These services provide Web users with a way of finding
information on the Web.
They can be distinguished by the way they collect data for their database.
Problems associated with the methods above include updating databases and access
to these services.
The "robot"s have the biggest problem with updating. Documents are being put on the Web,
taken out from the Web, or modified, every minute, but it can take several weeks, or months for
the "robot"s to come back to a document for updating, and the data might have been
changed long before the "robot" could update its information. As the Web increases in size, these
robots will need to cover more ground, as a consequence the length of time needed to update data in
their databases will increase dramatically.
Another problem with the "robot"s is that it is possible for these "robot"s to request a lot of
documents from one server, overloading the server, although this problem can be solved
by having conditions imposed on robots, so they do not request more than one document at
any one time.
Updating in a user entry based services can also be a problem.
Such a service has
to keep track of the data entered, so that it can update
the database when a document becomes invalid. This
can take a long time.
Another problem with these databases services is gaining access to them.
Because these databases can be used by everybody on the Web, there are times when the number of
people accessing these services reaches numbers beyond what they can handle. As the number
of Web users increases, this problem will grow worse.
Sometimes a database service will "mirror" itself, thus allowing search processes to be done at
other sites. But typically this solution will only last for a month or so before this "mirrored"
database starts to have the same congestion problem.
Another problem with these databases is their location. Most of these databases are
located overseas, so accessing them to look for something is like having to
fly all the way to a far away place to use their database.
This is a waste of Internet resources.