scanning booklets for archive

From: Paul Williams <paul_at_frixxon.co.uk>
Date: Wed Jul 23 03:05:01 2003

Antonio Carlini wrote:

> Good idea, although keeping track of a large number of documents
> in a thread like that will not scale well. You want to scrape
> the data and turn it into a web-accessible database.
>
> Like http://vt100.net/manx, but for all docs and not
> just DEC ones.

Manx is intended to cover more than just DEC documents, but I've been
concentrating on populating those first; there are over 4000 manuals in
the catalogue so far, of which over 1500 are online.

Manx already has a database structure that can accommodate multiple
companies and plenty of information about each manual, including
abstracts, table of contents and full text index.

I suppose I really need a way for third parties to add items to Manx. It
can already import tab-separated-variable files, which makes it very
easy to do a load of searching for new manuals and then add the results.
I'll look into this.

- Paul
Received on Wed Jul 23 2003 - 03:05:01 BST

This archive was generated by hypermail 2.3.0 : Fri Oct 10 2014 - 23:36:06 BST