I struggled for a bit trying to convert paper to HTML, but found it an
awkward task. I'm sure the state of the art has advanced beyond:
1) do a color scan to grab images
2) clean up images
3) resize based on guess at a good size and res for web pages
4) scan again as B/W line art
5) OCR
6) clean up OCR
7) create HTML combining OCR'd text and images
I don't much like PDF for web docs, so an HTML solution would be best. It
looks like the "pro" version of Xerox's OCR software might automate the
task somewhat. Any recommendations?
In any case, here's a picture of Simon, the first personal computer from
~1950:
http://www.yowza.com/classiccmp/berkeley/simon.gif
More info will be made available as I get this scanning stuff down to a
science.
-- Doug
Received on Wed Dec 30 1998 - 01:49:33 GMT