LISTSERV - AXSLIB-L Archives - LISTSERV.ICORS.ORG

AXSLIB-L Archives

Liberation Throough IT Accessibility (an EASI member list)

AXSLIB-L@LISTSERV.ICORS.ORG

	LISTSERV Archives
	AXSLIB-L Home

	Log In
	Register

	Subscribe or Unsubscribe

	Search Archives

Options:	Use Forum View Use Monospaced Font Show Text Part by Default Show All Mail Headers
Message:	[<< First] [< Prev] [Next >] [Last >>]
Topic:	[<< First] [< Prev] [Next >] [Last >>]
Author:	[<< First] [< Prev] [Next >] [Last >>]

Subject:	Re: electronic reserve and image-based pdf files
From:	Kathy Konicek <[log in to unmask]>
Reply To:	Library Access -- http://www.rit.edu/~easi
Date:	Thu, 18 Apr 2002 14:03:44 -0500
Content-Type:	text/plain
Parts/Attachments:	text/plain (102 lines)

We are trying a different approach here at UW Madison. It is currently
underway.  The approach is NOT putting the burden on scanning OCR!  We
don't feel it is there yet. But rather on having clean, correctly aligned
image PDF files and using Kurzweil 1000 to read these files.

-Students who need it check out a licensed copy of Kurzweil 1000 for their
computer or laptop.
-Students locate the pdf file that electronic reserves places out for students.
-This pdf file has some basic quidelines such as clean, correctly aligned
copy. (we encourage faculty to try to locate the digital text file first)
-Students run Kurzweil which on the fly converts the image pdf file to
audio output, text file (OCR)  .
-When student's leave the university we get the license copy back to check
out to the next student.

-We are experimenting now with settings that will allow for the best
accuracy of this audio/text file and documenting accuracy and acceptability
of the translated file.

Kathy Konicek
UW Madison
Instructional Technologist
608-265-6577


At 11:42 AM 4/18/2002 +0000, John Nissen wrote:
>Axel and others,
>
>One of the troubles with OCR is that it doesn't mark up the
>logical structure of a document with headings, emphasis (in bold,
>underline, italics), tables and lists.  Ideally the e-reserve should
>have the structure marked up in a standard and accessible way.
>HTML is suitable for this.  Therefore, I argue that "cleaning up"
>should involve put in these markings, using HTML tags.  That further
>increases the burden on librarians, and lends support to the argument
>that getting the original document source in electronic form is
>preferable.
>
>However if the original is in PDF, there is still a problem of
>accessibility.  Librarians should do PDF to HTML conversion for
>such documents, I would argue.
>
>BTW, the ultimate goal would be everything in XML, so that semantics
>is preserved as well as structure.  (See Ron Stewart's email.)
>
>Cheers,
>
>John
>--
>In message  <[log in to unmask]>
>Axel via [log in to unmask] writes:
>
> >Jeff and others,
> >
> >Thanks for you thoughts, Jeff. The problem with cleaning up OCRed
> >documents is the additional time factor. Folks who do it told me that it
> >takes about 10 times as long to get a document onto e-reserve if they
> >not only scan and ORC but also clean it up. This puts an enormous strain
> >on the libraries' resources.
> >
> >Here's my thinking at this point: If OCR technology by itself, without
> >additional editing and proofreading, does not provide an acceptable
> >product (this would be your position), and since additional cleaning and
> >proofreading of all materials placed on e-reserve puts an enormous
> >strain on a libraries' resources (this is my sentiment), we should look
> >for a different solution: We need to think about ways of establishing
> >e-reserves within the context of a larger system that allows us to get
> >articles in their original text-based electronic format and to place
> >them on reserve (without any optical scanning involved) in either their
> >original format or some converted text-based format. This, of course,
> >touches on legal issues, involving among others, copyright law and
> >interpretations thereof.
> >
> >Greetings,
> >
> >Axel
> >
> >-----Original Message-----
> >From: Senge, Jeff [mailto:[log in to unmask]]
> >Sent: Tuesday, April 16, 2002 10:32 AM
> >To: [log in to unmask]
> >Subject: Re: electronic reserve and image-based pdf files
> >
> >Axel,
> >
> >My personal opinion is that e-reserve materials are a tremendous step
> >forward in terms of accessibility but they need to be in accessible
> >formats.  This would mean scanning, running OCR to convert them to text,
> >and  then editing and proofreading them for format and accuracy.  This
> >process should produce very clean and useable accessible e-reserve
> >documents.
>
>[snip]
>
>--
>Access the word, access the world! -- Try our WordAloud software!!
>
>John Nissen, Cloudworld Ltd., Chiswick, London
>Tel:   +44 (0) 845 458 3944 (local rate in the UK)
>Fax:   +44 (0) 20 8742 8715
>Web:   http://www.cloudworld.co.uk

ATOM RSS1 RSS2

LISTSERV.ICORS.ORG