Fine Print
Back to contents

Unabridged Dictionary - r0k

Based on material Copyrighted by the Gutenberg project

Etext from the Gutenberg project, formatted by r0k

Why (re)do this?  
1 - I needed a portable unabridged dictionary, and so
    did some young readers I know. 
2 - Gutenberg's files are filled with html tags.  Almost 
    but not quite as bad as export-to-html from an M$ product. 
3 - My version is 35 MB (34620 K), theirs is 45 MB (45769 K)!  

How did I do this? 
1 - download gutenberg single html version.
2 - view in lynx
3 - export to plain text
4 - use csplit to break plain text into 26 separate files for each 
    letter by grepping for "                [A-Z]$"
5 - manually join or split files that were messed up because letter 
    titles were omitted for 8 or so letters, and extra letter titles 
    were included for 4 or 5 letters.
6 - write a simple php viewer for the 26 text files averaging 1.5 meg each
7 - create index and copyright pages
8 - attempt to convert to pdf (STILL takes 26 MB - argh! 
    - but still portable enough for a laptop)
9 - attempt to convert to adobe reader for palm os 
    - takes overnight to run  
    - uses 340 MB of ram while converting to pdb file
    - crashes on tungsten 
    - need another solution
10 - use plucker to view on pda! no more crashy acrobat.  all fit easily on 
    my sd card and plucker is faster than adobe reader too.
11 - June 2005, put plucker pdb files online for those who misplaced 
    their plucker conversion software

Todo:
1 - single php script for all text files - done June 05
2 - multiple letter files viewable at once - maybe someday
3 - parser to build a fast search index - maybe someday
4 - compression for palm os version - done using plucker 

If you find errors or omissions, email me.  
This is copyrighted work and has been heavily reformatted from the freely 
distributable version.  If you want the freely distributable version, 
get it from Project Gutenberg.

**This is a COPYRIGHTED Project Gutnberg Etext, Details Below**

This is the HTML version, which can also be accessed at:

http://humanities.uchicago.edu/forms_unrest/webster.form.html

*The Project Gutenberg Etext of Webster's Unabridged Dictionary*
Copyright (C) 1996 by MICRA, Inc.  Plainfield, N.J.


WARNING:  this is version 0.4 and is NOT up to Project Gutenberg
standards, and is being released so YOU can help us fix errors!!
If you would like to help, you can send us general email for the
correction of errors, or with suggestions; if you are interested
in participating in more detail, please read the file "help.out"
which is included in the xyz zipped portion of the dictionary in
with other files that accompany the dictionary.


*The Project Gutenberg Etext of Webster's Unabridged Dictionary*
Copyright (C) 1996 by MICRA, Inc.  Plainfield, N.J.

The field marks "<...>" in this version are copyrighted,
the actual dictionary entries are in the Public Domain,
and we hope to re-publish these files in Plain Vanilla ASCII
You are welcome to strip all the markup information to help us
create such a version, or to create one on your own.

*Unzipped files for this Etext take approximately 40 megabytes!*
**Zipped files for this Etext take approximately 12 megabytes!**
*Therefore, you should probably have about 60 meg to load them.*

**The HTML version is one large 45M file, which is 15M zipped.**
This HTML version is a first draft for my search engine version:
http://humanities.uchicago.edu/forms_unrest/webster.form.html
and Caveat emptor!!

You will find MANY errors in both versions.  We would LOVE your
suggestions, corrections, emendations, and new words.  help.out
is the file you should refer to if you are interested.


Preliminary Version 0.4 so named at the request of the provider;
this is going to need some serious proofreading, as much of this
material was typed in by hand by non-native speakers of English.

Our thanks to Patrick Cassidy for organizing and financing these
efforts on behalf of Project Gutenberg.  Please read his note in
file "*.*"

This is a LARGE dictionary, the first edition of perhaps what is
the most famous dictionary in the world and our communication on
the matter of copyright with the publisher has resolved that the
publisher has no copyright interest in this material in the U.S.
but please check your own country, as below, also, please, watch
for changes in both the U.S. Copyright Law AND other country's--
there ARE movements in the U.S. to eliminate this information as
part of the Public Domain, which, if successful, might require a
revision or retraction of these files.



Please take a look at the important information in this header.
We encourage you to keep this file on your own disk, keeping an
electronic path open for the next readers.  Do not remove this.


**Welcome To The World of Free Plain Vanilla Electronic Texts**

**Etexts Readable By Both Humans and By Computers, Since 1971**

*These Etexts Prepared By Hundreds of Volunteers and Donations*

Information on contacting Project Gutenberg to get Etexts, and
further information is included below.  We need your donations.


*The Project Gutenberg Etext of Webster's Unabridged Dictionary*
Copyright (C) 1996 by MICRA, Inc.  Plainfield, N.J.


October, 1996

Etext #673

September, 1996 releases were:
[Etexts #660-670] [11 files]
*The Project Gutenberg Etext of Webster's Unabridged Dictionary*
*****This file should be named pgwxx10.txt or pgwxx10.zip******

Corrected EDITIONS of our etexts get a new NUMBER, pgwxx11.txt.
VERSIONS based on separate sources get new LETTER, pgwxx10a.txt.

Where xx = the letters of the dictionary in each individual file
which are listed below:

We have broken the 40 million characters down into .zip files of
the size that should fit 1.44M floppies, for easy transport.  If
you cannot figure out .zip files, let us know.


Here are the filenames and sizes without this Project Gutenberg
Header,
which is approximately 12500 characters, please keep at least one
copy
of this header file with files you make of this dictionary.

Zipped File Sizes:

pgwab04.zip  1366502
pgwc04.zip   1128376
pgwde04.zip  1154392
pgwfh04.zip  1325394
pgwil04.zip  1064186
pgwmo04.zip  1024976
pgwpq04.zip  1042626
pgwr04.zip    590065
pgws04.zip   1417870
pgwtw04.zip  1393923
pgwxz04.zip    76621  This is only the dictionary file
                      there are added files in the .zip
                      pgwxz04.txt is the ONLY file with
                      hard cr/lf margination at time of
                      initial release.
========
Total.zip   11.5849M


Unzipped File Sizes:

pgwab04.txt  4610951
pgwc04.txt   3770127
pgwde04.txt  4043810
pgwfh04.txt  4426984
pgwil04.txt  3683257
pgwmo04.txt  3556153
pgwpq04.txt  3645485
pgwr04.txt   1990022
pgws04.txt   4807758
pgwtw04.txt  4752697
pgwxz04.txt   260233
======
Total.txt   39.5475M


The official release date of all Project Gutenberg Etexts is at
Midnight, Central Time, of the last day of the stated month.  A
preliminary version may often be posted for suggestion, comment
and editing by those who wish to do so.  To be sure you have an
up to date first edition [xxxxx10x.xxx] please check file sizes
in the first week of the next month.  Since our ftp program has
a bug in it that scrambles the date [tried to fix and failed] a
look at the file size will have to do, but we will try to see a
new copy has at least one byte more or less.


Information about Project Gutenberg (one page)

We produce about two million dollars for each hour we work.  The
fifty hours is one conservative estimate for how long it we take
to get any etext selected, entered, proofread, edited, copyright
searched and analyzed, the copyright letters written, etc.  This
projected audience is one hundred million readers.  If our value
per text is nominally estimated at one dollar, then we produce 2
million dollars per hour this year we, will have to do four text
files per month:  thus upping our productivity from one million.
The Goal of Project Gutenberg is to Give Away One Trillion Etext
Files by the December 31, 2001.  [10,000 x 100,000,000=Trillion]
This is ten thousand titles each to one hundred million readers,
which is 10% of the expected number of computer users by the end
of the year 2001.

We need your donations more than ever!

All donations should be made to "Project Gutenberg/IBC", and are
tax deductible to the extent allowable by law ("IBC" is Illinois
Benedictine College).  (Subscriptions to our paper newsletter go
to IBC, too)

For these and other matters, please mail to:

Project Gutenberg
P. O. Box  2782
Champaign, IL 61825

Internet:      dircompg@ux1.cso.uiuc.edu
Bitnet:        dircompg@uiucux1
CompuServe:    >internet:dircompg@.ux1.cso.uiuc.edu
Attmail:       internet!ux1.cso.uiuc.edu!dircompg

When all other email fails try our Michael S. Hart, Executive
Director:
hart@vmd.cso.uiuc.edu (internet)   hart@uiucvmd   (bitnet)

We would prefer to send you this information by email
(Internet, Bitnet, Compuserve, ATTMAIL or MCImail).

******
If you have an FTP program (or emulator), please
FTP directly to the Project Gutenberg archives:
[Mac users, do NOT point and click. . .type]

ftp mrcnext.cso.uiuc.edu
login:  anonymous
password:  your@login
cd etext/etext90 though etext/etext95
or cd etext/articles 
dir [to see files]
get or mget [to get files. . .set bin for zip files]
get INDEX?00.GUT
for a list of books
and
get NEW.GUT for general information
and
mget GUT* for newsletters.

**Information prepared by the Project Gutenberg legal advisor**
(Three Pages)

***START** SMALL PRINT! for COPYRIGHT PROTECTED ETEXTS ***
TITLE AND COPYRIGHT NOTICE:

Big Dummy's Guide To The Internet
(C)1993, 1994  by the Electronic Frontier Foundation [EFF]

This etext is distributed by Professor Michael S. Hart through
the Project Gutenberg Association at Illinois Benedictine College
(the "Project") under the Project's "Project Gutenberg" trademark
and with the permission of the etext's copyright owner.

LICENSE
You can (and are encouraged!) to copy and distribute this
Project Gutenberg-tm etext.  Since, unlike many other of the
Project's etexts, it is copyright protected, and since the
materials and methods you use will effect the Project's
reputation,
your right to copy and distribute it is limited by the copyright
laws and by the conditions of this "Small Print!" statement.

  [A]  ALL COPIES: The Project permits you to distribute
copies of this etext electronically or on any machine readable
medium now known or hereafter discovered so long as you:

     (1)  Honor the refund and replacement provisions of this
"Small Print!" statement; and

     (2)  Pay a royalty to the Project of 20% of the net
profits you derive calculated using the method you already use
to calculate your applicable taxes.  If you don't derive
profits, no royalty is due.  Royalties are payable to "Project
Gutenberg Association / Illinois Benedictine College" within
the 60 days following each date you prepare (or were legally
required to prepare) your annual (or equivalent periodic) tax
return.

  [B]  EXACT AND MODIFIED COPIES: The copies you distribute
must either be exact copies of this etext, including this
Small Print statement, or can be in binary, compressed, mark-
up, or proprietary form (including any form resulting from
word processing or hypertext software), so long as *EITHER*:

     (1)  The etext, when displayed, is clearly readable, and
does *not* contain characters other than those intended by the
author of the work, although tilde (~), asterisk (*) and
underline (_) characters may be used to convey punctuation
intended by the author, and additional characters may be used
to indicate hypertext links; OR

     (2)  The etext is readily convertible by the reader at no
expense into plain ASCII, EBCDIC or equivalent form by the
program that displays the etext (as is the case, for instance,
with most word processors); OR

     (3)  You provide or agree to provide on request at no
additional cost, fee or expense, a copy of the etext in plain
ASCII.

LIMITED WARRANTY; DISCLAIMER OF DAMAGES
This etext may contain a "Defect" in the form of incomplete,
inaccurate or corrupt data, transcription errors, a copyright
or other infringement, a defective or damaged disk, computer
virus, or codes that damage or cannot be read by your
equipment.  But for the "Right of Replacement or Refund"
described below, the Project (and any other party you may
receive this etext from as a PROJECT GUTENBERG-tm etext)
disclaims all liability to you for damages, costs and
expenses, including legal fees, and YOU HAVE NO REMEDIES FOR
NEGLIGENCE OR UNDER STRICT LIABILITY, OR FOR BREACH OF
WARRANTY OR CONTRACT, INCLUDING BUT NOT LIMITED TO INDIRECT,
CONSEQUENTIAL, PUNITIVE OR INCIDENTAL DAMAGES, EVEN IF YOU
GIVE NOTICE OF THE POSSIBILITY OF SUCH DAMAGES.

If you discover a Defect in this etext within 90 days of
receiving it, you can receive a refund of the money (if any)
you paid for it by sending an explanatory note within that
time to the person you received it from.  If you received it
on a physical medium, you must return it with your note, and
such person may choose to alternatively give you a replacement
copy.  If you received it electronically, such person may
choose to alternatively give you a second opportunity to
receive it electronically.

THIS ETEXT IS OTHERWISE PROVIDED TO YOU "AS-IS".  NO OTHER
WARRANTIES OF ANY KIND, EXPRESS OR IMPLIED, ARE MADE TO YOU AS
TO THE ETEXT OR ANY MEDIUM IT MAY BE ON, INCLUDING BUT NOT
LIMITED TO WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE.  Some states do not allow disclaimers of
implied warranties or the exclusion or limitation of
consequential damages, so the above disclaimers and exclusions
may not apply to you, and you may have other legal rights.

INDEMNITY
You will indemnify and hold the Project, its directors,
officers, members and agents harmless from all liability, cost
and expense, including legal fees, that arise directly or
indirectly from any of the following that you do or cause:
[1] distribution of this etext, [2] alteration, modification,
or addition to the etext, or [3] any Defect.

WHAT IF YOU *WANT* TO SEND MONEY EVEN IF YOU DON'T HAVE TO?
Project Gutenberg is dedicated to increasing the number of
public domain and licensed works that can be freely distributed
in machine readable form.  The Project gratefully accepts
contributions in money, time, scanning machines, OCR software,
public domain etexts, royalty free copyright licenses,
and whatever else you can think of.  Money should be paid to
"Project Gutenberg Association / Illinois Benedictine College".

*SMALL PRINT! Ver.04.29.93 FOR COPYRIGHT PROTECTED ETEXTS*END*





*The Project Gutenberg Etext of Webster's Unabridged Dictionary*
Copyright (C) 1996 by MICRA, Inc.  Plainfield, N.J.