®LM¯®RM¯ GNU Aspell 0.50.3 Copyright (c) 2000-2002 Kevin Atkinson kevina@gnu.org -------------------------------------------------------------------------- * Contents * 1. Introduction + 1.1 Comparison to other spell checker engines o 1.1.1 Comparison to Ispell # 1.1.1.1 Features that only Aspell has # 1.1.1.2 Things that, currently, only Ispell have * 2. Getting Started + 2.1 Requirements + 2.2 Obtaining + 2.3 Support + 2.4 Helping Out + 2.5 Compiling & Installing o 2.5.1 Generic Install Instructions o 2.5.2 General Problems o 2.5.3 Curses Notes o 2.5.4 Upgrading from Aspell .33/Pspell .12 o 2.5.5 Upgrading from a pre-release snapshot o 2.5.6 Win32 Notes # 2.5.6.1 Getting # 2.5.6.2 Building # 2.5.6.3 (PD)Curses # 2.5.6.4 Directories # 2.5.6.5 Installer # 2.5.6.6 Win32 consoles + 2.6 Using aspell-import * 3. Basic Usage + 3.1 Spellchecking Individual Files + 3.2 Using Aspell with other Applications o 3.2.1 With Applications that Expect Ispell o 3.2.2 With Emacs and Xemacs o 3.2.3 With LyX o 3.2.4 With VIM o 3.2.5 With Pine * 4. Customizing Aspell + 4.1 Specifying Options o 4.1.1 At the Command Line # 4.1.1.1 Boolean # 4.1.1.2 Value # 4.1.1.3 List o 4.1.2 Via a Configuration File # 4.1.2.1 Boolean # 4.1.2.2 Value # 4.1.2.3 List o 4.1.3 Via an Environmental Variable + 4.2 The Options o 4.2.1 Dictionary Options o 4.2.2 Checker Options o 4.2.3 Filter Options o 4.2.4 Run-together Word Options o 4.2.5 Misc Options o 4.2.6 Aspell Utility Options + 4.3 Dumping Configuration Values + 4.4 Notes on various Options o 4.4.1 Notes on Various Filters and Filter Modes # 4.4.1.1 None Mode # 4.4.1.2 Url Filter/Mode # 4.4.1.3 Email Filter/Mode # 4.4.1.4 SGML Filter/Mode # 4.4.1.5 TEX Filter/Mode o 4.4.2 Notes on the Prefix Option o 4.4.3 Notes on Typo-Analysis and the Keyboard Definition File o 4.4.4 Notes on the Different Suggestion Modes * 5. Working With Dictionaries + 5.1 How Aspell Selects an Appropriate Dictionary + 5.2 Listing Available Dictionaries + 5.3 Dumping the contents of the word list + 5.4 Creating an Individual Word List o 5.4.1 Format of the Replacement Word List + 5.5 Using Multi Dictionaries + 5.6 Dictionary Naming + 5.7 AWLI files * 6. Writing programs to use Aspell + 6.1 Through the C API o 6.1.1 Usage o 6.1.2 API Reference o 6.1.3 Examples o 6.1.4 Notes About Thread Safety + 6.2 Through A Pipe o 6.2.1 Format of the Data Stream + 6.3 Notes of Storing Replacement Pairs * 7. Adding Support For Other Languages + 7.1 The Language Data File + 7.2 Compiling the Word List + 7.3 Phonetic Code o 7.3.1 Syntax of the transformation array o 7.3.2 How do I start finally? # 7.3.2.1 Things that come in handy # 7.3.2.2 What the phonetic code should do + 7.4 Controlling the Behavior of Run-together Words * 8. How Aspell Works * A. Changelog + Changes from 0.50.2 to 0.50.3 (Nov 23, 2002) + Changes from 0.50.1 to 0.50.2 (Sep 28, 2002) + Changes from 0.50 to 0.50.1 (Aug 28, 2002) + Changes from .33.7.1 to 0.50 (Aug 23, 2002) + Changes from .33.7 to .33.7.1 (Aug 20, 2001) + Changes from .33.6.3 to .33.7 (Aug 2, 2001) + Changes from .33.6.2 to .33.6.3 (June 3, 2001) + Changes from .33.6.1 to .33.6.2 (June 3, 2001) + Changes from .33.6 to .33.6.1 (May 29, 2001) + Changes from .33.5 to .33.6 (May 18, 2001) + Changes from .33 to .33.5 (April 5, 2001) + Changes from .32.6 to .33 (January 28, 2001) + Changes from .32.5 to .32.6 (Nov 8, 2000) + Changes from .32.1 to .32.5 (August 18, 2000) + Changes from .32 to .32.1 (August 5, 2000) + Changes from .31.1 to .32 (July 23, 2000) + Changes from .31 to .31.1 (June 18, 2000) + Changes from .30.1 to .31 (June 11, 2000) + Changes from .30 to .30.1 (April 29, 2000) + Changes from .29.1 to .30 (April 2, 2000) + Changes from .29 to .29.1 (Feb 18, 2000) + Changes from .28.3 to .29 (Feb 6, 2000) + Changes from .28.2.1 to .28.3 (Nov 20, 1999) + Changes from .28.2 to .28.2.1 (Aug 25, 1999) + Changes from .28.1 to .28.2 (Aug 25, 1999) + Changes from .28 to .28.1 (July 27, 1999) + Changes from .27.2 to .28 (July 25, 1999) + Changes from .27.1 to .27.2 (Mar 1, 1999) + Changes from .27 to .27.1 (Feb 24, 1999) + Changes from .26.2 to .27 (Feb 22, 1999) + Changed from .26.1 to .26.2 (Jan 3, 1998) + Changed from .26 to .26.1 (Jan 3, 1998) + Changed from .25.1 to .26 (Jan 3, 1999) + Changes from .25 to .25.1 (Dec 10, 1998) + Changes from .24 to .25 (Nov 23, 1998) + Changes from .23 to .24 (Nov 8, 1998) + Changes from .22.1 to .23 (Oct 31, 1998) + Changes from .22 to .22.1 (Oct 27, 1998) + Changes from .21 to .22 (Oct 26, 1998) + Changes from .20 to .21 (Oct 13, 1998) + Changes from .11 to .20 (Oct 10, 1998) + Changes from .10 to .11 (Sep 12, 1998) * B. To Do + B.1 Things that need to be done + B.2 Good ideas that are worth implementing + B.3 Notes and Status of various items o B.3.1 General region skipping o B.3.2 Word skipping by context o B.3.3 Hidden Markov Model o B.3.4 Email the Personal Dictionary o B.3.5 Words With Spaces in Them * C. Credits * D. Glossary and References * E. Copyright + E.1 GNU Free Documentation License o Preamble o E.1.1 Applicability and Definitions o E.1.2 Verbatim Copying o E.1.3 Copying in Quantity o E.1.4 Modifications o E.1.5 Combining Documents o E.1.6 Collections of Documents o E.1.7 Aggregation With Independent Works o E.1.8 Translation o E.1.9 Termination o E.1.10 Future Revisions of This License o ADDENDUM: How to use this License for your documents + E.2 GNU Lesser Public License * 1.1 Comparison to other spell checker engines + 1.1.1 Comparison to Ispell o 1.1.1.1 Features that only Aspell has o 1.1.1.2 Things that, currently, only Ispell have -------------------------------------------------------------------------- 1. Introduction GNU Aspell is a spell checker designed to eventually replace Ispell. It can either be used as a library or as an independent spell checker. Its main feature is that it does a much better job of coming up with possible suggestions than just about any other spell checker out there for the English language, including Ispell and Microsoft Word. It also has many other technical enhancements over Ispell such as using shared memory for dictionaries and intelligently handling personal dictionaries when more than one Aspell process is open at once. 1.1 Comparison to other spell checker engines +------------------------------------------------------------------------+ | | Aspell | Ispell | Netscape | Microsoft Word | | | | | 4.0 | 97 | |------------------------+--------+--------+------------+----------------| | Open Source | x | x | | | |------------------------+--------+--------+------------+----------------| | Suggestion | 88-98 | 54 | 55-70? | 71 | | Intelligence | | | | | |------------------------+--------+--------+------------+----------------| | Personal part of | x | x | x | | | Suggestions | | | | | |------------------------+--------+--------+------------+----------------| | Alternate Dictionaries | x | x | ? | ? | |------------------------+--------+--------+------------+----------------| | International Support | x | x | ? | ? | +------------------------------------------------------------------------+ The suggestion Intelligence is based on a small test kernel of misspelled/ correct word pairs. Go to http://aspell.net/test for more info and how you can help contribute to the test kernel. The current scores for Aspell are 88 in fast mode, 93 in normal mode, and 98 in bad spellers mode see section 4.4.4 for more information about the various suggestion modes. If you have any other information you would like to add to this chart please contact me at kevina@gnu.org. 1.1.1 Comparison to Ispell 1.1.1.1 Features that only Aspell has * Does a much better job with coming up with suggestions than Ispell does or for that matter any other spell checker I have seen. If you know a spell checker that does a better job please let me know. * Can learn from users misspellings. * Is an actual library that others programs can link to instead of having to use it through a pipe. * Is multiprocess intelligent. When a personal dictionary (or replacement list) is saved it will now first update the list against the dictionary on disk in case another process modified it. * Can share the memory used in the main word list between processes. * Support for detachable dictionaries so that more than one aspell class can use the same dictionary. * Support for multiple personal dictionaries as well as support for special auxiliary dictionaries. * Better support for run-together words. * Ability to use multiple dictionaries by simply specifying it on the command line or in the configuration files. * A better, more complete word list for the English language. Word lists are provided for American, British, and Canadian spelling. Special care has been taken to only include one spelling for each word in any particular word list. The word list included in Ispell by contrast only included support for American and British and also tends to included multiple spellings for a word which can mask some spelling errors. 1.1.1.2 Things that, currently, only Ispell have * Support for affix compression (However this should change soon once Kevin Hendricks is integrated) * Lower memory footprint * Perhaps better support for spell checking (La)TEX files. * Support for spell checking Nroff files. * 2.1 Requirements * 2.2 Obtaining * 2.3 Support * 2.4 Helping Out * 2.5 Compiling & Installing + 2.5.1 Generic Install Instructions + 2.5.2 General Problems + 2.5.3 Curses Notes + 2.5.4 Upgrading from Aspell .33/Pspell .12 + 2.5.5 Upgrading from a pre-release snapshot + 2.5.6 Win32 Notes o 2.5.6.1 Getting o 2.5.6.2 Building o 2.5.6.3 (PD)Curses o 2.5.6.4 Directories o 2.5.6.5 Installer o 2.5.6.6 Win32 consoles * 2.6 Using aspell-import -------------------------------------------------------------------------- 2. Getting Started 2.1 Requirements Aspell requires gcc 2.95 (or better) as the C++ compiler. Other C++ compilers should work with some effort. Other C++ compilers for mostly POSIX compliant (Unix, Linux, BeOS, CygWin) systems should work with out any major problems provided that the compile can handle all of the advanced C++ features Aspell uses. C++ compilers for non-Unix systems might work but it will take some work. Aspell at very least requires a Unix-like environment (sh, grep, sed, tr, etc...) in order to build. Aspell also uses a few POSIX functions when necessary. 2.2 Obtaining The latest version can always be found at GNU Aspell's home page at http:/ /aspell.net. 2.3 Support Support for Aspell can be found on the Aspell mailing lists. Instructions for joining the various mailing lists (and an archive of them) can be found off the Aspell home page at http://aspell.net. 2.4 Helping Out I have very little time to work on Aspell so I desperately need other people to help with the development of Aspell. There are a lot of things that need to be done before I consider Aspell complete. See section B.1 for a list of them. I would really appreciate some help with doing them. If you are interested in helping with one of them please let me know. I am also looking for someone to eventually take over the maintenance and development of Aspell. If you are interested please contact me directly. 2.5 Compiling & Installing 2.5.1 Generic Install Instructions ./configure && make For additional configure options type ./configure --help. You can control what C++ compiler is used by setting the environmental variable CXX before running configure and you can control what flags are passed to the C++ compile via the environmental variable CXXFLAGS. Static libraries are disabled by default since static libraries will not work right due to the mixing of C and C++. When a C program links with the static libraries in Aspell it is likely to crash because Aspell's C++ objects are not getting initialized correctly. However, if for some reason you want them, you can enable them via --enable-static. Aspell should then compile with out any additional user intervention. If you run into problems please first check the sections below as that might solve your problem. To install the program simply type make install After Aspell is installed at least one dictionary needs to be installed. You can find them at http://aspell.net/. The "aspell" program must be in your path in order for the dictionaries to install correctly. If you do not have Ispell or the traditional Unix "spell" utility installed on your system than you should also copy the compatibly scripts "ispell" and "spell" located in the scripts/ directory into your binary directory which is usually /usr/local/bin so that programs that expect ispell or spell command will work correctly. 2.5.2 General Problems Aspell does not use a released version of GNU Libtool. If you have a need to modify configure.ac or any of the Makefile.am's you should install the CVS version of libtool. 2.5.3 Curses Notes If you are having problems compiling termios.cc than the most likely reason is due to incompatibilities with the curses implementation on your system. If this is the case than you can explicitly disable the curses library with --disable-curses. By doing this you will lose the nice full screen interface but hopefully you will be able to at least get Aspell to compile correctly. If the curses library is installed in a non-standard location than you can specify the library and include directory with --enable-curses={lib} and --enable-curses-include={dir}. Lib can either be the complete path of the library (for example "/usr/local/curses/libcurses.a"), the name of the library (for example "ncurses") or a combined location and library in the form "-L{lib dir} -l{lib}" (for example "-L/usr/local/ncurses/lib -lncurses"). Dir is the location of the curses header files (for example "/usr/local/ncurses/include"). 2.5.4 Upgrading from Aspell .33/Pspell .12 Aspell has undergone an extremely large number of changes since the previous Aspell/Pspell release. For one thing Pspell has been merged with Aspell so there in no longer two separate libraries you have to worry about. Because of the massive changes between Aspell/Pspell and Aspell 0.50 you may want to clean out the old files before installing the the new Aspell. To do so do a "make uninstall" in the original Aspell and Pspell source directories. The way dictionaries are handled has also changed. This includes a change in the naming conventions of both language names and dictionaries. Due to the language name change, your old personal dictionaries will not be recognized. However, you can import the old dictionaries by running the "aspell-import" script. This also means that dictionaries designed to work with older versions of Aspell are not likely to function correctly. Fortunately new dictionary packages are available for most languages. You can find them off of the Aspell home page at http://aspell.net. The Pspell ABI is now part of Aspell except that the name of everything has changed due to the renaming of Pspell to Aspell. In particular please note the following name changes: pspell -> aspell manager -> speller emulation -> enumeration master_word_list -> main_word_list Please also note that the name of the language-tag option has changed to lang. However, for backward compatible the language-tag option will still work. However, you should also be able to build applications that require Pspell with the new Aspell as a backward compatibility header file is provided. Due to a change in the way dictionaries are handled, scanning for .pwli files in order to get find out which dictionaries are available will no longer work. This means that programs that relied on this technique may have problems finding dictionaries. Fortunately, GNU Aspell now provided a uniform way to list all installed dictionaries via the C API. See the file "list-dicts.c" in the examples/ directory for an example of how to due this. Unfortunately there is isn't any simply way to find out which dictionaries are installed which will work with both the old Aspell/Pspell and the new GNU Aspell. 2.5.5 Upgrading from a pre-release snapshot At the last minute I decided to merge the "speller-util" program into the main "aspell" program. You may right to remove that speller-util program to avoid confusion. This also means that dictionaries designed to work with the snapshot will no longer work with the official release. 2.5.6 Win32 Notes 2.5.6.1 Getting The latest version of the native Aspell/Win32 port can be found at http:// aspell.net/win32. 2.5.6.2 Building There are two basically different ways of building ASpell using GCC for Win32: You can either use the Cygwin compiler, which will produce binaries that depend on the posix layer in cygwin1.dll. The other way is using MinGW GCC, those binaries use the native C runtime from Microsoft (MSVCRT.DLL). If you intend to use or link against the Aspell libraries using a native Win32 compiler (e.g. MS Visual C++), you will need the MinGW built ones to avoid problems caused by the different runtime libraries. Building Aspell using Cygwin: This works exactly like on other posix compatible systems using the "configure && make && make install" cycle. Some versions of Cygwin GCC will fail to link, this is caused by an incorrect libstdc++.la in the /lib directory. After removing or renaming this file, the build progress should work (GCC-2.95 and GCC-3.x should work). Building Aspell using MinGW: To compile Aspell with the MinGW compiler, you will need at least GCC-3.2 (as shipped with MinGW-2.0.3) and some GNU tools like rm and cp. The origin of those tools doesn't matter, it has shown to work with any tools from MinGW/MSys, Cygwin or Linux. To build Aspell, move into the win32 subdirectory and type "make". You can enable some additional build options by either commenting out the definitions at the head of the Makefile or passing those values as environment variables or at the make command line. Following options are supported: DEBUGVERSION If set to "1", the binaries will include debugging information (resulting in a much bigger size). CURSESDIR Enter the path to the pdcurses library here, in order to get a nicer console interface (see below). MSVCLIB Enter the filename of MS lib.exe here, if you want to build libraries that can be imported from MS Visual C++. WIN32_RELOCATABLE If set to "1", Aspell will detect the prefix from the path where the DLL resides (see below for further details). TARGET Sets a prefix to be used for cross compilation (e.g. "/usr/local/bin/ i586-mingw32msvc-" to cross compile from linux). There are also a MinGW compilers available for Cygwin and Linux, both versions are able to compile Aspell using the prebuilt Makefile. While the Cygwin port automatically detects the correct compiler, the Linux version depends on setting the "TARGET" variable in the Makefile (or environment) to the correct compiler prefix. Other compilers may work. There is a patch for MS Visual C++ 6.0 available at ftp://ftp.gnu.org/gnu/aspell, but it needs a lot of changes to the Aspell sources. It has also been reported that the Intel C++ compiler can be used for compilation. 2.5.6.3 (PD)Curses In order to get the nice full screen interface when spell checking files a curses implementation that does not require Cygwin is required. The PDCurses (http://pdcurses.sourceforge.net) implementation is known to work, other implementations may work however they have not been tested. See the previous section for information on specifying the location of the curses library and include file. Curses notes: * PDcurses built with MinGW needs to be compiled with -DPDC_STATIC_BUILD to avoid duplicate declaration of DllMain when compiling aspell.exe. * The curses enabled version can cause trouble in some shells (MSys rxvt, emacs) and will produce errors like "initscr() LINES=1 COLS=1: too small". Use a non-curses version for those purposes. 2.5.6.4 Directories If Aspell is compiled with WIN32_RELOCATABLE=1, it can be run from any directory: It will set {prefix} according to its install location (assuming it resides in {prefix}\bin). Your personal wordlists will be saved in the {prefix} directory with their names changed from ".aspell. .*" to ".*" (you can override the path by setting the HOME environment variable). 2.5.6.5 Installer The installer registers the DLLs as shared libraries, you should increase the reference counter to avoid the libraries being uninstalled if your application still depends on them (and decrease it again when uninstalling your program). The reference counters are located under: HKLM\SOFTWARE\Microsoft\Windows\CurrentVersion\SharedDLLs The install location and version numbers are stored under HKLM\SOFTWARE\Aspell 2.5.6.6 Win32 consoles The console uses a different encoding than GUI applications, changing this to to a Windows encoding (e.g. 1252) is not supported on Win9x/Me. On WinNT (and later) those codepages can be set by first changing the console font to "lucida console", then changing the codepage using "chcp 1252". Some alternative shells (e.g. MSys' rxvt or Cygwin's bash) do a codepage conversion (if correctly set up), so running Aspell inside those shells might be a workaround for Win9x. 2.6 Using aspell-import The aspell-import Perl script will look for old personal dictionaries and will import them into GNU Aspell. It will look for both Ispell and Aspell ones. To use it just run it from the command prompt. If you get an error about /usr/bin/perl not being found than instead try "perl {bindir}/ aspell-import". When running the script if you get a message like: Error: No word lists can be found for the language "de". This means that you have not installed support for the given language, in this case "de" for German. To rectify the situation download and install a dictionary designed to work with GNU Aspell 0.50 or better. * 3.1 Spellchecking Individual Files * 3.2 Using Aspell with other Applications + 3.2.1 With Applications that Expect Ispell + 3.2.2 With Emacs and Xemacs + 3.2.3 With LyX + 3.2.4 With VIM + 3.2.5 With Pine -------------------------------------------------------------------------- 3. Basic Usage For a quick reference on the Aspell utility use the command "aspell --help". 3.1 Spellchecking Individual Files To use Aspell to spellcheck a file use: aspell check [{options}] {filename} at the command line where {filename} is the file you want to check and {options} is any number of optional options. Some of the more useful ones include: --mode={mode} the mode to use when checking files. The available modes are none, url, email, sgml, or tex. See section 4.4.1 for more informations on the various modes. --dont-backup don't create a backup file. --sug-mode={mode} the suggestion mode to use where mode is one of ultra, fast, normal, or bad-spellers. See section 4.4.4 for more information on these modes. --master={name} the main dictionary to use. --keymapping={name} the keymapping to use. Either aspell for the default mapping or ispell to use the same mapping that the ispell utility uses. Please see Chapter 4 for more information on the available options. For example to check the file foo.txt: aspell check foo.txt and to check the file foo.txt using the bad-spellers suggestion mode and the large American English dictionary: aspell check --sug-mode=bad-spellers --master=american-lrg foo.txt If the mode option is not given then aspell will use the extension of the file to determine the current mode. If the extension is .tex, then TEX mode will be uses, if the extension is .html, .htm, .php, or .sgml it will check the file in sgml mode, otherwise it will use url mode. These extensions can be changed via the tex-extension and sgml-extension options respectfully. See chapter 4.4.1 for more information on the various modes that can be used. If Aspell was compiled with curses support and the TERM environmental variable is set to a capable terminal type than Aspell will use a nice full screen interface. Otherwise it will use a simpler "dumb" terminal interface where the misspelled word is surrounded by two '*'. In either case the interface should be self explanatory. 3.2 Using Aspell with other Applications 3.2.1 With Applications that Expect Ispell Aspell can be used as a drop in replacement for Ispell for programs that use Ispell through a pipe such as Emacs and LyX. It can also be used with programs that use simple call the ispell command and expect the original file to be overwritten with the corrected version. It support the basic features of Ispell, however it does not currently have a Nroff mode so there may be situations in which you still wish to use Ispell. Nevertheless, I have been using Aspell for Xemacs and LyX since the middle of September of 1998 with out any problems. If you do not have Ispell installed on your system and have installed the Ispell compatibly script than you should not need to do anything as most applications will that expect Ispell will work as expected with Aspell via the Ispell compatibility script. Otherwise, the recommended way to use Aspell as a replacement for ispell is to change the Ispell command from within the program being used. If the program uses ispell in pipe mode simple change ispell to aspell. If the program calls the ispell command to check the file change "ispell" with "aspell check". If that is impossible and the program uses ispell through a pipe than the run-with-aspell script can be used for programs using ispell in pipe mode. The format of the script is: run-with-aspell {command} where {command} is the name of the program with any optional arguments. The old method of mapping Ispell to Aspell is discouraged because it can create compatibility problems with programs that actually require Ispell such as Ispell's own scripts. 3.2.2 With Emacs and Xemacs The easiest way to use Aspell with Emacs or Xemacs is to add this line: (setq-default ispell-program-name "aspell") to the end of your .emacs file. For some reason version 3.0 of ispell.el (the lisp program that (x)emacs uses) want to reverse the suggestion list. To fix this add this line: (setq-default ispell-extra-args '("--reverse")) after the previous line in your .emacs file and it should solve the problem. Ispell.el, version 3.1 (December 1, 1998) and better, has the list reversing problem fixed. You can find it at http://www.kdstevens.com/ ~stevens/ispell-page.html. 3.2.3 With LyX Version 1.0 of LyX provides support for Aspell learning for users mistake feature. To use aspell with LyX 1.0 either change the spell_command option in the lyxrc file or use the run-with-aspell utility. 3.2.4 With VIM (The following section was written by "R. Marc", rmarc at copacetic net.) To use aspell in vim you simply need to add the following line to your .vimrc file: map ^T :w!:!aspell check %:e! % I use since that's the way you spell check in pico. In order to add a control character to your .vimrc you must type first. In this case . A more useful way to use Aspell, IMHO, is in combination with newsbody ( http://www.image.dk/~byrial/newsbody/) which is how I use it since vim is my editor for my mailer and my news reader. map ^T \1\2:e! % map \1 :w! map \2 :!newsbody -qs -n % -p aspell check \%f 3.2.5 With Pine To use aspell in pine simply change the option speller to aspell --mode=email check To change the speller option go to the main menu. Type S for setup, C for config, then W for where is. Type in speller as the word to find. The speller option should be highlighted now. Hit enter, type in the above line, and hit enter again. Then type E for exit setup and Y to save the change. If you have a strong desire to check other peoples comments change speller to aspell check instead which will avoid switching aspell into email mode. * 4.1 Specifying Options + 4.1.1 At the Command Line o 4.1.1.1 Boolean o 4.1.1.2 Value o 4.1.1.3 List + 4.1.2 Via a Configuration File o 4.1.2.1 Boolean o 4.1.2.2 Value o 4.1.2.3 List + 4.1.3 Via an Environmental Variable * 4.2 The Options + 4.2.1 Dictionary Options + 4.2.2 Checker Options + 4.2.3 Filter Options + 4.2.4 Run-together Word Options + 4.2.5 Misc Options + 4.2.6 Aspell Utility Options * 4.3 Dumping Configuration Values * 4.4 Notes on various Options + 4.4.1 Notes on Various Filters and Filter Modes o 4.4.1.1 None Mode o 4.4.1.2 Url Filter/Mode o 4.4.1.3 Email Filter/Mode o 4.4.1.4 SGML Filter/Mode o 4.4.1.5 TEX Filter/Mode + 4.4.2 Notes on the Prefix Option + 4.4.3 Notes on Typo-Analysis and the Keyboard Definition File + 4.4.4 Notes on the Different Suggestion Modes -------------------------------------------------------------------------- 4. Customizing Aspell The behavior of Aspell can be changed by any number of options which can be specified at either the command line, the environmental variable ASPELL_CONF, a personal configuration file, or a global configuration file. Options specified on the command line override options specified by the environmental variable. Options specified by the environmental variable override options specified by either of the configurations files. Finally options specified by the personal configuration file override options specified in the global configuration file. Options specified in the environmental variable ASPELL_CONF, a personal configuration file, or a global configuration file will take effect no matter how Aspell is used which includes being used by other applications. Aspell has three basic type of options: boolean, value, and list. Boolean options are either enabled or disabled, value options take a specific value, and list options can either have entries added or removed from the list. 4.1 Specifying Options 4.1.1 At the Command Line All options specified at the command line have the following basic format: --{option}[={value}] where the '=' can be replaced by whitespace. However some options also have single letter abbreviations of the form: -{letter}[{optional whitespace}{value}] 4.1.1.1 Boolean To enable a boolean option simply special the option with out any corresponding value. For example to ignore accents when checking words use "--ignore-accents". To disable a boolean option prefix the option name with a "dont-". For example to not ignore accents when checking words use "--dont-ignore-accents". If a boolean option has a single letter abbreviation simply give the letter corresponding to either enabling or disabling the option with out any corresponding value. For example to consider run-together words legal use "-C" or to consider them illegal use "-B" 4.1.1.2 Value To specify a value option simply specify the option with its corresponding value. For example to set the filter mode to Tex use "--mode=tex". If a value option has a single letter shortcut simply specify the single letter short cut with its corresponding value. For example to use a large american dictionary use "-d american-lrg". 4.1.1.3 List To add a value to the list prefix the option name with a "add-" and then specify the value to add. For example to add the URL filter use "--add-filter url". To remove a value from a list option prefix the option name with a "rem-" and then specify the value to remove. For example to remove the URL filter use "--rem-filter url". To remove all items from a list prefix the option name with a "rem-all" without specify any value. For example to remove all filters use "--rem-all-filter". 4.1.2 Via a Configuration File Aspell can also accept options via a personal or global configuration file. The exact files to used are specified by the options per-conf and conf respectfully but the personal configuration file is normally ".aspell.conf" located in the HOME directory and the global one is normally "aspell.conf" which is located in the etc directory which is normally "/usr/etc" or "/usr/local/etc". To find out the particular values for your particular system use "aspell dump config". Each line of the configuration file has the format: {option} [{value}] There may any number of spaces between the option and the value however it can only be spaces, ie there is no '=' between the option name and the value. Comments may also be included by preceding them with a '#' as anything from a "#" to a newline is ignored. Blank lines are also allowed. Values set in the personal configuration file override those in the global file. Options specified at either the command line or via an environmental variable override those specified by either configuration file. 4.1.2.1 Boolean To specify a boolean option simply include the option followed by a "true" to enable it or a "false" to disable it. For example to allow run-together words use "run-together true". 4.1.2.2 Value To specify a value option simply include the option followed by the corresponding option. For example to set the default language to german use "lang german". 4.1.2.3 List To add a value to the list prefix the option name with a "add-" and then specify the value to add. For example to add the URL filter use "add-filter url". To remove a value from a list option prefix the option name with a "rem-" and then specify the value to remove. For example to remove the URL filter use "rem-filter url". To remove all items from a list prefix the option name with a "rem-all" without specify any value. For example to remove all filters use "rem-all-filter". 4.1.3 Via an Environmental Variable The environmental variable ASPELL_CONF may also be used and it overrides any options set in the configuration file. The format of the string is exactly the same as the configuration file except that semicolons ( ; ) are used instead of newlines. 4.2 The Options The following is a list of available options broken down by category. Each entry has the following format: {option}[,{single letter abbreviations}] ({type}) {description} Where single letter options are specified as they would appear at the command line, ie with the preceding dash. Boolean single letter options are specified in the following format: -{abbreviation to enable}|-{abbreviation to disable} {Option} is one of the following: boolean, string, file, dir, integer, or list. String, file, dir, and integer types are all value options which can only take a specific type of value. 4.2.1 Dictionary Options The following options may be used to control which dictionaries to use and how they behave (see section 5 for more information): master,-d (string) base name of the dictionary to use. If this option is specified than Aspell with either use this dictionary or die. dict-dir (dir) location of the main word list lang (string) language to use, it follows the same format of the LANG environmental variable on most systems. It consists of the two letter ISO 639 language code and an optional two letter ISO 3166 country code after a dash or underscore. The default value is based on the value of the LC_MESSAGES locale. size (string) the preferred size of the word list jargon (string) an extra information to distinguish two different words lists that have the same lang and size. word-list-path (list) search path for word list information files module-search-order (list) list of available modules, modules that come first on this list have a higher priority. Currently there is only one speller module. personal,-p (file) personal word list file name repl (file) replacements list file name extra-dicts (list) extra dictionaries to use. strip-accents (boolean) strip accents from all words in the dictionary 4.2.2 Checker Options These options control the behavior of Aspell when checking documents. ignore,-W (integer) ignore words <= n chars ignore-case (boolean) ignore case when checking words ignore-accents (boolean) ignore accents when checking words ignore-repl (boolean) ignore commands to store replacement pairs save-repl (boolean) save the replacement word list on save allkeyboard (file) the base name of the keyboard definition file to use (see section 4.4.3) sug-mode (mode) suggestion mode = ultra | fast | normal | bad-spellers (see section 4.4.4) 4.2.3 Filter Options These options modify the behavior of the various filters (see section 4.4.1 for more information): filter (list) add or removes a filter mode (string) sets the filter mode. Mode is one if none, url, email, sgml, or tex. (The short cut options '-e' may be used for email, '-H' for Html/Sgml, or '-t' for Tex) encoding (string) The encoding the input text is in. Valid values are "utf-8", "iso8859-*", "koi8-r", "viscii", "cp1252", "machine unsigned 16", "machine unsigned 32". However, the aspell utility will currently only function correctly with 8-bit encodings. I hope to provide utf-8 support in the future. The two "machine unsigned" encodings are intended to be used by other programs using the Aspell library and it is unlikely the Aspell utility will ever support these encodings. add|rem-email-quote (list) email quote characters email-margin (integer) num chars that can appear before the quote char sgml-check (list) sgml attributes to always check. sgml-extension (list) sgml file extensions. tex-command (list) TEX commands tex-check-comments (boolean) check TEX comments 4.2.4 Run-together Word Options These may be used to control the behavior of run-together words (see section 7.4 for more information): run-together,-C|-B (boolean) consider run-together words legal run-together-limit (integer) maximum numbers that can be strung together run-together-min (integer) minimal length of interior words 4.2.5 Misc Options Misc. other options that don't fall under any other category conf (file) main configuration file conf-dir (dir) location of main configuration file data-dir (dir) location of language data files local-data-dir (dir) alternative location of language data files. This directory is searched before data-dir. It defaults to the same directory the actual main word list is in (which is not necessarily dict-dir). home-dir (dir) location for personal files per-conf (file) personal configuration file prefix (dir) prefix directory set-prefix (boolean) set the prefix based on executable location (only works on Win32 and when compiled with --enable-win32-relocatable) 4.2.6 Aspell Utility Options backup,-b|-x (boolean) create a backup file by appending ".bak" to the file name. (Only applies when the command is check) time (boolean) time load time and suggest time in pipe mode. reverse (boolean) reverse the order of the suggestions list. keymapping (string) the keymapping to use. Either aspell for the default mapping or ispell to use the same mapping that the ispell utility uses. 4.3 Dumping Configuration Values To find out the current value of all the options use the command "aspell dump config". This will dump the current configuration to standard output. The format of the contents dumped is such that it can be used as either the global or personal configuration file. To find out the current value of a particular option use "aspell config { option}". This will print out the value of {option} to stdout and nothing else. 4.4 Notes on various Options 4.4.1 Notes on Various Filters and Filter Modes Aspell now has rudimentary filter support. You can either select from individual filters or chose a filter mode. To select a filter mode use the mode option. You may chose from none, url, email, sgml, and tex. The default mode is url. Individual filters can be added with the option add-filter and remove with the rem-filter option. The currently available filters are url, email, sgml, tex as well as a bunch of filters which translate the text from one format to another. 4.4.1.1 None Mode This mode is exactly what it says. It turns off all filters. 4.4.1.2 Url Filter/Mode The url filter/mode skips over URL's, host names, and email addresses. Because this filter is almost always useful and rarely does any harm it is enabled in all modes except none. To turn it off either select the none mode or use rem-filter option after the desired mode is selected. 4.4.1.3 Email Filter/Mode The email filter/mode skips over quoted text. It currently does not support skipping over headers however a future version should. In the mean time I suggest you use Aspell with Newsbody which can be found at http:// home.worldonline.dk/~byrial/newsbody/. The option email-skip controls the number of characters that can appear before the email quote char, the default is 10. The option add|rem-email-quote controls the characters that are considered quote characters, the default is ">' and '|'. 4.4.1.4 SGML Filter/Mode The sgml filter/mode will skip over sgml commands. It currently does not handle nested < > unless they are in quotes. It also does it handle the null end tag (net) minimization feature of sgml such as ][-{jargon}][-{size}].multi Where {language} is the two letter language code, {region} is the two letter region code, {jargon} is any extra informations to distinguish the word list from other ones with the same language and spelling, and {size} is the size of the dictionary. If no size is specified that the default size of 60 will be assumed. For example: en.multi en_US.multi en-medical.multi en-medical-85.multi en-85.multi de.multi 5.7 AWLI files In order for Aspell to find dictionaries that are located in odd places or not named according to section 5.6 a AWLI file needs to be created for the dictionary and located in some place where Aspell can find it. Each AWLI has the the following name: {language}[_{region>][-{jargon}][-{size}]-{module}.awli Where the names have the same meaning as in section 5.6 and {module} is the speller module to use, which should be set to "default" for now since there is only one speller module. Each AWLI file for an Aspell word list should then contain exactly one line which contains the full path of the main word list. * 6.1 Through the C API + 6.1.1 Usage + 6.1.2 API Reference + 6.1.3 Examples + 6.1.4 Notes About Thread Safety * 6.2 Through A Pipe + 6.2.1 Format of the Data Stream * 6.3 Notes of Storing Replacement Pairs -------------------------------------------------------------------------- 6. Writing programs to use Aspell There are two main ways to use aspell from within your application. Through the external C API or through a pipe. The internal Aspell API can be used directly but that is not recommended as the actual Aspell API is constantly changing. 6.1 Through the C API The Aspell library contains two main classes and several helper classes. The two main classes are AspellConfig and AspellMaster. The AspellConfig class is used to set initial defaults and to change spell checker specific options. The AspellManager class does most of the real work. It is responsible for managing the dictionaries, checking if a word is in the dictionary, and coming up with suggestions among other things. There are many helper classes the important ones are AspellWordList, AspellMutableWordList, Aspell*Enumeration. The AspellWordList classes is used for accessing the suggestion list, as well as the personal and suggestion word list currently in use. The AspellMutableWordList is used to manage the personal, and perhaps other, word lists. The Aspell* Enumeration classes are used for iterating through a list. 6.1.1 Usage To use Aspell your application should include "aspell.h". In order to insure that all the necessary libraries are linked in libtool should be used to perform the linking. When using libtool simply linking with "-laspell" should be all that is necessary. When using shared libraries you might be able to simply link "-laspell", but this is not recommended. This version of Aspell uses the CVS version of libtool however released versions of libtool should also work. When your application first starts you should get a new configuration class with the command: AspellConfig * spell_config = new_aspell_config(); which will create a new AspellConfig class. It is allocated with new and it is your responsibility to delete it with delete_aspell_config. Once you have the config class you should set some variables. The most important one is the language variable. To do so use the command: aspell_config_replace(spell_config, "lang", "en_US"); which will set the default language to use to American English. The language is expected to be the standard two letter ISO 639 language code, with an optional two letter ISO 3166 country code after an underscore. You can set the preferred size via the size option, any extra info via the jargon option, and the encoding via the encoding option. Other things you might want to set is the preferred spell checker to use, the search path for dictionary's, and the like see section 4.2 for a list of all available options. When ever a new document is created a new AspellManager class should also be created. There should be one speller class per document. To create a new speller class use the new_aspell_speller and then cast it up using to_aspell_speller like so. AspellCanHaveError * possible_err = new_aspell_speller(spell_config); AspellManager * spell_checker = 0; if (aspell_error_number(possible_err) != 0) puts(aspell_error_message(possible_err)); else spell_checker = to_aspell_speller(possible_err); which will create a new AspellManager class using the defaults found in spell_config. To find out which dictionary is selected the lang, size, and jargon options may be examined. To find out the exact name of the dictionary the master option way be examined as well as the master-flags options to see if any special flags that were passed on to the module. The module option way also be examined to figure out which speller module was selected, but since there is only one this option will always be the same. If for some reason you want to use different defaults simply clone spell_config and change the setting like so: AspellConfig * spell_config2 = aspell_config_clone(spell_config); aspell_config_replace(spell_config2, "lang","nl"); possible_err = new_aspell_speller(spell_config2); delete_aspell_config(spell_config2); Once the speller class is created you can use the check method to see if a word in the document is correct like so: int correct = aspell_speller_check(spell_checker, , ); can is expected to a const char * character string. If the encoding is set to be "machine unsigned 16" or "machine unsigned 32". is expected to be a cast from either const u16int * or const u32int* respectfully. U16int and u32int are generally unsigned short and unsigned int respectfully. is the length of the string or -1 if the sting is null terminated. If the string is a cast from const u16int * or const u32int * then size is the amount of space in bytes the string takes up after being casted to const char * and not the true size of the string. Aspell_speller_check will return 0 is it is not found and non-zero otherwise. If the word is not correct than the suggest method can be used to come up with likely replacements. AspellWordList * suggestions = aspell_speller_suggest(spell_checker, , ); AspellStringEnumeration * elements = aspell_word_list_elements (suggestions); const char * word; while ( (word = aspell_string_enumeration_next(aspell_elements) != NULL ) { // add to suggestion list } delete_aspell_string_manag(elements); Notice how elements is deleted but suggestions is not. The value returned by suggestions is only valid to the next call to suggest. Once a replacement is made the store_repl method should be used to communicate the replacement pair back to the spell checker (see section 6.3 for why). It usage is as follows: aspell_speller_store_repl(spell_checker, , , , ); If the user decided to add the word to the session or personal dictionary the the word can be be added using the add_to_session or add_to_personal methods respectfully like so: aspell_speller_add_to_session|personal(spell_checker, , ); It is better to let the spell checker manage these words rather than doing it your self so that the words have a change of appearing in the suggestion list. Finally, when the document is closed the AspellManager class should be deleted like so. delete_aspell_speller(spell_checker); 6.1.2 API Reference Methods that return a boolean result generally return false on error and true other wise. To find out what went wrong use the error_number and error_message methods. Unless otherwise stated methods that return a "const char *" will return null on error. In general, the charter string returned is only valid until the next method which returns a "const char *" is called. For the details of the various classes please see the header files. In the future I will generate class references using some automated tool. 6.1.3 Examples Two simple examples are included in the examples directory. The "example-c" program demonstrates most of the Aspell library functionary and the "list-dicts" lists the available dictionaries. 6.1.4 Notes About Thread Safety Read-only Aspell methods and functions should be thread safe as long as exceptions, new, delete, delete[], and STL allocators are thread safe. To the best of my knowledge gcc and egcs meet these requirements. It is up to the programmer to make sure multiple threads do not do thing such as change the dictionaries and add or delete items from the personal or session dictionaries. 6.2 Through A Pipe When given the pipe or -a command aspell goes into a pipe mode that is compatible with "ispell -a". Aspell also defines its own set of extensions to ispell pipe mode. 6.2.1 Format of the Data Stream In this mode, Aspell prints a one-line version identification message, and then begins reading lines of input. For each input line, a single line is written to the standard output for each word checked for spelling on the line. If the word was found in the main dictionary, or your personal dictionary, then the line contains only a '*'. If the word is not in the dictionary, but there are suggestions, then the line contains an '&', a space, the misspelled word, a space, the number of near misses, the number of characters between the beginning of the line and the beginning of the misspelled word, a colon, another space, and a list of the suggestions separated by commas and spaces. Finally, if the word does not appear in the dictionary, and there are no suggestions, then the line contains a '#', a space, the misspelled word, a space, and the character offset from the beginning of the line. Each sentence of text input is terminated with an additional blank line, indicating that ispell has completed processing the input line. These output lines can be summarized as follows: OK: * Suggestions: & {original} {count} {offset}: {miss}, {miss}, ... None: # {original} {offset} When in the -a mode, Aspell will also accept lines of single words prefixed with any of '*', '&', '@', '+', '-', '~', '#', '!', '%', or '^'. A line starting with '*' tells ispell to insert the word into the user's dictionary. A line starting with '&' tells ispell to insert an all-lowercase version of the word into the user's dictionary. A line starting with '@' causes ispell to accept this word in the future. A line starting with '+', followed immediately by a valid mode will cause aspell to parse future input according the syntax of that formatter. A line consisting solely of a '+' will place ispell in TEX/LATEX mode (similar to the -t option) and '-' returns aspell to its default mode (but these commands are obsolete). A line '~', is ignored for ispell compatibility. A line prefixed with '#' will cause the personal dictionaries to be saved. A line prefixed with '!' will turn on terse mode (see below), and a line prefixed with '%' will return ispell to normal (non-terse) mode. Any input following the prefix characters '+', '-', '#', '!', '~', or '%' is ignored, as is any input following. To allow spell-checking of lines beginning with these characters, a line starting with '^' has that character removed before it is passed to the spell-checking code. It is recommended that programmatic interfaces prefix every data line with an uparrow to protect themselves against future changes in Aspell. To summarize these: *{word} Add a word to the personal dictionary &{word} Insert the all-lowercase version of the word in the personal dictionary @{word} Accept the word, but leave it out of the dictionary # Save the current personal dictionary ~ Ignored for ispell compatibility. + Enter TEX mode. +{mode} Enter the mode specified by {mode}. - Enter the default mode. ! Enter terse mode % Exit terse mode ^ Spell-check the rest of the line In terse mode, Aspell will not print lines beginning with '*', which indicate correct words. This significantly improves running speed when the driving program is going to ignore correct words anyway. In addition to the above commands which are designed for Ispell compatibility Aspell also supports its own extension. All Aspell extensions follow the following format. $${command} [data] Where data may or may not be required depending on the particular command. Aspell currently supports the following command. cs {option},{value} Change a configuration option. cr {option} Prints the value of a configuration option. pp Returns a list of all words in the current personal wordlist. ps Returns a list of all words in the current session dictionary. l Returns the current language name. ra {mis},{cor} Add the word pair to the replacement dictionary for latter use. Returns nothing. Anything returned is returned on its own line line. All lists returned have the following format {num of items}: {item1}, {item2}, {etc} (Part of the preceding section was directly copied out of the Ispell manual) 6.3 Notes of Storing Replacement Pairs The store_repl method and the $$ra should be used because Aspell is able to learn from users misspellings. For example on the first pass a user misspells beginning as beging so aspell suggests: begging, begin, being, Beijing, bagging, .... However the user then tries "begning" and aspell suggests beginning, beaning, begging, ... so the user selects beginning. However than, latter on in the document the user misspells it as begng (NOT beging). Normally aspell will suggest. began, begging, begin, begun, .... However because it knows the user misspelled beginning as beging it will instead suggest: beginning, began, begging, begin, begun ... I myself often misspelled beginning (and still do) as something close to begging and two many times wind up writing sentences such as "begging with ....". Please also note that replacements commands has a memory. Which means if you first store the replacement pair: sicolagest -> psycolagest then store the replacement pair psycolagest -> psychologist The replacement pair sicolagest -> psychologist will also get stored so that you don't have to worry about it. * 7.1 The Language Data File * 7.2 Compiling the Word List * 7.3 Phonetic Code + 7.3.1 Syntax of the transformation array + 7.3.2 How do I start finally? o 7.3.2.1 Things that come in handy o 7.3.2.2 What the phonetic code should do * 7.4 Controlling the Behavior of Run-together Words -------------------------------------------------------------------------- 7. Adding Support For Other Languages Before you consider adding support for Aspell first make sure that someone else has not already done it. A good number of dictionaries off the Aspell home page at http://aspell.net. If your language is not listed above please send me a note and I will work with you on adding support. This chapter describes the old manual way of adding support for a new language in Aspell. The recommended way to add support is through the aspell-dicts package. The scripts used, as well as some documentation on what to do, is available in the aspell-gen package which is available in the same location the dictionary packages are at. Adding a language to aspell is fairly straightforward. You need to create the language data file, and compile a new word list. 7.1 The Language Data File The basic format of the language data data is the same as it for aspell configuration file. It is named {lang}.dat and is located in the architecture independent data dir for aspell (option data-dir) which is usually {prefix}/share/aspell. Use "aspell config" to find out where it is in your installation. By convention the language name should be the two letter ISO 639 language code. The language data file has several mandatory fields, and several optional ones. All fields are case sensitive and should be in all lower case. The two mandatory fields are name and charset. Name is the name of the language and should be the same as the file name (without the .dat). Charset is the charset aspell will expect the word lists to be formatted in. You may chose from any of the iso-8859-* character sets as well as, koi8-f, koi8-r, and viscii. If your language can fit in the plain old ASCII character set use iso8859-1. If you use some other character set for your language other than the ones listed here drop me a note and I will look into adding support for it. The optional fields are special, soundslike, keyboard and a bunch of options to specify how run-together words are handles. Special is for non letter characters that can appear in your language such as the ' and -. The format for the value is a list separated by spaces. Each item of the list has the following format {char} {begin}{middle}{end} {char} is the non letter character in question. {begin},{middle},{end} are either a '-' or a '*'. A star for {begin} means that the character at the beginning of the word, a '-' means it can't. The same is true for {middle} and {end}. For example the entry for the ' in english is: ' -*- To include more than one middle character just list them one after another on the same line. for example to make both the ' and the - a middle character use the following line in the language data file: special ' -*- - -*- The soundslike option, if present, should be the name of the soundslike data for the language. The data is expected to be in the file {name} _phonetic.dat. If the name is generic a really generic soundslike algorithm will be used which consists of striping all the vowels and removing all accents. I recommend first using the generic algorithm and then, after aspell is working with the new language, work on the transformation array. If the soundslike name is none then no soundslike lookup table will be used. This will reduce the size of the compiled word list by around 50% but at the sacrifice of suggestion quality. If the soundslike is none than the soundslike for the word will simply be the word itself in lowercase, will all accents stripped. For languages with phonetic spelling the difference will not be very noticeable. However, for languages with non-phonetic spelling there will be a noticeable difference. The difference you notice will depend on the quality of the soundslike data file. If you do not notice much of a difference for a language with non-phonetic spelling that is a good indication that the soundslike data is not rough enough--or the words you are trying are not that badly misspelled. The keyboard option specifies the base name of the keyboard definition file to use. See section 4.4.3 for more information. The options to control how run-together words are handled are the same as the are in the normal configurations files. Please see section 7.4 for more information. 7.2 Compiling the Word List Once you have a working language data file installed in the right place you are ready to compile the main word list. See section 5 to find out what to do. This section also includes instructions for creating the awli file. 7.3 Phonetic Code (The following section was written by Björn Jacke, bjoern.jacke at gmx de) Aspell is in fact the spell checker that comes up with the best suggestions if it finds an unknown word. One reason is that it does not just compare the word with other words in the dictionary (like Ispell does) but also uses phonetic comparisons with other words. The new table driven phonetic code is very flexible and setting up phonetic transformation rules for other languages is not difficult but there can be a number of stumbling stones -- that's why I wrote this section. The main phonetic code is free of any language specific code and should be powerful enough to allow setting up rules for any language. Anything which is language specific is kept in a plain text file and can easily be edited. So it's even possible to write phonetic transformation rules if you don't have any programming skills. All you need to know is how words of the language are written and how they are pronounced. 7.3.1 Syntax of the transformation array In the translation array there are two strings on each line; the first one is the search string (or switch name) and the second one is the replacement string (or switch parameter). The line version {version} is also required to appear somewhere in the translation array. The version string can be anything but it should be changed when ever the a new version of the translation array is released. This is important because it will keep Aspell from using a compiled dictionary with the wrong set of rules. For example if when coming up with suggestion for "hallo" aspell will use the new rules to come up with the soundslike say "H*L*" but if hello is stored in the dictionary using the old rules as "HL" instead of "H*L*" aspell will never be able to come up with hello. So to solve this problem Aspell checks if the version strings match and abort with an error if they don't. Thus it is important to update it when ever a new version of the translation array is releases. This is only a problem with the main word list as the personal word lists are now stored as simple word lists with a single header line (ie, no soundslike data). Each non switch line represents one replacement (transformation) rule. Words beginning with the same letter must be grouped together; the order inside this group does not depend on alphabetical issues but it gives priorities; the higher the rule the higher the priority. That's why the first rule that matches is applied. In the following example: GH _ G K "GH to _" has higher priority than "G to K". "_" represents the empty string "". If "GH to _" would stand after "G to K", the second rule would never match because the algorithm would stop searching for more rules after the first match. The above rules transform any "GH" to an empty string (delete them) and transform any other "G" to "K". At the end of the first string of a line (the search string) there may optionally stand a number of characters in brackets. One (only one!) of these characters must fit. It's comparable with the [ ] brackets in regular expressions. The rule "DG(EIY) to J" for example would match any "DGE", "DGI" and "DGY" and replace them with "J". This way you can reduce several rules to one. Behind the search string there can stand one or more dashes (-). Those search strings will be matched totally but only the beginning of the string will be replaced. Furthermore for these rules no follow-up rule will be searched (what this is will be explained later). The rule "TCH-- to _" will match any word containing "TCH" (like "match") but will only replace the first character "T" with an empty string. The number of dashes determines how many characters from the end will not be replaced. After the replacement the search for transformation rules continues with the not replaced "CH"! If a "<" is appended to the search string, the search for replacement rules will continue with the replacement string and not with the next character of the word. The rule "PH< to F" for example would replace "PH" with "F" and then again start to search for a replacement rule for "F...". If there would also be rules like "FO to O" and "F to _" then words like "PHOXYZ" would be transformed to "OXYZ" and any occurrences of "PH" that are not followed by an "O" will be deleted like "PHIXYZ to IXYZ". The second replacement however is not applied if the priority of this rule is lower than the priority of the first rule. Priorities are added to a rule by putting a number between 0 and 9 at the end of the search string, for example "ING6 to N". The higher the number the higher is the priority. Priorities are especially important for the previously mentioned follow-up rules. Follow-up rules are searched beginning from the last string of the first search string. This is a bit complicated but I hope this example will make it more clear: CHS X CH G HAU--1 H SCH SH In this example "CHS' in the word "FUCHS" would be transformed to "X". If we take the word "DURCHSCHNITT" the things look a bit different. Here "CH" belongs together and "SCH" belongs together and both are spoken separately. The algorithm however first finds the string "CHS" which may not be transformed like in the previous word "FUCHS". At this point the algorithm can find a follow up rule. It takes the last character of the first matching rule ("CHS") which is "S" and looks for the next match, beginning from this character. What it finds is clear: It finds "SCH to SH", which has the same priority (no priority means standard priority, which is 5). If the priority is the same or higher the follow-up rule will be applied. Let's take a look at the word "SCHAUKEL". In this word "SCH" belongs together and may not be torn apart. After the algorithm has found "SCH to SH" it searches for a follow-up rule for "H"+"AUKEL". It finds "HAU--1 to H", but does not apply it because its priority is lower than the one of the first rule. You see that this is a very powerful feature but it also can easily lead to mistakes. If you really don't need this feature you can turn it off by putting the line followup 0 at the beginning of the phonetic table file. As mentioned, for rules containing a `-' no follow-up rules are searched but giving such rules a priority is not totally senseless because they self can be follow-up rules and in that case the priority makes sense again. Follow-up rules of follow-up rules are not searched because this is in fact not needed very often. The control character "^" says that the search string only matches at the beginning of words so that the rule "RH^to R" will only apply to words like "RHESUS" but not "PERHAPS". You can append another "^" to the search string. In that case the algorithm treats the rest of the word totally separately from first matched string in at beginning. This is useful for prefixes whose pronunciation does not depend on the rest of the word and vice versa like "OVER^^" in English for example. The same way as "^" works does "$" only apply on words that end with the search string. "GN$ to N" only matches on words like "SIGN" but not "SIGNUM". If you use "^" and "$" together, both of them must fit "ENOUGH^$ to NF" will only match the word "ENOUGH" and nothing else. Of course you can combine all of the mentioned control characters but they must occur in this order: < - priority ^ $. All characters must be written in CAPITAL letters. If absolutely no rule can be found -- might happen if you use strange characters for which you don't have any replacement rule -- the next character will simply be skipped and the search for replacement rules will continue with the rest of the word. If you want double letters to be reduced to one you must set up a rule like "LL- to L". If double letters in the resulting phonetic word should be allowed, you must place the line collapse_result 0 at the beginning of your transformation table file; otherwise set the value to `1'. The English rules for example strip all vowels from words and so the word "GOGO" would be transformed to "K" and not to "KK" (as desired) if collapse_result is set to 1. That's why the English rules have collapse_result set to 0. 7.3.2 How do I start finally? Before you start to write an array of transformation rules, you should be aware that you have to do some work to make sure that things you do will result in correct transformation rules. 7.3.2.1 Things that come in handy First of all you need to have a large word list of the language you want to make phonetics for. It should contain about as many words as the dictionary of the spell checker. If you don't have such a list, you will probably find an Ispell dictionary at http://fmg-www.cs.ucla.edu/geoff/ ispell-dictionaries.htmlwhich will help you. You can then make affix expansion via ispell -e and then pipe it trough \tr " " "\n" to put one word on each line. After that you eventually have to convert special characters like `é' from Ispell's internal representation to latin1 encoding. sed s/e'/é/g for example would replace all e' with é. The second is that you know how to use regular expressions and know how to use grep. You should for example know that grep ^[^aeiou]qu[io] wordlist | less will show you all words that begin with any character but a, e, i, o or u and then continue with `qui' or `quo'. This stuff is important for example to find out if a phonetic replacement rule you want to set up is valid for all words which match the expression you want to replace. Taking a look at the regex(7) man page is a good idea. 7.3.2.2 What the phonetic code should do Normal text comparison works well as long as the typer misspells a word because he pressed one key he didn't really want to press. In this cases mostly one character differs from the original word. In cases where the writer didn't know about the correct spelling of the word however the word may have several characters that differ from the original word but usually the word would still sound like the original word. Someone might think for example that `tough' is spelled `taff'. No spell checker without phonetic code will come to the idea that this might be `tough' but a spell checker who knows that `taff' would be pronounced like `tough' will make good suggestions to the user. Another example could be `funetik' and `phonetic'. From this examples you can see that the phonetic transformation should not be too fussy and too precise. If you implement a whole phonetic dictionary as you can find it in books this will not be very useful because then there could still be many characters differing from the misspelled and the desired word. What you should do if you implement the phonetic transformation table is to reduce the number of used letters to the only really necessary ones. Characters that sound similar should be reduced to one. In English language for example `Z' sounds like `S' and that's why the transformation rule "Z to S" is present in the replacement table. `PH' is spoken like `F' and so we have a "PH to F" rule. If you take a closer look you will even see that vowels sound very similar in English language: `contradiction', `cuntradiction', `cantradiction' or `centradiction' in fact sound nearly the same, don't they? Therefore the English phonetic replacement rules not only reduce all vowels to one but even remove them all (removing is done by just setting up no rule for those letters). The phonetic code of `contradiction' is `KNTRTKXN' and if you try to read this letter-monster loud you will hear that it still sound a bit like `contradiction'. You also see that `D' is transformed to `T' because they nearly sound the same. If you think you have found a regularity you should always take your word list and grep for the corresponding regular expression you want to make a transformation rule for. An example: If you come to the idea that all English words ending on `ough' sound like `AF' at the end because you think of `enough' and `tough'. If you then grep for the corresponding regular expression by "grep -i ough$ wordlist" you will see that the rule you wanted to set up is not correct because the rule doesn't fit to words like `although' or `bough'. So you have to define your rule more precisely or you have to set up exceptions if the number of words that differ from the desired rule is not so big. Don't forget about follow-up rules which can help in many cases but which also can lead to many confusions and side effects. It's also important to write exceptions in front of the more general rules ("GH" before "G" etc.). If you think you have set up a number of rules that may produce some good results try them out! If you run Aspell as "aspell --lang={your language} pipe" you get a prompt at which you can type in words. If you just type words Aspell checks them and eventually makes suggestions if they are misspelled. If you type in "$$Sw {word}" you will see the phonetic transformation and you can test out if your work does what you want. Another good way to control if changes you apply to your rules don't have any evil side effects is to create another list from your word list which contains not only the word of the word list but also the corresponding phonetic version of this word on the same line. If you do this one time before the change and one time after the change you can make a diff (see man diff) to see what really changed. To do this use the command "aspell --lang={your language} soundslike". In this mode aspell will output the the original word and then its soundslike separated by a tab character for each word you give it. If you are interested in seeing how the algorithm works you can download a set of useful programs from http:// members.xoom.com/maccy/spell/phonet-utils.tar.gz. This includes a program that produces a list as mentioned above and another program which illustrates how the algorithm works. It uses the same transformation table as Aspell and so it helps a lot during the process of creating a phonetic transformation table for Aspell. During your work you should write down your basic ideas so that other people are able to understand what you did (and you still know about it after a few weeks). The English table has a huge documentation appended for example. Now you can start experimenting with all the things you just read and perhaps set up a nice phonetic transformation table for your language to help Aspell to come up with the best correction suggestions ever seen also for your language. Take a look at the Aspell homepage to see if there is already a transformation table for your language. If there is one you might also take a look at it to see if it could be improved. If you think that this section helped you or if you think that this is just a waste of time you can send any feedback to bjoern.jacke@gmx.de. 7.4 Controlling the Behavior of Run-together Words Aspell has support for either unconditionally accepting run-together words or only accepting certain words in compound formation. Support for unconditionally accepting run-together words can either be turned on in the language data file or as a normal option via the run-together option. The run-together-limit options controls the maximum number of words that can be strung together, the default is normally 255. The run-together-min options controls the minimal length the individual components of the run together word can be, the default is normally 3. Both the run-together-limit and run-together-min option may be specified in both the language data file or as a normal. The run-together-mid option, which may only be specified in the language data file, may be used to specify up to three optional characters that may appear between individual words. In order for aspell to conditionally only accept certain words in compounds those words must be flagged when the compiled word list is being created. The format for each entry is {word}:C[1][2][3]{middle char} The 1, 2, and 3 control if the word is allowed to appear in the begging, middle, or end of the compound, respectfully. More than one position flag may be specified. If none of them are specified it as assumed that the word may appear anywhere. The C is optional if 1, 2, or 3 is specified. The {middle char} represents an optional character that may appear after the word in the formation of the compound if the word is not at the end of the compound. If the letter is lowercase than the character may appear after the word, if it is in uppercase then that letter must appear after the compound. Only one letter may be specified and it must also be in the list of middle letters specified via the run-together-mid option. The run-together-limit option may also be used to specify the maximum number of words to string together. For example the word list: beg:1 mid:2 end:3 any:C never must:CM maybe:Cm Means that the word "beg" may only appear at the begging of a word, the word "mid" at the middle, the word "end" at the end, and the word "any" any place. The word "never" is never accepted in a compound unless the run-together option is set. The word "must" may appear anywhere however it must be followed by an "m", while the word maybe may be followed by an "m". Given the above word list the following compounds or legal: begmidend begany mustmend maybeend maybemend are all legal, but the following are not: begmid mustend neverany Individual words such as "beg" are always accepted. When the run-together option is not set Aspell will only accept words that have been flagged in a run-together word. When the run-together option is set aspell will accept words which are as least as long as the value specified in the run-together-min option. If the words length is less than run-together-min then it will only accept the word if it has been flagged. When the run-together option is not set the run-together-min option is ignored all together. Currently Aspell only supports run-together words when checking if a word is in the dictionary. When coming up with suggestions Aspell treats the word as a normal word and does not do anything special. This means that the suggestions will be virtually meaningless when the actual word is a run-together. I plan on more intelligently supporting run-together words when coming up with suggestions in a future version of Aspell. 8. How Aspell Works The magic behind my spell checker comes from merging Lawrence Philips excellent metaphone algorithm and Ispell's near miss strategy which is inserting a space or hyphen, interchanging two adjacent letters, changing one letter, deleting a letter, or adding a letter. The process goes something like this. 1. Convert the misspelled word to its soundslike equivalent (its metaphone for English words). 2. Find all words that have a soundslike within one or two edit distances from the original words soundslike. The edit distance is the total number of deletions, insertions, exchanges, or adjacent swaps needed to make one string equivalent to the other. When set to only look for soundslikes within one edit distance it tries all possible soundslike combinations and check if each one is in the dictionary. When set to find all soundslike within two edit distance it scans through the entire dictionary and quickly scores each soundslike. The scoring is quick because it will give up if the two soundslikes are more than two edit distances apart. 3. Find misspelled words that have a correctly spelled replacement by the same criteria of step number 2 and 3. That is the misspelled word in the word pair (such as teh -> the) would appear in the suggestions list as if it was a correct spelling. 4. Score the result list and return the words with the lowest score. The score is roughly the weighed average of the weighed edit distance of the word to the misspelled word and the soundslike equivalent of the two words. The weighted edit distance is like the edit distance except that the various edits have weights attached to them. 5. Replace the misspelled words that have correctly spelled replacements with their replacements and remove any duplicates that might arise because of this. Please note that the soundslike equivalent is a rough approximation of how the words sounds. It is not the phoneme of the word by any means. For more details about exactly how each step is performed please see the file suggest.cc. For more information on the metaphone algorithm please see the data file english_phonet.dat. * Changes from 0.50.2 to 0.50.3 (Nov 23, 2002) * Changes from 0.50.1 to 0.50.2 (Sep 28, 2002) * Changes from 0.50 to 0.50.1 (Aug 28, 2002) * Changes from .33.7.1 to 0.50 (Aug 23, 2002) * Changes from .33.7 to .33.7.1 (Aug 20, 2001) * Changes from .33.6.3 to .33.7 (Aug 2, 2001) * Changes from .33.6.2 to .33.6.3 (June 3, 2001) * Changes from .33.6.1 to .33.6.2 (June 3, 2001) * Changes from .33.6 to .33.6.1 (May 29, 2001) * Changes from .33.5 to .33.6 (May 18, 2001) * Changes from .33 to .33.5 (April 5, 2001) * Changes from .32.6 to .33 (January 28, 2001) * Changes from .32.5 to .32.6 (Nov 8, 2000) * Changes from .32.1 to .32.5 (August 18, 2000) * Changes from .32 to .32.1 (August 5, 2000) * Changes from .31.1 to .32 (July 23, 2000) * Changes from .31 to .31.1 (June 18, 2000) * Changes from .30.1 to .31 (June 11, 2000) * Changes from .30 to .30.1 (April 29, 2000) * Changes from .29.1 to .30 (April 2, 2000) * Changes from .29 to .29.1 (Feb 18, 2000) * Changes from .28.3 to .29 (Feb 6, 2000) * Changes from .28.2.1 to .28.3 (Nov 20, 1999) * Changes from .28.2 to .28.2.1 (Aug 25, 1999) * Changes from .28.1 to .28.2 (Aug 25, 1999) * Changes from .28 to .28.1 (July 27, 1999) * Changes from .27.2 to .28 (July 25, 1999) * Changes from .27.1 to .27.2 (Mar 1, 1999) * Changes from .27 to .27.1 (Feb 24, 1999) * Changes from .26.2 to .27 (Feb 22, 1999) * Changed from .26.1 to .26.2 (Jan 3, 1998) * Changed from .26 to .26.1 (Jan 3, 1998) * Changed from .25.1 to .26 (Jan 3, 1999) * Changes from .25 to .25.1 (Dec 10, 1998) * Changes from .24 to .25 (Nov 23, 1998) * Changes from .23 to .24 (Nov 8, 1998) * Changes from .22.1 to .23 (Oct 31, 1998) * Changes from .22 to .22.1 (Oct 27, 1998) * Changes from .21 to .22 (Oct 26, 1998) * Changes from .20 to .21 (Oct 13, 1998) * Changes from .11 to .20 (Oct 10, 1998) * Changes from .10 to .11 (Sep 12, 1998) -------------------------------------------------------------------------- A. Changelog Changes from 0.50.2 to 0.50.3 (Nov 23, 2002) * Hopefully fixed the ispell alignment error problem when Aspell is used with ispell.el. * Fixed a problem with personal dictionaries on NFS mounted home directories. * Compiled libaspell-common directory into libaspell for now to avoid forcing applications to relink when ever a new Aspell version is out which was due to the use of the libtool '-release' flag. * Fixed Makefiles so that aspell can be built outside the source tree (i.e. with VPATH). * Updated the section on compiling with Win32. * Updated to Autoconf 2.56. Changes from 0.50.1 to 0.50.2 (Sep 28, 2002) * Fixed a number of bugs in ispell compatibility mode * Fixed a number of bugs with the handling of replacement pairs * Other miscellaneous bug fixes * Additional Win32 portability fixes * Added the Ukrainian KOI8-U charset. Changes from 0.50 to 0.50.1 (Aug 28, 2002) * A rather large number of portability fixes for non GNU/Linux platforms. * Fixed pkglibdir and pkgdatadir in configure. * Reintroduced some configure options from Aspell .33.7 included dict-dir, data-dir, curses, curses-include, win32-relocatable. * Fixed Aspell so it will now compile with -O3 when using gcc. * Updated note on Win32 support. * Other minor manual improvements. * Portability fixes in dictionary files * Official dictionary package for the Slovak language. Changes from .33.7.1 to 0.50 (Aug 23, 2002) * A complete overhaul of the source code which included merging Pspell into Aspell. * Changed the way dictionaries and languages are handled. * Added Dvorak keymap. * Added the ability to list the available dictionaries * Improved the spell checking interface a bit. * Added support for using the Ispell keymapping when checking files. * Complete rewrite of the filter interface. It should now be fairly easy to add new filters to Aspell. * Added some preliminary developer documentation. * Lots of other changes due to the massive overhaul of the source code. Changes from .33.7 to .33.7.1 (Aug 20, 2001) * Minor manual fixes. * Compile fix for Gcc 3.0 and Solaris. Changes from .33.6.3 to .33.7 (Aug 2, 2001) * Updates to Autoconf 2.50 and switched to the HEAD branch of libtools. * Fixed a bug which caused Aspell to crash when typo-analysis is not used such as when sug-mode is fast or bad spellers. * Added support for typo-analysis even when a soundslike is not used. * Fixed a bug which causes extended charters to display incorrectly on some platforms * Compile fixes so that it will compile with Gcc 3.0. * Compile fixed which should allow Aspell to compile with Egcs 1.1. I have not been able to actually test it though. Please let me know at kevina@users.sourceforge.net if you have have tried with Egcs 1.1. * Compile and configuration script fixes so that USE_FILE_INO will properly be defined and Aspell will compile correctly when it is defined. * More ANSI C++ compliance fixes. Changes from .33.6.2 to .33.6.3 (June 3, 2001) * Fixed a build problem in the manual/ directory by including manual-text and manual-html in the distribution. Changes from .33.6.1 to .33.6.2 (June 3, 2001) * Compile fix so that Aspell will work correctly when not installed in / usr/local. * Avoided regenerating the manual unless configured with enable-maintainer-mode. * Added the missing documentation files in the scowl directory. Changes from .33.6 to .33.6.1 (May 29, 2001) * Fixed a formating problem with the manual involving {. * Added a note about creating pwli files. * Removed the space after between the -L and the directory name in the pspell-module/ Makefile which caused problems on some platforms. * Added the configure option AM_MAINTAINER_MODE to avoid enabling rules which often causes generated build files to be rebuild with the wrong version of Libtool by default. I don't know why I didn't think to do this a long time ago. Changes from .33.5 to .33.6 (May 18, 2001) * Fixed a minor bug where some words would have random compound tags attached to them. * Fixed a compile problem on many platforms where fileno is defined as a macro. * Updated the description for a few of Aspell's options. * Removed the note of Aspell not being able to run when compiled with the upcoming Gcc 3.0 compiler as things seam to work now. * Added a note about Aspell not being able to compile with Egcs 1.1. * Added hack to deal with Libtool's interdependencies problem. See bug # 416981 for Pspell for more info. Changes from .33 to .33.5 (April 5, 2001) * "dump master" correctly detects which dictionary and language to use based on the LANG environmental variable. * Fixed a problem on Win32 which involves path names that begin with { Drive Letter}:. * Bug fixes and enhancements so that Aspell can once again run under MinGW. You can even use the new full screen interface if Aspell is compiled with PDCurses. * Some major modifications to make Aspell more C++ compliant in order to get Aspell to compile under the upcoming Gcc 3.0 compiler. This included only using STL features found in the standard version of C++. (Which means Aspell will no longer require using the SGI version of the STL) This should also make compiling C++ under non-gcc compilers a lot simpler. Please not that Aspell still has some problems with the upcoming Gcc 3.0 compiler (see section [*] for more info). * Minor changes to remove some -Wall warnings. * Added a hack to that Aspell will properly compile as a shared library under Solaris. * Added a few import missing words to the English word list. Changes from .32.6 to .33 (January 28, 2001) * Added a new new curses based interface to replace the dumb terminal interface everyone has been bitching about. * Added the ability to give higher priority to words such as "the" instead of "teh" which are likely to be due to typos. * Reorganized the manual so that it is hopefully easier to follow. * Ability to automatically select the best dictionary to used based on the setting of the LANG environmental variable. * Expanded the medium dictionary size to include more words which included the original words found in ispell and eliminated the large size for now. * Added three special variant add-on dictionaries. * Switched to the multi-language branch of the CVS version of libtool. Changes from .32.5 to .32.6 (Nov 8, 2000) * Fixed a bug that where Aspell will crash when reading in accented characters on some platforms. This fixes bug # 112435. * Fixed some other bugs so that it will run under Win32 under CygWin. Unfortunately it still won't run properly under Mingw. * Fixed the mmap test in configure so that it won't fail on some platforms that use munmap(char *, int) instead of munmap(void *, int). * Upgraded to the latest CVS version of libtool which fixed the problem with using GNU Make under Solaris. * Added an option to copy files instead of using symbolic links for the special "multi" dictionary files. Changes from .32.1 to .32.5 (August 18, 2000) * Changed my email from kevinatk at home com to kevina at users sourceforge net please make a note of the new email address. * Added an option to control if the personal replacement dictionary is saved when the save_all_wls method is called. * Brought back the ability to dump the master word list even in the case of the special "multi" lists. * Added a large number of hacker related words and some other slang terms to the medium size word list. * Added an "ispell" and "spell" compatibility script for systems which don't have ispell installed. They are located in the scripts/ directory and are not installed by default. * Manual fixes. * Added a note on not using GNU Make on Solaris. Changes from .32 to .32.1 (August 5, 2000) * Minor compile fixes for recent gcc snapshot. * Fixed naming of pwli files. * Fixed a bug when aspell will crash when used with certain single letter flags. This bug was most noticeable when used with Emacs. * Word list changes, see SCOWL Readme. * Other miscellaneous changes. Changes from .31.1 to .32 (July 23, 2000) * Added support for optionally doing without the soundslike data. * Greatly reduced the amount of memory used when creating word lists. * Added support for ignoring accents when coming up with suggestions. * Added support for local-data-dir which is searched before data-dir. * Added support for specifying which words may be used in compounds and where they may be used. * Added support for having more than one main word list as well as a special "multi" word list files which will allow multiple word lists to be treated as one. * Aspell now uses a completely new word list. * The apostrophe (') is no longer considered part of the word when it as at the end of the word such as in " dogs' ". Changes from .31 to .31.1 (June 18, 2000) * Fixed a bug where Aspell would not create a complete dictionary file on some platforms when the data is 8-bit. * Added a workaround so Aspell will work with ispell.el 3.3. * Minor compile fixes so it would compile better with the very latest gcc (CVS Version). * Removed note about compiling in Win32 as I was now able to get it to work. Changes from .30.1 to .31 (June 11, 2000) * Added support for spell checking run together words. * Added an option to produce a list of misspelled words from standard input. * More robust error reporting when reading in language data files. * Fixed a bug when that will cause Aspell to crash if the "special" line was not defined in the language data file. * Update Pspell Module. * Minor bug fixes. * Added cross references in "The Aspell utility Chapter" for easier use. Changes from .30 to .30.1 (April 29, 2000) * Ported Aspell to Win32 platforms. * Portability fixes which may help aspell compile on other platforms. * Aspell will no longer fail if for some reason the mmap fails, instead it will just read the file in as normal and free the memory when done. * Minor changes in the format of the main word list as a result of the changes, the old format should still work in most cases. * Fixed a bug when aspell was ignoring the extension of file names such as .html or .tex when checking files. * Fixed a bug when aspell will go into an infinite loop when creating the main word list from a word list which has duplicates in it. * Minor changes to the manual for better clarity. Changes from .29.1 to .30 (April 2, 2000) * Fixed many of the capitalization bugs found in previous versions of Aspell. * Changed the format of the main word list yet again. * Fixed a bug so that "aspell check" will work on the PowerPC. * Added ability to change configuration options in the middle of a session. * Added words from /usr/dict/words found on most Linux systems as well as a bunch of commonly used abbreviation to the word list. * Fixed a bug when aspell will dump core after reporting certain errors when compiled with gcc 2.95 or higher. This involved reworked the Exception heritage to get around a bug in gcc 2.95. * Added a few more commands to the list of default commands the TEX filter knows about. * Aspell will now check if a word only contains valid characters before adding it to any dictionaries. This might mean that you have to manually delete a few words from your personal word list. * Added option to ignore case when checking a document. * Adjusted the parameters of the "normal" suggest mode to so that significantly less far fetched results are returned in cases such as tomatoe, which went from 100 suggestions down to 32, at the expense of getting slightly lower results (less than 1%), * Improved the edit distance algorithm for slightly faster results. * Removed the $$m command in pipe mode, you should now use "$$cs mode,{ mode}" to set the mode and "$$cr mode" to find out the current mode. * Reworked parts of Aspell to use Pspell services to avoid duplicating code. * Added a module for the newly released Pspell. It will get installed with the rest of aspell. * Miscellaneous other bug fixes. Changes from .29 to .29.1 (Feb 18, 2000) * Improved the TEX filter so that it will accept '@' at the begging of a command name and ignore trailing '*'s. It also now has better defaults for which parameters to skip. * Reworked the main dictionary so that it can be memory mapped in. This decreases startup time and allows multiple aspell processes to use the same memory for the main word list. This also also made Aspell 64 bit clean so that it should work on an alpha now. * Fix so that aspell can compile on platforms that gnu as is not available for. * Fixed issue with flock so it would compile on FreeBSD. * Minor changes in the code to make it more C++ compliant although I am sure there will still be problems when using some other compiler other than gcc or egcs. * Added some comments to the header files to better document a few of the classes. Changes from .28.3 to .29 (Feb 6, 2000) * Fixed a bug in the pipe mode with lines that start with "^$$". * Added support for ignoring all words less than or equal to a specified length * New soundslike code based thanks to the contribution of Björn Jacke. It now gets all of its data from a table making it easier for other people to add soundslike code for their native language. He also converted the metaphone algorithm to table form, eliminating the need for the old metaphone code. * Major redesign of the suggestion code for better results. * Changed the format of the personal word lists. In most cases it should be converted automatically. * Changed the format of the main word list. * Name space cleanup for more consistent naming. I now use name spaces which means that gcc 2.8.* and egcs 1.0.* will no longer cut it. * Used file locks when reading and saving the personal dictionaries so that it truly multiprocess safe. * Added rudimentary filter support. * Reworked the configuration system once again. However, the changes to the end user who does not directly use my library should be minimal. * Rewrote my code that handles parsing command line parameters so that it no longer uses popt as it was causing to many problems and didn't integrate well with my new configuration system. * Fixed pipe mode so that it will properly ignore lines starting with '~' for better ispell compatibility. * Aspell now has a new home page at http://aspell.sourceforge.net/. Please make note of the new URL. * Miscellaneous manual fixes and clarifications. Changes from .28.2.1 to .28.3 (Nov 20, 1999) * Fixed a bug that caused aspell to crash when spell checking words over 60 characters long. * Reworked "aspell check" so that 1. You no longer have to hit enter when making a choice. 2. It will now overwrite the original file instead of creating a new file. An optional backup can be made by using the -b option. * Fixed a few bugs in data.cc. Changes from .28.2 to .28.2.1 (Aug 25, 1999) * Fixed the version number for the shared library. * Fixed a problem with undefined references when linking to the shared library. Changes from .28.1 to .28.2 (Aug 25, 1999) * Fixed a bunch of bugs in the language and configuration classes. * Minor changed in the code so that it could compile with the new gcc 2.95(.1). * Changed the output of "dump config" so that default values are given the value "". This means that the output can be used to created a configuration file. * Added notes on using aspell with VIM. Changes from .28 to .28.1 (July 27, 1999) * Removed some debug output * Changed notes on compiling with gcc 2.8.* as I managed to get it to compile on my school account * Avoided included stdexcept in const_string.hh so that I could get to compile on my schools account with gcc 2.8.1. Changes from .27.2 to .28 (July 25, 1999) * Provided an iterator for the replacement classes. * Added support for dumping and creating and merging the personal and replacement word lists. * Changed the aspell utility command line a bit, it now used popt. * Totally reworked aspell configuration system. Now aspell can get configuration from any of 5 sources: the command line, the environmental variable ASPELL_CONF, the personal configuration file, the global configuration file, and finally the compiled in defaults. * Totally reworked the language class in preparation for my new language code. See http://aspell.sourceforge.net/international/ for more information of what I have in store. * Added some options to the configure script: --enable-dict-dir=DIR, --enable-doc-dir=DIR, --enable-debug, and --enable-opt * Removed some old header files. * Reorganized the directory structure a bit * Made the text version of the manual pages slightly easier to read * Used the \url command for urls for better formating of the printed version. Changes from .27.1 to .27.2 (Mar 1, 1999) * Fixed a major bug that caused aspell to dump core when used without any arguments * Fixed another major bug that caused aspell to do nothing when used in interactive mode. * Added an option to exit in Aspell's interactive mode. * Removed some old documentation files from the distribution. * Minor changes on to the section on using Aspell with egcs. * Minor changes to remove -Wall warnings. Changes from .27 to .27.1 (Feb 24, 1999) * Fixed a minor compile problem. * Updated the section on using Aspell with egcs to it it more clear why the patch is necessary. Changes from .26.2 to .27 (Feb 22, 1999) * Totally reworked the C++ library which means you may need to change some things in your code. * Added support for detachable and multiple personal dictionaries in the C++ class library. * The C++ class library now throws exceptions. * Reworked aspell ability to learn from users misspellings a bit so that it now has a memory. See section 6.3 for more information. * Upgraded autoconf to version 2.13 and automake to version 1.4 for better portability. * Fixed the configuration so the "make dist" will work. For now on aspell will be distributed with "make dist". * Added support to skip over URL's, email addresses and host names. * Added support for dumping the master and personal word list. You can now also merge a personal word list. Type aspell -help for help on using this feature. * Reorganized the source code. * Started using proper version numbers for the shared library. * Fixed a bug that caused a spell to crash when adding certain replacement pairs. * Fixed the problem with duplicate lines when exiting pipe mode for good. Changed from .26.1 to .26.2 (Jan 3, 1998) * Fixed another compile problem. Hopefully this time it will really compile OK on other peoples machines. Changed from .26 to .26.1 (Jan 3, 1998) * Fixed a small compile problem in as_data.cc. Changed from .25.1 to .26 (Jan 3, 1999) * Fixed a bug that causes duplicates items to be displayed in the suggestion list for good. (If it still does it please send be email.) * Added the ability for aspell to learn form the users misspellings. * Library Interface changes. Still more to come.... * Is now multiprocess safe. When a personal dictionary (or replacement list) is saved it will now first update he list against the dictionary on disk in case another process modified it. * Fixed the bug that caused duplicate output when used non interactively in pipe mode. * Dropped support for gcc 2.7.2 as the C++ compiler. * Updated the How Aspell Works (8) Chapter. * Added support for the ASPELL_DATA_DIR environmental variable. Changes from .25 to .25.1 (Dec 10, 1998) * Fixed the version number so that Aspell reports the correct version number. * Changed the note on gcc 2.7.2 compilers to make it clear that only the C++ compiler can not be gcc 2.7.2, it is ok if the C compiler is gcc 2.7.2. * Updated the TODO list and reorganized it a bit. * Fixed the directory so that all the documentation will get installed in ${prefix}/doc/aspell instead of half of it in ${prefix}/doc/aspell and half of it in ${prefix}/doc/kspell. Changes from .24 to .25 (Nov 23, 1998) * Total rework of how the main word list is stored. Start up time decreased to about 1/3 of what it was in .24 and memory usage decreased to about 2/3. (When used with the provided word list on a Linux system). Also the format and default locations of the main word list data files changed in the process and the data is now machine dependent. The personal word list format, however, stayed the same. * Changed the scoring method to produce slightly better results with words like the vs. teh. And other simpler misspellings where two letters are swapped. * Fixed the very unpredictable behavior of the '*', '&', '@' commands in the pipe mode. * Added documentations for Aspell pipe mode (also known as ispell -a compatibility mode) * Added a bunch of Aspell specific extensions to the pipe mode and documented them. * Documented the to_soundslike and soundslike methods for the aspell class. * Changed the scoring method to produce better results for words like "fone" vs "phone" and other words that have a spelling that doesn't directly relate to how the word sounds by using the phoneme equivalent of the word in the scoring of it. * Added the to_phoneme and have_phoneme methods to the SC_Language class. * Added the to_phoneme method to the aspell class. * Added the framework for being able to learn from the users misspelling. Right now it just keep a log of replacements. * Re did stl_rope-30.diff. For some reason the version of patch on my system refused it. * Rewrite of the "Using as a replacement for Ispell" section and added the run-with-aspell utility as a replacement of the old method of mapping Ispell to Aspell. * Fixed a bug that caused duplicate words to appear in the suggestion list. Changes from .23 to .24 (Nov 8, 1998) * Fixed my code so that it can once again compile with g++ 2.7.2. * Rewrote the How It Works chapter. * Rewrote the Requirement section and added noted on compiling with g++ 2.7.2. * Added a To Do chapter. * Added a Glossary and References chapter. * Other minor documentation improvements. * Internal code documentation improvements. Changes from .22.1 to .23 (Oct 31, 1998) * Minor documentation fixes. * Changed the scoring strategy for words with 3 or less letters. This cut the number of words returned for these roughly in half. * Expanded the word list to also include american.0 and american.1 from the Ispell distribution. It now includes english.0, english.1, american.0 and american.1 from the directory languages/english provided with Ispell 3.1.20. * Added a link to the location of the latest Ispell.el in the documentation. * Started a C interface and added some rough documentation for it. Changes from .22 to .22.1 (Oct 27, 1998) * Minor bug fixes. I was deleting arrays with delete rather than delete []. I was suppressed that this had not created a problem. * Added a simple test program to test for a memory leak present on some systems. (Only debian slink at the moment.) See the file memleak-test.cc for more info. Changes from .21 to .22 (Oct 26, 1998) * Major redesign or the scoring method. It now uses absolute distances rather than relative scores for more consistent results. See suggest.cc for more info. * Suggest code rewritten is several places however the core process stayed the same. * The suggest_ultra method temporally does nothing. It should be working again by the next release. Changes from .20 to .21 (Oct 13, 1998) * Added documentation for aspell::Error * Changed the library name from libspell to libaspell. It should never have been libspell in the first place. Sorry for the incompatibility. * Added as_error.hh to the list of files copied to the include directory so that you can actually use the library outside of the source dir. * Fixed bug that caused a segmentation fault with words where the only suggestions was inserting a space or hyphen such as in "ledgerline". * Added the score method to aspell. * Changed the scoring method to deal with word when the user uses "f" in place of "ph" a lot better. Changes from .11 to .20 (Oct 10, 1998) * Name change. Everything that was Kspell is now Aspell. Sorry, the name Kspell was already used by KDE and I didn't want to cause any confusion. * Fixed a bug that causes a segmentation fault when the HOME environmental variable doesn't exist. Changes from .10 to .11 (Sep 12, 1998) * Overhaul of the SC_Language class * Added documentation for international support * Added documentation for the C++ library * Other minor bug fixes. * B.1 Things that need to be done * B.2 Good ideas that are worth implementing * B.3 Notes and Status of various items + B.3.1 General region skipping + B.3.2 Word skipping by context + B.3.3 Hidden Markov Model + B.3.4 Email the Personal Dictionary + B.3.5 Words With Spaces in Them -------------------------------------------------------------------------- B. To Do Words in bold indicate how you should refer to the item when discussing it with me or others. B.1 Things that need to be done These items need to be done before I consider Aspell finished. Unfortunately, I do not have the time to do all so if you are interested in helping me with one of these tasks please email me. Good C++ skills are needed for most of these tasks involving coding. * Convert manual from LyX/LATEX to Texinfo. * Clean up copyright notices and bring the Aspell package up to GNU Standards. * Add gettext support to Aspell. * Allow Aspell to check documents which are in UTF-8. The main thing to be done is to get Aspell to display UTF-8 correctly. This will involve being able to get the length of UTF-8 strings and display UTF-8 using the curses library. * Make Aspell Thread safe. Even though Aspell itself is not multi-threaded I would like it to be thread safe so that it can be used by multi-threaded programs. There are several areas of Aspell that that are potently thread unsafe (such as accessing a global pool) and several several classes which have the potential of being used by more than one thread (such as the personal dictionary). * Allow filters to be loaded at run-time. At the moment all filters must be complied in. * Integrate Kevin Hendricks affix compression code into Aspell. His code is already in use in OpenOffice as part of the lingucomponent component. More information can be found at http:// whiteboard.openoffice.org/lingucomponent/dictionary.html. The latest version of his code is also available there. * Create a C++ interface for Aspell, possibly on top of the C one. * Enhance ispell.el so that it will work better with the new Aspell. B.2 Good ideas that are worth implementing These items all sound like good ideas however I am not sure when I will get to implementing then if ever. * Use Lawrence Philips' new Double Metaphone algorithm. See http:// aspell.sourceforge.net/metaphone/. * Come up with a plug-in for gEdit the gnome text editor. * Change languages (and thus dictionaries) based on the information in the actual document. * Come up with a nroff mode for spell checking. * Come up with a mode that will skip words based on the symbols that (almost) always surround the word. (Word skipping by context) * Create two server modes for Aspell. One that uses the DICT protocol and one that uses ispell -a method of communication of some arbitrary port. * Come up with thread safe personal dictionaries. * Use the Hidden Markov Model to base the suggestions on not only the word itself but on the context around the word. * Having a way to email the personal dictionary and/or replacement list to a particular address either periodical or when it grows to a certain size. * Be able to accept words with spaces in them as many languages have words such as as a word in a foreign phrases which only make sense when followed by other words. The following good ideas where found in the ispell WISHES file so I thought I would pass them on. * Ispell should be smart enough to ignore hyphenation signs, such as the TEX \- hyphenation indicator. * (Jeff Edmonds) The personal dictionary should be able to remove certain words from the master dictionary, so that obscure words like "wether" wouldn't mask favorite typos. * (Jeff Edmonds) It would be wonderful if ispell could correct inserted spaces such as "th e" for "the" or even "can not" for "cannot". * Since ispell has dictionaries available to it, it is conceivable that it could automatically determine the language of a particular file by choosing the dictionary that produced the fewest spelling errors on the first few lines. B.3 Notes and Status of various items B.3.1 General region skipping I want to implement this give other people an idea of how it should be done and because I am really sick of having to spell check through url and email address. B.3.2 Word skipping by context This was posted on the Aspell mailing list on January 1, 1999: I had an idea on a great general way to determine if a word should be skipped. Determine the words to skip based on the symbols that (almost) always surround the word. For example when asked to check the following C++ code: cout { "My age is: " { num { endl; cout { "Next year I will be " { num + 1 { endl; cout, num, and endl will all be skipped. "cout" will be skipped because it is always preceded by a {. "num" will be skipped because it is always preceded by a {. And "endl" will be skipped because it is always between a { and a ;. Given the following html code.
OneTwoThree
123
table, width cellspacing, cellpadding, tr, td will all be skipped because they are always enclosed in "<>". Now of course table and width would be marked as correct anyway however there is no harm in skipping them. So I was wondering if anyone on this list has any experience in writing this sort of context recognition code or could give me some pointers in the right direction. This sort of word skipping will be very powerful if done right. I imagine that it could replace specific spell checker modes for Tex, Nroff, SGML etc because it will automatically be able to figure out where it should skip words. It could also probably do a very good job on programming languages code. If you are interested in helping be out with this or just have general comments about the idea please let me know. B.3.3 Hidden Markov Model Knud Haugaard Sørensen suggested this one. From his email on the Aspell mailing list: consider this examples. a fone number. -> a phone number. a fone dress. -> a fine dress. the example illustrates that the right correction might depend on the context of the word. So I suggest that you take a look on HMM to solve this problem. This might also provide a good base to include grammar correction in aspell. see this link http://www.cse.ogi.edu/CSLU/HLTsurvey/ch1node7.html I think it is a great idea. However unfortunately it will probably be very complicated to implement. Perhaps in the far future. B.3.4 Email the Personal Dictionary Some one suggest in a personal email: Have you thought of adding a function to aspell, that - when the personal dictionary has grown significantly - sends the user's personal dictionary to the maintainer of the corresponding aspell dictionary? (if the user allows it) It would be a very useful service to the dictionary maintainers, and I think most users can see their benefit in it too. And I replied: Yes I have considered something like that but not for the personal dictionaries but rather the replacement word list in order to get better test data for http://aspell.sourceforge.net/test/. The problem is I don't know of a good way to do this sense Aspell can also be used as a library. It also is not a real high priority, especially sense I would first need to learn how to send email within a C++ program. B.3.5 Words With Spaces in Them While this is something I would like to do it is not a simple task. The basic problem is that when tokenizing a string there is no good way to keep phrases together. So the solution is to some how add special conditions to certain words which will dictate which words can come before /after it. Then there is also a problem of how to come up with intelligent suggestions. What further complicates things is that many applications send words to Aspell a word at a time. So even if Aspell did support such a thing many applications that would use Aspell will not. So, in order for this to work applications will need to send text to Aspell a document or at least a sentence at a time. Unfortunately the framework for doing this is not there yet. It will be once I finish the filter interface. Another possible is to provide call back functions in which Aspell will be able to request the previous or next word on request. Yet again the framework for doing this is not there. Perhaps sometime in the near future. C. Credits * To the FSF for adopting the project. * To the many authors of Ispell (including R. E. Gorin, Pace Willisson, and Geoff Kuenning) for providing me with a good word list as well as giving me a few good ideas. * Alan Beale for going well beyond the call of duty with helping me create a better word list. * Lawrence Philips for coming up with the original metaphone algorithm and Michael Kuhn for writing C code for the algorithm. * Björn Jacke for coming up with a generic soundslike algorithm which gets all of its data from a file, thus eliminating almost all need for language specific code from aspell. * To the authors of SGI STL version 3.0 and up for proving a great set of generic container classes with cut the development time of this program in half at least. * To the LyX development team for giving me a great tool for the development of this manual. D. Glossary and References affix in grammar, a word element that, when added to a word, modifies its meaning or function; prefix, infix, or suffix. Debian A 100% Open Source Linux distribution http://www.debian.org DICT protocol A TCP transaction based query/response protocol that allows a client to access dictionary definitions from a set of natural language dictionary databases. http://www.dict.org/ Gnome A project to build a complete, user-friendly desktop based entirely on free software. http://www.gnome.org/ GTK+ A library for creating graphical user interfaces for the X Window System. It is designed to be small, efficient, and flexible. http:// www.gtk.org/ GUI Graphics User Interface Ispell.el Emacs interface for ispell. http://www.kdstevens.com/~stevens/ ispell-page.html Ispell An international spell checker which is just about the only decent Open Source spell checker out there. (except Aspell of course). http:/ /fmg-www.cs.ucla.edu/geoff/ispell.html Ispell -a An Ispell mode that is designed to be used by other applications though a pipe. KDE A powerful graphical desktop environment for Unix workstations. http:/ /www.kde.org Linux A Open Source version of Unix which runs on many platforms. http:// www.linux.org/ LyX An What You See is What You Mean document editor for the X environment, http://www.lyx.org Open Source Software where the source code is available for anyone to extend or modify. http://www.opensource.org/ Red Hat A commercial Linux distribution. http://www.redhat.com RPM Red Hat's packing format also used by other Linux distributions. STL Standard Temple Library. A C++ library of container classes, algorithms, and iterators. SGI STL Silicon Graphics implantation of the STL. http://www.sgi.com/ Technology/STL/ STLPort A port of SGI STL designed to run on compilers that don't support all the latest features of C++. http://corp.metabyte.com/~fbp/stl/ * E.1 GNU Free Documentation License + Preamble + E.1.1 Applicability and Definitions + E.1.2 Verbatim Copying + E.1.3 Copying in Quantity + E.1.4 Modifications + E.1.5 Combining Documents + E.1.6 Collections of Documents + E.1.7 Aggregation With Independent Works + E.1.8 Translation + E.1.9 Termination + E.1.10 Future Revisions of This License + ADDENDUM: How to use this License for your documents * E.2 GNU Lesser Public License -------------------------------------------------------------------------- E. Copyright This manual is Copyright (c) 2002 Kevin Atkinson. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts. and no Back-Cover Texts. A copy of the license is included in the section . The library and utility program is Copyrighted (c) 2000-2002 by Kevin Atkinson. You can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as (LGPL) published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version. Certain parts of the library, as indicated at the top of the source file, are under a weaker license. However, all parts of the library or LGPL Compatible. The two licenses follow: E.1 GNU Free Documentation License Version 1.1, March 2000 Copyright © 2000 Free Software Foundation, Inc. 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed. Preamble The purpose of this License is to make a manual, textbook, or other written document "free" in the sense of freedom: to assure everyone the effective freedom to copy and redistribute it, with or without modifying it, either commercially or noncommercially. Secondarily, this License preserves for the author and publisher a way to get credit for their work, while not being considered responsible for modifications made by others. This License is a kind of "copyleft", which means that derivative works of the document must themselves be free in the same sense. It complements the GNU General Public License, which is a copyleft license designed for free software. We have designed this License in order to use it for manuals for free software, because free software needs free documentation: a free program should come with manuals providing the same freedoms that the software does. But this License is not limited to software manuals; it can be used for any textual work, regardless of subject matter or whether it is published as a printed book. We recommend this License principally for works whose purpose is instruction or reference. E.1.1 Applicability and Definitions This License applies to any manual or other work that contains a notice placed by the copyright holder saying it can be distributed under the terms of this License. The "Document", below, refers to any such manual or work. Any member of the public is a licensee, and is addressed as "you". A "Modified Version" of the Document means any work containing the Document or a portion of it, either copied verbatim, or with modifications and/or translated into another language. A "Secondary Section" is a named appendix or a front-matter section of the Document that deals exclusively with the relationship of the publishers or authors of the Document to the Document's overall subject (or to related matters) and contains nothing that could fall directly within that overall subject. (For example, if the Document is in part a textbook of mathematics, a Secondary Section may not explain any mathematics.) The relationship could be a matter of historical connection with the subject or with related matters, or of legal, commercial, philosophical, ethical or political position regarding them. The "Invariant Sections" are certain Secondary Sections whose titles are designated, as being those of Invariant Sections, in the notice that says that the Document is released under this License. The "Cover Texts" are certain short passages of text that are listed, as Front-Cover Texts or Back-Cover Texts, in the notice that says that the Document is released under this License. A "Transparent" copy of the Document means a machine-readable copy, represented in a format whose specification is available to the general public, whose contents can be viewed and edited directly and straightforwardly with generic text editors or (for images composed of pixels) generic paint programs or (for drawings) some widely available drawing editor, and that is suitable for input to text formatters or for automatic translation to a variety of formats suitable for input to text formatters. A copy made in an otherwise Transparent file format whose markup has been designed to thwart or discourage subsequent modification by readers is not Transparent. A copy that is not "Transparent" is called "Opaque". Examples of suitable formats for Transparent copies include plain ASCII without markup, Texinfo input format, LATEX input format, SGML or XML using a publicly available DTD, and standard-conforming simple HTML designed for human modification. Opaque formats include PostScript, PDF, proprietary formats that can be read and edited only by proprietary word processors, SGML or XML for which the DTD and/or processing tools are not generally available, and the machine-generated HTML produced by some word processors for output purposes only. The "Title Page" means, for a printed book, the title page itself, plus such following pages as are needed to hold, legibly, the material this License requires to appear in the title page. For works in formats which do not have any title page as such, "Title Page" means the text near the most prominent appearance of the work's title, preceding the beginning of the body of the text. E.1.2 Verbatim Copying You may copy and distribute the Document in any medium, either commercially or noncommercially, provided that this License, the copyright notices, and the license notice saying this License applies to the Document are reproduced in all copies, and that you add no other conditions whatsoever to those of this License. You may not use technical measures to obstruct or control the reading or further copying of the copies you make or distribute. However, you may accept compensation in exchange for copies. If you distribute a large enough number of copies you must also follow the conditions in section 3. You may also lend copies, under the same conditions stated above, and you may publicly display copies. E.1.3 Copying in Quantity If you publish printed copies of the Document numbering more than 100, and the Document's license notice requires Cover Texts, you must enclose the copies in covers that carry, clearly and legibly, all these Cover Texts: Front-Cover Texts on the front cover, and Back-Cover Texts on the back cover. Both covers must also clearly and legibly identify you as the publisher of these copies. The front cover must present the full title with all words of the title equally prominent and visible. You may add other material on the covers in addition. Copying with changes limited to the covers, as long as they preserve the title of the Document and satisfy these conditions, can be treated as verbatim copying in other respects. If the required texts for either cover are too voluminous to fit legibly, you should put the first ones listed (as many as fit reasonably) on the actual cover, and continue the rest onto adjacent pages. If you publish or distribute Opaque copies of the Document numbering more than 100, you must either include a machine-readable Transparent copy along with each Opaque copy, or state in or with each Opaque copy a publicly-accessible computer-network location containing a complete Transparent copy of the Document, free of added material, which the general network-using public has access to download anonymously at no charge using public-standard network protocols. If you use the latter option, you must take reasonably prudent steps, when you begin distribution of Opaque copies in quantity, to ensure that this Transparent copy will remain thus accessible at the stated location until at least one year after the last time you distribute an Opaque copy (directly or through your agents or retailers) of that edition to the public. It is requested, but not required, that you contact the authors of the Document well before redistributing any large number of copies, to give them a chance to provide you with an updated version of the Document. E.1.4 Modifications You may copy and distribute a Modified Version of the Document under the conditions of sections 2 and 3 above, provided that you release the Modified Version under precisely this License, with the Modified Version filling the role of the Document, thus licensing distribution and modification of the Modified Version to whoever possesses a copy of it. In addition, you must do these things in the Modified Version: * Use in the Title Page (and on the covers, if any) a title distinct from that of the Document, and from those of previous versions (which should, if there were any, be listed in the History section of the Document). You may use the same title as a previous version if the original publisher of that version gives permission. * List on the Title Page, as authors, one or more persons or entities responsible for authorship of the modifications in the Modified Version, together with at least five of the principal authors of the Document (all of its principal authors, if it has less than five). * State on the Title page the name of the publisher of the Modified Version, as the publisher. * Preserve all the copyright notices of the Document. * Add an appropriate copyright notice for your modifications adjacent to the other copyright notices. * Include, immediately after the copyright notices, a license notice giving the public permission to use the Modified Version under the terms of this License, in the form shown in the Addendum below. * Preserve in that license notice the full lists of Invariant Sections and required Cover Texts given in the Document's license notice. * Include an unaltered copy of this License. * Preserve the section entitled "History", and its title, and add to it an item stating at least the title, year, new authors, and publisher of the Modified Version as given on the Title Page. If there is no section entitled "History" in the Document, create one stating the title, year, authors, and publisher of the Document as given on its Title Page, then add an item describing the Modified Version as stated in the previous sentence. * Preserve the network location, if any, given in the Document for public access to a Transparent copy of the Document, and likewise the network locations given in the Document for previous versions it was based on. These may be placed in the "History" section. You may omit a network location for a work that was published at least four years before the Document itself, or if the original publisher of the version it refers to gives permission. * In any section entitled "Acknowledgments" or "Dedications", preserve the section's title, and preserve in the section all the substance and tone of each of the contributor acknowledgments and/or dedications given therein. * Preserve all the Invariant Sections of the Document, unaltered in their text and in their titles. Section numbers or the equivalent are not considered part of the section titles. * Delete any section entitled "Endorsements". Such a section may not be included in the Modified Version. * Do not retitle any existing section as "Endorsements" or to conflict in title with any Invariant Section. If the Modified Version includes new front-matter sections or appendices that qualify as Secondary Sections and contain no material copied from the Document, you may at your option designate some or all of these sections as invariant. To do this, add their titles to the list of Invariant Sections in the Modified Version's license notice. These titles must be distinct from any other section titles. You may add a section entitled "Endorsements", provided it contains nothing but endorsements of your Modified Version by various parties - for example, statements of peer review or that the text has been approved by an organization as the authoritative definition of a standard. You may add a passage of up to five words as a Front-Cover Text, and a passage of up to 25 words as a Back-Cover Text, to the end of the list of Cover Texts in the Modified Version. Only one passage of Front-Cover Text and one of Back-Cover Text may be added by (or through arrangements made by) any one entity. If the Document already includes a cover text for the same cover, previously added by you or by arrangement made by the same entity you are acting on behalf of, you may not add another; but you may replace the old one, on explicit permission from the previous publisher that added the old one. The author(s) and publisher(s) of the Document do not by this License give permission to use their names for publicity for or to assert or imply endorsement of any Modified Version. E.1.5 Combining Documents You may combine the Document with other documents released under this License, under the terms defined in section 4 above for modified versions, provided that you include in the combination all of the Invariant Sections of all of the original documents, unmodified, and list them all as Invariant Sections of your combined work in its license notice. The combined work need only contain one copy of this License, and multiple identical Invariant Sections may be replaced with a single copy. If there are multiple Invariant Sections with the same name but different contents, make the title of each such section unique by adding at the end of it, in parentheses, the name of the original author or publisher of that section if known, or else a unique number. Make the same adjustment to the section titles in the list of Invariant Sections in the license notice of the combined work. In the combination, you must combine any sections entitled "History" in the various original documents, forming one section entitled "History"; likewise combine any sections entitled "Acknowledgments", and any sections entitled "Dedications". You must delete all sections entitled "Endorsements." E.1.6 Collections of Documents You may make a collection consisting of the Document and other documents released under this License, and replace the individual copies of this License in the various documents with a single copy that is included in the collection, provided that you follow the rules of this License for verbatim copying of each of the documents in all other respects. You may extract a single document from such a collection, and distribute it individually under this License, provided you insert a copy of this License into the extracted document, and follow this License in all other respects regarding verbatim copying of that document. E.1.7 Aggregation With Independent Works A compilation of the Document or its derivatives with other separate and independent documents or works, in or on a volume of a storage or distribution medium, does not as a whole count as a Modified Version of the Document, provided no compilation copyright is claimed for the compilation. Such a compilation is called an "aggregate", and this License does not apply to the other self-contained works thus compiled with the Document, on account of their being thus compiled, if they are not themselves derivative works of the Document. If the Cover Text requirement of section 3 is applicable to these copies of the Document, then if the Document is less than one quarter of the entire aggregate, the Document's Cover Texts may be placed on covers that surround only the Document within the aggregate. Otherwise they must appear on covers around the whole aggregate. E.1.8 Translation Translation is considered a kind of modification, so you may distribute translations of the Document under the terms of section 4. Replacing Invariant Sections with translations requires special permission from their copyright holders, but you may include translations of some or all Invariant Sections in addition to the original versions of these Invariant Sections. You may include a translation of this License provided that you also include the original English version of this License. In case of a disagreement between the translation and the original English version of this License, the original English version will prevail. E.1.9 Termination You may not copy, modify, sublicense, or distribute the Document except as expressly provided for under this License. Any other attempt to copy, modify, sublicense or distribute the Document is void, and will automatically terminate your rights under this License. However, parties who have received copies, or rights, from you under this License will not have their licenses terminated so long as such parties remain in full compliance. E.1.10 Future Revisions of This License The Free Software Foundation may publish new, revised versions of the GNU Free Documentation License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. See http://www.gnu.org/copyleft/. Each version of the License is given a distinguishing version number. If the Document specifies that a particular numbered version of this License "or any later version" applies to it, you have the option of following the terms and conditions either of that specified version or of any later version that has been published (not as a draft) by the Free Software Foundation. If the Document does not specify a version number of this License, you may choose any version ever published (not as a draft) by the Free Software Foundation. ADDENDUM: How to use this License for your documents To use this License in a document you have written, include a copy of the License in the document and put the following copyright and license notices just after the title page: Copyright © YEAR YOUR NAME. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free Software Foundation; with the Invariant Sections being LIST THEIR TITLES, with the Front-Cover Texts being LIST, and with the Back-Cover Texts being LIST. A copy of the license is included in the section entitled "GNU Free Documentation License". If you have no Invariant Sections, write "with no Invariant Sections" instead of saying which ones are invariant. If you have no Front-Cover Texts, write "no Front-Cover Texts" instead of "Front-Cover Texts being LIST"; likewise for Back-Cover Texts. If your document contains nontrivial examples of program code, we recommend releasing these examples in parallel under your choice of free software license, such as the GNU General Public License, to permit their use in free software. E.2 GNU Lesser Public License GNU LESSER GENERAL PUBLIC LICENSE Version 2.1, February 1999 Copyright (C) 1991, 1999 Free Software Foundation, Inc. 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed. [This is the first released version of the Lesser GPL. It also counts as the successor of the GNU Library Public License, version 2, hence the version number 2.1.] Preamble The licenses for most software are designed to take away your freedom to share and change it. By contrast, the GNU General Public Licenses are intended to guarantee your freedom to share and change free software--to make sure the software is free for all its users. This license, the Lesser General Public License, applies to some specially designated software packages--typically libraries--of the Free Software Foundation and other authors who decide to use it. You can use it too, but we suggest you first think carefully about whether this license or the ordinary General Public License is the better strategy to use in any particular case, based on the explanations below. When we speak of free software, we are referring to freedom of use, not price. Our General Public Licenses are designed to make sure that you have the freedom to distribute copies of free software (and charge for this service if you wish); that you receive source code or can get it if you want it; that you can change the software and use pieces of it in new free programs; and that you are informed that you can do these things. To protect your rights, we need to make restrictions that forbid distributors to deny you these rights or to ask you to surrender these rights. These restrictions translate to certain responsibilities for you if you distribute copies of the library or if you modify it. For example, if you distribute copies of the library, whether gratis or for a fee, you must give the recipients all the rights that we gave you. You must make sure that they, too, receive or can get the source code. If you link other code with the library, you must provide complete object files to the recipients, so that they can relink them with the library after making changes to the library and recompiling it. And you must show them these terms so they know their rights. We protect your rights with a two-step method: (1) we copyright the library, and (2) we offer you this license, which gives you legal permission to copy, distribute and/or modify the library. To protect each distributor, we want to make it very clear that there is no warranty for the free library. Also, if the library is modified by someone else and passed on, the recipients should know that what they have is not the original version, so that the original author's reputation will not be affected by problems that might be introduced by others. Finally, software patents pose a constant threat to the existence of any free program. We wish to make sure that a company cannot effectively restrict the users of a free program by obtaining a restrictive license from a patent holder. Therefore, we insist that any patent license obtained for a version of the library must be consistent with the full freedom of use specified in this license. Most GNU software, including some libraries, is covered by the ordinary GNU General Public License. This license, the GNU Lesser General Public License, applies to certain designated libraries, and is quite different from the ordinary General Public License. We use this license for certain libraries in order to permit linking those libraries into non-free programs. When a program is linked with a library, whether statically or using a shared library, the combination of the two is legally speaking a combined work, a derivative of the original library. The ordinary General Public License therefore permits such linking only if the entire combination fits its criteria of freedom. The Lesser General Public License permits more lax criteria for linking other code with the library. We call this license the "Lesser" General Public License because it does Less to protect the user's freedom than the ordinary General Public License. It also provides other free software developers Less of an advantage over competing non-free programs. These disadvantages are the reason we use the ordinary General Public License for many libraries. However, the Lesser license provides advantages in certain special circumstances. For example, on rare occasions, there may be a special need to encourage the widest possible use of a certain library, so that it becomes a de-facto standard. To achieve this, non-free programs must be allowed to use the library. A more frequent case is that a free library does the same job as widely used non-free libraries. In this case, there is little to gain by limiting the free library to free software only, so we use the Lesser General Public License. In other cases, permission to use a particular library in non-free programs enables a greater number of people to use a large body of free software. For example, permission to use the GNU C Library in non-free programs enables many more people to use the whole GNU operating system, as well as its variant, the GNU/Linux operating system. Although the Lesser General Public License is Less protective of the users' freedom, it does ensure that the user of a program that is linked with the Library has the freedom and the wherewithal to run that program using a modified version of the Library. The precise terms and conditions for copying, distribution and modification follow. Pay close attention to the difference between a "work based on the library" and a "work that uses the library". The former contains code derived from the library, whereas the latter must be combined with the library in order to run. GNU LESSER GENERAL PUBLIC LICENSE TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION 0. This License Agreement applies to any software library or other program which contains a notice placed by the copyright holder or other authorized party saying it may be distributed under the terms of this Lesser General Public License (also called "this License"). Each licensee is addressed as "you". A "library" means a collection of software functions and/or data prepared so as to be conveniently linked with application programs (which use some of those functions and data) to form executables. The "Library", below, refers to any such software library or work which has been distributed under these terms. A "work based on the Library" means either the Library or any derivative work under copyright law: that is to say, a work containing the Library or a portion of it, either verbatim or with modifications and/or translated straightforwardly into another language. (Hereinafter, translation is included without limitation in the term "modification".) "Source code" for a work means the preferred form of the work for making modifications to it. For a library, complete source code means all the source code for all modules it contains, plus any associated interface definition files, plus the scripts used to control compilation and installation of the library. Activities other than copying, distribution and modification are not covered by this License; they are outside its scope. The act of running a program using the Library is not restricted, and output from such a program is covered only if its contents constitute a work based on the Library (independent of the use of the Library in a tool for writing it). Whether that is true depends on what the Library does and what the program that uses the Library does. 1. You may copy and distribute verbatim copies of the Library's complete source code as you receive it, in any medium, provided that you conspicuously and appropriately publish on each copy an appropriate copyright notice and disclaimer of warranty; keep intact all the notices that refer to this License and to the absence of any warranty; and distribute a copy of this License along with the Library. You may charge a fee for the physical act of transferring a copy, and you may at your option offer warranty protection in exchange for a fee. 2. You may modify your copy or copies of the Library or any portion of it, thus forming a work based on the Library, and copy and distribute such modifications or work under the terms of Section 1 above, provided that you also meet all of these conditions: a) The modified work must itself be a software library. b) You must cause the files modified to carry prominent notices stating that you changed the files and the date of any change. c) You must cause the whole of the work to be licensed at no charge to all third parties under the terms of this License. d) If a facility in the modified Library refers to a function or a table of data to be supplied by an application program that uses the facility, other than as an argument passed when the facility is invoked, then you must make a good faith effort to ensure that, in the event an application does not supply such function or table, the facility still operates, and performs whatever part of its purpose remains meaningful. (For example, a function in a library to compute square roots has a purpose that is entirely well-defined independent of the application. Therefore, Subsection 2d requires that any application-supplied function or table used by this function must be optional: if the application does not supply it, the square root function must still compute square roots.) These requirements apply to the modified work as a whole. If identifiable sections of that work are not derived from the Library, and can be reasonably considered independent and separate works in themselves, then this License, and its terms, do not apply to those sections when you distribute them as separate works. But when you distribute the same sections as part of a whole which is a work based on the Library, the distribution of the whole must be on the terms of this License, whose permissions for other licensees extend to the entire whole, and thus to each and every part regardless of who wrote it. Thus, it is not the intent of this section to claim rights or contest your rights to work written entirely by you; rather, the intent is to exercise the right to control the distribution of derivative or collective works based on the Library. In addition, mere aggregation of another work not based on the Library with the Library (or with a work based on the Library) on a volume of a storage or distribution medium does not bring the other work under the scope of this License. 3. You may opt to apply the terms of the ordinary GNU General Public License instead of this License to a given copy of the Library. To do this, you must alter all the notices that refer to this License, so that they refer to the ordinary GNU General Public License, version 2, instead of to this License. (If a newer version than version 2 of the ordinary GNU General Public License has appeared, then you can specify that version instead if you wish.) Do not make any other change in these notices. Once this change is made in a given copy, it is irreversible for that copy, so the ordinary GNU General Public License applies to all subsequent copies and derivative works made from that copy. This option is useful when you wish to copy part of the code of the Library into a program that is not a library. 4. You may copy and distribute the Library (or a portion or derivative of it, under Section 2) in object code or executable form under the terms of Sections 1 and 2 above provided that you accompany it with the complete corresponding machine-readable source code, which must be distributed under the terms of Sections 1 and 2 above on a medium customarily used for software interchange. If distribution of object code is made by offering access to copy from a designated place, then offering equivalent access to copy the source code from the same place satisfies the requirement to distribute the source code, even though third parties are not compelled to copy the source along with the object code. 5. A program that contains no derivative of any portion of the Library, but is designed to work with the Library by being compiled or linked with it, is called a "work that uses the Library". Such a work, in isolation, is not a derivative work of the Library, and therefore falls outside the scope of this License. However, linking a "work that uses the Library" with the Library creates an executable that is a derivative of the Library (because it contains portions of the Library), rather than a "work that uses the library". The executable is therefore covered by this License. Section 6 states terms for distribution of such executables. When a "work that uses the Library" uses material from a header file that is part of the Library, the object code for the work may be a derivative work of the Library even though the source code is not. Whether this is true is especially significant if the work can be linked without the Library, or if the work is itself a library. The threshold for this to be true is not precisely defined by law. If such an object file uses only numerical parameters, data structure layouts and accessors, and small macros and small inline functions (ten lines or less in length), then the use of the object file is unrestricted, regardless of whether it is legally a derivative work. (Executables containing this object code plus portions of the Library will still fall under Section 6.) Otherwise, if the work is a derivative of the Library, you may distribute the object code for the work under the terms of Section 6. Any executables containing that work also fall under Section 6, whether or not they are linked directly with the Library itself. 6. As an exception to the Sections above, you may also combine or link a "work that uses the Library" with the Library to produce a work containing portions of the Library, and distribute that work under terms of your choice, provided that the terms permit modification of the work for the customer's own use and reverse engineering for debugging such modifications. You must give prominent notice with each copy of the work that the Library is used in it and that the Library and its use are covered by this License. You must supply a copy of this License. If the work during execution displays copyright notices, you must include the copyright notice for the Library among them, as well as a reference directing the user to the copy of this License. Also, you must do one of these things: a) Accompany the work with the complete corresponding machine-readable source code for the Library including whatever changes were used in the work (which must be distributed under Sections 1 and 2 above); and, if the work is an executable linked with the Library, with the complete machine-readable "work that uses the Library", as object code and/or source code, so that the user can modify the Library and then relink to produce a modified executable containing the modified Library. (It is understood that the user who changes the contents of definitions files in the Library will not necessarily be able to recompile the application to use the modified definitions.) b) Use a suitable shared library mechanism for linking with the Library. A suitable mechanism is one that (1) uses at run time a copy of the library already present on the user's computer system, rather than copying library functions into the executable, and (2) will operate properly with a modified version of the library, if the user installs one, as long as the modified version is interface-compatible with the version that the work was made with. c) Accompany the work with a written offer, valid for at least three years, to give the same user the materials specified in Subsection 6a, above, for a charge no more than the cost of performing this distribution. d) If distribution of the work is made by offering access to copy from a designated place, offer equivalent access to copy the above specified materials from the same place. e) Verify that the user has already received a copy of these materials or that you have already sent this user a copy. For an executable, the required form of the "work that uses the Library" must include any data and utility programs needed for reproducing the executable from it. However, as a special exception, the materials to be distributed need not include anything that is normally distributed (in either source or binary form) with the major components (compiler, kernel, and so on) of the operating system on which the executable runs, unless that component itself accompanies the executable. It may happen that this requirement contradicts the license restrictions of other proprietary libraries that do not normally accompany the operating system. Such a contradiction means you cannot use both them and the Library together in an executable that you distribute. 7. You may place library facilities that are a work based on the Library side-by-side in a single library together with other library facilities not covered by this License, and distribute such a combined library, provided that the separate distribution of the work based on the Library and of the other library facilities is otherwise permitted, and provided that you do these two things: a) Accompany the combined library with a copy of the same work based on the Library, uncombined with any other library facilities. This must be distributed under the terms of the Sections above. b) Give prominent notice with the combined library of the fact that part of it is a work based on the Library, and explaining where to find the accompanying uncombined form of the same work. 8. You may not copy, modify, sublicense, link with, or distribute the Library except as expressly provided under this License. Any attempt otherwise to copy, modify, sublicense, link with, or distribute the Library is void, and will automatically terminate your rights under this License. However, parties who have received copies, or rights, from you under this License will not have their licenses terminated so long as such parties remain in full compliance. 9. You are not required to accept this License, since you have not signed it. However, nothing else grants you permission to modify or distribute the Library or its derivative works. These actions are prohibited by law if you do not accept this License. Therefore, by modifying or distributing the Library (or any work based on the Library), you indicate your acceptance of this License to do so, and all its terms and conditions for copying, distributing or modifying the Library or works based on it. 10. Each time you redistribute the Library (or any work based on the Library), the recipient automatically receives a license from the original licensor to copy, distribute, link with or modify the Library subject to these terms and conditions. You may not impose any further restrictions on the recipients' exercise of the rights granted herein. You are not responsible for enforcing compliance by third parties with this License. 11. If, as a consequence of a court judgment or allegation of patent infringement or for any other reason (not limited to patent issues), conditions are imposed on you (whether by court order, agreement or otherwise) that contradict the conditions of this License, they do not excuse you from the conditions of this License. If you cannot distribute so as to satisfy simultaneously your obligations under this License and any other pertinent obligations, then as a consequence you may not distribute the Library at all. For example, if a patent license would not permit royalty-free redistribution of the Library by all those who receive copies directly or indirectly through you, then the only way you could satisfy both it and this License would be to refrain entirely from distribution of the Library. If any portion of this section is held invalid or unenforceable under any particular circumstance, the balance of the section is intended to apply, and the section as a whole is intended to apply in other circumstances. It is not the purpose of this section to induce you to infringe any patents or other property right claims or to contest validity of any such claims; this section has the sole purpose of protecting the integrity of the free software distribution system which is implemented by public license practices. Many people have made generous contributions to the wide range of software distributed through that system in reliance on consistent application of that system; it is up to the author/donor to decide if he or she is willing to distribute software through any other system and a licensee cannot impose that choice. This section is intended to make thoroughly clear what is believed to be a consequence of the rest of this License. 12. If the distribution and/or use of the Library is restricted in certain countries either by patents or by copyrighted interfaces, the original copyright holder who places the Library under this License may add an explicit geographical distribution limitation excluding those countries, so that distribution is permitted only in or among countries not thus excluded. In such case, this License incorporates the limitation as if written in the body of this License. 13. The Free Software Foundation may publish revised and/or new versions of the Lesser General Public License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. Each version is given a distinguishing version number. If the Library specifies a version number of this License which applies to it and "any later version", you have the option of following the terms and conditions either of that version or of any later version published by the Free Software Foundation. If the Library does not specify a license version number, you may choose any version ever published by the Free Software Foundation. 14. If you wish to incorporate parts of the Library into other free programs whose distribution conditions are incompatible with these, write to the author to ask for permission. For software which is copyrighted by the Free Software Foundation, write to the Free Software Foundation; we sometimes make exceptions for this. Our decision will be guided by the two goals of preserving the free status of all derivatives of our free software and of promoting the sharing and reuse of software generally. NO WARRANTY 15. BECAUSE THE LIBRARY IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY FOR THE LIBRARY, TO THE EXTENT PERMITTED BY APPLICABLE LAW. EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES PROVIDE THE LIBRARY "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE LIBRARY IS WITH YOU. SHOULD THE LIBRARY PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING, REPAIR OR CORRECTION. 16. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR REDISTRIBUTE THE LIBRARY AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE LIBRARY (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A FAILURE OF THE LIBRARY TO OPERATE WITH ANY OTHER SOFTWARE), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. END OF TERMS AND CONDITIONS How to Apply These Terms to Your New Libraries If you develop a new library, and you want it to be of the greatest possible use to the public, we recommend making it free software that everyone can redistribute and change. You can do so by permitting redistribution under these terms (or, alternatively, under the terms of the ordinary General Public License). To apply these terms, attach the following notices to the library. It is safest to attach them to the start of each source file to most effectively convey the exclusion of warranty; and each file should have at least the "copyright" line and a pointer to where the full notice is found. Copyright (C) This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version. This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details. You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA Also add information on how to contact you by electronic and paper mail. You should also get your employer (if you work as a programmer) or your school, if any, to sign a "copyright disclaimer" for the library, if necessary. Here is a sample; alter the names: Yoyodyne, Inc., hereby disclaims all copyright interest in the library `Frob' (a library for tweaking knobs) written by James Random Hacker. , 1 April 1990 Ty Coon, President of Vice That's all there is to it! -------------------------------------------------------------------------- About this document ... GNU Aspell 0.50.3 This document was generated using the LaTeX2HTML translator Version 99.2beta8 (1.43) Copyright © 1993, 1994, 1995, 1996, Nikos Drakos, Computer Based Learning Unit, University of Leeds. Copyright © 1997, 1998, 1999, Ross Moore, Mathematics Department, Macquarie University, Sydney. The command line arguments were: latex2html -split 3 -toc_depth 3 -dir man-html -long_titles 2 -local_icons -show_section_numbers manual.tex The translation was initiated by Thorsten Maerz on 2002-12-18 --------------------------------------------------------------------------