(recode.info)Top


`recode'
********

   This recoding library converts files between various coded character
sets and surface encodings.  When this cannot be achieved exactly, it
may get rid of the offending characters or fall back on approximations.
The library recognises or produces more than 300 different character
sets and is able to convert files between almost any pair.  Most
RFC 1345 character sets, and all `libiconv' character sets, are
supported.  The `recode' program is a handy front-end to the library.

   The current `recode' release is 3.6.

Tutorial: Quick Tutorial
Introduction: Terminology and purpose
Invoking recode: How to use this program
Library: A recoding library
Universal: The universal charset
libiconv: The `iconv' library
Tabular: Tabular sources (RFC 1345)
ASCII misc: ASCII and some derivatives
IBM and MS: Some IBM or Microsoft charsets
CDC: Charsets for CDC machines
Micros: Other micro-computer charsets
Miscellaneous: Various other charsets
Surfaces: All about surfaces
Internals: Internal aspects
Concept Index: Concept Index
Option Index: Option Index
Library Index: Library Index
Charset and Surface Index: Charset and Surface Index

--- The Detailed Node Listing --- Terminology and purpose

Charset overview: Overview of charsets
Surface overview: Overview of surfaces
Contributing: Contributions and bug reports

How to use this program

Synopsis: Synopsis of `recode' call
Requests: The REQUEST parameter
Listings: Asking for various lists
Recoding: Controlling how files are recoded
Reversibility: Reversibility issues
Sequencing: Selecting sequencing methods
Mixed: Using mixed charset input
Emacs: Using `recode' within Emacs
Debugging: Debugging considerations

A recoding library

Outer level: Outer level functions
Request level: Request level functions
Task level: Task level functions
Charset level: Charset level functions
Errors: Handling errors

The universal charset

UCS-2: Universal Character Set, 2 bytes
UCS-4: Universal Character Set, 4 bytes
UTF-7: Universal Transformation Format, 7 bits
UTF-8: Universal Transformation Format, 8 bits
UTF-16: Universal Transformation Format, 16 bits
count-characters: Frequency count of characters
dump-with-names: Fully interpreted UCS dump

ASCII and some derivatives

ASCII: Usual ASCII
ISO 8859: ASCII extended by Latin Alphabets
ASCII-BS: ASCII 7-bits, BS to overstrike
flat: ASCII without diacritics nor underline

Some IBM or Microsoft charsets

EBCDIC: EBCDIC codes
IBM-PC: IBM's PC code
Icon-QNX: Unisys' Icon code

Charsets for CDC machines

Display Code: Control Data's Display Code
CDC-NOS: ASCII 6/12 from NOS
Bang-Bang: ASCII ``bang bang''

Other micro-computer charsets

Apple-Mac: Apple's Macintosh code
AtariST: Atari ST code

Various other charsets

HTML: World Wide Web representations
LaTeX: LaTeX macro calls
Texinfo: GNU project documentation files
Vietnamese: Vietnamese charsets
African: African charsets
Others: Cyrillic and other charsets
Texte: Easy French conventions
Mule: Mule as a multiplexed charset

All about surfaces

Permutations: Permuting groups of bytes
End lines: Representation for end of lines
MIME: MIME contents encodings
Dump: Interpreted character dumps
Test: Artificial data for testing

Internal aspects

Main flow: Overall organisation
New charsets: Adding new charsets
New surfaces: Adding new surfaces
Design: Comments on the library design

automatically generated by info2www version 1.2.2.9