(recode.info)Top


Next: Tutorial Prev: (dir) Up: (dir)
Enter node , (file) or (file)node

`recode'
********

   This recoding library converts files between various coded character
sets and surface encodings.  When this cannot be achieved exactly, it
may get rid of the offending characters or fall back on approximations.
The library recognises or produces more than 300 different character
sets and is able to convert files between almost any pair.  Most
RFC 1345 character sets, and all `libiconv' character sets, are
supported.  The `recode' program is a handy front-end to the library.

   The current `recode' release is 3.6.

Tutorial
Quick Tutorial
Introduction
Terminology and purpose
Invoking recode
How to use this program
Library
A recoding library
Universal
The universal charset
libiconv
The `iconv' library
Tabular
Tabular sources (RFC 1345)
ASCII misc
ASCII and some derivatives
IBM and MS
Some IBM or Microsoft charsets
CDC
Charsets for CDC machines
Micros
Other micro-computer charsets
Miscellaneous
Various other charsets
Surfaces
All about surfaces
Internals
Internal aspects
Concept Index
Concept Index
Option Index
Option Index
Library Index
Library Index
Charset and Surface Index
Charset and Surface Index
 --- The Detailed Node Listing ---

Terminology and purpose

Charset overview
Overview of charsets
Surface overview
Overview of surfaces
Contributing
Contributions and bug reports
How to use this program

Synopsis
Synopsis of `recode' call
Requests
The REQUEST parameter
Listings
Asking for various lists
Recoding
Controlling how files are recoded
Reversibility
Reversibility issues
Sequencing
Selecting sequencing methods
Mixed
Using mixed charset input
Emacs
Using `recode' within Emacs
Debugging
Debugging considerations
A recoding library

Outer level
Outer level functions
Request level
Request level functions
Task level
Task level functions
Charset level
Charset level functions
Errors
Handling errors
The universal charset

UCS-2
Universal Character Set, 2 bytes
UCS-4
Universal Character Set, 4 bytes
UTF-7
Universal Transformation Format, 7 bits
UTF-8
Universal Transformation Format, 8 bits
UTF-16
Universal Transformation Format, 16 bits
count-characters
Frequency count of characters
dump-with-names
Fully interpreted UCS dump
ASCII and some derivatives

ASCII
Usual ASCII
ISO 8859
ASCII extended by Latin Alphabets
ASCII-BS
ASCII 7-bits, BS to overstrike
flat
ASCII without diacritics nor underline
Some IBM or Microsoft charsets

EBCDIC
EBCDIC codes
IBM-PC
IBM's PC code
Icon-QNX
Unisys' Icon code
Charsets for CDC machines

Display Code
Control Data's Display Code
CDC-NOS
ASCII 6/12 from NOS
Bang-Bang
ASCII ``bang bang''
Other micro-computer charsets

Apple-Mac
Apple's Macintosh code
AtariST
Atari ST code
Various other charsets

HTML
World Wide Web representations
LaTeX
LaTeX macro calls
Texinfo
GNU project documentation files
Vietnamese
Vietnamese charsets
African
African charsets
Others
Cyrillic and other charsets
Texte
Easy French conventions
Mule
Mule as a multiplexed charset
All about surfaces

Permutations
Permuting groups of bytes
End lines
Representation for end of lines
MIME
MIME contents encodings
Dump
Interpreted character dumps
Test
Artificial data for testing
Internal aspects

Main flow
Overall organisation
New charsets
Adding new charsets
New surfaces
Adding new surfaces
Design
Comments on the library design

automatically generated by info2www version 1.2.2.9