(recode.info)Top
`recode'
********
This recoding library converts files between various coded character
sets and surface encodings. When this cannot be achieved exactly, it
may get rid of the offending characters or fall back on approximations.
The library recognises or produces more than 300 different character
sets and is able to convert files between almost any pair. Most
RFC 1345 character sets, and all `libiconv' character sets, are
supported. The `recode' program is a handy front-end to the library.
The current `recode' release is 3.6.
- Tutorial
- Quick Tutorial
- Introduction
- Terminology and purpose
- Invoking recode
- How to use this program
- Library
- A recoding library
- Universal
- The universal charset
- libiconv
- The `iconv' library
- Tabular
- Tabular sources (RFC 1345)
- ASCII misc
- ASCII and some derivatives
- IBM and MS
- Some IBM or Microsoft charsets
- CDC
- Charsets for CDC machines
- Micros
- Other micro-computer charsets
- Miscellaneous
- Various other charsets
- Surfaces
- All about surfaces
- Internals
- Internal aspects
- Concept Index
- Concept Index
- Option Index
- Option Index
- Library Index
- Library Index
- Charset and Surface Index
- Charset and Surface Index
--- The Detailed Node Listing ---
Terminology and purpose
- Charset overview
- Overview of charsets
- Surface overview
- Overview of surfaces
- Contributing
- Contributions and bug reports
How to use this program
- Synopsis
- Synopsis of `recode' call
- Requests
- The REQUEST parameter
- Listings
- Asking for various lists
- Recoding
- Controlling how files are recoded
- Reversibility
- Reversibility issues
- Sequencing
- Selecting sequencing methods
- Mixed
- Using mixed charset input
- Emacs
- Using `recode' within Emacs
- Debugging
- Debugging considerations
A recoding library
- Outer level
- Outer level functions
- Request level
- Request level functions
- Task level
- Task level functions
- Charset level
- Charset level functions
- Errors
- Handling errors
The universal charset
- UCS-2
- Universal Character Set, 2 bytes
- UCS-4
- Universal Character Set, 4 bytes
- UTF-7
- Universal Transformation Format, 7 bits
- UTF-8
- Universal Transformation Format, 8 bits
- UTF-16
- Universal Transformation Format, 16 bits
- count-characters
- Frequency count of characters
- dump-with-names
- Fully interpreted UCS dump
ASCII and some derivatives
- ASCII
- Usual ASCII
- ISO 8859
- ASCII extended by Latin Alphabets
- ASCII-BS
- ASCII 7-bits, BS to overstrike
- flat
- ASCII without diacritics nor underline
Some IBM or Microsoft charsets
- EBCDIC
- EBCDIC codes
- IBM-PC
- IBM's PC code
- Icon-QNX
- Unisys' Icon code
Charsets for CDC machines
- Display Code
- Control Data's Display Code
- CDC-NOS
- ASCII 6/12 from NOS
- Bang-Bang
- ASCII ``bang bang''
Other micro-computer charsets
- Apple-Mac
- Apple's Macintosh code
- AtariST
- Atari ST code
Various other charsets
- HTML
- World Wide Web representations
- LaTeX
- LaTeX macro calls
- Texinfo
- GNU project documentation files
- Vietnamese
- Vietnamese charsets
- African
- African charsets
- Others
- Cyrillic and other charsets
- Texte
- Easy French conventions
- Mule
- Mule as a multiplexed charset
All about surfaces
- Permutations
- Permuting groups of bytes
- End lines
- Representation for end of lines
- MIME
- MIME contents encodings
- Dump
- Interpreted character dumps
- Test
- Artificial data for testing
Internal aspects
- Main flow
- Overall organisation
- New charsets
- Adding new charsets
- New surfaces
- Adding new surfaces
- Design
- Comments on the library design
automatically generated by info2www version 1.2.2.9