diff options
author | Ulrich Drepper <drepper@redhat.com> | 1999-01-15 16:24:11 +0000 |
---|---|---|
committer | Ulrich Drepper <drepper@redhat.com> | 1999-01-15 16:24:11 +0000 |
commit | 37a87f834a459f6136ff44c7887b013d1ae0f6be (patch) | |
tree | b9a8be38cd639638c196f42b278371a9b952aff4 /manual | |
parent | d0ab77802ac748c07d7302aa728ad063ce98309f (diff) | |
download | glibc-37a87f834a459f6136ff44c7887b013d1ae0f6be.tar glibc-37a87f834a459f6136ff44c7887b013d1ae0f6be.tar.gz glibc-37a87f834a459f6136ff44c7887b013d1ae0f6be.tar.bz2 glibc-37a87f834a459f6136ff44c7887b013d1ae0f6be.zip |
Update.
Improve mbsinit example.
Diffstat (limited to 'manual')
-rw-r--r-- | manual/charset.texi | 17 |
1 files changed, 15 insertions, 2 deletions
diff --git a/manual/charset.texi b/manual/charset.texi index 1242cc06ac..268cce1a15 100644 --- a/manual/charset.texi +++ b/manual/charset.texi @@ -71,7 +71,7 @@ As shown in some other part of this manual, there exists a completely new family of functions which can handle texts of this kind in memory. The most commonly used character set for such internal wide character representations are Unicode and @w{ISO 10646}. -The former is a subset of the later and used when wide characters are +The former is a subset of the latter and used when wide characters are chosen to by 2 bytes (@math{= 16} bits) wide. The standard names of the @cindex UCS2 @cindex UCS4 @@ -501,6 +501,8 @@ is declared in @file{wchar.h}. Code using this function often looks similar to this: +@c Fix the example to explicitly say how to generate the escape sequence +@c to restore the initial state. @smallexample @{ mbstate_t state; @@ -510,12 +512,23 @@ Code using this function often looks similar to this: if (! mbsinit (&state)) @{ /* @r{Emit code to return to initial state.} */ - fputs ("@r{whatever needed}", fp); + const char empty[] = ""; + const char **srcp = ∅ + wcsrtombs (outbuf, &srcp, outbuflen, &state); @} ... @} @end smallexample +The code to emit the escape sequence to get back to the initial state is +interesting. The @code{wcsrtombs} function can be used to determine the +necessary output code (@pxref{Converting Strings}). Please note that on +GNU systems it is not necessary to perform this extra action for the +conversion from multibyte text ot wide character text since the wide +character encoding is not stateful. But there is nothing mentioned in +any standard which prohibits making @code{wchar_t} using a stateful +encoding. + @node Converting a Character @subsection Converting Single Characters |