stringprep_utf8_nfkc_normalize

Section: libidn (3)
Updated: 1.9
Index Return to Main Contents
 

NAME

stringprep_utf8_nfkc_normalize - normalize Unicode string  

SYNOPSIS

#include <stringprep.h>

char * stringprep_utf8_nfkc_normalize(const char * str, ssize_t len);  

ARGUMENTS

const char * str
a UTF-8 encoded string.
ssize_t len
length of str, in bytes, or -1 if str is nul-terminated.
 

DESCRIPTION

Converts a string into canonical form, standardizing such issues as whether a character with an accent is represented as a base character and combining accent or as a single precomposed character.

The normalization mode is NFKC (ALL COMPOSE). It standardizes differences that do not affect the text content, such as the above-mentioned accent representation. It standardizes the "compatibility" characters in Unicode, such as SUPERSCRIPT THREE to the standard forms (in this case DIGIT THREE). Formatting information may be lost but for most text operations such characters should be considered the same. It returns a result with composed forms rather than a maximally decomposed form.  

RETURN VALUE

a newly allocated string, that is the NFKC normalized form of str.  

REPORTING BUGS

Report bugs to <bug-libidn@gnu.org>.  

COPYRIGHT

Copyright © 2002, 2003, 2004, 2005, 2006, 2007, 2008 Simon Josefsson.
Permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and this permission notice are preserved on all copies.  

SEE ALSO

The full documentation for libidn is maintained as a Texinfo manual. If the info and libidn programs are properly installed at your site, the command
info libidn

should give you access to the complete manual.