From: Jarkko Hietaniemi Date: Mon, 7 Jul 2003 09:56:48 +0000 (+0000) Subject: Be more explicit than just 'characters'. X-Git-Url: http://git.shadowcat.co.uk/gitweb/gitweb.cgi?a=commitdiff_plain;h=1d714267e012b8a8fa8797e9541b2a022c7d4f12;p=p5sagit%2Fp5-mst-13.2.git Be more explicit than just 'characters'. p4raw-id: //depot/perl@20051 --- diff --git a/pod/perlfunc.pod b/pod/perlfunc.pod index 41c44b0..25bf159 100644 --- a/pod/perlfunc.pod +++ b/pod/perlfunc.pod @@ -3822,7 +3822,9 @@ Note the I: depending on the status of the filehandle, either (8-bit) bytes or characters are read. By default all filehandles operate on bytes, but for example if the filehandle has been opened with the C<:utf8> I/O layer (see L, and the C -pragma, L), the I/O will operate on characters, not bytes. +pragma, L), the I/O will operate on UTF-8 encoded Unicode +characters, not bytes. Similarly for the C<:encoding> pragma: +in that case pretty much any characters can be read. =item readdir DIRHANDLE @@ -3909,7 +3911,9 @@ Note the I: depending on the status of the socket, either (8-bit) bytes or characters are received. By default all sockets operate on bytes, but for example if the socket has been changed using binmode() to operate with the C<:utf8> I/O layer (see the C -pragma, L), the I/O will operate on characters, not bytes. +pragma, L), the I/O will operate on UTF-8 encoded Unicode +characters, not bytes. Similarly for the C<:encoding> pragma: +in that case pretty much any characters can be read. =item redo LABEL @@ -4437,9 +4441,10 @@ L for examples. Note the I: depending on the status of the socket, either (8-bit) bytes or characters are sent. By default all sockets operate on bytes, but for example if the socket has been changed using -binmode() to operate with the C<:utf8> I/O layer (see L, or -the C pragma, L), the I/O will operate on characters, not -bytes. +binmode() to operate with the C<:utf8> I/O layer (see L, or the +C pragma, L), the I/O will operate on UTF-8 encoded +Unicode characters, not bytes. Similarly for the C<:encoding> pragma: +in that case pretty much any characters can be sent. =item setpgrp PID,PGRP @@ -5621,7 +5626,7 @@ for a return value for 0 to decide whether you're done. Note that if the filehandle has been marked as C<:utf8> Unicode characters are read instead of bytes (the LENGTH, OFFSET, and the -return value of sysread() are in Unicode characters). +return value of sysread() are in UTF-8 encoded Unicode characters). The C<:encoding(...)> layer implicitly introduces the C<:utf8> layer. See L, L, and the C pragma, L. @@ -5734,9 +5739,9 @@ string other than the beginning. A negative OFFSET specifies writing that many characters counting backwards from the end of the string. In the case the SCALAR is empty you can use OFFSET but only zero offset. -Note that if the filehandle has been marked as C<:utf8>, -Unicode characters are written instead of bytes (the LENGTH, OFFSET, -and the return value of syswrite() are in Unicode characters). +Note that if the filehandle has been marked as C<:utf8>, Unicode +characters are written instead of bytes (the LENGTH, OFFSET, and the +return value of syswrite() are in UTF-8 encoded Unicode characters). The C<:encoding(...)> layer implicitly introduces the C<:utf8> layer. See L, L, and the C pragma, L.