X-Git-Url: http://git.shadowcat.co.uk/gitweb/gitweb.cgi?p=catagits%2FCatalyst-Runtime.git;a=blobdiff_plain;f=lib%2FCatalyst%2FUTF8.pod;h=044a573ec5ab6232aeb2e52ac520d0f0ab051fd8;hp=0f20099d18ae26b32e2856e98f79cd17781c80be;hb=d47640840f28d5fd03cd9606946f1b35cde805c6;hpb=80ba671f681862620befead95d299c8dfc91cbaf diff --git a/lib/Catalyst/UTF8.pod b/lib/Catalyst/UTF8.pod index 0f20099..044a573 100644 --- a/lib/Catalyst/UTF8.pod +++ b/lib/Catalyst/UTF8.pod @@ -205,7 +205,7 @@ precedence: C If true, then do not try to character decode any wide characters in your -request URL query or keywords. You will need gto handle this manually in your action code +request URL query or keywords. You will need to handle this manually in your action code (although if you choose this setting, chances are you already do this). C @@ -307,10 +307,16 @@ In this case we've created a POST request but each part specifies its own conten character set (and setting a content encoding would also be possible). Generally one would not run into this situation in a web browser context but for completeness sake Catalyst will notice if a multipart POST contains parts with complex or extended -header information and in those cases it will not attempt to apply decoding to the -form values. Instead the part will be represented as an instance of an object -L which will contain all the header information needed -for you to perform custom parser of the data. +header information. In these cases we will try to inspect the meta data and do the +right thing (in the above case we'd use SHIFT_JIS to decode, not UTF-8). However if +after inspecting the headers we cannot figure out how to decode the data, in those cases it +will not attempt to apply decoding to the form values. Instead the part will be represented as +an instance of an object L which will contain all the header +information needed for you to perform custom parser of the data. + +Ideally we'd fix L to be smarter about decoding so please submit your cases of +this so we can add intelligence to the parser and find a way to extract a valid value out +of it. =head1 UTF8 Encoding in Body Response @@ -490,7 +496,7 @@ L, L. The main difference this year is that previously calling ->write_fh would return the actual -L writer object that was supplied by your plack application handler, whereas now we wrap +L writer object that was supplied by your Plack application handler, whereas now we wrap that object in a lightweight decorator object that proxies the C and C methods and supplies an additional C method. C does the exact same thing as C except that it will first encode the string when necessary. In general if you are @@ -631,7 +637,7 @@ with a mix of content character sets. If you believe you have discovered a bug in UTF8 body encoding, I strongly encourage you to report it (and not try to hack a workaround in your local code). We also recommend that you regard such a workaround as a temporary solution. It is ideal if L extension -authors can start to count on L doing the write thing for encoding. +authors can start to count on L doing the right thing for encoding. =head1 Conclusion