0001182: CX behavior wasn't changed appropriately with TC2

Notes
(0003914) geoffclare (manager) 2018-01-30 09:31	The statement "In the POSIX locale, btowc() shall not return WEOF if c has a value in the range 0 to 255 inclusive" is precisely what was intended by the changes made in TC2. If there is any inconsistency between this statement and other parts of the standard post-TC2, then it is those other parts that need to change. See in particular the interpretation rationale in 0000663 "The intention was always that the POSIX locale should have an 8-bit-clean single-byte encoding. The omission of an explicit statement to that effect was an oversight."

(0003925) shware_systems (reporter) 2018-02-21 02:23	Re: 3914 That makes things look cleaner for btowc(), but we established back in 2009 the POSIX locale is required to function dirty as a multi-byte encoding, and more recently due to factors that predate and are outside the control of the standard. Bug 663 was approved because we hadn't established then those additional factors had to be considered binding. The standard can't change that much so that rationale has to, in other words. As to this report, you were the one that pointed out where XBD7 conflicts with XBD6 trying to make that assertion of intent, according to the Oct 2009 "multibyte C locale" email thread, and listed most of the code points of the portable charset that are not permitted to have wchar_t encodings because they function as shift codes of one type or another. You were right; earlier in that thread I was guilty of focusing too narrowly on what XBD6 alone was saying and just muddling the debate. The code points POSIX doesn't require be assigned to specific functions have to be treated the same, imo now (and then but I forgot), because encodings like UTF-8 or 8859-1 do make use of some of them in that manner. TC2 saying the intent was char, CHAR_MIN and CHAR_MAX be unsigned for the POSIX locale, is fine; this does not mean all values between MIN and MAX magically become valid for btowc() to successfully convert, however. All it means is the interface shouldn't reject 128 because an implementation wants to say char is the same as signed char as an extension. This is different from what that change requires of implementations.

Issue History
Date Modified	Username	Field	Change
2018-01-30 00:01	shware_systems	New Issue
2018-01-30 00:01	shware_systems	Name	=> Mark Ziegast
2018-01-30 00:01	shware_systems	Organization	=> SHware Systems Dev.
2018-01-30 00:01	shware_systems	Section	=> btowc()
2018-01-30 00:01	shware_systems	Page Number	=> C165 632
2018-01-30 00:01	shware_systems	Line Number	=> 21871-2
2018-01-30 09:31	geoffclare	Note Added: 0003914
2018-02-21 02:23	shware_systems	Note Added: 0003925
2019-02-21 16:01	nick	Relationship added	related to 0000663
2019-02-21 16:15	geoffclare	Interp Status	=> ---
2019-02-21 16:15	geoffclare	Status	New => Closed
2019-02-21 16:15	geoffclare	Resolution	Open => Rejected

Aardvark Mark IV