Commit Graph

2149 Commits

Author SHA1 Message Date
Rich Felker
c2d3fd3aad add workaround header for wait.h remapping to sys/wait.h 2013-09-01 01:01:31 -04:00
Rich Felker
316d6741b6 fix missing return value warning in faccessat, minor cleanup
clone will pass the return value of the start function to SYS_exit
anyway; there's no need to call the syscall directly.
2013-08-31 23:15:48 -04:00
Rich Felker
f0328a5656 fix invalid %m format crash in wide scanf variants
the wide variant was missed in the previous commit.
2013-08-31 22:52:41 -04:00
Rich Felker
cbb8a69790 avoid crash in scanf when invalid %m format is encountered
invalid format strings invoke undefined behavior, so this is not a
conformance issue, but it's nicer for scanf to report the error safely
instead of calling free on a potentially-uninitialized pointer or a
pointer to memory belonging to the caller.
2013-08-31 22:47:44 -04:00
Rich Felker
35e8621a28 remove incorrect cancellation points from realpath 2013-08-31 16:01:01 -04:00
Rich Felker
dfddd43256 debloat realpath's allocation strategy
rather than allocating a PATH_MAX-sized buffer when the caller does
not provide an output buffer, work first with a PATH_MAX-sized temp
buffer with automatic storage, and either copy it to the caller's
buffer or strdup it on success. this not only avoids massive memory
waste, but also avoids pulling in free (and thus the full malloc
implementation) unnecessarily in static programs.
2013-08-31 15:50:23 -04:00
Rich Felker
27b4923ba0 make realpath use O_PATH when opening the file
this avoids failure if the file is not readable and avoids odd
behavior for device nodes, etc. on old kernels that lack O_PATH, the
old behavior (O_RDONLY) will naturally happen as the fallback.
2013-08-31 15:44:58 -04:00
Rich Felker
9ff8ed463c fix regression in creation of ldso symlink
DESTDIR was wrongly included in the symlink contents.
2013-08-31 11:36:56 -04:00
Rich Felker
7cc49f98a2 fix breakage in synccall due to incorrect signal restoration in sigqueue
commit 07827d1a82 seems to have
introduced this issue. sigqueue is called from the synccall core, at
which time, even implementation-internal signals are blocked. however,
pthread_sigmask removes the implementation-internal signals from the
old mask before returning, so that a process which began life with
them blocked will not be able to save a signal mask that has them
blocked, possibly causing them to become re-blocked later. however,
this was causing sigqueue to unblock the implementation-internal
signals during synccall, leading to deadlock.
2013-08-31 01:12:00 -04:00
Rich Felker
d0f0fa484c fix typo in release notes 2013-08-30 17:30:27 -04:00
Rich Felker
57565bee00 release notes for 0.9.13 2013-08-30 17:24:25 -04:00
Rich Felker
da19f88659 only expose struct tcphdr under _GNU_SOURCE
the BSD and GNU versions of this structure differ, so exposing it in
the default _BSD_SOURCE profile is possibly problematic. both versions
could be simultaneously supported with anonymous unions if needed in
the future, but for now, just omitting it except under _GNU_SOURCE
should be safe.
2013-08-30 17:06:17 -04:00
Rich Felker
09b2995bcb add struct tcphdr in netinet/tcp.h 2013-08-30 16:50:00 -04:00
Rich Felker
f7bc29ed22 remove -Wcast-align from --enable-warnings
I originally added this warning option based on a misunderstanding of
how it works. it does not warn whenever the destination of the cast
has stricter alignment; it only warns in cases where misaligned
dereference could lead to a fault. thus, it's essentially a no-op for
i386, which had me wrongly believing the code was clean for this
warning level. on other archs, numerous diagnostic messages are
produced, and all of them are false-positives, so it's better just not
to use it.
2013-08-28 05:08:16 -04:00
Rich Felker
90edf1cc15 optimized C memcpy
unlike the old C memcpy, this version handles word-at-a-time reads and
writes even for misaligned copies. it does not require that the cpu
support misaligned accesses; instead, it performs bit shifts to
realign the bytes for the destination.

essentially, this is the C version of the ARM assembly language
memcpy. the ideas are all the same, and it should perform well on any
arch with a decent number of general-purpose registers that has a
barrel shift operation. since the barrel shifter is an optional cpu
feature on microblaze, it may be desirable to provide an alternate asm
implementation on microblaze, but otherwise the C code provides a
competitive implementation for "generic risc-y" cpu archs that should
alleviate the urgent need for arch-specific memcpy asm.
2013-08-28 03:34:57 -04:00
Rich Felker
38e6acbf89 stdbool.h should define __bool_true_false_are_defined even for C++
while the incorporation of this requirement from C99 into C++11 was
likely an accident, some software expects it to be defined, and it
doesn't hurt. if the requirement is removed, then presumably
__bool_true_false_are_defined would just be in the implementation
namespace and thus defining it would still be legal.
2013-08-28 00:41:00 -04:00
Rich Felker
ebc87349fe fix invalid instruction mnemonics in powerpc fenv asm
there is no non-dot version of the andis instruction, but there's no
harm in updating the flags anyway, so just use the dot version.
2013-08-27 18:54:46 -04:00
Rich Felker
a543369e3b optimized C memset
this version of memset is optimized both for small and large values of
n, and makes no misaligned writes, so it is usable (and near-optimal)
on all archs. it is capable of filling up to 52 or 56 bytes without
entering a loop and with at most 7 branches, all of which can be fully
predicted if memset is called multiple times with the same size.

it also uses the attribute extension to inform the compiler that it is
violating the aliasing rules, unlike the previous code which simply
assumed it was safe to violate the aliasing rules since translation
unit boundaries hide the violations from the compiler. for non-GNUC
compilers, 100% portable fallback code in the form of a naive loop is
provided. I intend to eventually apply this approach to all of the
string/memory functions which are doing word-at-a-time accesses.
2013-08-27 18:08:29 -04:00
Rich Felker
06ceee8ca3 add attribute((may_alias)) checking in configure
this will be needed for upcoming commits to the string/mem functions
to correct their unannounced use of aliasing violations for
word-at-a-time search, fill, and copy operations.
2013-08-27 17:33:47 -04:00
Rich Felker
242a4bb440 add the %s (seconds since the epoch) format to strftime
this is a nonstandard extension but will be required in the next
version of POSIX, and it's widely used/useful in shell scripts
utilizing the date utility.
2013-08-25 02:02:15 -04:00
Rich Felker
2828a130b1 fix strftime regression in %e format
%e pads with spaces instead of zeros.
2013-08-24 14:35:17 -04:00
Rich Felker
190bbb9923 properly fill in tzname[] for old (pre-64-bit-format) zoneinfo files
in this case, the first standard-time and first daylight-time rules
should be taken as the "default" ones to expose.
2013-08-24 13:11:18 -04:00
Rich Felker
32985d4f63 minor fix to tz name checking
if a zoneinfo file is not (or is no longer) in use, don't check the
abbrevs pointers, which may be invalid.
2013-08-24 13:10:01 -04:00
Rich Felker
d78be392e1 fix strftime handling of time zone data
this may need further revision in the future, since POSIX is rather
unclear on the requirements, and is designed around the assumption of
POSIX TZ specifiers which are not sufficiently powerful to represent
real-world timezones (this is why zoneinfo support was added).

the basic issue is that strftime gets the string and numeric offset
for the timezone from the extra fields in struct tm, which are
initialized when calling localtime/gmtime/etc. however, a conforming
application might have created its own struct tm without initializing
these fields, in which case using __tm_zone (a pointer) could crash.
other zoneinfo-based implementations simply check for a null pointer,
but otherwise can still crash of the field contains junk.

simply ignoring __tm_zone and using tzname[] would "work" but would
give incorrect results in time zones with more complex rules. I feel
like this would lower the quality of implementation.

instead, simply validate __tm_zone: unless it points to one of the
zone name strings managed by the timezone system, assume it's invalid.

this commit also fixes several other minor bugs with formatting:
tm_isdst being negative is required to suppress printing of the zone
formats, and %z was using the wrong format specifiers since the type
of val was changed, resulting in bogus output.
2013-08-24 12:59:02 -04:00
Rich Felker
0f9b1f672b make dlopen honor the rpath of the main program
this seems to match what other systems do, and seems useful for
programs that have their libraries and plugins stored relative to the
executable.
2013-08-23 23:13:25 -04:00
Rich Felker
bd5ed22c58 fix mishandling of empty or blank TZ environment variable
the empty TZ string was matching equal to the initial value of the
cached TZ name, thus causing do_tzset never to run and never to
initialize the time zone data.
2013-08-23 23:07:09 -04:00
Rich Felker
7211551e9f fix regression in dn_expand/reverse dns
off-by-one error copying the name components was yielding junk at the
beginning and truncating one character at the end (of every
component).
2013-08-23 21:25:01 -04:00
Rich Felker
d2c42ed25f fix bugs in $ORIGIN handling
1. an occurrence of ${ORIGIN} before $ORIGIN would be ignored due to
the strstr logic. (note that rpath contains multiple :-delimited paths
to be searched.)

2. data read by readlink was not null-terminated.
2013-08-23 15:51:59 -04:00
Rich Felker
cc51505a32 use AT_EXECFN, if available, for dynamic linker to identify main program
fallback to argv[0] as before. unlike argv[0], AT_EXECFN was a valid
(but possibly relative) pathname for the new program image at the time
the execve syscall was made.

as a special case, ignore AT_EXECFN if it begins with "/proc/", in
order not to give bogus (and possibly harmful) results when fexecve
was used.
2013-08-23 14:14:47 -04:00
Rich Felker
a897a20a57 add rpath $ORIGIN processing to dynamic linker 2013-08-23 13:56:30 -04:00
Rich Felker
709355e1f6 add recursive rpath support to dynamic linker
previously, rpath was only honored for direct dependencies. in other
words, if A depends on B and B depends on C, only B's rpath (if any),
not A's rpath, was being searched for C. this limitation made
rpath-based deployment difficult in the presence of multiple levels of
library dependency.

at present, $ORIGIN processing in rpath is still unsupported.
2013-08-23 11:15:40 -04:00
Rich Felker
8b491f1499 fix missing string.h in strftime.c (needed by new strftime code)
this bug was masked by local experimental CFLAGS in my config.mak.
2013-08-23 08:11:43 -04:00
Rich Felker
6fdaaf251d fix some documentation typos 2013-08-22 22:40:30 -04:00
Rich Felker
fc48ceee77 add strftime and wcsftime field widths
at present, since POSIX requires %F to behave as %+4Y-%m-%d and ISO C
requires %F to behave as %Y-%m-%d, the default behavior for %Y has
been changed to match %+4Y. this seems to be the only way to conform
to the requirements of both standards, and it does not affect years
prior to the year 10000. depending on the outcome of interpretations
from the standards bodies, this may be adjusted at some point.
2013-08-22 22:36:19 -04:00
Rich Felker
33413cdd25 simplify strftime and fix integer overflows
use a long long value so that even with offsets, values cannot
overflow. instead of using different format strings for different
numeric formats, simply use a per-format width and %0*lld for all of
them.

this width specifier is not for use with strftime field widths; that
will be a separate step in the caller.
2013-08-22 19:44:02 -04:00
Rich Felker
87e133b352 strftime cleanup: avoid recomputing strlen when it's known 2013-08-22 19:36:30 -04:00
Rich Felker
45849d3dcb more strftime refactoring
make __strftime_fmt_1 return a string (possibly in the caller-provided
temp buffer) rather than writing into the output buffer. this approach
makes more sense when padding to a minimum field width might be
required, and it's also closer to what wcsftime wants.
2013-08-22 19:27:36 -04:00
Rich Felker
f5e4efc4bd begin refactoring strftime to make adding field widths easier 2013-08-22 19:02:52 -04:00
Rich Felker
ecf4e24d81 add SUN_LEN macro to sys/un.h under appropriate feature tests
this is ugly and useless, but it seems to be the least-ugly way to
provide it...
2013-08-21 21:46:57 -04:00
Rich Felker
580b8d8c52 unbreak vwarn: print ": " before errno message
patch by Strake. this seems to be a regression caused by fixing the
behavior of perror("") to match perror(0) at some point in the past.
2013-08-21 00:49:46 -04:00
Rich Felker
e449974dc5 fix two bugs in sed code configure uses to save command line
one place where semicolon (non-portable) was still used in place of
separate -e options (copied over from an old version of this code),
and use of a literal slash in the bracket expression for the final
command, despite slash being used as the delimiter for the s command.
2013-08-20 13:51:46 -04:00
Rich Felker
4681aae23a re-add logic for ignoring failure of ld.so symlink installation
this was inadvertently removed when switching to the new install.sh.
2013-08-18 20:20:08 -04:00
Szabolcs Nagy
a6b0170a7f fix fenv exception functions to mask their argument
fesetround.c is a wrapper to do the arch independent argument
check (on archs where rounding mode is not stored in 2 bits
__fesetround still has to check its arguments)

on powerpc fe*except functions do not accept the extra invalid
flags of its fpscr register

the useless FENV_ACCESS pragma was removed from feupdateenv
2013-08-18 20:08:18 +00:00
Szabolcs Nagy
d8764bf840 optimize x86 feclearexcept: only use save/restore x87 fenv if needed
the x87 exception summary (ES) and stack fault (SF) flags may be
spuriously cleared by feclearexcept using the fnclex instruction,
but these flags are not observable through libc hence maintaining
their state is not critical.
2013-08-18 16:27:21 +00:00
Szabolcs Nagy
baba2630c9 remove the __mxcsr member from fenv_t on i386 to follow the glibc abi
in the previous commit sse fenv support was added, but there is no
need to save mxcsr (sse fenv register) so fix the abi incompatibility
with glibc.
2013-08-18 16:27:20 +00:00
Szabolcs Nagy
ebc10fa176 add sse fenv support on i386 through hwcap
the sse and x87 rounding modes should be always the same,
the visible exception flags are the bitwise or of the two
fenv states (so it's enough to query the rounding mode or
raise exceptions on one fenv)
2013-08-18 16:27:20 +00:00
Szabolcs Nagy
d684149910 fix i386 fesetenv: FE_DFL_ENV is (fenv_t*)-1 not 0 2013-08-18 16:27:20 +00:00
Rich Felker
37c25065be remove spurious tmp file present since initial git check-in 2013-08-17 22:28:50 -04:00
Rich Felker
e678fc6f32 replace system's install command with a shell script
the historical (non-standardized) install command is really
inappropriate for installing binaries/libraries on a system that
utilizes memory-mapped executable files. rather than replacing an
existing file atomically, it overwrites the existing file. this can
cause running programs to see a partially-modified version of the
file, resulting in unpredictable behavior, or SIGBUS. a MAP_COPY mode
for mmap would get around this problem, but Linux lacks MAP_COPY.

the shell script added with this commit works around the problem by
writing temporary files and moving them into place. unlike the
historical install utility, it also support a -l option for installing
a symbolic link atomically, via the same method.
2013-08-17 22:21:11 -04:00
Rich Felker
109bd65acf add hkscs/big5-2003/eten extensions to iconv big5
with these changes, the character set implemented as "big5" in musl is
a pure superset of cp950, the canonical "big5", and agrees with the
normative parts of Unicode. this means it has minor differences from
both hkscs and big5-2003:

- the range A2CC-A2CE maps to CJK ideographs rather than numerals,
  contrary to changes made in big5-2003.

- C6CD maps to a CJK ideograph rather than its corresponding Kangxi
  radical character, contrary to changes made in hkscs.

- F9FE maps to U+2593 rather than U+FFED.

of these differences, none but the last are visually distinct, and the
last is a character used purely for text-based graphics, not to convey
linguistic content.

should there be future demand for strict conformance to big5-2003 or
hkscs mappings, the present charset aliases can be replaced with
distinct variants.

reportedly there are other non-standard big5 extensions in common use
in Taiwan and perhaps elsewhere, which could also be added as layers
on top of the existing big5 support.

there may be additional characters which should be added to the hkscs
table: the whatwg standard for big5 defines what appears to be a
superset of hkscs.
2013-08-17 16:23:22 -04:00