RepoMirrors/musl

mirror of git://git.musl-libc.org/musl synced 2025-01-11 09:09:45 +00:00

Author	SHA1	Message	Date
Rich Felker	759bf785a8	arpa/nameser.h: update RR types list our RR type list in arpa/nameser.h was badly outdated, and missing important types for DNSSEC and DANE use, among other things.	2022-09-22 18:44:44 -04:00
Rich Felker	51d4669fb9	dns: implement tcp fallback in __res_msend query core tcp fallback was originally deemed unwanted and unnecessary, since we aim to return a bounded-size result from getaddrinfo anyway and normally plenty of address records fit in the 512-byte udp dns limit. however, this turned out to have several problems: - some recursive nameservers truncate by omitting all the answers, rather than sending as many as can fit. - a pathological worst-case CNAME for a worst-case name can fill the entire 512-byte space with just the two names, leaving no room for any addresses. - the res_* family of interfaces allow querying of non-address records such as TLSA (DANE), TXT, etc. which can be very large. for many of these, it's critical that the caller see the whole RRset. also, res_send/res_query are specified to return the complete, untruncated length so that the caller can retry with an appropriately-sized buffer. determining this is not possible without tcp. so, it's time to add tcp fallback. the fallback strategy implemented here uses one tcp socket per question (1 or 2 questions), initiated via tcp fastopen when possible. the connection is made to the nameserver that issued the truncated answer. right now, fallback happens unconditionally when truncation is seen. this can, and may later be, relaxed for queries made by the getaddrinfo system, since it will only use a bounded number of results anyway. retry is not attempted again after failure over tcp. the logic could easily be adapted to do that, but it's of questionable value, since the tcp stack automatically handles retransmission and the successs answer with TC=1 over udp strongly suggests that the nameserver has the full answer ready to give. further retry is likely just "take longer to fail".	2022-09-22 14:17:05 -04:00
Rich Felker	e2e9517607	res_send: use a temp buffer if caller's buffer is under 512 bytes for extremely small buffer sizes, the DNS query core in __res_msend may malfunction completely, being unable to get even the headers to determine the response code. but there is also a problem for reasonable sizes under 512 bytes: __res_msend is unable to determine if the udp answer was truncated at the recv layer, in which case it may be incomplete, and res_send is then unable to honor its contract to return the length of the full, non-truncated answer. at present, res_send does not honor that contract anyway when the full answer would exceed 512 bytes, since there is no tcp fallback, but this change at least makes it consistent in a context where this is the only "full answer" to be had.	2022-09-22 12:41:23 -04:00
Rich Felker	c87d75f2aa	adapt res_msend DNS query core for working with multiple sockets this is groundwork for TCP fallback support, but does not itself change behavior in any way.	2022-09-21 18:49:53 -04:00
Rich Felker	85050ac5a2	getaddrinfo: add EAI_NODATA error code to distinguish NODATA vs NxDomain this was apparently omitted long ago out of a lack of understanding of its importance and the fact that POSIX doesn't specify it. despite not being officially standardized, however, it turns out that at least AIX, glibc, NetBSD, OpenBSD, QNX, and Solaris document and support it. in certain usage cases, such as implementing a DNS gateway on top of the stub resolver interfaces, it's necessary to distinguish the case where a name does not exit (NxDomain) from one where it exists but has no addresses (or other records) of the requested type (NODATA). in fact, even the legacy gethostbyname API had this distinction, which we were previously unable to support correctly because the backend lacked it. apart from fixing an important functionality gap, adding this distinction helps clarify to users how search domain fallback works (falling back in cases corresponding to EAI_NONAME, not in ones corresponding to EAI_NODATA), a topic that has been a source of ongoing confusion and frustration. as a result of this change, EAI_NONAME is no longer a valid universal error code for getaddrinfo in the case where AI_ADDRCONFIG has suppressed use of all address families. in order to return an accurate result in this case, getaddrinfo is modified to still perform at least one lookup. this will almost surely fail (with a network error, since there is no v4 or v6 network to query DNS over) unless a result comes from the hosts file or from ip literal parsing, but in case it does succeed, the result is replaced by EAI_NODATA. glibc has a related error code, EAI_ADDRFAMILY, that could be used for the AI_ADDRCONFIG case and certain NODATA cases, but distinguishing them properly in full generality seems to require additional DNS queries that are otherwise not useful. on glibc, it is only used for ip literals with mismatching family, not for DNS or hosts file results where the name has addresses only in the opposite family. since this seems misleading and inconsistent, and since EAI_NODATA already covers the semantic case where the "name" exists but doesn't have any addresses in the requested family, we do not adopt EAI_ADDRFAMILY at this time. this could be changed at some point if desired, but the logic for getting all the corner cases with AI_ADDRCONFIG right is slightly nontrivial.	2022-09-20 18:09:42 -04:00
Rich Felker	dc9285ad1d	fix error cases in gethostbyaddr_r EAI_MEMORY is not possible (but would not provide errno if it were) and EAI_FAIL does not provide errno. treat the latter as EBADMSG to match how it's handled in gethostbyname2_r (it indicates erroneous or failure response from the nameserver).	2022-09-19 19:12:09 -04:00
Rich Felker	f9827fc7da	remove impossible error case from gethostbyname2_r EAI_MEMORY is not possible because the resolver backend does not allocate. if it did, it would be necessary for us to explicitly return ENOMEM as the error, since errno is not guaranteed to reflect the error cause except in the case of EAI_SYSTEM, so the existing code was not correct anyway.	2022-09-19 19:09:02 -04:00
Rich Felker	f081d5336a	fix return value of gethostnbyname[2]_r on result not found these functions are horribly underspecified, inconsistent between historical systems, and should never have been included. however, the signatures we have match the glibc ones, and the glibc behavior is to treat NxDomain and NODATA results as a success condition, not an ENOENT error.	2022-09-19 19:02:40 -04:00
Rich Felker	1e7fb12f77	dns: treat names rejected by res_mkquery as nonexistent rather than error this distinction only affects search, but allows search to continue when concatenating one of the search domains onto the requested name produces a result that's not valid. this can happen when the concatenation is too long, or one of the search list entries is itself not valid. as a consequence of this change, having "." in the search domains list will now be ignored/skipped rather than making the lookup abort with no results (due to producing a concatenation ending in ".."). this behavior could be changed later if needed.	2022-09-19 15:51:04 -04:00
Rich Felker	001c1afb0a	res_mkquery: error out on consecutive final dots in name the main loop already errors out on zero-length labels within the name, but terminates before having a chance to check for an erroneous final zero-length label, instead producing a malformed query packet with a '.' byte instead of the terminating zero. rather than poke at the look logic, simply detect this condition early and error out without doing anything. this also fixes behavior of getaddrinfo when "." appears in the search domain list, which produces a name ending in ".." after concatenation, at least in the sense of no longer emitting malformed packets on the network. however, due to other issues, the lookup will still fail.	2022-09-19 15:38:00 -04:00
Alexey Izbyshev	3ad3fa962e	fix thread leak on timer_create(SIGEV_THREAD) failure After commit `5b74eed3b3` the timer thread doesn't check whether timer_create() actually created the timer, proceeding to wait for a signal that might never arrive. We can't fix this by simply checking for a negative timer_id after pthread_barrier_wait() because we have no way to distinguish a timer creation failure and a request to delete a timer with INT_MAX id if it happens to arrive quickly (a variation of this bug existed before `5b74eed3b3`, where the timer would be leaked in this case). So (ab)use cancel field of pthread_t instead.	2022-09-19 13:24:05 -04:00
Rich Felker	bf14ef193b	re-enable vdso clock_gettime on arm (32-bit) with workaround commit `4486c579cb` disabled vdso clock_gettime on arm due to a Linux kernel bug that was not understood at the time, whereby the vdso function silently produced catastrophically wrong results on some systems. since then, the bug was tracked down to the way the arm kernel disabled use of vdso clock_gettime on kernels where the necessary timer was not available or was disabled. it simply patched out the symbols, but it only did this for the legacy time32 functions, and left the time64 function in place but non-operational. kernel commit 4405bdf3c57ec28d606bdf5325f1167505bfdcd4 (first present in 5.8) provided the fix. if this were a bug that impacted all users of the broken kernel versions, we could probably ignore it and assume it had been patched or replaced. however, it's very possible that these kernels appear in the wild in devices running time32 userspace (glibc, musl 1.1.x, or some other environment) where they appear to work fine, but where our new binaries would fail catastrophically if we used the time64 vdso function. since the kernel has not (yet?) given us a way to probe for the working time64 vdso function semantically, we work around the problem by refusing to use the time64 one unless the time32 one is also present. this will revert to not using vdso at all if the time32 one is ever removed, but at least that's safe against wrong results and is just a missed optimization.	2022-09-19 13:21:54 -04:00
Rich Felker	6f3ead0ae1	process DT_RELR relocations in ldso-startup/static-pie commit `d32dadd60e` added DT_RELR processing for programs and shared libraries processed by the dynamic linker, but left them unsupported in the dynamic linker itseld and in static pie binaries, which self-relocate via code in dlstart.c. add the equivalent processing to this code path so that there are not arbitrary restrictions on where the new packed relative relocation form can be used.	2022-09-12 08:30:36 -04:00
Rich Felker	25085c85a0	fix fwprintf missing output to open_wmemstream FILEs open_wmemstream's write method was written assuming no buffering, since it sets the FILE up with buf_len of zero in order to avoid issues with position/seeking. however, as a consequence of commit `bd57e2b43a`, a FILE being written to by the printf core has a temporary local buffer for the duration of the operation if it was unbuffered to begin with. since this was disregarded by the wide memstream's write method, output produced through this code path, particularly numeric fields, was missing from the output wchar buffer. copy the equivalent logic for using the buffered data from the byte-oriented open_memstream.	2022-09-07 19:38:32 -04:00
Rich Felker	a636fd630f	dns: fail if ipv6 is disabled and resolv.conf has only v6 nameserves if resolv.conf lists no nameservers at all, the default of 127.0.0.1 is used. however, another "no nameservers" case arises where the system has ipv6 support disabled/configured-out and resolv.conf only contains v6 nameservers. this caused the resolver to repeat socket operations that will necessarily fail (sending to one or more wrong-family addresses) while waiting for a timeout. it would be contrary to configured intent to query 127.0.0.1 in this case, but the current behavior is not conducive to diagnosing the configuration problem. instead, fail immediately with EAI_SYSTEM and errno==EAFNOSUPPORT so that the configuration error is reportable.	2022-08-26 14:57:52 -04:00
Rich Felker	996b6154b2	use kernel-provided AT_MINSIGSTKSZ for sysconf(_SC_[MIN]SIGSTKSZ) use the legacy constant values if the kernel does not provide AT_MINSIGSTKSZ (__getauxval will return 0 in this case) and as a safety check if something is wrong and the provided value is less than the legacy constant. sysconf(_SC_SIGSTKSZ) returns SIGSTKSZ adjusted for the difference between the legacy constant MINSIGSTKSZ and the runtime value, so that the working space the application has on top of the minimum remains invariant under changes to the minimum.	2022-08-26 11:34:46 -04:00
Rich Felker	25340a9337	add sysconf keys/values for signal stack size as a result of ISA extensions exploding register file sizes on some archs, using a constant for minimum signal stack size no longer seems viably future-proof. add sysconf keys allowing the kernel to provide a machine-dependent minimum applications can query to ensure they allocate sufficient space for stacks. the key names and indices align with the same functionality in glibc. see commit `d5a5045382` for previous action on this subject. ultimately, the macros MINSIGSTKSZ and SIGSTKSZ probably need to be deprecated, but that is standards-amendment work outside the scope of a single implementation.	2022-08-26 10:20:46 -04:00
Rich Felker	d8fddb9641	fix fallback when ipv6 is disabled but resolv.conf has v6 nameserves apparently this code path was never tested, as it's not usual to have v6 nameservers listed on a system without v6 networking support. but it was always intended to work. when reverting to binding a v4 address, also revert the family in the sockaddr structure and the socklen for it. otherwise bind will just fail due to mismatched family/sockaddr size. fix dns resolver fallback when v6 nameservers are listed by	2022-08-24 20:48:47 -04:00
Kristina Martsenko	d4f987e4ac	epoll_create: fail with EINVAL if size is non-positive This is a part of the interface contract defined in the Linux man page (official for a Linux-specific interface) and asserted by test cases in the Linux Test Project (LTP).	2022-08-24 20:35:47 -04:00
Rich Felker	2e5fff43dd	use alt signal stack when present for implementation-internal signals a request for this behavior has been open for a long time. the motivation is that application code, particularly under some language runtimes designed around very-low-footprint coroutine type constructs, may be operating with extremely small stack sizes unsuitable for receiving signals, using a separate signal stack for any signals it might handle. progress on this was blocked at one point trying to determine whether the implementation is actually entitled to clobber the alt stack, but the phrasing "available to the implementation" in the POSIX spec for sigaltstack seems to make it clear that the application cannot rely on the contents of this memory to be preserved in the absence of signal delivery (on the abstract machine, excluding implementation-internal signals) and that we can therefore use it for delivery of signals that "don't exist" on the abstract machine. no change is made for SIGTIMER since it is always blocked when used, and accepted via sigwaitinfo rather than execution of the signal handler.	2022-08-20 12:24:49 -04:00
Érico Nogueira	379b18218d	ldso: make exit condition clearer in fixup_rpath breaking out of the switch-case when l==-1 means the conditional below will necessarily be true (-1 >= buf_size, a size_t variable) and the function will return 0. it is, however, somewhat unclear that that's what's happening. simply returning there is simpler	2022-08-17 19:49:54 -04:00
Rich Felker	37e18b7bf3	freopen: reset stream orientation (byte/wide) and encoding rule this is a requirement of the C language (orientation) and POSIX (encoding rule) that was somehow overlooked. we rely on the fact that the buffer pointers have been reset by fflush, so that any future stdio operations on the stream will go through the same code paths they would on a newly-opened file without an orientation set, thereby setting the orientation as they should.	2022-08-17 18:34:07 -04:00
Rich Felker	bf99258564	ldso: process RELR only for non-FDPIC archs the way RELR is applied is not a meaningful operation for FDPIC (there is no single "base" address). it seems unlikely RELR would ever be added for FDPIC, but if it ever is, the behavior and possibly data format will need to be different, so guard against calling the non-FDPIC code.	2022-08-02 17:29:01 -04:00
Fangrui Song	d32dadd60e	ldso: support DT_RELR relative relocation format this resolves DT_RELR relocations in non-ldso, dynamic-linked objects.	2022-08-02 17:27:45 -04:00
Alex Xu (Hello71)	2404d9d643	use syscall_arg_t and __scc macro for arguments to __alt_socketcall otherwise, pointer arguments are sign-extended on x32, resulting in EFAULT.	2022-08-02 14:12:18 -04:00
Michael Pratt	46d1c7801b	fix strings.h feature test macro usage due to missing features.h	2022-08-01 13:57:11 -04:00
Eugene Yudin	baaf257f05	fix ESRCH error handling for clock_getcpuclockid the syscall used to probe availability of the clock fails with EINVAL when the requested pid does not exist, but clock_getcpuclockid is specified to use ESRCH for this purpose.	2022-08-01 13:53:22 -04:00
Szabolcs Nagy	4f48da008d	aarch64: add vfork The generic vfork implementation uses clone(SIGCHLD) which has fork semantics. Implement vfork as clone(SIGCHLD\|CLONE_VM\|CLONE_VFORK, 0) instead which has vfork semantics. (stack == 0 means sp is unchanged in the child.) Some users rely on vfork semantics when memory overcommit is disabled or when the vfork child runs code that synchronizes with the parent process (non-conforming).	2022-08-01 13:37:39 -04:00
Rich Felker	7d568410b4	fix mishandling of errno in getaddrinfo AI_ADDRCONFIG logic this code attempts to use the value of errno from failure of socket or connect to infer availability of the requested address family (v4 or v6). however, in the case where connect failed, there is an intervening call to close between connect and the use of errno. close is not required to preserve errno on success, and in fact the __aio_close code, which is called whenever aio is linked and thus always called in dynamic-linked programs, unconditionally clobbers errno. as a result, getaddrinfo fails with EAI_SYSTEM and errno=ENOENT rather than correctly determining that the address family was unavailable. this fix is based on report/patch by Jussi Nieminen, but simplified slightly to avoid breaking the case where socket, not connect, failed.	2022-08-01 12:54:23 -04:00
Rich Felker	d16d7b1099	early stage ldso: remove symbolic references via error handling function while the error handling function should not be reached in stage 2 (assuming ldso itself was linked correctly), this was not statically determinate from the compiler's perspective, and in theory a compiler performing LTO could lift the TLS references (errno and other things) out of the printf-family functions called in a stage where TLS is not yet initialized. instead, perform the call via a static-storage, internal-linkage function pointer which will be set to a no-op function until the stage where the real error handling function should be reachable. inspired by commit `63c67053a3`.	2022-07-19 19:00:53 -04:00
Alex Xu (Hello71)	63c67053a3	in early stage ldso before __dls2b, call mprotect with __syscall if LTO is enabled, gcc hoists the call to ___errno_location outside the loop even though the access to errno is gated behind head != &ldso because ___errno_location is marked __attribute__((const)). this causes the program to crash because TLS is not yet initialized when called from __dls2. this is also possible if LTO is not enabled; even though gcc 11 doesn't do it, it is still wrong to use errno here. since the start and end are already aligned, we can simply call __syscall instead of using global errno. Fixes: `e13a2b8953` ("implement PT_GNU_RELRO support")	2022-07-02 16:12:04 -04:00
Rich Felker	a23a3da29b	avoid limited space of random temp file names if clock resolution is low this is not an issue that was actually hit, but I noticed it during previous changes to __randname: if the resolution of tv_nsec is too low, the space of temp file names obtainable by a thread could plausibly be exhausted. mixing in tv_sec avoids this.	2022-06-23 11:53:28 -04:00
Rich Felker	4100279825	remove random filename obfuscation that leaks ASLR information the __randname function is used by various temp file creation interfaces as a backend to produce a name to attempt using. it does not have to produce results that are safe against guessing, and only aims to avoid unintentional collisions. mixing the address of an object on the stack in a reversible manner leaked ASLR information, potentially allowing an attacker who can observe the temp files created and their creation timestamps to narrow down the possible ASLR state of the process that created them. there is no actual value in mixing these addresses in; it was just obfuscation. so don't do it. instead, mix the tid, just to avoid collisions if multiple processes/threads stampede to create temp files at the same moment. even without this measure, they should not collide unless the clock source is very low resolution, but it's a cheap improvement. if/when we have a guaranteed-available userspace csprng, it could be used here instead. even though there is no need for cryptographic entropy here, it would avoid having to reason about clock resolution and such to determine whether the behavior is nice.	2022-06-03 18:54:41 -04:00
Rich Felker	6c858d6fd4	ensure distinct query id for parallel A and AAAA queries in resolver assuming a reasonable realtime clock, res_mkquery is highly unlikely to generate the same query id twice in a row, but it's possible with a very low-resolution system clock or under extreme delay of forward progress. when it happens, res_msend fails to wait for both answers, and instead stops listening after getting two answers to the same query (A or AAAA). to avoid this, increment one byte of the second query's id if it matches the first query's. don't bother checking if the second byte is also equal, since it doesn't matter; we just need to ensure that at least one byte is distinct.	2022-06-03 11:03:00 -04:00
Rich Felker	8974ef2124	mntent: fix potential mishandling of extremely long lines commit `05973dc3bb` made it so that lines longer than INT_MAX can in theory be read, but did not use a suitable type for the positions determined by sscanf. we could change to using size_t, but since the signature for getmntent_r does not admit lines longer than INT_MAX, it does not make sense to support them in the legacy thread-unsafe form either -- the principle here is that there should not be an incentive to use the unsafe function to get added functionality.	2022-05-15 19:26:22 -04:00
Alyssa Ross	751bee0ee7	mntent: fix parsing lines with optional fields According to fstab(5), the last two fields are optional, but this wasn't accepted. After this change, only the first field is required, which matches glibc's behaviour. Using sscanf as before, it would have been impossible to differentiate between 0 fields and 4 fields, because sscanf would have returned 0 in both cases due to the use of assignment suppression and %n for the string fields (which is important to avoid copying any strings). So instead, before calling sscanf, initialize every string to the empty string, and then we can check which strings are empty afterwards to know how many fields were matched.	2022-05-15 19:19:42 -04:00
Rich Felker	dcb31f6b45	fix constraint violation in qsort wrapper around qsort_r function pointer types do not implicitly convert to void *. a cast is required here.	2022-05-06 19:34:48 -04:00
Rich Felker	6e9d2370c7	use __fstat instead of __fstatat with AT_EMPTY_PATH in __map_file this isolates knowledge of the nonstandard AT_EMPTY_PATH extension to one place and returns __map_file to its prior simplicity.	2022-05-04 10:53:01 -04:00
Rich Felker	05a55868ff	provide an internal namespace-safe __fstat this avoids the need for implementation-internal callers to depend on the nonstandard AT_EMPTY_PATH extension to use __fstatat and isolates knowledge of that extension to the implementation of __fstat.	2022-05-04 10:51:00 -04:00
Rich Felker	fb10dc288d	only use fstatat and others legacy stat syscalls if they exist riscv32 and future architectures only provide statx.	2022-05-01 23:25:21 -04:00
Rich Felker	2b754a5424	drop direct use of stat syscalls in internal __map_file this function is used to implement some baseline ISO C interfaces, so it cannot call any of the stat functions by their public names. use the namespace-safe __fstatat instead.	2022-05-01 23:25:21 -04:00
Rich Felker	c9ba0769a7	provide an internal namespace-safe __fstatat this makes it so we can drop direct stat syscall use in interfaces that can't use the POSIX namespace.	2022-05-01 23:25:21 -04:00
Rich Felker	7edbcbeb76	drop direct use of stat syscalls in fchmodat instead, use the fstatat/stat functions, so that the logic for which syscalls are present and usable is all in fstatat. this results in a slight increase in cost for old kernels on 32-bit archs: now statx will be attempted first rather than just using the legacy time32 syscalls, despite us not caring about timestamps. however, it's not even clear that the legacy syscalls should succeed if the timestamps are out of range; arguably they should fail with EOVERFLOW. as such, paying a small cost here on old kernels seems well-motivated. with this change, fchmodat itself is no longer blocking ports to new archs that lack the legacy syscalls.	2022-05-01 23:25:21 -04:00
Rich Felker	9a93749555	drop use of stat operation in temporary file name generation this change serves two purposes: 1. it eliminates one of the few remaining uses of the kernel stat structure which will not be present in future archs, avoiding the need for growing ifdef logic here. 2. it potentially makes the operations less expensive when the candidate exists as a non-symlink by avoiding the need to read the inode (assuming the directory tables suffice to distinguish symlinks). this uses the idiom I discovered while rewriting realpath for commit `29ff7599a4` of being able to use the readlink operation as an inexpensive probe for file existence that doesn't following symlinks.	2022-05-01 23:25:21 -04:00
Stefan O'Rear	12a757b321	only fallback to gettimeofday/settimeofday syscalls if they exist riscv32 and future architectures only provide the clock_ functions.	2022-05-01 23:25:21 -04:00
Stefan O'Rear	41149ea8c7	only use getrlimit/setrlimit syscalls if they exist riscv32 and future architectures only provide prlimit64.	2022-05-01 23:25:21 -04:00
Stefan O'Rear	8910efd0e4	don't remap internal-use syscall macros to nonexistent time32 syscalls riscv32 and future architectures lack the _time32 variants entirely, so don't try to use their numbers. instead, reflect that they're not present.	2022-04-27 08:45:33 -04:00
Stefan O'Rear	03f71251e6	remove ARMSUBARCH relic from configure commit `0f814a4e57` removed its use.	2022-04-27 08:45:18 -04:00
Rich Felker	55b727d7ad	add missing POSIX confstr keys for pthread CFLAGS/LDFLAGS _CS_POSIX_V7_THREADS_CFLAGS and _CS_POSIX_V7_THREADS_LDFLAGS have been missing for a long time, which is a conformance defect. we were waiting on glibc to add them or at least agree on the numeric values they will have so as to keep the numbering aligned. it looks like they will be added to glibc with these numbers, and in any case, this list does not have any significant churn that would result in the numbers getting taken.	2022-04-20 09:06:54 -04:00
Ondrej Jirman	8cf87b3027	fix incorrect parameter name in internal netlink.h RTA_OK macro the wrong name works only by accident.	2022-04-10 02:25:02 -04:00

... 2 3 4 5 6 ...

4911 Commits