freebsd-skq

Author	SHA1	Message	Date
dfr	79d2dfdaa6	Add the new kernel-mode NFS Lock Manager. To use it instead of the user-mode lock manager, build a kernel with the NFSLOCKD option and add '-k' to 'rpc_lockd_flags' in rc.conf. Highlights include: * Thread-safe kernel RPC client - many threads can use the same RPC client handle safely with replies being de-multiplexed at the socket upcall (typically driven directly by the NIC interrupt) and handed off to whichever thread matches the reply. For UDP sockets, many RPC clients can share the same socket. This allows the use of a single privileged UDP port number to talk to an arbitrary number of remote hosts. * Single-threaded kernel RPC server. Adding support for multi-threaded server would be relatively straightforward and would follow approximately the Solaris KPI. A single thread should be sufficient for the NLM since it should rarely block in normal operation. * Kernel mode NLM server supporting cancel requests and granted callbacks. I've tested the NLM server reasonably extensively - it passes both my own tests and the NFS Connectathon locking tests running on Solaris, Mac OS X and Ubuntu Linux. * Userland NLM client supported. While the NLM server doesn't have support for the local NFS client's locking needs, it does have to field async replies and granted callbacks from remote NLMs that the local client has contacted. We relay these replies to the userland rpc.lockd over a local domain RPC socket. * Robust deadlock detection for the local lock manager. In particular it will detect deadlocks caused by a lock request that covers more than one blocking request. As required by the NLM protocol, all deadlock detection happens synchronously - a user is guaranteed that if a lock request isn't rejected immediately, the lock will eventually be granted. The old system allowed for a 'deferred deadlock' condition where a blocked lock request could wake up and find that some other deadlock-causing lock owner had beaten them to the lock. * Since both local and remote locks are managed by the same kernel locking code, local and remote processes can safely use file locks for mutual exclusion. Local processes have no fairness advantage compared to remote processes when contending to lock a region that has just been unlocked - the local lock manager enforces a strict first-come first-served model for both local and remote lockers. Sponsored by: Isilon Systems PR: 95247 107555 115524 116679 MFC after: 2 weeks	2008-03-26 15:23:12 +00:00
brueffer	b64d211df2	Fix some "in in" typos in comments. PR: 121490 Submitted by: Anatoly Borodin <anatoly.borodin@gmail.com> Approved by: rwatson (mentor), jkoshy MFC after: 3 days	2008-03-26 07:32:08 +00:00
ru	5398c833ef	Compile libthr with warnings. (Somehow this file sneaked from initial commit.)	2008-03-25 15:33:00 +00:00
ru	e8df07e5aa	Compile libthr with warnings.	2008-03-25 13:28:12 +00:00
antoine	73a86dc575	Don't allocate the constant array "props" on the stack in wctype. PR: 74743 Submitted by: knut st. osmundsen Approved by: rwatson (mentor) MFC after: 1 month	2008-03-17 18:22:23 +00:00
das	6f407f2920	scandir(3) previously used st_size to obtain an initial estimate of the array length needed to store all the directory entries. Although BSD has historically guaranteed that st_size is the size of the directory file, POSIX does not, and more to the point, some recent filesystems such as ZFS use st_size to mean something else. The fix is to not stat the directory at all, set the initial array size to 32 entries, and realloc it in powers of 2 if that proves insufficient. PR: 113668	2008-03-16 19:08:53 +00:00
ru	5fad0ab914	Fix bugs in previous revision (missing comma, misspelled syscall name).	2008-03-13 10:33:24 +00:00
ru	346fbfb32e	Remove trailing whitespace.	2008-03-13 10:26:17 +00:00
ru	0fbb165835	Add missing section number.	2008-03-13 10:25:30 +00:00
davidxu	f4d90c5978	In file sem_timewait.3, remove reference to SYSV semphore in SEE ALSO section, sync it with sem_wait.3.	2008-03-13 01:53:28 +00:00
jeff	9d33d28fb7	- Remove kse syscall symbols and man pages.	2008-03-12 10:12:22 +00:00
davidxu	f2039f468f	Add missing comma.	2008-03-12 02:37:31 +00:00
davidxu	20682c9e8d	Add manual for function sem_timedwait(). Reviewed by: ru, deischen	2008-03-12 02:33:17 +00:00
rwatson	72cc21ea73	Add reference to kldunloadf system call, which was previously not mentioned in the kldunload(2) man page. MFC after: 3 days Spotted by: rink	2008-03-10 09:54:13 +00:00
antoine	514f31f40e	Introduce a new F_DUP2FD command to fcntl(2), for compatibility with Solaris and AIX. fcntl(fd, F_DUP2FD, arg) and dup2(fd, arg) are functionnaly equivalent. Document it. Add some regression tests (identical to the dup2(2) regression tests). PR: 120233 Submitted by: Jukka Ukkonen Approved by: rwaston (mentor) MFC after: 1 month	2008-03-08 22:02:21 +00:00
jasone	423cb10cb4	Remove stale #include <machine/atomic.h>, which as needed by lazy deallocation.	2008-03-07 16:54:03 +00:00
rwatson	360d527360	Add __FBSDID() tags. MFC after: 3 days	2008-03-07 15:25:56 +00:00
jeff	694203dedd	Add cpuset, an api for thread to cpu binding and cpu resource grouping and assignment. - Add a reference to a struct cpuset in each thread that is inherited from the thread that created it. - Release the reference when the thread is destroyed. - Add prototypes for syscalls and macros for manipulating cpusets in sys/cpuset.h - Add syscalls to create, get, and set new numbered cpusets: cpuset(), cpuset_{get,set}id() - Add syscalls for getting and setting affinity masks for cpusets or individual threads: cpuid_{get,set}affinity() - Add types for the 'level' and 'which' parameters for the cpuset. This will permit expansion of the api to cover cpu masks for other objects identifiable with an id_t integer. For example, IRQs and Jails may be coming soon. - The root set 0 contains all valid cpus. All thread initially belong to cpuset 1. This permits migrating all threads off of certain cpus to reserve them for special applications. Sponsored by: Nokia Discussed with: arch, rwatson, brooks, davidxu, deischen Reviewed by: antoine	2008-03-02 07:39:22 +00:00
philip	a72a71deeb	Use the easily-greppable copyright notice template from src/share/examples/mdoc/POSIX-copyright. Requested by: ru	2008-02-29 17:48:25 +00:00
scf	7ee4756ce9	Replace the use of warnx() with direct output to stderr using _write(). This reduces the size of a statically-linked binary by approximately 100KB in a trivial "return (0)" test application. readelf -S was used to verify that the .text section was reduced and that using strlen() saved a few more bytes over using sizeof(). Since the section of code is only called when environ is corrupt (program bug), I went with fewer bytes over fewer cycles. I made minor edits to the submitted patch to make the output resemble warnx(). Submitted by: kib bz Approved by: wes (mentor) MFC after: 5 days	2008-02-28 04:09:08 +00:00
jhb	8ee71003bd	Add <limits.h> for SHRT_MAX. Pointy hat to: jhb	2008-02-27 21:25:19 +00:00
jhb	4c65fa8afd	File descriptors are an int, but our stdio FILE object uses a short to hold them. Thus, any fd whose value is greater than SHRT_MAX is handled incorrectly (the short value is sign-extended when converted to an int). An unpleasant side effect is that if fopen() opens a file and gets a backing fd that is greater than SHRT_MAX, fclose() will fail and the file descriptor will be leaked. Better handle this by fixing fopen(), fdopen(), and freopen() to fail attempts to use a fd greater than SHRT_MAX with EMFILE. At some point in the future we should look at expanding the file descriptor in FILE to an int, but that is a bit complicated due to ABI issues. MFC after: 1 week Discussed on: arch Reviewed by: wollman	2008-02-27 19:02:02 +00:00
wollman	e043fbfcde	stdio is currently limited to file descriptors not greater than {SHRT_MAX}, so {STREAM_MAX} should be no greater than that. (This does not exactly meet the letter of POSIX but comes reasonably close to it in spirit.) MFC after: 14 days	2008-02-27 05:56:57 +00:00
ru	f12be23c59	Added the "restrict" type-qualifier to the readlink() prototype.	2008-02-26 20:33:52 +00:00
brueffer	bcb6adff03	Add missing words. MFC after: 3 days	2008-02-25 13:03:18 +00:00
raj	69575dab52	Let PowerPC world optionally build with -msoft-float. For FPU-less PowerPC variations (e500 currently), this provides a gcc-level FPU emulation and is an alternative approach to the recently introduced kernel-level emulation (FPU_EMU). Approved by: cognet (mentor) MFp4: e500	2008-02-24 19:22:53 +00:00
philip	9044373a13	Note, as required by our agreement with IEEE/The Open Group, that the message queue manual pages excerpt the POSIX standard. Spotted by: Mindaugas Rasiukevicius <rmind -at- NetBSD.org> Reviewed by: imp MFC after: 1 day	2008-02-21 19:16:57 +00:00
kevlo	c74ac9adc1	getopt(3) returns -1, not EOF.	2008-02-18 03:19:25 +00:00
jasone	2bc29a1530	Fix a race condition in arena_ralloc() for shrinking in-place large reallocation, when junk filling is enabled. Junk filling must occur prior to shrinking, since any deallocated trailing pages are immediately available for use by other threads. Reported by: Mats Palmgren <mats.palmgren@bredband.net>	2008-02-17 18:34:17 +00:00
jasone	b08b976e68	Remove support for lazy deallocation. Benchmarks across a wide range of allocation patterns, number of CPUs, and MALLOC_OPTIONS settings indicate that lazy deallocation has the potential to worsen throughput dramatically. Performance degradation occurs when multiple threads try to clear the lazy free cache simultaneously. Various experiments to avoid this bottleneck failed to completely solve this problem, while adding yet more complexity.	2008-02-17 17:09:24 +00:00
delphij	653069d327	Allow underscore in domain names while resolving. While having underscore is a violation of RFC 1034 [STD 13], it is accepted by certain name servers as well as other popular operating systems' resolver library. Bugs are mine. Obtained from: ume MFC after: 2 weeks	2008-02-16 00:16:49 +00:00
ru	56aa644e2a	Change readlink(2)'s return type and type of the last argument to match POSIX. Prodded by: Alexey Lyashkov	2008-02-12 20:09:04 +00:00
remko	1f85c46223	After issueing a ntpdate [1] I noticed it's already 2008, reflect that in the last modified date. Noticed by: brueffer [1]	2008-02-11 07:43:23 +00:00
remko	3ce98657bd	Fix typo (s/existance/existence/) Noticed by: ceri	2008-02-11 07:15:52 +00:00
jasone	f6ce9fe601	Fix a bug in lazy deallocation that was introduced when arena_dalloc_lazy_hard() was split out of arena_dalloc_lazy() in revision 1.162. Reduce thundering herd problems in lazy deallocation by randomly varying how many probes a thread does before taking the slow path.	2008-02-08 08:02:34 +00:00
jasone	c614695539	Clean up manipulation of chunk page map elements to remove some tenuous assumptions about whether bits are set at various times. This makes adding other flags safe. Reorganize functions in order to inline i{m,c,p,s,re}alloc(). This allows the entire fast-path call chains for malloc() and free() to be inlined. [1] Suggested by: [1] Stuart Parmenter <stuart@mozilla.com>	2008-02-08 00:35:56 +00:00
des	0cd1685caf	Add pthread_mutex_isowned_np() so there is no need for an additional prototype next to the implementation. MFC after: 2 weeks	2008-02-06 20:42:35 +00:00
jasone	44c343f8fa	Track dirty unused pages so that they can be purged if they exceed a threshold, according to the 'F' MALLOC_OPTIONS flag. This obsoletes the 'H' flag. Try to realloc() large objects in place. This substantially speeds up incremental large reallocations in the common case. Fix a bug in arena_ralloc() that caused relocation of sub-page objects even if the old and new sizes were in the same size class. Maintain trees of runs and simplify the per-chunk page map. This allows logarithmic-time searching for sufficiently large runs in arena_run_alloc(), whereas the previous algorithm required linear time in the worst case. Break various large functions into smaller sub-functions, and inline only the functions that are in the fast path for small object allocation/deallocation. Remove an unnecessary check in base_pages_alloc_mmap(). Avoid integer division in choose_arena() for the NO_TLS case on single-CPU systems.	2008-02-06 02:59:54 +00:00
ume	97fd4b42a1	Remove incomplete support of AI_ALL and AI_V4MAPPED. Reported by: "Heiko Wundram (Beenic)" <wundram__at__beenic.net>	2008-02-03 19:07:55 +00:00
phk	13132840a1	Give sendfile(2) a SF_SYNC flag which makes it wait until all mbufs referencing the files VM pages are returned from the network stack, making changes to the file safe. This flag does not guarantee that the data has been transmitted to the other end.	2008-02-03 15:54:41 +00:00
trhodes	46c986723b	Update this manual page to describe the extattr_list_file() and the extattr_list_fd() functions. PR: 108142 Submitted by: Richard Dawe <rich@phekda.gotadsl.co.uk> Reviewed by: kientzle	2008-01-29 18:15:38 +00:00
yar	ac1e4103b9	Our fts(3) API, as inherited from 4.4BSD, suffers from integer fields in FTS and FTSENT structs being too narrow. In addition, the narrow types creep from there into fts.c. As a result, fts(3) consumers, e.g., find(1) or rm(1), can't handle file trees an ordinary user can create, which can have security implications. To fix the historic implementation of fts(3), OpenBSD and NetBSD have already changed <fts.h> in somewhat incompatible ways, so we are free to do so, too. This change is a superset of changes from the other BSDs with a few more improvements. It doesn't touch fts(3) functionality; it just extends integer types used by it to match modern reality and the C standard. Here are its points: o For C object sizes, use size_t unless it's 100% certain that the object will be really small. (Note that fts(3) can construct pathnames _much_ longer than PATH_MAX for its consumers.) o Avoid the short types because on modern platforms using them results in larger and slower code. Change shorts to ints as follows: - For variables than count simple, limited things like states, use plain vanilla `int' as it's the type of choice in C. - For a limited number of bit flags use `unsigned' because signed bit-wise operations are implementation-defined, i.e., unportable, in C. o For things that should be at least 64 bits wide, use long long and not int64_t, as the latter is an optional type. See FTSENT.fts_number aka FTS.fts_bignum. Extending fts_number `to satisfy future needs' is pointless because there is fts_pointer, which can be used to link to arbitrary data from an FTSENT. However, there already are fts(3) consumers that require fts_number, or fts_bignum, have at least 64 bits in it, so we must allow for them. o For the tree depth, use `long'. This is a trade-off between making this field too wide and allowing for 64-bit inode numbers and/or chain-mounted filesystems. On the one hand, `long' is almost enough for 32-bit filesystems on a 32-bit platform (our ino_t is uint32_t now). On the other hand, platforms with a 64-bit (or wider) `long' will be ready for 64-bit inode numbers, as well as for several 32-bit filesystems mounted one under another. Note that fts_level has to be signed because -1 is a magic value for it, FTS_ROOTPARENTLEVEL. o For the `nlinks' local var in fts_build(), use `long'. The logic in fts_build() requires that `nlinks' be signed, but our nlink_t currently is uint16_t. Therefore let's make the signed var wide enough to be able to represent 2^16-1 in pure C99, and even 2^32-1 on a 64-bit platform. Perhaps the logic should be changed just to use nlink_t, but it can be done later w/o breaking fts(3) ABI any more because `nlinks' is just a local var. This commit also inludes supporting stuff for the fts change: o Preserve the old versions of fts(3) functions through libc symbol versioning because the old versions appeared in all our former releases. o Bump __FreeBSD_version just in case. There is a small chance that some ill-written 3-rd party apps may fail to build or work correctly if compiled after this change. o Update the fts(3) manpage accordingly. In particular, remove references to fts_bignum, which was a FreeBSD-specific hack to work around the too narrow types of FTSENT members. Now fts_number is at least 64 bits wide (long long) and fts_bignum is an undocumented alias for fts_number kept around for compatibility reasons. According to Google Code Search, the only big consumers of fts_bignum are in our own source tree, so they can be fixed easily to use fts_number. o Mention the change in src/UPDATING. PR: bin/104458 Approved by: re (quite a while ago) Discussed with: deischen (the symbol versioning part) Reviewed by: -arch (mostly silence); das (generally OK, but we didn't agree on some types used; assuming that no objections on -arch let me to stick to my opinion)	2008-01-26 17:09:40 +00:00
ache	061b803830	Fix longstanding mb/wc functions segfault if error occurse inside _<encoding>_init(). Currently _EUC_init() only was affected.	2008-01-23 03:05:35 +00:00
ache	28095b28d0	Better fix for longstanding segfault. Don't touch current locale at all on unknown encoding. Previous fix resets it to POSIX.	2008-01-23 02:17:27 +00:00
ache	76c6a978cc	1) Add (void) cast to _none_init() (while I am here) 2) Fix longstanding segfault in mb/wc code when unknown encoding is specified in the locale file (mb/wc functions becomes NULL in that case).	2008-01-23 01:57:26 +00:00
trhodes	3c543fe5ae	Xref flopen.3 which references this manual page. PR: 112650	2008-01-22 15:56:48 +00:00
ache	c52b8566b4	Introduce new encoding: "ASCII" It differs from default C/POSIX "NONE" mainly by stricter 8bit check for mbtowc/wctomb family, returning EILSEQ	2008-01-21 23:48:12 +00:00
das	764b848f5d	Add a new union member to access the exponent and sign of a long double in a single op. Idea from bde.	2008-01-18 21:25:51 +00:00
bde	c553ad248f	Add an alternative view of the bits in an 80-bit long double (64+16 instead of 32+32+15+1) on all arches that have such long doubles (amd64, ia64 and i386). Large objects should be be accessed in large units, and the 32+32+15+1[+padding] decomposition asks for almost the opposite of that, sometimes resulting in very slow accesses depending on how well the compiler ignores what we ask for and converts to the best units for the given machine. E.g., on Athlons, there is a 10-20 cycle penalty for accessing the middle 32-bit word immediately after an 80-bit store. Whether actually using the alternative view is better is very machine- dependent. A 32+32+16 view is probably best with old 32-bit systems and gcc through 4.2.1. The compiler should mostly avoid the view and generate best accesses, but gcc-4.2.1 is far from doing that. I think 64+16 is best for now. Similarly for doubles -- they should be using 64+0 especially on 64-bit machines, but fdlibm uses 32+32 extensively for them. Fortunately, in 64-bit mode for doubles, gcc already ignores the 32+32-bit view and generates best accesses in many cases.	2008-01-17 16:39:07 +00:00
remko	af0f4dc1e0	Fix some style nits. Prodded by: brueffer MFC After: 3 days	2008-01-16 19:36:21 +00:00

1 2 3 4 5 ...

5719 Commits