freebsd-dev

Author	SHA1	Message	Date
Bjoern A. Zeeb	97fc027722	Try to unbreak the build after r285390 removing the obsolete static declaration.	2015-07-12 00:26:22 +00:00
Luiz Otavio O Souza	a8921b867f	Return the FDT node of the GPIO controller to gpiobus. It is used by the children of gpiobus.	2015-07-11 21:09:43 +00:00
Ed Schouten	4f1905177a	Implement normal and abnormal process termination. CloudABI does not provide an explicit kill() system call, for the reason that there is no access to the global process namespace. Instead, it offers a raise() system call that can at least be used to terminate the process abnormally. CloudABI does not support installing signal handlers. CloudABI's raise() system call should behave as if the default policy is set up. Call into kern_sigaction(SIG_DFL) before calling sys_kill() to force this. Obtained from: https://github.com/NuxiNL/freebsd	2015-07-11 19:41:31 +00:00
Ed Schouten	a4001f4cb9	Use FDDUP_NORMAL instead of hardcoding value 0. Proposed by: mjg	2015-07-11 18:53:30 +00:00
Ed Schouten	329d1bca7f	Add missing function parameter. A function parameter got added in r285356, meaning that the call to kern_dup() needs to be patched up.	2015-07-11 18:39:16 +00:00
Justin Hibbits	20b6ee617f	cpu_number and cpu_swapout are never used, and only defined in powerpc.	2015-07-11 17:33:50 +00:00
Mateusz Guzik	b34be824a0	linprocfs: vref the vnode passed to vn_fullpath	2015-07-11 16:44:28 +00:00
Mateusz Guzik	c634b75204	vfs: always clear VI_OWEINACT in consumers bumping v_usecount Previously vputx would detect the condition and clear the flag. With this change it is invalid to have both v_usecount > 0 and the flag set. Assert the condition is met in all revlevant places. Reviewed by: kib	2015-07-11 16:28:55 +00:00
Mateusz Guzik	2d1ca3cdff	vfs: move si_usecount manipulation to dedicated functions Reviewed by: kib	2015-07-11 16:28:12 +00:00
Mateusz Guzik	8a08cec166	Create a dedicated function for ensuring that cdir and rdir are populated. Previously several places were doing it on its own, partially incorrectly (e.g. without the filedesc locked) or even actively harmful by populating jdir or assigning rootvnode without vrefing it. Reviewed by: kib	2015-07-11 16:22:48 +00:00
Mateusz Guzik	f0725a8e1e	Move chdir/chroot-related fdp manipulation to kern_descrip.c Prefix exported functions with pwd_. Deduplicate some code by adding a helper for setting fd_cdir. Reviewed by: kib	2015-07-11 16:19:11 +00:00
Andrew Turner	70915d1289	Always send a SIGSEGV on a map failure. Use the code to tell the reason for the signal. Sponsored by: ABT Systems Ltd	2015-07-11 16:02:06 +00:00
Adrian Chadd	871ef8b0d8	Regenerate syscalls.	2015-07-11 15:22:11 +00:00
Adrian Chadd	6520495abc	Add an initial NUMA affinity/policy configuration for threads and processes. This is based on work done by jeff@ and jhb@, as well as the numa.diff patch that has been circulating when someone asks for first-touch NUMA on -10 or -11. * Introduce a simple set of VM policy and iterator types. * tie the policy types into the vm_phys path for now, mirroring how the initial first-touch allocation work was enabled. * add syscalls to control changing thread and process defaults. * add a global NUMA VM domain policy. * implement a simple cascade policy order - if a thread policy exists, use it; if a process policy exists, use it; use the default policy. * processes inherit policies from their parent processes, threads inherit policies from their parent threads. * add a simple tool (numactl) to query and modify default thread/process policities. * add documentation for the new syscalls, for numa and for numactl. * re-enable first touch NUMA again by default, as now policies can be set in a variety of methods. This is only relevant for very specific workloads. This doesn't pretend to be a final NUMA solution. The previous defaults in -HEAD (with MAXMEMDOM set) can be achieved by 'sysctl vm.default_policy=rr'. This is only relevant if MAXMEMDOM is set to something other than 1. Ie, if you're using GENERIC or a modified kernel with non-NUMA, then this is a glorified no-op for you. Thank you to Norse Corp for giving me access to rather large (for FreeBSD!) NUMA machines in order to develop and verify this. Thank you to Dell for providing me with dual socket sandybridge and westmere v3 hardware to do NUMA development with. Thank you to Scott Long at Netflix for providing me with access to the two-socket, four-domain haswell v3 hardware. Thank you to Peter Holm for running the stress testing suite against the NUMA branch during various stages of development! Tested: * MIPS (regression testing; non-NUMA) * i386 (regression testing; non-NUMA GENERIC) * amd64 (regression testing; non-NUMA GENERIC) * westmere, 2 socket (thankyou norse!) * sandy bridge, 2 socket (thankyou dell!) * ivy bridge, 2 socket (thankyou norse!) * westmere-EX, 4 socket / 1TB RAM (thankyou norse!) * haswell, 2 socket (thankyou norse!) * haswell v3, 2 socket (thankyou dell) * haswell v3, 2x18 core (thankyou scott long / netflix!) * Peter Holm ran a stress test suite on this work and found one issue, but has not been able to verify it (it doesn't look NUMA related, and he only saw it once over many testing runs.) * I've tested bhyve instances running in fixed NUMA domains and cpusets; all seems to work correctly. Verified: * intel-pcm - pcm-numa.x and pcm-memory.x, whilst selecting different NUMA policies for processes under test. Review: This was reviewed through phabricator (https://reviews.freebsd.org/D2559) as well as privately and via emails to freebsd-arch@. The git history with specific attributes is available at https://github.com/erikarn/freebsd/ in the NUMA branch (https://github.com/erikarn/freebsd/compare/local/adrian_numa_policy). This has been reviewed by a number of people (stas, rpaulo, kib, ngie, wblock) but not achieved a clear consensus. My hope is that with further exposure and testing more functionality can be implemented and evaluated. Notes: * The VM doesn't handle unbalanced domains very well, and if you have an overly unbalanced memory setup whilst under high memory pressure, VM page allocation may fail leading to a kernel panic. This was a problem in the past, but it's much more easily triggered now with these tools. * This work only controls the path through vm_phys; it doesn't yet strongly/predictably affect contigmalloc, KVA placement, UMA, etc. So, driver placement of memory isn't really guaranteed in any way. That's next on my plate. Sponsored by: Norse Corp, Inc.; Dell	2015-07-11 15:21:37 +00:00
Konstantin Belousov	cf88021ab1	Do not allow creation of the dirty buffers for the dead buffer objects, i.e. for buffer objects which vnode was reclaimed. Buffer cache cannot write such buffers. Return the error and discard the buffer immediately on write attempt. BO_DIRTY now always set during vnode reclamation, since it is used not only for the INVARIANTS checks. Do allow placement of the clean buffers on dead bufobj list, otherwise filesystems cannot use bufcache at all after the devvp reclaim. Reported and tested by: trasz Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-11 11:21:56 +00:00
John-Mark Gurney	2ff9c4f915	Complete the move that was started w/ r263218.. For some reason I didn't delete the files, so that means we need to bring the changes in r282726 to the correct files.. make tinderbox completed with this patch...	2015-07-11 03:12:34 +00:00
Pawel Jakub Dawidek	4273d41299	Spoil even can happen for some time now even on providers opened exclusively (on the media change event). Update GELI to handle that situation. PR: 201185 Submitted by: Matthew D. Fuller	2015-07-10 19:27:19 +00:00
Luigi Rizzo	4af7aed7c6	assorted algorithmic fixes from Paolo Valente (one of my qfq coauthors): - use 1ULL to avoid shift truncations - recompute the sum of weight dynamically to provide better fairness - fix an erroneous constant in the computation of the slot - preserve timestamp correctness when the old timestamp is stale.	2015-07-10 19:24:36 +00:00
Luigi Rizzo	e38e277fc4	one more warning suppression when compiling the test code in userspace.	2015-07-10 19:18:49 +00:00
Luigi Rizzo	e25716b7cc	add code to compute fairness indexes; cleanups to remove compile warnings.	2015-07-10 18:10:40 +00:00
Luigi Rizzo	8fd44c9395	staticize functions only used in netmap.c (detected by jenkins run with gcc 4.9) Update documentation on the use of netmap_priv_d, rename the refcount and use the same structure in FreeBSD and linux No functional changes.	2015-07-10 16:05:24 +00:00
Ed Schouten	ea566832d7	Add missing const keyword to kern_sigaction()'s 'act' parameter. This structure is not modified by the function. Also add const to sigact_flag_test(), as it is called by kern_sigaction().	2015-07-10 14:39:46 +00:00
Mateusz Guzik	9a1ad66fb5	fd: further cleanup of kern_dup - make mode enum start from 0 so that the assertion covers all cases [1] - rename prefix _CLOEXEC flag with _FLAG - postpone fhold on the old file descriptor, which eliminates the need to fdrop in error cases. - fixup FDDUP_FCNTL check missed in the previous commit This removes 'fp == oldfde->fde_file' assertion which had little value. kern_dup only calls fd-related functions which cannot drop the lock or a whole lot of races would be introduced. Noted by: kib [1]	2015-07-10 13:54:03 +00:00
Mateusz Guzik	5fe97c20dc	fd: split kern_dup flags argument into actual flags and a mode Tidy up the code inside to switch on the mode.	2015-07-10 11:01:30 +00:00
Konstantin Belousov	85237e335d	Convert between abridged (from FXSAVE) and unabridged (from FSAVE) versions of the x87 tags. The conversion is naive, used abridged tag is converted to valid unabridged, without additional checks for zero and special values. Noted by: bde Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-10 09:20:13 +00:00
Konstantin Belousov	bcfc2be186	Duplicate the copyright from the i386/i386/machdep.c into i386/include/frame.h after a code was moved from machdep.c to frame.h in r284925. Use include guards style similar to other guards. Noted by: bde Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-10 09:15:06 +00:00
Konstantin Belousov	e8677f3885	Change the mb() use in the sched_ult tdq_notify() and sched_idletd() to more C11-ish atomic_thread_fence_seq_cst(). Note that on PowerPC, which currently uses lwsync for mb(), the change actually fixes the missed store/load barrier, intended by r271604 []. Reviewed by: alc Noted by: alc [] Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-10 08:54:12 +00:00
Luigi Rizzo	0fdeab7bc5	add netmap dependency when compiled as a module	2015-07-10 07:13:14 +00:00
Ed Schouten	47a84387ad	Let listen() return EDESTADDRREQ when not bound. We currently return EINVAL when calling listen() on a UNIX socket that has not been bound to a pathname. If my interpretation of POSIX is correct, we should return EDESTADDRREQ: "The socket is not bound to a local address, and the protocol does not support listening on an unbound socket." Return EDESTADDRREQ instead when not bound and not connected. Differential Revision: https://reviews.freebsd.org/D3038 Reviewed by: gnn, network	2015-07-10 06:47:14 +00:00
Luigi Rizzo	847bf38369	Sync netmap sources with the version in our private tree. This commit contains large contributions from Giuseppe Lettieri and Stefano Garzarella, is partly supported by grants from Verisign and Cisco, and brings in the following: - fix zerocopy monitor ports and introduce copying monitor ports (the latter are lower performance but give access to all traffic in parallel with the application) - exclusive open mode, useful to implement solutions that recover from crashes of the main netmap client (suggested by Patrick Kelsey) - revised memory allocator in preparation for the 'passthrough mode' (ptnetmap) recently presented at bsdcan. ptnetmap is described in S. Garzarella, G. Lettieri, L. Rizzo; Virtual device passthrough for high speed VM networking, ACM/IEEE ANCS 2015, Oakland (CA) May 2015 http://info.iet.unipi.it/~luigi/research.html - fix rx CRC handing on ixl - add module dependencies for netmap when building drivers as modules - minor simplifications to device-specific routines (txsync, rxsync) - general code cleanup (remove unused variables, introduce macros to access rings and remove duplicate code, Applications do not need to be recompiled, unless of course they want to use the new features (monitors and exclusive open). Those willing to try this code on stable/10 can just update the sys/dev/netmap/, sys/net/netmap with the version in HEAD and apply the small patches to individual device drivers. MFC after: 1 month Sponsored by: (partly) Verisign, Cisco	2015-07-10 05:51:36 +00:00
George V. Neville-Neil	0b75d21e18	Summary: Fix LINT build. The names of the new AES modes were not correctly used under the REGRESSION kernel option.	2015-07-10 02:23:50 +00:00
Dimitry Andric	4afafe0c86	Fix swapped copyin(9) arguments in cxgb's iwch_arm_cq() function. Detected by clang 3.7.0 with the warning: sys/dev/cxgb/ulp/iw_cxgb/iw_cxgb_provider.c:309:18: error: variable 'rptr' is uninitialized when used here [-Werror,-Wuninitialized] chp->cq.rptr = rptr; ^~~~ MFC after: 1 week	2015-07-09 22:13:23 +00:00
Mariusz Zaborski	306a82f8f4	Rename zfs nvpair files to not colidate with our nvlist. PR: 201356 Approved by: pjd (mentor)	2015-07-09 21:53:40 +00:00
Andrew Turner	3303004f1a	Remove checks for __ARM_EABI__, we only build for EABI now. Sponsored by: ABT Systems Ltd	2015-07-09 21:02:40 +00:00
Andrew Turner	6c50960be6	Add support for __aeabi_memclr4, clang 3.7 calls it. Sponsored by: ABT Systems Ltd	2015-07-09 20:54:38 +00:00
George V. Neville-Neil	16de9ac1b5	Add support for AES modes to IPSec. These modes work both in software only mode and with hardware support on systems that have AESNI instructions. Differential Revision: D2936 Reviewed by: jmg, eri, cognet Sponsored by: Rubicon Communications (Netgate)	2015-07-09 18:16:35 +00:00
Andrew Turner	bf1717e566	Clear the carry bit on the saved program state register when asked to clear the return value, it's used to indicate an error. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-09 17:26:56 +00:00
Mateusz Guzik	318b946321	vfs: cosmetic changes to namei and namei_handle_root - don't initialize cnp during declaration - don't test error/!error, compare to 0 instead	2015-07-09 17:17:26 +00:00
Mateusz Guzik	d177f49f6f	vfs: simplify error handling in namei The logic is reorganised so that there is one exit point prior to the lookup loop. This is an intermediate step to making audit logging functions use found vnode instead of translating ni_dirfd on their own. ni_startdir validation is removed. The only in-tree consumer is nfs which already makes sure it is a directory. Reviewed by: kib	2015-07-09 16:32:58 +00:00
Ermal Luçi	56844a6203	Correct issue presented in r285051, apparently neither clang nor gcc complain about this. But clang intis the var to NULL correctly while gcc on at least mips does not. Correct the undefined behavior by initializing the variable properly. PR: 201371 Differential Revision: https://reviews.freebsd.org/D3036 Reviewed by: gnn Approved by: gnn(mentor)	2015-07-09 16:28:36 +00:00
Ed Schouten	2491302a04	Add implementations for some of the CloudABI file descriptor system calls. All of the CloudABI system calls that operate on file descriptors of an arbitrary type are prefixed with fd_. This change adds wrappers for most of these system calls around their FreeBSD equivalents. The dup2() system call present on CloudABI deviates from POSIX, in the sense that it can only be used to replace existing file descriptor. It cannot be used to create new ones. The reason for this is that this is inherently thread-unsafe. Furthermore, there is no need on CloudABI to use fixed file descriptor numbers. File descriptors 0, 1 and 2 have no special meaning. This change exposes the kern_dup() through <sys/syscallsubr.h> and puts the FDDUP_* flags in <sys/filedesc.h>. It then adds a new flag, FDDUP_MUSTREPLACE to force that file descriptors are replaced -- not allocated. Differential Revision: https://reviews.freebsd.org/D3035 Reviewed by: mjg	2015-07-09 16:07:01 +00:00
Mateusz Guzik	efdc25304c	fd: prepare do_dup for being exported - rename it to kern_dup. - prefix flags with FD - assert that correct flags were passed	2015-07-09 15:19:45 +00:00
Mateusz Guzik	d19ba50e12	vfs: avoid spurious vref/vrele for absolute lookups namei used to vref fd_cdir, which was immediatley vrele'd on entry to the loop. Check for absolute lookup and vref the right vnode the first time. Reviewed by: kib	2015-07-09 15:06:58 +00:00
Mateusz Guzik	a03f1b2970	vfs: plug a use-after-free of fd_rdir in namei fd_rdir vnode was stored in ni_rootdir without refing it in any way, after which the filedsc lock was being dropped. The vnode could have been freed by mountcheckdirs or another thread doing chroot. VREF the vnode while the lock is held. Reviewed by: kib MFC after: 1 week	2015-07-09 15:06:24 +00:00
Andrew Turner	b2b5507779	Add support for SMP. This uses the FDT data to find the CPUs to start on, and psci to start them. I expect ACPI support to be added later. This has been tested on qemu with 2 cpus as that is the current value of MAXCPUS. This is expected to be increased in the future as FreeBSD has been tested on 48 cores on the Cavium ThunderX hardware. Partially based on a patch from Robin Randhawa from ARM. Approved by: ABT Systems Ltd Relnotes: yes Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3024	2015-07-09 13:23:29 +00:00
Andrew Turner	3ad7e84ef5	Add logging of synchronous exceptions. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-09 13:07:12 +00:00
Andrew Turner	7df38eabdc	Add the definition of the shareable bits in the pagetables Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-09 12:56:09 +00:00
Andrew Turner	144aa0b7f5	Clean up the types used in <machine/ucontext.h> on arm64. As some ports include this file without first including the headers needed for uint32_t and the like use the __foo type. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-09 12:51:50 +00:00
Ed Schouten	3a41ec6af7	Don't clobber td->td_retval[0] in proc_reap(). While writing tests for CloudABI, I noticed that close() on process descriptors returns the process ID of the child process. This is interesting, as close() is only allowed to return 0 or -1. It turns out that we clobber td->td_retval[0] in proc_reap(), so that wait*() properly returns the process ID. Change proc_reap() to leave td->td_retval[0] alone. Set the return value in kern_wait6() instead, by keeping track of the PID before we (potentially) reap the process. Differential Revision: https://reviews.freebsd.org/D3032 Reviewed by: kib	2015-07-09 12:04:45 +00:00
Zbigniew Bodek	6c03ba71f8	Rework CPU identification on ARM64 This commit reworks the code responsible for identification of the CPUs during runtime. It is necessary to provide a way for workarounds and erratums to be applied only for certain HW versions. The copy of MIDR is now stored in pcpu to provide a fast and convenient way for assambly code to read it (pcpu is used quite often so there is a chance it's inside the cache). The MIDR is also better way of identification than using user-friendly cpu_desc structure, because it can be compiled into comparision of single u32 with only one access to the memory - this is crucial for some erratums which are called from performance-critical places. Changes in cpu_identify makes this function safe to be called on non-boot CPUs. New function CPU_MATCH was implemented which returns boolean value based on mathing masked MIDR with chip identification. Example of usage: printf("is thunder: %d\n", CPU_MATCH(CPU_IMPL_MASK \| CPU_PART_MASK, CPU_IMPL_CAVIUM, CPU_PART_THUNDER, 0, 0)); printf("is generic: %d\n", CPU_MATCH(CPU_IMPL_MASK \| CPU_PART_MASK, CPU_IMPL_ARM, CPU_PART_FOUNDATION, 0, 0)); Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3030	2015-07-09 11:32:29 +00:00
Konstantin Belousov	fcb5b3a419	Cover a race between doselwakeup() and selfdfree(). If doselwakeup() loop finds the selfd entry and clears its sf_si pointer, which is handled by selfdfree() in parallel, NULL sf_si makes selfdfree() free the memory. The result is the race and accesses to the freed memory. Refcount the selfd ownership. One reference is for the sf_link linkage, which is unconditionally dereferenced by selfdfree(). Another reference is for sf_threads, both selfdfree() and doselwakeup() race to deref it, the winner unlinks and than frees the selfd entry. Reported by: Larry Rosenman <ler@lerctr.org> Tested by: Larry Rosenman <ler@lerctr.org>, pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-09 09:22:21 +00:00
Ed Schouten	39b160b3e2	Add forward declaration of struct thread. This structure is used in some of the functions in this header, but we don't depend on any header that pulls it i.	2015-07-09 07:31:40 +00:00
Ed Schouten	f355e810cf	Generate CloudABI system call table with proper $FreeBSD$ tags.	2015-07-09 07:21:33 +00:00
Ed Schouten	6d338f9a81	Import the CloudABI datatypes and create a system call table. CloudABI is a pure capability-based runtime environment for UNIX. It works similar to Capsicum, except that processes already run in capabilities mode on startup. All functionality that conflicts with this model has been omitted, making it a compact binary interface that can be supported by other operating systems without too much effort. CloudABI is 'secure by default'; the idea is that it should be safe to run arbitrary third-party binaries without requiring any explicit hardware virtualization (Bhyve) or namespace virtualization (Jails). The rights of an application are purely determined by the set of file descriptors that you grant it on startup. The datatypes and constants used by CloudABI's C library (cloudlibc) are defined in separate files called syscalldefs_mi.h (pointer size independent) and syscalldefs_md.h (pointer size dependent). We import these files in sys/contrib/cloudabi and wrap around them in cloudabi*_syscalldefs.h. We then add stubs for all of the system calls in sys/compat/cloudabi or sys/compat/cloudabi64, depending on whether the system call depends on the pointer size. We only have nine system calls that depend on the pointer size. If we ever want to support 32-bit binaries, we can simply add sys/compat/cloudabi32 and implement these nine system calls again. The next step is to send in code reviews for the individual system call implementations, but also add a sysentvec, to allow CloudABI executabled to be started through execve(). More information about CloudABI: - GitHub: https://github.com/NuxiNL/cloudlibc - Talk at BSDCan: https://www.youtube.com/watch?v=SVdF84x1EdA Differential Revision: https://reviews.freebsd.org/D2848 Reviewed by: emaste, brooks Obtained from: https://github.com/NuxiNL/freebsd	2015-07-09 07:20:15 +00:00
John-Mark Gurney	275a0a97ed	upon further examination, it turns out that _unregister_all already provides the guarantee that no threads will be in the _newsession code.. This is provided by the CRYPTODRIVER lock... This makes the pause unneeded...	2015-07-08 22:48:41 +00:00
Mateusz Guzik	06d1ada870	seq: use seq_consistent_nomb in seq_consistent Constify seqp argument for seq_consistent_nomb. No functional changes.	2015-07-08 22:21:25 +00:00
Zbigniew Bodek	0a3f65a107	Style cleanups after r285270 There should be no semicolons in added macro definitions. Define empty macro as "do {} while (0)". Pointed out by: jmg	2015-07-08 22:09:47 +00:00
John-Mark Gurney	e808e13b8b	Now that aesni won't reuse fpu contexts (D3016), add seatbelts to the fpu code to prevent other reuse of the contexts in the future... Differential Revision: https://reviews.freebsd.org/D3015 Reviewed by: kib, gnn	2015-07-08 19:26:36 +00:00
John-Mark Gurney	9d38fd076e	address an issue where consumers, like IPsec, can reuse the same session in multiple threads w/o locking.. There was a single fpu context shared per session, if multiple threads were using the session, and both migrated away, they could corrupt each other's fpu context... This patch adds a per cpu context and a lock to protect it... It also tries to better address unloading of the aesni module... The pause will be removed once the OpenCrypto Framework provides a better method for draining callers into _newsession... I first discovered the fpu context sharing issue w/ a flood ping over an IPsec tunnel between two bhyve machines... The patch in D3015 was used to verify that this fix does fix the issue... Reviewed by: gnn, kib (both earlier versions) Differential Revision: https://reviews.freebsd.org/D3016	2015-07-08 19:15:29 +00:00
Konstantin Belousov	f4b5a9725a	Reimplement the ordering requirements for the timehands updates, and for timehands consumers, by using fences. Ensure that the timehands->th_generation reset to zero is visible before the data update is visible []. tc_setget() allowed data update writes to become visible before generation (but not on TSO architectures). Remove tc_setgen(), tc_getgen() helpers, use atomics inline []. Noted by: alc [] Requested by: bde [**] Reviewed by: alc, bde Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-08 18:42:08 +00:00
Konstantin Belousov	261fda00cd	Use atomic_fence_fence_rel() to ensure ordering in the seq_write_begin(), instead of the load_rmb/rbm_load functions. The update does not need to be atomic due to the write lock owned. Similarly, in seq_write_end(), update of seqp needs not be atomic. Only store must be atomic with release. For seq_read(), the natural operation is the load acquire of the sequence value, express this directly with atomic_load_acq_int() instead of using custom partial fence implementation atomic_load_rmb_int(). In seq_consistent, use atomic_thread_fence_acq() which provides the desired semantic of ordering reads before fence before the re-reading of seqp, instead of custom atomic_rmb_load_int(). Reviewed by: alc, bde Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-08 18:37:08 +00:00
Konstantin Belousov	8954a9a4e6	Add the atomic_thread_fence() family of functions with intent to provide a semantic defined by the C11 fences with corresponding memory_order. atomic_thread_fence_acq() gives r \| r, w, where r and w are read and write accesses, and \| denotes the fence itself. atomic_thread_fence_rel() is r, w \| w. atomic_thread_fence_acq_rel() is the combination of the acquire and release in single operation. Note that reads after the acq+rel fence could be made visible before writes preceeding the fence. atomic_thread_fence_seq_cst() orders all accesses before/after the fence, and the fence itself is globally ordered against other sequentially consistent atomic operations. Reviewed by: alc Discussed with: bde Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-08 18:12:24 +00:00
Alan Cox	22cf98d1f3	The intention of r254304 was to scan the active queue continuously. However, I've observed the active queue scan stopping when there are frequent free page shortages and the inactive queue is steadily refilled by other mechanisms, such as the sequential access heuristic in vm_fault() or madvise(2). To remedy this problem, record the time of the last active queue scan, and always scan a number of pages proportional to the time since the last scan, regardless of whether that last scan was a timeout-triggered ("pass == 0") or free-page-shortage-triggered ("pass > 0") scan. Also, on a timeout-triggered scan, allow a full scan of the active queue when the system is short of inactive pages. Reviewed by: kib MFC after: 6 weeks Sponsored by: EMC / Isilon Storage Division	2015-07-08 17:45:59 +00:00
Andrew Turner	6bae05d951	Correctly set __WCHAR_MIN, there is no __UINT_MIN, it's 0. Sponsored by: ABT Systems Ltd	2015-07-08 16:18:28 +00:00
Andrew Turner	ded32d88f1	Add support for ipi_all_but_self on arm64. Obtained from: ABT Systems Ltd Sponsored by: The freeBSD Foundation	2015-07-08 15:32:59 +00:00
Andrew Turner	80ad08a3e9	Add an implementation of savectx that doesn't just call panic. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-08 14:07:06 +00:00
Zbigniew Bodek	4981c60e07	Add memory barrier to bus_dmamap_sync() On platforms which are fully IO-coherent, the map might be null. We need to guarantee that all data is observable after the sync operation is called. Add a memory barrier to ensure that on ARM. Reviewed by: andrew, kib Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3012	2015-07-08 13:52:59 +00:00
Konstantin Belousov	69d11def74	Handle copyout for the fcntl(F_OGETLK) using oflock structure. Otherwise, kernel overwrites a word past the destination. Submitted by: walter@pelissero.de PR: 196718 MFC after: 1 week	2015-07-08 13:19:13 +00:00
Andrew Turner	cb02f6b942	Send the correct signal when vm_fault fails. While here also set the code and address fields. Sponsored by: ABT Systems Ltd	2015-07-08 12:42:44 +00:00
John-Mark Gurney	a13589bc47	unroll the loop slightly... This improves performance enough to justify, especially for CBC performance where we can't pipeline.. I don't happen to have my measurements handy though... Sponsored by: Netflix, Inc.	2015-07-07 20:31:09 +00:00
Mark Johnston	620711e033	Fix an incorrect assertion in witness. The number of available lock list entries for a thread is LOCK_CHILDCOUNT, and each entry can record up to LOCK_NCHILDREN locks. When iterating over the locks held by a thread, a bound on the loop index is therefore given by LOCK_CHILDCOUNT * LOCK_NCHILDREN; WITNESS_COUNT is an unrelated constant. Reviewed by: jhb MFC after: 1 week Sponsored by: EMC / Isilon Storage Division Differential Revision: https://reviews.freebsd.org/D2974	2015-07-07 19:29:18 +00:00
Luiz Otavio O Souza	f935da6fee	Add the Banana Pi DTS. The Banana Pi support is in progress and this is intended to help the early adopters.	2015-07-07 19:01:54 +00:00
John-Mark Gurney	748a12e2c3	we may get here w/ non-sleepable locks held, so switch to _NOWAIT when doing this memory allocation... Reviewed by: ae	2015-07-07 18:45:32 +00:00
Ed Maste	906451276a	Avoid creating invalid UEFI device path The UEFI loader on the 10.1 release install disk (disc1) modifies an existing EFI_DEVICE_PATH_PROTOCOL instance in an apparent attempt to truncate the device path. In doing so it creates an invalid device path. Perform the equivalent action without modification of structures allocated by firmware. PR: 197641 MFC After: 1 week Submitted by: Chris Ruffin <chris.ruffin@intel.com>	2015-07-07 18:44:27 +00:00
Luiz Otavio O Souza	f4b61a34e5	Add the GMAC entries to sun7i (A20) DTS. While here make EMAC disabled unless explicitly enabled.	2015-07-07 18:32:23 +00:00
Takanori Watanabe	99043514c6	Fix rfcomm_sppd regression I could reproduced. To reproduce it, Two machine running FreeBSD and run rfcomm_sppd -c 3 -S rfcomm_sppd -a ${PEER} -c 3 on each side.	2015-07-07 15:56:51 +00:00
Pedro F. Giffuni	9129dd59be	Relocate sched_random() within the SMP section. Place sched_random nearer to where it's first used: moving the code nearer to where it is used makes the code easier to read and we can reduce the initial "#ifdef SMP" island. Reword a little the comment and clean some whitespaces while here.	2015-07-07 15:22:29 +00:00
Achim Leubner	4e1bc9a039	Driver 'pmspcv' added. Supports PMC-Sierra PM8001/8081/8088/8089/8074/8076/8077 SAS/SATA HBA Controllers.	2015-07-07 13:17:02 +00:00
Michael Tuexen	29b9533b43	Export the ssthresh value per SCTP path via the sysctl interface. MFC after: 1 month	2015-07-07 06:34:28 +00:00
Adrian Chadd	54991f37c9	Attempt to make 5GHz HT/40 work on the 6xxx series NICs. The 6205 (Taylor Peak) in the Lenovo X230 works fine in 5GHz 11a and 11n HT20, but not 11n HT40. The NIC goes RX deaf the moment HT40 is configured. It's so RX deaf that it doesn't even hear beacons and the firmware sends "BEACON MISS" events. That's pretty deaf. I tried configuring up the HT40 flags in monitor mode and it worked - so I assumed that doing the transition from 20 -> 40MHz channel configuration when going auth->assoc (ie, after the NIC has been partially configured) is a problem. So for now, let's just always set them if they're available. Tested: * Intel 5300, STA mode, 5GHz HT/40 AP; 2GHz HT/20 AP * Intel 6205, STA mode, 5GHz HT/40, HT20, 11a AP; 2GHz HT/20 AP This was pointed out to me by coworkers trying to use FreeBSD-HEAD in the office on their Thinkpad T420p laptops. TODO: * I don't like how the HT40 flags are configured - the whole interop/ protection config should be re-checked. Notably, I think curhtprotmode is 0 in a lot of cases, which means "no interoperability" and i think that's busted. Sponsored by: Norse Corp, Inc.	2015-07-07 03:51:29 +00:00
Justin Hibbits	7f78865ba0	Enable the wireless on attach. This comes from the archives of "forgotten in the original commit, and probably pointless now because nobody uses it, and the driver's broken anyway."	2015-07-07 02:42:48 +00:00
Justin Hibbits	44027a8321	style(9) cleanups. Don't use PRIxPTR, these registers are 32-bits, cast to u_long instead. Pointed out by: bde	2015-07-07 02:37:29 +00:00
Navdeep Parhar	9af71ab3bc	cxgbe(4): Add a new knob that controls the congestion response of netmap rx queues. The default is to drop rather than backpressure. This decouples the congestion settings of NIC and netmap rx queues. MFC after: 3 days	2015-07-06 20:56:59 +00:00
Navdeep Parhar	41f7622b64	cxgbe(4): Do not override the the global defaults for congestion drops. The hw.cxgbe.cong_drop knob is not affected by this change because the driver sets up congestion drop on a per-queue basis. MFC after: 3 days	2015-07-06 20:28:42 +00:00
Neel Natu	5e4f29c037	Move the 'devmem' device nodes from /dev/vmm to /dev/vmm.io Some external tools just do a 'ls /dev/vmm' to figure out the bhyve virtual machines on the host. These tools break if the devmem device nodes also appear in /dev/vmm. Requested by: grehan	2015-07-06 19:41:43 +00:00
John-Mark Gurney	5a550cca9a	Fix for non-random IV's when CRD_F_IV_PRESENT and CRD_F_IV_EXPLICIT flags are not specified... This bug was introduced in r275732... This only affects IPsec ESP only policies w/ the aesni module loaded, other subsystems specify one or both of the flags... Reviewed by: gnn, delphij, eri	2015-07-06 19:30:29 +00:00
John-Mark Gurney	bcc0b68477	remove _NORMAL flag which isn't suppose to be used w/ _alloc_ctx... Reviewed by: kib (a while ago)	2015-07-06 19:17:56 +00:00
Mateusz Guzik	aa0e2887f4	tty: replace several curthread->td_proc with stored curproc No functional changes.	2015-07-06 18:53:56 +00:00
Zbigniew Bodek	1ae9c994c8	Introduce ITS support for ARM64 Add ARM ITS (Interrupt Translation Services) support required to bring-up message signalled interrupts on some ARM64 platforms. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation	2015-07-06 18:27:41 +00:00
Andrew Turner	b67d1aad6f	Add more tlb invalidations. We currently invalidate when we may not need to, but with this I can boot on a simulator that models the tlb. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-06 18:27:18 +00:00
Luiz Otavio O Souza	1d7a730974	When initializing the (unused) TX descriptors it is not necessary set the chain bit. Obtained from: NetBSD	2015-07-06 17:13:17 +00:00
Luiz Otavio O Souza	ff0752c870	Use uint32_t consistently to store registers values. Always use unsigned numbers to avoid undefined behavior on (1 << 31). Remove unused variables and some stray semicolons. No functional changes.	2015-07-06 16:45:48 +00:00
Patrick Kelsey	6f99ea0520	Don't acquire sysctlmemlock in userland_sysctl() when the old value pointer is NULL, as in that case there are no userland pages that could potentially be wired. It is common for old to be NULL and oldlenp to be non-NULL in calls to userland_sysctl(), as this is used to probe for the length of a variable-length sysctl entry before retrieving a value. Note that it is typical for such calls to be made with an uninitialized value in oldlenp, so sysctlmemlock was essentially being acquired at random (depending on the uninitialized value in oldlenp being > PAGE_SIZE or not) for these calls prior to this patch. Differential Revision: https://reviews.freebsd.org/D2987 Reviewed by: mjg, kib Approved by: jmallett (mentor) MFC after: 1 month	2015-07-06 16:07:21 +00:00
Konstantin Belousov	9889bbac23	Mutex memory is not zeroed, add MTX_NEW. Reported and tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-06 14:09:00 +00:00
Andrey V. Elsukov	280d77a3bb	Fill the port and protocol information in the SADB_ACQUIRE message in case when security policy has it as required by RFC 2367. PR: 192774 Differential Revision: https://reviews.freebsd.org/D2972 MFC after: 1 week	2015-07-06 12:40:31 +00:00
Steven Hartland	64ce6b09a3	Correct bit offsets for ahci quirks Fix bit offsets causing incorrect quirks being reported on boot for ahci introduced by r280184. MFC after: 3 days Sponsored by: Multiplay	2015-07-06 09:44:07 +00:00
Justin Hibbits	3f3cffedce	Merge booke and aim interrupt.c files. Summary: Both booke and AIM interrupt.c files contain nearly identical code. This merges the two files, to reduce duplication. Reviewers: #powerpc, marcel Reviewed By: marcel Subscribers: imp Differential Revision: https://reviews.freebsd.org/D2991	2015-07-06 05:08:57 +00:00
Luiz Otavio O Souza	a5221d68dc	Fix the sent packets statistics for if_dwc.	2015-07-06 03:06:13 +00:00
Patrick Kelsey	e9617c305c	Fix if_loop so bpfwrite() can use it regardless of the state of bd_hdrcmplt. As if_loop does not use link-level headers, its behavior when used by bpfwrite() should be the same regardless of the state of bd_hdrcmplt. Without this change, libpcap (and other BPF users that work like it) fail when writing to loopback interfaces. Differential Revision: https://reviews.freebsd.org/D2989 Reviewed by: gnn, melifaro Approved by: jmallett (mentor) MFC after: 3 days	2015-07-06 02:12:49 +00:00
Mark Johnston	947401dd50	Move the comment describing namei(9) back to namei()'s definition. MFC after: 3 days	2015-07-05 22:56:41 +00:00
Mark Johnston	8bbd1f25b1	Remove a stale descriptive comment for gbincore(). The splay trees referenced in the comment were converted to path-compressed tries in r250551. MFC after: 3 days	2015-07-05 22:44:41 +00:00
Mark Johnston	5f34e93c58	Check suspendability on the mountpoint returned by VOP_GETWRITEMOUNT. This obviates the need for a MNTK_SUSPENDABLE flag, since passthrough filesystems like nullfs and unionfs no longer need to inherit this information from their lower layer(s). This change also restores the pre-r273336 behaviour of using the presence of a susp_clean VFS method to request suspension support. Reviewed by: kib, mjg Differential Revision: https://reviews.freebsd.org/D2937	2015-07-05 22:37:33 +00:00
Mark Johnston	010ba3842c	Add a local variable initialization needed in the OBJT_DEFAULT case. Reviewed by: kib Differential Revision: https://reviews.freebsd.org/D2992	2015-07-05 22:26:19 +00:00
Mateusz Guzik	f131759f54	fd: make 'rights' a manadatory argument to fget* functions	2015-07-05 19:05:16 +00:00
Andrew Turner	5f8583891f	Add the kernel functions needed to enable threading. Sponsored by: ABT Systems Ltd	2015-07-05 18:16:06 +00:00
Bjoern A. Zeeb	31c98473c1	Fix GENERIC64 and LINT64 powerpc builds after r285144.	2015-07-05 15:30:16 +00:00
Ian Lepore	1a6570fb1f	Enable ipsec by default on all armv6 platforms.	2015-07-05 14:16:31 +00:00
Ian Lepore	b108902461	Ensure all the required files get built when you include the IPSEC option.	2015-07-05 14:15:58 +00:00
Alexander Motin	d1f4058735	Make first step toward supporting target and initiator roles same time. To avoid conflicts between target and initiator devices in CAM, make CTL use target ID reported by HBA as its initiator_id in XPT_PATH_INQ. That target ID is known to never be used for initiator role, so it won't conflict. For Fibre Channel and FireWire HBAs this specific ID choice is irrelevant since all target IDs there are virtual. Same time for SPI HBAs it seems could be even requirement to use same target ID for both initiator and target roles. While there are some more things to polish in isp(4) driver, first tests of using both roles same time on the same port appeared successfull: # camcontrol devlist -v scbus0 on isp0 bus 0: <FREEBSD CTLDISK 0001> at scbus0 target 1 lun 0 (da20,pass21) <> at scbus0 target 256 lun 0 (ctl0) <> at scbus0 target -1 lun ffffffff (ctl1)	2015-07-05 03:38:58 +00:00
Alexander Motin	766a65a50d	Remove extra level of target ID indirection (isp_dev_map). FreeBSD never had limitation on number of target IDs, and there is no any other requirement to allocate them densely. Since slots of port database already populated just sequentially, there is no much need for another indirection to allocate sequentially too.	2015-07-05 02:09:46 +00:00
George V. Neville-Neil	aaa7cbfe1a	Summary: Add missing files necessary to build with IPSEC and crypto	2015-07-04 21:32:44 +00:00
George V. Neville-Neil	0661a7c224	Fix up tabs vs. spaces	2015-07-04 20:31:06 +00:00
Justin Hibbits	0936003e3d	Use the correct type for physical addresses. On Book-E, physical addresses are actually 36-bits, not 32-bits. This is currently worked around by ignoring the top bits. However, in some cases, the boot loader configures CCSR to something above the 32-bit mark. This is stage 1 in updating the pmap to handle 36-bit physaddr.	2015-07-04 19:00:38 +00:00
Alexander Motin	8656f200dc	Change comment added in r284540. This appeared to be not card's issue, but driver's, though solution is the same so far.	2015-07-04 18:51:54 +00:00
Alexander Motin	6bef0aa0c6	Drop discovered targets when initiator role is disabled.	2015-07-04 18:38:46 +00:00
Justin Hibbits	398973f809	Add machine check register printing This will print out the Memory Subsystem Status Register on MPC745x (G4+ class), and the Machine Check Status Register on Book-E class CPUs, to aid in debugging machine checks. Other relevant registers, for other CPUs, can be added in the future.	2015-07-04 18:16:41 +00:00
George V. Neville-Neil	3839369c03	Enable IPSEC in all GENERIC kernels. Universe and kernel build tests passed 4 July 2015 PR: 128030 Sponsored by: Rubicon Communications (Netgate)	2015-07-04 17:37:00 +00:00
Mariusz Zaborski	54f98da930	Move the nvlist source and private includes from sys/kern to seperate directory sys/contrib/libnv. The goal of this operation is to NOT install header files which shouldn't be used outside the nvlist library. Approved by: pjd (mentor)	2015-07-04 16:33:37 +00:00
Luiz Otavio O Souza	06844c0f7e	Install loader.rc with ARM u-boot loader (ubldr). loader.rc is the responsible to read and process loader.conf variables. This fix the issue of loader.conf being silently ignored. MFC after: 3 days	2015-07-04 16:19:38 +00:00
Mateusz Guzik	9ca30b0e06	vfs: use shared vnode locking when looking up ".." in vop_stdvptocnp Briefly discussed with: kib	2015-07-04 15:46:39 +00:00
Mateusz Guzik	dba0bec2bb	fd: de-k&r-ify functions + some whitespace fixes No functional changes.	2015-07-04 15:42:03 +00:00
Mateusz Guzik	ee5f66f820	sysctl: get rid of sysctl_lock/unlock Inline their contents into the only consumer.	2015-07-04 14:44:39 +00:00
Mariusz Zaborski	07929e4e77	Remove non-existent dnvlist functions. Approved by: pjd (mentor)	2015-07-04 10:33:33 +00:00
John-Mark Gurney	64ff224dd5	improve dependencies for this module a bit... not great, but at least gives some basics... I would add them to DPSRC, but due to the intrinsics headers, they can't be added...	2015-07-04 08:16:32 +00:00
Mateusz Guzik	d5fc115a1a	sysctl: remove a debugging printf which crept in with r285125	2015-07-04 07:01:43 +00:00
Mateusz Guzik	b8633775a8	sysctl: switch sysctllock to a sleepable rmlock The lock is almost never taken for writing.	2015-07-04 06:54:15 +00:00
Warner Losh	8cad626fbb	Cache _MPATH and pass it down into the modules build. Some NFS setups make the find it does extremely expensive, so compute it only once. Also make sure the 'traditional' module building method works at the expense of a bit of duplicated code.	2015-07-04 05:43:45 +00:00
Adrian Chadd	3e307100f6	Quieten the scorpion SoC/WMAC reset path. Stuff the non-error stuff under HALDEBUG().	2015-07-04 03:15:42 +00:00
Adrian Chadd	35aae619d7	Call the WMAC DDR flush before handling an interrupt for the Atheros AHB (internally) connected MAC. TODO: * verify the interrupt was for us before doing the DDR flush.	2015-07-04 03:07:28 +00:00
Adrian Chadd	c37904e8ba	Reshuffle all of the DDR flush operations into a single switch/mux, and start teaching subsystems about it. The Atheros MIPS platforms don't guarantee any kind of FIFO consistency with interrupts in hardware. So software needs to do a flush when it receives an interrupt and before it calls the interrupt handler. There are new ones for the QCA934x and QCA955x, so do a few things: * Get rid of the individual ones (for ethernet and IP2); * Create a mux and enum listing all the variations on DDR flushes; * replace the uses of IP2 with the relevant one (which will typically be "PCI" here); * call the USB DDR flush before calling the real USB interrupt handlers; * call the ethernet one upon receiving an interrupt that's for us, rather than never calling it during operation. Tested: * QCA9558 (TP-Link archer c7 v2) * AR9331 (Carambola 2) TODO: * PCI, USB, ethernet, etc need to do a double-check to see if the interrupt was truely for them before doing the DDR. For now I prefer "correct" over "fast".	2015-07-04 03:05:57 +00:00
Adrian Chadd	52f5515397	Wake up the hardware before doing anything in sysctl. This stops the panics that occur on MIPS platforms when doing say, 'sysctl dev.ath.0' whilst the MAC is asleep. The MIPS platform is rather unforgiving in getting power-save register access wrong and you will get all kinds of odd failures if you don't have things woken up at the right times. Tested: * QCA9558 (TP-Link Archer C7 v2) * AR9331 (Carambola 2) .. with no VAPs configured and ath0 down (thus the MAC is definitely asleep.) PR: kern/201117	2015-07-04 02:59:30 +00:00
Rick Macklem	2a3508eb48	If a "principal" argument isn't provided for a Kerberized NFS mount, the kernel would generate a bogus one with a ":/<path>" suffix. This would only occur for the case where there was no explicit "principal" argument and the getaddrinfo() call in mount_nfs.c failed to a return a cannonical name for the server. This patch fixes this unusual case. PR: 201073 Submitted by: masato@itc.naist.jp MFC after: 2 weeks	2015-07-03 22:11:07 +00:00
George V. Neville-Neil	987de84445	New AES modes for IPSec, user space components. Update setkey and libipsec to understand aes-gcm-16 as an encryption method. A partial commit of the work in review D2936. Submitted by: eri Reviewed by: jmg MFC after: 2 weeks Sponsored by: Rubicon Communications (Netgate)	2015-07-03 20:09:14 +00:00
Andrey V. Elsukov	cb207f93ca	Keep IPv6 address specified by IPV6_PKTINFO socket option in kernel internal form to be able handle link-local IPv6 addresses. Reported by: kp Tested by: kp	2015-07-03 19:01:38 +00:00
Luiz Otavio O Souza	50ad20b383	Add the routines to activate the GMAC clock and setup the GMAC mode. Tested on Cubieboard 2 and Banana pi.	2015-07-03 18:39:25 +00:00
Luiz Otavio O Souza	9639a6c7c9	Rename a10_emac_gpio_config() to a10_gpio_ethernet_activate() to make the change to GMAC easier on A20 SoCs. On A10 only the EMAC controller is available (fast ethernet), but on A20 there is also GMAC a high (or better) performant controller (gigabit ethernet). On A20 the both controllers uses the same pins to talk to the ethernet PHY (MII or RGMII) and they can be selected by the GPIO pin mux. There is work in progress to bring in GMAC support.	2015-07-03 17:54:41 +00:00
Luiz Otavio O Souza	2e33d3583d	Remove duplicate and unnecessary includes. While here remove an unused and wrong define.	2015-07-03 17:09:27 +00:00
Marcel Moolenaar	194709a520	Remove commented-out and non-existent cbus(4) attachment for uart(4).	2015-07-03 16:02:06 +00:00
Marcel Moolenaar	472aa69925	Allow proto(4) to be compiled into the kernel.	2015-07-03 15:56:00 +00:00
Ermal Luçi	c1fc5e9601	Reduce overhead of IPSEC for traffic generated from host When IPSEC is enabled on the kernel the forwarding path has an optimization to not enter the code paths for checking security policies but first checks if there is any security policy active at all. The patch introduces the same optimization but for traffic generated from the host itself. This reduces the overhead by 50% on my tests for generated host traffic without and SP active. Differential Revision: https://reviews.freebsd.org/D2980 Reviewed by: ae, gnn Approved by: gnn(mentor)	2015-07-03 15:31:56 +00:00
Ruslan Bukin	4ebd95ae06	o Add a description for virtio block device implemented in PISM (Bluespec C-interface device) o Add a kernel config Sponsored by: HEIF5	2015-07-03 14:46:57 +00:00
Ruslan Bukin	4934834d6b	Allow BERI virtio-platform code to operate with no PIO devices specified. We will use it with Bluespec simulator of CHERI processor for invalidating caches only.	2015-07-03 14:27:28 +00:00
Ruslan Bukin	116b8d2b0b	Add 'prewrite' method allowing us to run some platform-specific code before each write happens, e.g. write-back caches. This will help booting in Bluespec simulator of CHERI processor.	2015-07-03 14:13:16 +00:00
Luiz Otavio O Souza	7ec8c789c3	Add AHCI attachment code for Allwinner A10/A20 SoCs. The Allwinner SoC has an AHCI device on its internal main bus rather than the PCI bus. This SoC is somewhat underdocumented, and its SATA controller is no exception. The methods to support this chip were harvested from the Linux Allwinner SDK, and then constants invented to describe what's going on based on low-level constants contained in the SATA standard and guess work. This SoC requires a specific AHCI channel setup in order to start the operations on the channel properly. Clock setup and AHCI channel setup idea came from NetBSD. Tested on Cubieboard 2 and Banana pi (and attachment on Cubieboard by Pratik Singhal). Differential Revision: https://reviews.freebsd.org/D737 Submitted by: imp Reviewed by: imp, ganbold, mav, andrew	2015-07-03 14:11:01 +00:00
Roger Pau Monné	6a8e9695ba	netfront: preserve configuration across migrations Try to preserve the xn configuration when migrating. This is not always possible since the backend might not have the same set of options available, in which case we will try to preserve as many as possible. MFC after: 2 weeks PR: 183139 Reported by: mcdouga9@egr.msu.edu Sponsored by: Citrix Systems R&D	2015-07-03 12:09:05 +00:00
Hans Petter Selasky	49557d2481	Fix broken implementation of "kvasprintf()" function by adding missing kmalloc() call. Make function global instead of static inline to fix compiler warnings about passing variable argument lists to inline functions. MFC after: 1 week Sponsored by: Mellanox Technologies	2015-07-03 11:16:20 +00:00
Bjoern A. Zeeb	bfbc08b848	Move comment to the right position. PR: 152791 Submitted by: vangyzen (as part of the functional change) MFC after: 3 days	2015-07-03 09:53:56 +00:00
Adrian Chadd	ef19855701	Oops - fix typo.	2015-07-03 07:00:24 +00:00
Simon J. Gerraty	96a11afdff	Updated depends	2015-07-03 06:11:54 +00:00
Adrian Chadd	b14a705362	Add initial support for the TP-Link Archer C7 v2. The SoC, the flash, the ethernet ports and ethernet switch all work. The USB works. The 11ac PCIe NIC internally is at least seen by the PCIE RC, but I haven't tried using it yet. There's no driver and I haven't yet swapped it out for a non-11ac chip. The on-chip 2GHz wifi works, but there are some data errors that get thrown up in STA mode when scanning. I have a feeling I have to finish the DDR flush code out and have it run correctly on the shared interrupts; that'll take a bit of time to get right. But if you're after an updated piece of hardware, the Archer C7 v2 is certainly there, and you can replace the 11ac NIC with a 3x3 Atheros PCIe device (eg AR9380, AR9390, AR9580, etc) and it'll "just work". Tested: * TP-Link archer c7v2.	2015-07-03 06:09:56 +00:00
Adrian Chadd	212faba17d	Add pcb1 to the QCA955x. The Tp-link Archer-C7v2 unit has a QCA9558 internally but hangs the QCA988x 11ac PCIe NIC off of PCI RC #1, not #0. So I actually finally /do/ have a board to verify whether PCIe is working. Grr. Tested: * TP-Link Archer-C7v2.	2015-07-03 06:06:44 +00:00
Marcel Moolenaar	42d3ab5d1b	Implement unload and sync operations.	2015-07-03 05:44:58 +00:00
Adrian Chadd	9d0e5a1718	Enable setting the QCA955x GPIO output mux configuration. It's not used by any boards yet, but it's going to creep up soon as more boards show up.	2015-07-03 03:34:21 +00:00
Adrian Chadd	3facd56c71	Add register defines for the QCA955x DDR flush and GPIO control.	2015-07-03 03:32:54 +00:00
Marcel Moolenaar	89abdea8f0	Add create, destroy and load of memory descriptors.	2015-07-03 01:52:22 +00:00
Warner Losh	12f05b8446	Kill MFILES and find things automatically. It turned out to be only lightly used. Find the proper .m file when we depend on *_if.[ch] in the srcs line, with seat-belts for false positive matches. This uses make's path mechanism. A further refinement would be to calculate this once, and then pass the resulting _MPATH to modules submakes. Differential Revision: https://reviews.freebsd.org/D2327	2015-07-03 01:50:26 +00:00
Rick Macklem	d189dcb6e2	Alex Burlyga reported a POLA violation for the new NFS client as compared to the old NFS client via email to the freebsd-fs@ mailing list. For the new client, when multiple clients attempted to create a symbolic link concurrently, more that one client would report success instead of EEXIST. This was caused by code in the new client that mapped EEXIST to OK assuming it was caused by a retried RPC request. Since the old client did not do this, the patch defaults to the old behaviour and permits the new behaviour to be enabled via a sysctl. Reported by: alex.burlyga.ietf@gmail.com Tested by: alex.burlyga.ietf@gmail.com MFC after: 2 weeks	2015-07-03 01:15:21 +00:00
Mariusz Zaborski	dc619c2f57	Add stddef.h for size_t typedef. Approved by: pjd (mentor)	2015-07-02 21:46:07 +00:00
Marcel Moolenaar	3a232946f7	Add an ISA/ACPI bus attachment to proto(4).	2015-07-02 19:21:29 +00:00
Mateusz Guzik	e2f5418e73	sysvshm: fix up some whitespace issues and spurious initialisation	2015-07-02 19:14:30 +00:00
Mateusz Guzik	77a26248a3	sysvshm: don't lock proc when calculating attach_va vm_daddr is constant and RLIMIT_DATA can be obtained from thread's copy of rlimits.	2015-07-02 19:03:44 +00:00
Mateusz Guzik	0be3a191a4	sysvshm: fix shmrealloc The code was supposed to initialize new segs in newsegs array, but used the old pointer.	2015-07-02 19:00:22 +00:00
Mateusz Guzik	cd336bad26	vm: don't lock proc around accesses to vm_{t,d}addr and RLIMIT_DATA in sys_mmap vm_{t,d}addr are constant and we can use thread's copy of resource limits	2015-07-02 18:30:12 +00:00
Ermal Luçi	d14122b078	Avoid doing multiple route lookups for the same destination IP during forwarding ip_forward() does a route lookup for testing this packet can be sent to a known destination, it also can do another route lookup if it detects that an ICMP redirect is needed, it forgets all of this and handovers to ip_output() to do the same lookup yet again. This optimisation just does one route lookup during the forwarding path and handovers that to be considered by ip_output(). Differential Revision: https://reviews.freebsd.org/D2964 Approved by: ae, gnn(mentor) MFC after: 1 week	2015-07-02 18:10:41 +00:00
Andrew Turner	d2676f552e	Remove an unneeded define and old comment referencing amd64.	2015-07-02 16:13:29 +00:00
Andrew Turner	b9b3574474	Remove an old comment, the cache is enabled. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-02 15:26:40 +00:00
Konstantin Belousov	be930a2021	Account for the main process stack being one page below the highest user address when ABI uses shared page. Note that the change is no-op for correctness, since shared page does not fault. The mapping for the shared page is installed at the address space creation, the page is unmanaged and its pte/pv entry cannot be reclaimed. Submitted by: Oliver Pinter Review: https://reviews.freebsd.org/D2954 MFC after: 1 week	2015-07-02 15:22:13 +00:00
Andrew Turner	40fc1dffc3	Use pmap_load to load table entries. This simplifies finding places where we access the tables. Obtained from: ABT Systems Ltd Sponsored by: The fReeBSD Foundation	2015-07-02 15:17:30 +00:00
Konstantin Belousov	6fdfd88220	Use single instance of the identical INKERNEL() and PMC_IN_KERNEL() macros on amd64 and i386. Move the definition to machine/param.h. kgdb defines INKERNEL() too, the conflict is resolved by renaming kgdb version to PINKERNEL(). On i386, correct the lowest kernel address. After the shared page was introduced, USRSTACK no longer points to the last user address + 1 [] Submitted by: Oliver Pinter [] Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-02 14:37:21 +00:00
Andrew Turner	a380ef6a02	Enable kernel debugging on arm64, other than GDB as it fails to build. Sponsored by: ABT Systems Ltd	2015-07-02 14:35:30 +00:00
Konstantin Belousov	1965f86c72	Vnode is not referenced by the vfs_domount() at the point where asserts are made. Remove them, since we might dereference freed memory. Leaked locks are asserted by the syscall return code anyway. Reported and tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-02 14:31:47 +00:00
Alexander Motin	b9b4269c1d	Fix couple panics on forced unmount of backing file. MFC after: 1 week Sponsored by: iXsystems, Inc.	2015-07-02 12:53:22 +00:00
Pawel Jakub Dawidek	fefb6a143a	Properly propagate errors in metadata reading. PR: 198860 Submitted by: Matthew D. Fuller	2015-07-02 10:57:34 +00:00
Pawel Jakub Dawidek	edaa9008ff	Allow to omit keyfile number for the first keyfile.	2015-07-02 10:55:32 +00:00
Andriy Gapon	74f75cb1bd	zfs_mount(MS_REMOUNT): protect zfs_(un)register_callbacks calls We now take z_teardown_lock as a writer to ensure that there is no I/O while the filesystem state is in a flux. Also, zfs_suspend_fs() -> zfsvfs_teardown() call zfs_unregister_callbacks() and zfs_resume_fs() -> zfsvfs_setup() call zfs_unregister_callbacks(). Previously there was no synchronization between those calls and the calls in the re-mounting case. That could lead to concurrent execution and a crash. PR: 180060 Differential Revision: https://reviews.freebsd.org/D2865 Suggested by: mahrens Reviewed by: delphij, pho, mahrens, will MFC after: 13 days Sponsored by: ClusterHQ	2015-07-02 08:32:02 +00:00
Alexander Motin	556e4c9a83	Disable port multiplier support on Marvell 88SE61xx chips. According to report, some recent unrelated changes in the driver triggered timeouts when testing for absent port multiplier. Cause of this behavior channge is unclear, but since these chips are old, rare and buggy, it is easier to just disable port multiplier support, same as done in Linux. Reported by: bar MFC after: 3 days	2015-07-02 08:25:45 +00:00
Luiz Otavio O Souza	5ee00411e4	Add DMA support for Allwinner MMC controller. DMA handles all data transfers up to 128K or 16 segments and fallback to pio mode when DMA requirements are not met. The read performance has improved greatly while the write performance also showed some improvement but seems limited by the card type and quality. Submitted by: Pratik Singhal <pratiksinghal@freebsd.org> Sponsored by: Google Summer of Code 2015 Tested on: A10 (cubieboard) and A20 (cubieboard 2 and banana pi)	2015-07-01 23:27:01 +00:00
Andrew Turner	c950fb6b67	Fix the logic for when to restore the VFP registers. It should restore them when a different thread last used them, or when the thread was last run on a different cpu. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-01 17:27:44 +00:00
Konstantin Belousov	3ce8c94f29	Disallow a debugger on 64bit system to set fs/gs bases of the 32bit process beyond the end of the process address space. Such setting is not dangerous to the kernel integrity, but it causes confusing application misbehaviour. Sponsored by: The FreeBSD Foundation MFC after: 12 days	2015-07-01 16:37:03 +00:00
Ruslan Bukin	b78ee15e9f	First cut of DTrace for AArch64. Reviewed by: andrew, emaste Sponsored by: ARM Limited Differential Revision: https://reviews.freebsd.org/D2738	2015-07-01 15:51:11 +00:00
Christian Brueffer	8b47cda2f7	Use the correct le*dec function to decode a 16bit type. PR: 194228 Submitted by: David Horwitt MFC after: 2 weeks	2015-07-01 14:54:13 +00:00
Ruslan Bukin	0ff41755cd	Add a central location for exclusion checks. We check here if function is excluded from FBT instrumentation. Reviewed by: andrew, emaste, markj Differential Revision: https://reviews.freebsd.org/D2899	2015-07-01 14:09:59 +00:00
Navdeep Parhar	fd215e45eb	cxgbe(4): request an automatic tx update when a netmap tx queue idles. The NIC tx queues already do this. MFC after: 1 week Differential Revision:	2015-07-01 00:34:14 +00:00
Li-Wen Hsu	4889014996	- Fix `make depend` in sys/modules Differential Revision: https://reviews.freebsd.org/D2951 Approved by: delphij	2015-06-30 19:35:14 +00:00
Navdeep Parhar	9523d1bfc3	Fix leak in tcp_lro_rx. Simply clearing M_PKTHDR isn't enough, any tags hanging off the header need to be freed too. Differential Revision: https://reviews.freebsd.org/D2708 Reviewed by: ae@, hiren@	2015-06-30 17:19:58 +00:00
Mark Murray	c4f9c760c9	Updated random(4) boot/shutdown scripting. Fix the man pages as well. Differential Revision: https://reviews.freebsd.org/D2924 Approved by: so (delphij)	2015-06-30 17:09:41 +00:00
Mark Murray	d1b06863fb	Huge cleanup of random(4) code. * GENERAL - Update copyright. - Make kernel options for RANDOM_YARROW and RANDOM_DUMMY. Set neither to ON, which means we want Fortuna - If there is no 'device random' in the kernel, there will be NO random(4) device in the kernel, and the KERN_ARND sysctl will return nothing. With RANDOM_DUMMY there will be a random(4) that always blocks. - Repair kern.arandom (KERN_ARND sysctl). The old version went through arc4random(9) and was a bit weird. - Adjust arc4random stirring a bit - the existing code looks a little suspect. - Fix the nasty pre- and post-read overloading by providing explictit functions to do these tasks. - Redo read_random(9) so as to duplicate random(4)'s read internals. This makes it a first-class citizen rather than a hack. - Move stuff out of locked regions when it does not need to be there. - Trim RANDOM_DEBUG printfs. Some are excess to requirement, some behind boot verbose. - Use SYSINIT to sequence the startup. - Fix init/deinit sysctl stuff. - Make relevant sysctls also tunables. - Add different harvesting "styles" to allow for different requirements (direct, queue, fast). - Add harvesting of FFS atime events. This needs to be checked for weighing down the FS code. - Add harvesting of slab allocator events. This needs to be checked for weighing down the allocator code. - Fix the random(9) manpage. - Loadable modules are not present for now. These will be re-engineered when the dust settles. - Use macros for locks. - Fix comments. * src/share/man/... - Update the man pages. * src/etc/... - The startup/shutdown work is done in D2924. * src/UPDATING - Add UPDATING announcement. * src/sys/dev/random/build.sh - Add copyright. - Add libz for unit tests. * src/sys/dev/random/dummy.c - Remove; no longer needed. Functionality incorporated into randomdev.. live_entropy_sources.c live_entropy_sources.h - Remove; content moved. - move content to randomdev.[ch] and optimise. * src/sys/dev/random/random_adaptors.c src/sys/dev/random/random_adaptors.h - Remove; plugability is no longer used. Compile-time algorithm selection is the way to go. * src/sys/dev/random/random_harvestq.c src/sys/dev/random/random_harvestq.h - Add early (re)boot-time randomness caching. * src/sys/dev/random/randomdev_soft.c src/sys/dev/random/randomdev_soft.h - Remove; no longer needed. * src/sys/dev/random/uint128.h - Provide a fake uint128_t; if a real one ever arrived, we can use that instead. All that is needed here is N=0, N++, N==0, and some localised trickery is used to manufacture a 128-bit 0ULLL. * src/sys/dev/random/unit_test.c src/sys/dev/random/unit_test.h - Improve unit tests; previously the testing human needed clairvoyance; now the test will do a basic check of compressibility. Clairvoyant talent is still a good idea. - This is still a long way off a proper unit test. * src/sys/dev/random/fortuna.c src/sys/dev/random/fortuna.h - Improve messy union to just uint128_t. - Remove unneeded 'static struct fortuna_start_cache'. - Tighten up up arithmetic. - Provide a method to allow eternal junk to be introduced; harden it against blatant by compress/hashing. - Assert that locks are held correctly. - Fix the nasty pre- and post-read overloading by providing explictit functions to do these tasks. - Turn into self-sufficient module (no longer requires randomdev_soft.[ch]) * src/sys/dev/random/yarrow.c src/sys/dev/random/yarrow.h - Improve messy union to just uint128_t. - Remove unneeded 'staic struct start_cache'. - Tighten up up arithmetic. - Provide a method to allow eternal junk to be introduced; harden it against blatant by compress/hashing. - Assert that locks are held correctly. - Fix the nasty pre- and post-read overloading by providing explictit functions to do these tasks. - Turn into self-sufficient module (no longer requires randomdev_soft.[ch]) - Fix some magic numbers elsewhere used as FAST and SLOW. Differential Revision: https://reviews.freebsd.org/D2025 Reviewed by: vsevolod,delphij,rwatson,trasz,jmg Approved by: so (delphij)	2015-06-30 17:00:45 +00:00
Konstantin Belousov	6ef120027f	Do not calculate the stack's bottom address twice. Submitted by: Olivц╘r Pintц╘r Review: https://reviews.freebsd.org/D2953 MFC after: 1 week	2015-06-30 15:22:47 +00:00
Hiren Panchasara	f85680793b	Avoid a situation where we do not set persist timer after a zero window condition. If you send a 0-length packet, but there is data is the socket buffer, and neither the rexmt or persist timer is already set, then activate the persist timer. PR: 192599 Differential Revision: D2946 Submitted by: jlott at averesystems dot com Reviewed by: jhb, jch, gnn, hiren Tested by: jlott at averesystems dot com, jch MFC after: 2 weeks	2015-06-29 21:23:54 +00:00
Christian Brueffer	9f026a420b	Set the initial system time to a sane (as in: not end of 21st century) value when booting on a PC with CMOS clock set to a year before 2000. This uses 1980 (instead of 1970 as in the initial patch) as pivot year as suggested by imp in the PR followup. PR: 195703 Submitted by: cs@soi.spb.ru Reviewed by: imp MFC after: 1 weeks	2015-06-29 17:02:09 +00:00
Konstantin Belousov	521987c3e5	Simplify code, no need to test the flag before clearing it. Submitted by: ed MFC after: 12 days	2015-06-29 13:06:24 +00:00
Konstantin Belousov	b24de3ac40	Provide npx_get_fsave(9) and npx_set_fsave(9) functions to obtain and restore the FPU state from the format of machine FSAVE area. The intended use is for ABI emulators to provide FSAVE-formatted FPU state to usermode requiring it, while kernel could use FXSAVE due to XMM/XSAVE. The core functionality to convert from/to FXSAVE format is shared with the fill_fpregs_xmm() and set_fpregs_xmm(). Move the later functions to npx.c and rename them to npx_fill_fpregs_xmm() and npx_set_fpregs_xmm(). They differ from nptx_get/set_fsave(9) since our mcontext contains padding to be zeroed or ignored. fill_fpregs() and set_fpregs() could be converted to use the new interface, but there are small differences to handle. Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-06-29 12:06:36 +00:00
Konstantin Belousov	f9343dacbd	Move CS_SECURE() and EFL_SECURE() macros to the machine/frame.h. They are useful for most implementations of sendsig(). Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-06-29 10:35:00 +00:00
Konstantin Belousov	2a4734651c	svr4 emulator has custom sendsig() implementation, it does not use sv_sigtbl. Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-06-29 10:33:04 +00:00
Konstantin Belousov	773554f79e	Remove sv_sigtbl handling from the arm64 sendsig(). There is no ABI emulators on arm64. Reviewed by: andrew Review: https://reviews.freebsd.org/D2889 Sponsored by: The FreeBSD Foundation	2015-06-29 10:31:12 +00:00
Konstantin Belousov	3ac3c0f269	Add a comment about too strong semantic of atomic_load_acq() on x86. Submitted by: bde MFC after: 2 weeks	2015-06-29 09:58:40 +00:00
Konstantin Belousov	d9008978c8	pcb_gs32sd is unused for long time, remove it. Keep the padding in pcb. Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-06-29 07:53:44 +00:00
Konstantin Belousov	1817023775	Add x86 PT_GETFSBASE, PT_GETGSBASE machine-depended ptrace requests to obtain the thread %fs and %gs bases. Add x86 PT_SETFSBASE and PT_SETGSBASE requests to set the bases from debuggers. The set requests, similarly to the sysarch({I386,AMD64}_SET_FSBASE), override the corresponding segment registers. The main purpose of the operations is to retrieve and modify the tcb address for debuggee. Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-06-29 07:07:24 +00:00
Konstantin Belousov	06d058bd23	Reduce code duplication. Add helper fill_based_sd(9) which creates a based user data descriptor covering whole VA. Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-06-29 06:59:08 +00:00
Pedro F. Giffuni	1374252397	Add a new __sentinel attribute. The sentinel attribute was originally implemented in OpenBSD's gcc and later adopted by upstream GCC 4.0 (and clang). From the OpenBSD's gcc-local manpage: - gcc recognizes the extra attribute __sentinel__, which can be used to mark varargs function that need a NULL pointer to mark argument termination, like execl(3). This exposes latent bugs for 64-bit architectures, where a terminating 0 will expand to a 32-bit int, and not a full-fledged 64-bits pointer. While here sort the visibility attributes. Hinted-by: OpenBSD	2015-06-29 00:30:30 +00:00

... 2 3 4 5 6 ...

104990 Commits