freebsd-nq

Author	SHA1	Message	Date
Konstantin Belousov	6a875bf929	Do not pretend that vm_fault(9) supports unwiring the address. Rename the VM_FAULT_CHANGE_WIRING flag to VM_FAULT_WIRE. Assert that the flag is only passed when faulting on the wired map entry. Remove the vm_page_unwire() call, which should be never reachable. Since VM_FAULT_WIRE flag implies wired map entry, the TRYPAGER() macro is reduced to the testing of the fs.object having a default pager. Inline the check. Suggested and reviewed by: alc Tested by: pho (previous version) MFC after: 1 week	2015-07-30 18:28:34 +00:00
Andrew Turner	8df0053b7a	Add enough of pmap_page_set_memattr to run gstat. It still needs to split the DMAP 1G pages so we set the attributes only on the specified page. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-30 16:17:44 +00:00
Konstantin Belousov	0b6476ec5b	Improve comments. Submitted by: bde MFC after: 2 weeks	2015-07-30 15:47:53 +00:00
Roger Pau Monné	c023d8234b	vfs: fill fallout from r286076 This right operator is >= not =>. Reported by: cem	2015-07-30 15:43:26 +00:00
Roger Pau Monné	8f89a299e2	vfs: fix off-by-one error in vfs_buf_check_mapped The check added in r285872 can trigger for valid buffers if the buffer space used happens to be just after unmapped_buf in KVA space. Discussed with: kib Sponsored by: Citrix Systems R&D	2015-07-30 15:28:06 +00:00
Ed Maste	c547d650eb	Add ARM64TODO markers to unimplemented functionality Reviewed by: andrew Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D2389	2015-07-30 14:20:36 +00:00
Zbigniew Bodek	9028b18f75	Enable IRQ during syscalls on ARM64 FreeBSD provides a feature called Adaptive Mutexes, which allows a thread to spin for a while when the mutex is taken instead of immediately going to sleep. This causes issues when called from syscall handler if interrupts are masked. If every other core also attempts to access the same mutex there is a chance that all of them are spinning on the same lock at the same time. If interrupts are disabled, no kernel preemtion can occur and the system becomes unresponsive. This patch enables interrupts when syscall is being executed and masks them as soon as it is completed. Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3246	2015-07-30 13:59:38 +00:00
Zbigniew Bodek	4d3523c2f7	Remove obsolete vendor code from Alpine platform support This is a clean-up patch from a serie delivering support for Annapurna Labs Alpine PoC. The HAL files have already been added to sys/contrib/alpine-hal so there is no need for them in the platform directory. This patch removes obsolete files. Reviewed by: andrew Obtained from: Semihalf Sponsored by: Annapurna Labs Differential Revision: https://reviews.freebsd.org/D3248	2015-07-30 13:45:34 +00:00
Andrey V. Elsukov	a5965d1513	Build if_stf(4) module only when both INET and INET6 support are enabled.	2015-07-30 10:26:43 +00:00
Colin Percival	aaebf69062	Add support for Xen blkif indirect segment I/Os. This makes it possible for the blkfront driver to perform I/Os of up to 2 MB, subject to support from the blkback to which it is connected and the initiation of such large I/Os by the rest of the kernel. In practice, the I/O size is increased from 40 kB to 128 kB. The changes to xen/interface/io/blkif.h consist merely of merging updates from the upstream Xen repository. In dev/xen/blkfront/block.h we add some convenience macros and structure fields used for indirect-page I/Os: The device records its negotiated limit on the number of indirect pages used, while each I/O command structure gains permanently allocated page(s) for indirect page references and the Xen grant references for those pages. In dev/xen/blkfront/blkfront.c we now check in xbd_queue_cb whether a request is small enough to handle without an indirection page, and either follow the previous behaviour or use new code for issuing an indirect segment I/O. In xbd_connect we read the size of indirect segment I/Os supported by the backend and select the maximum size we will use; then allocate the pages and Xen grant references for each I/O command structure. In xbd_free those grants and pages are released. A new loader tunable, hw.xbd.xbd_enable_indirect, can be set to 0 in order to disable this functionality; it works by pretending that the backend does not support this feature. Some backends exhibit a loss of performance with large I/Os, so users may wish to test with and without this functionality enabled. Reviewed by: royger MFC after: 3 days Relnotes: yes	2015-07-30 03:50:01 +00:00
Luiz Otavio O Souza	8b15f615e0	Follow r256586 and rename the kernel version of the Free() macro to R_Free(). This matches the other macros and reduces the chances to clash with other headers. This also fixes the build of radix.c outside of the kernel environment. Reviewed by: glebius	2015-07-30 02:09:03 +00:00
Konstantin Belousov	48cae112b5	Use private cache line for the locked nop in *mb() on i386. Suggested by: alc Reviewed by: alc, bde Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-30 00:13:20 +00:00
Konstantin Belousov	dd5b64258f	MFamd64 r285934: Remove store/load (= full) barrier from the i386 atomic_load_acq_*(). Noted by: alc (long time ago) Reviewed by: alc, bde Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-29 23:59:17 +00:00
John-Mark Gurney	e381fd293d	const'ify an arg that we don't update...	2015-07-29 23:37:15 +00:00
Rick Macklem	25f37276e5	This patch fixes a problem where, if the NFSv4 server has a previous unconfirmed clientid structure for the same client on the last hash list, this old entry would not be removed/deleted. I do not think this bug would have caused serious problems, since the new entry would have been before the old one on the list. This old entry would have eventually been scavenged/removed. Detected while reading the code looking for another bug. MFC after: 3 days	2015-07-29 23:06:30 +00:00
Jim Harris	0e1fd2dda3	nvme: do not notify a consumer about failures that occur during initialization MFC after: 3 days Sponsored by: Intel	2015-07-29 21:29:50 +00:00
Sean Bruno	e0fe6b4835	Add support for BCM5466 PHY Differential Revision: D3232 Submitted by: kevin.bowling@kev009.com	2015-07-29 20:50:48 +00:00
Sean Bruno	79855a57e2	Remove dead functions pmap_pvdump and pads. Differential Revision: D3206 Submitted by: kevin.bowling@kev009.com Reviewed by: alc	2015-07-29 20:47:27 +00:00
Ermal Luçi	3c40232395	Avoid double reference decrement when firewalls force relooping of packets When firewalls force a reloop of packets and the caller supplied a route the reference to the route might be reduced twice creating issues. This is especially the scenario when a packet is looped because of operation in the firewall but the new route lookup gives a down route. Differential Revision: https://reviews.freebsd.org/D3037 Reviewed by: gnn Approved by: gnn(mentor)	2015-07-29 20:10:36 +00:00
Ermal Luçi	d9f2a78249	ip_output normalization and fixes ip_output has a big chunk of code used to handle special cases with pfil consumers which also forces a reloop on it. Gather all this code together to make it readable and properly handle the reloop cases. Some of the issues identified: M_IP_NEXTHOP is not handled properly in existing code. route reference leaking is possible with in FIB number change route flags checking is not consistent in the function Differential Revision: https://reviews.freebsd.org/D3022 Reviewed by: gnn Approved by: gnn(mentor) MFC after: 4 weeks	2015-07-29 18:04:01 +00:00
Patrick Kelsey	4741bfcb57	Revert r265338, r271089 and r271123 as those changes do not handle non-inline urgent data and introduce an mbuf exhaustion attack vector similar to FreeBSD-SA-15:15.tcp, but not requiring VNETs. Address the issue described in FreeBSD-SA-15:15.tcp. Reviewed by: glebius Approved by: so Approved by: jmallett (mentor) Security: FreeBSD-SA-15:15.tcp Sponsored by: Norse Corp, Inc.	2015-07-29 17:59:13 +00:00
Ed Schouten	8328babdd0	Make pipes in CloudABI work. Summary: Pipes in CloudABI are unidirectional. The reason for this is that CloudABI attempts to provide a uniform runtime environment across different flavours of UNIX. Instead of implementing a custom pipe that is unidirectional, we can simply reuse Capsicum permission bits to support this. This is nice, because CloudABI already attempts to restrict permission bits to correspond with the operations that apply to a certain file descriptor. Replace kern_pipe() and kern_pipe2() by a single kern_pipe() that takes a pair of filecaps. These filecaps are passed to the newly introduced falloc_caps() function that creates the descriptors with rights in place. Test Plan: CloudABI pipes seem to be created with proper rights in place: https://github.com/NuxiNL/cloudlibc/blob/master/src/libc/unistd/pipe_test.c#L44 Reviewers: jilles, mjg Reviewed By: mjg Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3236	2015-07-29 17:18:27 +00:00
Ed Schouten	e555b4309c	Introduce falloc_caps() to create descriptors with capabilties in place. falloc_noinstall() followed by finstall() allows you to create and install file descriptors with custom capabilities. Add falloc_caps() that can do both of these actions in one go. This will be used by CloudABI to create pipes with custom capabilities. Reviewed by: mjg	2015-07-29 17:16:53 +00:00
Sean Bruno	1f6aae90ad	Make Broadcom XLR use shared ds1374 RTC driver. Remove its identical and redundant ds1374u version. Differential Revision: D3225 Submitted by: kevin.bowling@kev009.com	2015-07-29 15:32:59 +00:00
Andrey V. Elsukov	10a0e0bf0a	Eliminate the use of m_copydata() in gif_encapcheck(). ip_encap already has inspected mbuf's data, at least an IP header. And it is safe to use mtod() and do direct access to needed fields. Add M_ASSERTPKTHDR() to gif_encapcheck(), since the code expects that mbuf has a packet header. Move the code from gif_validate[46] into in[6]_gif_encapcheck(), also remove "martian filters" checks. According to RFC 4213 it is enough to verify that the source address is the address of the encapsulator, as configured on the decapsulator. Reviewed by: melifaro Obtained from: Yandex LLC Sponsored by: Yandex LLC	2015-07-29 14:07:43 +00:00
Ed Schouten	9d2332c9ee	Split up Capsicum to CloudABI rights conversion into two separate routines. CloudABI's openat() ensures that files are opened with the smallest set of relevant rights. For example, when opening a FIFO, unrelated rights like CAP_RECV are automatically removed. To remove unrelated rights, we can just reuse the code for this that was already present in the rights conversion function.	2015-07-29 12:42:45 +00:00
Zbigniew Bodek	cf89e8c919	Add quirk for ThunderX ITS device table size Limit the number of supported device IDs to 0x100000 in order to decrease the size of the ITS device table so that it matches with the HW capabilities. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3131	2015-07-29 11:22:19 +00:00
Andrey V. Elsukov	b13653baf9	Reduce overhead of ipfw's me6 opcode. Skip checks for IPv6 multicast addresses. Use in6_localip() for global unicast. And for IPv6 link-local addresses do search in the IPv6 addresses list. Since LLA are stored in the kernel internal form, use IN6_ARE_MASKED_ADDR_EQUAL() macro with lla_mask for addresses comparison. lla_mask has zero bits in the second word, where we keep sin6_scope_id. Obtained from: Yandex LLC Sponsored by: Yandex LLC	2015-07-29 10:53:42 +00:00
Konstantin Belousov	6cebf7e2be	Move bufshutdown() out of the #ifdef INVARIANTS block.	2015-07-29 09:57:34 +00:00
Andrey V. Elsukov	cc0a3c8ca4	Convert in_ifaddr_lock and in6_ifaddr_lock to rmlock. Both are used to protect access to IP addresses lists and they can be acquired for reading several times per packet. To reduce lock contention it is better to use rmlock here. Reviewed by: gnn (previous version) Obtained from: Yandex LLC Sponsored by: Yandex LLC Differential Revision: https://reviews.freebsd.org/D3149	2015-07-29 08:12:05 +00:00
John-Mark Gurney	a09a7146a7	RFC4868 section 2.3 requires that the output be half... This fixes problems that was introduced in r285336... I have verified that HMAC-SHA2-256 both ah only and w/ AES-CBC interoperate w/ a NetBSD 6.1.5 vm... Reviewed by: gnn	2015-07-29 07:15:16 +00:00
Kristof Provost	48c29b118e	pf: Always initialise pf_fragment.fr_flags When we allocate the struct pf_fragment in pf_fillup_fragment() we forgot to initialise the fr_flags field. As a result we sometimes mistakenly thought the fragment to not be a buffered fragment. This resulted in panics because we'd end up freeing the pf_fragment but not removing it from V_pf_fragqueue (believing it to be part of V_pf_cachequeue). The next time we iterated V_pf_fragqueue we'd use a freed object and panic. While here also fix a pf_fragment use after free in pf_normalize_ip(). pf_reassemble() frees the pf_fragment, so we can't use it any more. PR: 201879, 201932 MFC after: 5 days	2015-07-29 06:35:36 +00:00
Ed Schouten	3720b82fa8	Implement CloudABI's readdir(). Summary: CloudABI's readdir() system call could be thought of as a mixture between FreeBSD's getdents(2) and pread(). Instead of using the file descriptor offset, userspace provides a 64-bit cloudabi_dircookie_t continue reading at a given point. CLOUDABI_DIRCOOKIE_START, having value 0, can be used to return entries at the start of the directory. The file descriptor offset is not used to store the cookie for the reason that in a file descriptor centric environment, it would make sense to allow concurrent use of a single file descriptor. The remaining space returned by the system call should be filled with a partially truncated copy of the next entry. The advantage of doing this is that it gracefully deals with long filenames. If the C library provides a buffer that is too small to hold a single entry, it can still extract the directory entry header, meaning that it can retry the read with a larger buffer or skip it using the cookie. Test Plan: This implementation passes the cloudlibc unit tests at: https://github.com/NuxiNL/cloudlibc/tree/master/src/libc/dirent Reviewers: marcel, kib Reviewed By: kib Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3226	2015-07-29 06:31:44 +00:00
Jeff Roberson	7d07bfd8a3	- Remove some dead code copied from ffs.	2015-07-29 03:06:08 +00:00
Jeff Roberson	98082691bb	- Make 'struct buf *buf' private to vfs_bio.c. Having a global variable 'buf' is inconvenient and has lead me to some irritating to discover bugs over the years. It also makes it more challenging to refactor the buf allocation system. - Move swbuf and declare it as an extern in vfs_bio.c. This is still not perfect but better than it was before. - Eliminate the unused ffs function that relied on knowledge of the buf array. - Move the shutdown code that iterates over the buf array into vfs_bio.c. Reviewed by: kib Sponsored by: EMC / Isilon Storage Division	2015-07-29 02:26:57 +00:00
Jean-Sébastien Pédron	133362912c	drm/i915: Sort functions in i915_gem.c to match Linux 3.8's ordering While here, reduce the style diff with Linux. There is no functional change. The goal is to ease the future update to Linux 3.8's i915 driver. MFC after: 2 months	2015-07-28 21:47:37 +00:00
Jeff Roberson	38750ada8f	- Eliminate the EMPTYKVA queue. It served as a cache of KVA allocations attached to bufs to avoid the overhead of the vm. This purposes is now better served by vmem. Freeing the kva immediately when a buf is destroyed leads to lower fragmentation and a much simpler scan algorithm. Reviewed by: kib Sponsored by: EMC / Isilon Storage Division	2015-07-28 20:24:09 +00:00
David C Somayajulu	ab97207add	- Avoid lock contention in the if_transmit callback by using trylock and enqueueing the frames when it fails. This way there is some latency removed from the transmitting path. - If IFF_DRV_OACTIVE is set (and also if IFF_DRV_RUNNING is not) just enqueue the desired frames and return successful transmit. This way we avoid to return errors on transmit side and resulting in possible out-of-order frames. Please note that IFF_DRV_OACTIVE is set everytime we get the threshold ring hit, so this can be happening quite often. Submitted by: Attilio.Rao@isilon.com MFC after:5 days	2015-07-28 19:15:44 +00:00
Renato Botelho	299c819a75	Simplify logic added in r285945 as suggested by glebius Approved by: glebius MFC after: 3 days Sponsored by: Netgate	2015-07-28 14:59:29 +00:00
Zbigniew Bodek	f4b37ed0f8	Import Annapurna Labs Alpine HAL to sys/contrib/ Import from vendor-sys/alpine-hal/2.7 SVN rev.: 285432 HAL version: 2.7 Obtained from: Semihalf Sponsored by: Annapurna Labs	2015-07-28 14:20:33 +00:00
Zbigniew Bodek	8b21d6ae5a	Limit ofw_cpu_early_foreach() to CPUs only On some platforms, the /cpus node contains cpu-to-cluster map which deffinitely is not a CPU node. Its presence was causing incrementing of "id" variable and reporting more CPUs available than it should. To make "id" valid, increment it only when an entry really is a CPU device. Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3216	2015-07-28 13:16:08 +00:00
Ed Schouten	1d96fd8d9f	Implement file attribute modification system calls for CloudABI. CloudABI uses a system call interface to modify file attributes that is more similar to KPI's/FUSE, namely where a stat structure is passed back to the kernel, together with a bitmask of attributes that should be changed. This would allow us to update any set of attributes atomically. That said, I'd rather not go as far as to actually implement it that way, as it would require us to duplicate more code than strictly needed. Let's just stick to the combinations that are actually used by cloudlibc. Obtained from: https://github.com/NuxiNL/freebsd	2015-07-28 12:57:19 +00:00
Steven Hartland	1635369e99	Add warning about low KSTACK_PAGES for ZFS use As ZFS requires a more kernel stack pages than is the default on some architectures e.g. i386, warn if KSTACK_PAGES is less than ZFS_MIN_KSTACK_PAGES (which is 4 at the time of writing). MFC after: 3 days Sponsored by: Multiplay	2015-07-28 11:19:38 +00:00
Renato Botelho	b1b98a2db7	Respect pf rule log option before log dropped packets with IP options or dangerous v6 headers Reviewed by: gnn, eri Approved by: gnn Obtained from: pfSense MFC after: 3 days Sponsored by: Netgate Differential Revision: https://reviews.freebsd.org/D3222	2015-07-28 10:31:34 +00:00
Gleb Smirnoff	3e437fd2c6	Fix a typo in r280169. Of course we are interested in deleting nsn only if we have just created it and we were the last reference. Submitted by: dhartmei	2015-07-28 09:36:26 +00:00
Michael Tuexen	9ae56375af	Fix a typo reported by Erik Cederstrand. MFC after: 1 week	2015-07-28 08:50:13 +00:00
Hans Petter Selasky	ed0ed9b424	Optimise the DWC OTG host mode driver's receive path: Remove NAKing limit and pause IN and OUT transactions for 125us in case of NAK response for BULK and CONTROL endpoints. This gets the receive latency down and improves USB network throughput at the cost of some CPU usage. MFC after: 1 month	2015-07-28 07:30:07 +00:00
Konstantin Belousov	1d1ec02c44	Remove full barrier from the amd64 atomic_load_acq_*(). Strong ordering semantic of x86 CPUs makes only the compiler barrier neccessary to give the acquire behaviour. Existing implementation ensured sequentially consistent semantic for load_acq, making much stronger guarantee than required by standard's definition of the load acquire. Consumers which depend on the barrier are believed to be identified and already fixed to use proper operations. Noted by: alc (long time ago) Reviewed by: alc, bde Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-28 07:04:51 +00:00
Konstantin Belousov	b4c0214605	Remove useless acquire semantic from the atomic_add operation before sosend(). The only release on the xp_snt_cnt is done after sosend(), with an intent to synchronize with load_acq in svc_vc_ack(). Reviewed by: alc Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-28 06:58:10 +00:00
Konstantin Belousov	90a2db45eb	Add bit names for the IA32_MISC_ENABLE msr. Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-28 06:55:08 +00:00
Ed Schouten	29515a68a5	Implement directory and FIFO creation. The file_create() system call can be used to create files of a given type. Right now it can only be used to create directories and FIFOs. As CloudABI does not expose filesystem permissions, this system call lacks a mode argument. Simply use 0777 or 0666 depending on the file type.	2015-07-28 06:50:47 +00:00
Ed Schouten	cec575201a	Make fstat() and friends work. Summary: CloudABI provides access to two different stat structures: - fdstat, containing file descriptor level status: oflags, file descriptor type and Capsicum rights, used by cap_rights_get(), fcntl(F_GETFL), getsockopt(SO_TYPE). - filestat, containing your regular file status: timestamps, inode number, used by fstat(). Unlike FreeBSD's stat::st_mode, CloudABI file descriptor types don't have overloaded meanings (e.g., returning S_ISCHR() for kqueues). Add a utility function to extract the type of a file descriptor accurately. CloudABI does not work with O_ACCMODEs. File descriptors have two sets of Capsicum-style rights: rights that apply to the file descriptor itself ('base') and rights that apply to any new file descriptors yielded through openat() ('inheriting'). Though not perfect, we can pretty safely decompose Capsicum rights to such a pair. This is done in convert_capabilities(). Test Plan: Tests for these system calls are fairly extensive in cloudlibc. Reviewers: jonathan, mjg, #manpages Reviewed By: mjg Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3171	2015-07-28 06:36:49 +00:00
Marcel Moolenaar	f40c76d8de	Check the sync operation.	2015-07-28 04:54:05 +00:00
Michael Tuexen	267dbe63a1	Provide consistent error causes whenever an ABORT chunk is sent. MFC after: 1 week	2015-07-27 22:35:54 +00:00
Marius Strobl	43bc87c459	- Move the remainder of host controller capability registers reading from xhci_start_controller() to xhci_init(). These values don't change at run- time so there's no point of acquiring them on every USB_HW_POWER_RESUME instead of only once during initialization. In r276717, reading the first couple of registers in question already had been moved as a prerequisite for the changes in that revision. - Identify ASMedia ASM1042A controllers. - Use NULL instead of 0 for pointers. MFC after: 3 days	2015-07-27 15:26:50 +00:00
Marius Strobl	891c57d8a9	- Fix compilation after r285909 with USB_DEBUG defined. - Regenerate usb.conf.	2015-07-27 14:43:14 +00:00
Marius Strobl	d75accb539	- Use __FBSDID(). - Const'ify cons_to_vga_colors. - Fix line wrapping. MFC after: 3 days	2015-07-27 14:34:32 +00:00
Marius Strobl	0309276c28	- Nuke dupe $FreeBSD$. - Fix whitespace. MFC after: 3 days	2015-07-27 14:03:34 +00:00
Ed Schouten	b114aa7959	Make shutdown() return ENOTCONN as required by POSIX, part deux. Summary: Back in 2005, maxim@ attempted to fix shutdown() to return ENOTCONN in case the socket was not connected (r150152). This had to be rolled back (r150155), as it broke some of the existing programs that depend on this behavior. I reapplied this change on my system and indeed, syslogd failed to start up. I fixed this back in February (279016) and MFC'ed it to the supported stable branches. Apart from that, things seem to work out all right. Since at least Linux and Mac OS X do the right thing, I'd like to go ahead and give this another try. To keep old copies of syslogd working, only start returning ENOTCONN for recent binaries. I took a look at the XNU sources and they seem to test against both SS_ISCONNECTED, SS_ISCONNECTING and SS_ISDISCONNECTING, instead of just SS_ISCONNECTED. That seams reasonable, so let's do the same. Test Plan: This issue was uncovered while writing tests for shutdown() in CloudABI: https://github.com/NuxiNL/cloudlibc/blob/master/src/libc/sys/socket/shutdown_test.c#L26 Reviewers: glebius, rwatson, #manpages, gnn, #network Reviewed By: gnn, #network Subscribers: bms, mjg, imp Differential Revision: https://reviews.freebsd.org/D3039	2015-07-27 13:17:57 +00:00
Marius Strobl	fecf9642ba	- Probe UICLASS_CDC/UISUBCLASS_ABSTRACT_CONTROL_MODEL/0xff again. This variant of Microsoft RNDIS, i. e. their unofficial version of CDC ACM, has been disabled in r261544 for resolving a conflict with umodem(4). Eventually, in r275790 that problem was dealt with in the right way. However, r275790 failed to put probing of RNDIS devices in question back. - Initialize the device prior to querying it, as required by the RNDIS specification. Otherwise already determining the MAC address may fail rightfully. - On detach, halt the device again. - Use UCDC_SEND_ENCAPSULATED_{COMMAND,RESPONSE}. While these macros are resolving to the same values as UR_{CLEAR_FEATURE,GET_STATUS}, the former set is way more appropriate in this context. - Report unknown - rather: unimplemented - events unconditionally and not just in debug mode. This ensures that we'll get some hint of what is going wrong instead of the driver silently failing. - Deal with the Microsoft ActiveSync requirement of using an input buffer the size of the expected reply or larger - except for variably sized replies - when querying a device. - Fix some pointless NULL checks, style bugs etc. This changes allow urndis(4) to communicate with a Microsoft-certified USB RNDIS test token. MFC after: 3 days Sponsored by: genua mbh	2015-07-27 12:14:14 +00:00
Ed Schouten	af7e75f59d	Add a futex implementation for CloudABI. Summary: CloudABI provides two different types of futex objects: read-write locks and condition variables. There is no need to provide separate support for once objects and thread joining, as these are efficiently simulated by blocking on a read-write lock. Mutexes simply use read-write locks. Condition variables always have a lock object associated to them. They always know to which lock a thread needs to be migrated if woken up. This allows us to implement requeueing. A broadcast on a condition variable will never cause multiple threads to be woken up at once. They will be woken up iteratively. This implementation still has lots of room for improvement. Locking is coarse and right now we use linked lists to store all of the locks and condition variables, instead of using a hash table. The primary goal of this implementation was to behave correctly. Performance will be improved as we go. Test Plan: This futex implementation has been in use for the last couple of months and seems to work pretty well. All of the cloudlibc and libc++ unit tests seem to pass. Reviewers: dchagin, kib, vangyzen Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3148	2015-07-27 10:07:29 +00:00
Ed Schouten	533c8a29da	Regenerate system call table.	2015-07-27 10:04:28 +00:00
Ed Schouten	f4c06d124f	Sync in latest upstream system call definitions. Futex object scopes have been renamed from using their own constants to simply reusing the existing CLOUDABI_MAP_{PRIVATE,SHARED} flags, as they are more accurate in this context.	2015-07-27 10:04:06 +00:00
Marcel Moolenaar	b2ce196ca1	o make sure the boundary is a power of 2, when not zero. o don't convert 0 to ~0 just so that we can use MIN. ~0 is not a valid boundary. Introduce BNDRY_MIN that deals with 0 values that mean no boundary.	2015-07-26 16:39:37 +00:00
Andrey V. Elsukov	da6c24e123	Report the scheme and provider names in warning message about unaligned partition. PR: 201873 MFC after: 1 week	2015-07-26 11:16:48 +00:00
Andrey V. Elsukov	41f5f69f96	Build debug version of rmlock's methods only when LOCK_DEBUG > 0. Currently LOCK_DEBUG is always defined in sys/lock.h (0 or 1). This means that debugging code always built. In addition the kernel modules have always defined LOCK_DEBUG as 1. So, debugging rmlock code is always used by kernel modules. MFC after: 1 week	2015-07-26 10:53:32 +00:00
Michael Tuexen	cf9e47b2f0	Improve locking on Mac OS X. This does not change the functionality on FreeBSD. Reviewed by: rrs MFC after: 1 week	2015-07-26 10:37:40 +00:00
Michael Tuexen	6247db3541	Fix and improve a debug message. The SID was reported as an SSN. MFC after: 1 week	2015-07-26 10:17:17 +00:00
Christian Brueffer	382353e2e8	In tmpfs_chtimes(), remove checks on the nanosecond level when determining whether a node changed. Other filesystems, e.g., UFS, only check on seconds, when determining whether something changed. This also corrects the birthtime case, where we checked tv_nsec twice, instead of tv_sec and tv_nsec (PR). PR: 201284 Submitted by: David Binderman Patch suggested by: kib Reviewed by: kib MFC after: 2 weeks Committed from: Essen FreeBSD Hackathon	2015-07-26 08:33:46 +00:00
Michael Gmelin	ca2e4ecd73	isl(4), driver for Intersil I2C ISL29018 Digital Ambient Light Sensor Differential Revision: https://reviews.freebsd.org/D2811 Reviewed by: adrian, wblock Approved by: adrian, wblock Relnotes: yes	2015-07-25 20:17:19 +00:00
Edward Tomasz Napierala	46a8ca51e3	Use consistent spacing. MFC after: 1 month Sponsored by: The FreeBSD Foundation	2015-07-25 20:17:19 +00:00
Edward Tomasz Napierala	caf9bbecdc	Add md_root example to defaults/loader.conf. Note that this doesn't quite work yet - the preloaded image gets loaded twice for some reason. MFC after: 1 month Sponsored by: The FreeBSD Foundation	2015-07-25 20:15:29 +00:00
Sean Bruno	a82cd51680	Remove unused txd_saved. Intialize txd_upper, txd_lower and txd_used at declaration. Differential Revision: D3174 Reviewed by: erj hiren MFC after: 2 weeks Sponsored by: Limelight Networks	2015-07-25 19:24:33 +00:00
Konstantin Belousov	6195b24a79	Revert r173708's modifications to vm_object_page_remove(). Assume that a vnode is mapped shared and mlocked(), and then the vnode is truncated, or truncated and then again extended past the mapping point EOF. Truncation removes the pages past the truncation point, and if pages are later created at this range, they are not properly mapped into the mlocked region, and their wiring count is wrong. The revert leaves the invalidated but wired pages on the object queue, which means that the pages are found by vm_object_unwire() when the mapped range is munlock()ed, and reused by the buffer cache when the vnode is extended again. The changes in r173708 were required since then vm_map_unwire() looked at the page tables to find the page to unwire. This is no longer needed with the vm_object_unwire() introduction, which follows the objects shadow chain. Also eliminate OBJPR_NOTWIRED flag for vm_object_page_remove(), which is now redundand, we do not remove wired pages. Reported by: trasz, Dmitry Sivachenko <trtrmitya@gmail.com> Suggested and reviewed by: alc Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-25 18:29:06 +00:00
Michael Tuexen	4ff815b71c	Move including netinet/icmp6.h around to avoid a problem when including netinet/icmp6.h and net/netmap.h. Both use ni_flags... This allows to build multistack with SCTP support. MFC after: 1 week	2015-07-25 18:26:09 +00:00
Michael Gmelin	46f07718f7	cyapa(4), driver for the Cypress APA I2C trackpad Differential Revision: https://reviews.freebsd.org/D3068 Reviewed by: kib, wblock Approved by: kib Relnotes: yes	2015-07-25 18:14:35 +00:00
Edward Tomasz Napierala	371583f6ca	Use double newlines consistently. MFC after: 1 month Sponsored by: The FreeBSD Foundation	2015-07-25 16:20:04 +00:00
Kristof Provost	fc4443a1d5	Remove stale comment. The IPv6 pseudo header checksum was added by bz in r235961. Sponsored by: Essen FreeBSD Hackathon	2015-07-25 16:14:55 +00:00
Konstantin Belousov	6fd04eff66	With the removal of b_saveaddr in the r285819, b_data must be reset to b_kvabase when the buffer is reclaimed. Otherwise, if b_data for the mapped buffer was adjusted with the page-offset portion of b_offset, nothing would re-adjust the b_data, which breaks buffer management code which expects page-aligned b_data (see e.g. bpman_qenter(), which skips partial pages). Fix a minor issue with the GB_KVAALLOC requests, which could result in returning the mapped buffer if the reused buffer is mapped and have the right amount of KVA reserved. Improve assertion in the vfs_buf_check_mapped() to catch unmapped buffers which have their b_data incorrectly adjusted with offset. Reported and tested by: pho (previous version) Reviewed by: jeff (previous version) Sponsored by: The FreeBSD Foundation	2015-07-25 15:00:14 +00:00
Edward Tomasz Napierala	933333caf8	Document md_root in loader(8). The md(4) manual page mentions it, but it's hard to find and easy to miss. Reviewed by: wblock@ MFC after: 2 weeks Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3183	2015-07-25 13:02:41 +00:00
Oleksandr Tymoshenko	4f5f0f288f	Fix color mapping for TDA19988. Values for VIP_CNTRL_1 and VIP_CNTRL_2 registers were mixed up	2015-07-25 03:19:02 +00:00
Oleksandr Tymoshenko	b8397a9f01	Synchronize PIN input/output modes with gnu/dts/include/dt-bindings/pinctrl/am33xx.h gpio driver requires exact value to match SoC pin mode with GPIO pin direction	2015-07-25 03:03:32 +00:00
Oleksandr Tymoshenko	5625a3e560	If there is panel info in DTB do not wait for HDMI event and setup framebuffer immediately	2015-07-25 02:59:45 +00:00
Oleksandr Tymoshenko	7339f7821b	OF_getencprop_alloc shouldn't be used to get string value. If string length + 1 is not divisible by 4 this function returns NULL property value. Otherwise - string with each 4 letters inverted	2015-07-25 00:58:50 +00:00
Xin LI	1a7c14aec7	Fix a typo in comment. Submitted by: Yanhui Shen via twitter MFC after: 3 days	2015-07-24 22:13:39 +00:00
Alan Cox	d8b56c8eab	Add a comment discussing the appropriate use of the atomic_() functions with acquire and release semantics versus the mb() functions on amd64 processors. Reviewed by: bde (an earlier version), kib Sponsored by: EMC / Isilon Storage Division	2015-07-24 19:43:18 +00:00
Marius Strobl	86fb540033	- Since r253161, uart_intr() abuses FILTER_SCHEDULE_THREAD for signaling uart_bus_attach() during its test that 20 iterations weren't sufficient for clearing all pending interrupts, assuming this means that hardware is broken and doesn't deassert interrupts. However, under pressure, 20 iterations also can be insufficient for clearing all pending interrupts, leading to a panic as intr_event_handle() tries to schedule an interrupt handler not registered. Solve this by introducing a flag that is set in test mode and otherwise restores pre-r253161 behavior of uart_intr(). The approach of additionally registering uart_intr() as handler as suggested in PR 194979 is not taken as that in turn would abuse special pccard and pccbb handling code of intr_event_handle(). [1] - Const'ify uart_driver_name. - Fix some minor style bugs. PR: 194979 [1] Reviewed by: marcel (earlier version) MFC after: 3 days	2015-07-24 17:01:16 +00:00
Ed Maste	119b75925c	Add RISC-V ELF machine type definition EM_RISCV is now officially registered as e_machine 243. MFC after: 1 month Sponsored by: The FreeBSD Foundation	2015-07-24 16:52:21 +00:00
Marius Strobl	e18e2adaae	- In mpt_send_handshake_cmd(), use bus_space_write_stream_4(9) for writing raw data to the doorbell offset in order to clarify the intent and for avoiding unnecessarily converting the endianess back and forth. Unfortunately, the same can't be done in mpt_recv_handshake_reply() as 16-bit data needs to be read using 32-bit bus accessors. - In mpt_recv_handshake_reply(), get rid of a redundant variable. MFC after: 1 fortnight	2015-07-24 16:00:35 +00:00
Marius Strobl	7815d3948c	o Revert the other functional half of r239864, i. e. the merge of r134227 from x86 to use smp_ipi_mtx spin lock not only for smp_rendezvous_cpus() but also for the MD cache invalidation, TLB demapping and remote register reading IPIs due to the following reasons: - The cross-IPI SMP deadlock x86 otherwise is subject to can't happen on sparc64. That's because on sparc64, spin locks don't disable interrupts completely but only raise the processor interrupt level to PIL_TICK. This means that IPIs still get delivered and direct dispatch IPIs such as the cache invalidation etc. IPIs in question are still executed. - In smp_rendezvous_cpus(), smp_ipi_mtx is held not only while sending an IPI_RENDEZVOUS, but until all CPUs have processed smp_rendezvous_action(). Consequently, smp_ipi_mtx may be locked for an extended amount of time as queued IPIs (as opposed to the direct ones) such as IPI_RENDEZVOUS are scheduled via a soft interrupt. Moreover, given that this soft interrupt is only delivered at PIL_RENDEZVOUS, processing of smp_rendezvous_action() on a target may be interrupted by f. e. a tick interrupt at PIL_TICK, in turn leading to the target in question trying to send an IPI by itself while IPI_RENDEZVOUS isn't fully handled, yet, and, thus, resulting in a deadlock. o As mentioned in the commit message of r245850, on least some sun4u platforms concurrent sending of IPIs by different CPUs is fatal. Therefore, hold the reintroduced MD ipi_mtx also while delivering cross-traps via MI helpers, i. e. ipi_{all_but_self,cpu,selected}(). o Akin to x86, let the last CPU to process cpu_mp_bootstrap() set smp_started instead of the BSP in cpu_mp_unleash(). This ensures that all APs actually are started, when smp_started is no longer 0. o In all MD and MI IPI helpers, check for smp_started == 1 rather than for smp_cpus > 1 or nothing at all. This avoids races during boot causing IPIs trying to be delivered to APs that in fact aren't up and running, yet. While at it, move setting of the cpu_ipi_{selected,single}() pointers to the appropriate delivery functions from mp_init() to cpu_mp_start() where it's better suited and allows to get rid of the global isjbus variable. o Given that now concurrent IPI delivery no longer is possible, also nuke the delays before completely disabling interrupts again in the CPU-specific cross-trap delivery functions, previously giving other CPUs a window for sending IPIs on their part. Actually, we now should be able to entirely get rid of completely disabling interrupts in these functions. Such a change needs more testing, though. o In {s,}tick_get_timecount_mp(), make the {s,}tick variable static. While not necessary for correctness, this avoids page faults when accessing the stack of a foreign CPU as {s,}tick now is locked into the TLBs as part of static kernel data. Hence, {s,}tick_get_timecount_mp() always execute as fast as possible, avoiding jitter. PR: 201245 MFC after: 3 days	2015-07-24 15:13:21 +00:00
Randall Stewart	5f98acb594	Fix silly syntax error emacs chugged in for me.. gesh. MFC after: 3 weeks	2015-07-24 14:13:43 +00:00
Randall Stewart	c616859963	Fix an issue with MAC OS locking and also optimize the case where we are sending back a stream-reset and a sack timer is running, in that case we should just send the SACK. MFC after: 3 weeks	2015-07-24 14:09:03 +00:00
Ed Schouten	4615998165	Implement the basic system calls that operate on pathnames. Summary: Unlike FreeBSD, CloudABI does not use null terminated strings for its pathnames. Introduce a function called copyin_path() that can be used by all of the filesystem system calls that use pathnames. This change already implements the system calls that don't depend on any additional functionality (e.g., conversion of struct stat). Also implement the socket system calls that operate on pathnames, namely the ones used by the C library functions bindat() and connectat(). These don't receive a 'struct sockaddr_un', but just the pathname, meaning they could be implemented in such a way that they don't depend on the size of sun_path. For now, just use the existing interfaces. Add a missing #include to cloudabi_syscalldefs.h to get this code to build, as one of its macros depends on UINT64_C(). Test Plan: These implementations have already been tested in the CloudABI branch on GitHub. They pass all of the tests. Reviewers: kib, pjd Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3097	2015-07-24 07:46:02 +00:00
Warner Losh	d2e3ed5af6	Panic when a device is trying to recursively acquire rather than hang indefinitely. Improve error messages from other panics.	2015-07-24 04:56:46 +00:00
Sergey Kandaurov	ef88ae77ea	Call ksem_get() with initialized 'rights'. ksem_get() consumes fget(), and it's mandatory there. Reported by: truckman Reviewed by: mjg	2015-07-23 23:18:03 +00:00
Jeff Roberson	fade8dd714	Refactor unmapped buffer address handling. - Use pointer assignment rather than a combination of pointers and flags to switch buffers between unmapped and mapped. This eliminates multiple flags and generally simplifies the logic. - Eliminate b_saveaddr since it is only used with pager bufs which have their b_data re-initialized on each allocation. - Gather up some convenience routines in the buffer cache for manipulating buf space and buf malloc space. - Add an inline, buf_mapped(), to standardize checks around unmapped buffers. In collaboration with: mlaier Reviewed by: kib Tested by: pho (many small revisions ago) Sponsored by: EMC / Isilon Storage Division	2015-07-23 19:13:41 +00:00
Jim Harris	cbdec09c1c	nvme: ensure csts.rdy bit is cleared before returning from nvme_ctrlr_disable PR: 200458 MFC after: 3 days Sponsored by: Intel	2015-07-23 15:50:39 +00:00
Jim Harris	de9a58f4ee	nvme: properly handle case where pci_alloc_msix does not alloc all vectors Reported by: Sean Kelly <smkelly@smkelly.org> MFC after: 3 days Sponsored by: Intel	2015-07-23 15:35:08 +00:00
Ed Schouten	fef97e09d9	Allow us to create UNIX sockets and socketpairs in CloudABI processes.	2015-07-23 13:52:53 +00:00
Ed Schouten	cf6b9e9b07	Allow cap_rights_{set,clear,is_set} to be called with no arguments. In the CloudABI code I sometimes call into cap_rights_* without providing any arguments. Though one could argue that this doesn't make sense, in this specific case it's hard to avoid, as the rights that should be tested against are forwarded by a couple of wrapper macros.	2015-07-23 11:11:01 +00:00
Jeff Roberson	1c1ddc0351	- Don't defeat the FIFO nature of the buffer cache by eliminating the most recently used buffer when we are under paging pressure. This is a perversion of the buffer and page replacement algorithms and recent improvements to the page daemon have rendered it unnecessary. In the event that low-memory deadlocks become an issue it would be possible to make a daemon or event handler that performs a similar action on the oldest buffers rather than the newest. Since the buf cache is analogous to the page cache and some minimum working set is desired another possibility is to simply shrink the minimum working set which has less downside now that file pages are not directly mapped. Sponsored by: EMC / Isilon Reviewed by: alc, kib (with some minor objection) Tested by: pho	2015-07-23 02:20:41 +00:00
Conrad Meyer	6b8c5d92a4	vt: cpu logos: Correct reversed 0/1 beastie descriptions Differential Revision: https://reviews.freebsd.org/D3158 Approved by: markj (mentor) Obtained from: Pavel Timofeev MFC after: 1 week	2015-07-22 23:30:54 +00:00
Conrad Meyer	f39130e75d	vt: Change default CPU logo to Orb Differential Revision: https://reviews.freebsd.org/D3156 Approved by: markj (mentor) MFC after: 1 week	2015-07-22 23:23:12 +00:00
Conrad Meyer	6d2b01fc54	vt: Default to cpu logos off Apologies, this was how it was supposed to land. Mea culpa. Differential Revision: https://reviews.freebsd.org/D3157 Reviewed by: gnn, hiren Approved by: markj (mentor) MFC after: 1 week	2015-07-22 23:19:53 +00:00
Conrad Meyer	8ef2f53c59	vt_core.c: Use do/while to highlight missed semi-colon errors Also, fix some nearby #define whitespace while here. (Style cleanup for r285794.) Suggested by: jmg Differential Revision: https://reviews.freebsd.org/D3154 Approved by: markj (mentor) MFC after: 1 week Sponsored by: EMC / Isilon Storage Division	2015-07-22 18:50:47 +00:00
Jung-uk Kim	0594dadeb8	Catch up with ACPICA 20150717.	2015-07-22 16:26:17 +00:00
Andrew Rybchenko	e31b688a57	sfxge: added fallbacks for pre 4.2.1 firmware support Driver must be able to start against older firmware that is missing recently added MCDI calls, otherwise firmware upgrade will not be possible. Submitted by: Richard Houldsworth <rhouldsworth at solarflare.com> Sponsored by: Solarflare Communications, Inc. MFC after: 2 days Differential Revision: https://reviews.freebsd.org/D3145	2015-07-22 16:25:18 +00:00
Jung-uk Kim	fe0f0bbb19	Merge ACPICA 20150717.	2015-07-22 16:25:07 +00:00
Conrad Meyer	5f7d6682c5	vt: Unbreak build on no-splash configurations PR: 201751 Differential Revision: https://reviews.freebsd.org/D3151 Tested by: Andrey Fesenko Approved by: markj (mentor) MFC after: 1 week	2015-07-22 15:30:10 +00:00
Randall Stewart	7cca17758c	Fix several problems with Stream Reset. 1) We were not handling (or sending) the IN_PROGRESS case if the other side (or our side) was not able to reset (awaiting more data). 2) We would improperly send a stream-reset when we should not. Not waiting until the TSN had been assigned when data was inqueue. Reviewed by: tuexen	2015-07-22 11:30:37 +00:00
Ed Schouten	c989441af6	Regenerate system call table.	2015-07-22 10:05:46 +00:00
Ed Schouten	73dcd7db56	Import upstream changes to the system call definitions. Support has been added for providing the scope of a futex operation, whether the futex is local to the process or shared between processes.	2015-07-22 10:04:53 +00:00
Zbigniew Bodek	0af6011a92	Introduce support for MSI-X interrupts in AHCI - Allocate resources for MSI-X table and PBA if necessary - Add function ahci_free_mem() to free all resources Reviewed by: jhb, mav Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3009	2015-07-22 09:46:22 +00:00
Randall Stewart	f260c1b939	Fix inverted logic bug that David Wolfskill found (thanks David!) MFC after: 3 Weeks	2015-07-22 09:29:50 +00:00
Konstantin Belousov	c48c590f63	Remove duplicate and useless declarations. Submitted by: bde	2015-07-22 09:12:40 +00:00
Ed Schouten	8bc7851803	Add Makefiles for CloudABI kernel modules. Place all of the machine/pointer size independent code in a kernel module called 'cloudabi'. All of the 64-bit specific code goes in a separate module called 'cloudabi64'. The latter is only enabled on amd64, as it is the only architecture supported.	2015-07-22 07:32:49 +00:00
Wei Hu	5f302628d0	Do not enable UDP checksum offloading when running on the Hyper-V on Windows Server 2012 and earlier hosts. Submitted by: whu Reviewed by: royger Approved by: royger MFC after: 3 days Relnotes: No Sponsored by: Microsoft OSTC Differential Revision: https://reviews.freebsd.org/D3086	2015-07-22 05:05:01 +00:00
Luiz Otavio O Souza	315dbfb053	Cosmetic change. When printing the child's mapped pins, use the plural only when necessary. Reported by: Daniel O'Connor <darius@dons.net.au>, Sulev-Madis Silber (ketas)	2015-07-22 04:18:33 +00:00
John Baldwin	9a2d6ab990	Various changes to the registers displayed in DDB for x86. - Fix segment registers to only display the low 16 bits. - Remove unused handlers and entries for the debug registers. - Display xcr0 (if valid) in 'show sysregs'. - Add '0x' prefix to MSR values to match other values in 'show sysregs'. - MFamd64: Display various MSRs in 'show sysregs'. - Add a 'show dbregs' to display the value of debug registers. - Dynamically size the column width for register values to properly align columns on 64-bit platforms. - Display %gs for i386 in 'show registers'. Differential Revision: https://reviews.freebsd.org/D2784 Reviewed by: kib, markj MFC after: 2 weeks	2015-07-22 01:09:02 +00:00
Mark Johnston	d258fd1d98	Remove checks for a NULL return value from M_WAITOK allocations.	2015-07-21 23:44:36 +00:00
Xin LI	47a8e86509	Fix resource exhaustion due to sessions stuck in LAST_ACK state. Submitted by: Jonathan Looney (Juniper SIRT) Reviewed by: lstewart Security: CVE-2015-5358 Security: SA-15:13.tcp	2015-07-21 23:42:15 +00:00
Mark Johnston	a5cbf8b9c0	Let the unwinder handle faults during function prologues or epilogues. The i386 and amd64 DDB stack unwinders contain code to detect and handle the case where the first frame is not completely set up or torn down. This code was accidentally unused however, since db_backtrace() was never called with a non-NULL trap frame. This change fixes that. Also remove get_rsp() from the amd64 code. It appears to have come from i386, which needs to take into account whether the exception triggered a CPL switch, since SS:ESP is only pushed onto the stack if so. On amd64, SS:RSP is pushed regardless, so get_rsp() was doing the wrong thing for kernel-mode exceptions. As a result, we can also remove custom print functions for these registers. Reviewed by: jhb Sponsored by: EMC / Isilon Storage Division Differential Revision: https://reviews.freebsd.org/D2881	2015-07-21 23:22:23 +00:00
Mark Johnston	f8a757d016	Improve stack unwinding on i386 and amd64 after an IP fault. If we can't find a symbol corresponding to the faulting instruction, assume that the previously-executed function is a call and attempt to find the calling function using the return address on the stack. Otherwise we end up associating the last stack frame with the current call, which is incorrect and causes the unwinder to skip printing of the calling function, resulting in a confusing backtrace. Reviewed by: jhb Sponsored by: EMC / Isilon Storage Division Differential Revision: https://reviews.freebsd.org/D2859	2015-07-21 23:13:11 +00:00
Mark Johnston	e31a60b486	Don't return undefined symbols to a DDB symbol lookup. Undefined symbols have a value of zero, so it makes no sense to return such a symbol when performing a lookup by value. This occurs for example when unwinding the stack after calling a NULL function pointer, and we confusingly report the faulting function as uart_sab82532_class() on amd64. Convert db_print_loc_and_inst() to only attempt disassembly if we managed to find a symbol corresponding to the IP. Otherwise we may fault and re-enter the debugger. Reviewed by: jhb Sponsored by: EMC / Isilon Storage Division Differential Revision: https://reviews.freebsd.org/D2858	2015-07-21 23:07:55 +00:00
Mark Johnston	1a5bee0849	Remove some dead code from DDB's amd64 stack unwinder. The amd64 port copied some code from i386 to fetch function arguments and display them in backtraces. However, it was commented out and can't easily be implemented since the function arguments are passed in registers rather than on the stack in amd64. Remove it in preparation for some bug fixes in this area. Reviewed by: jhb Sponsored by: EMC / Isilon Storage Division Differential Revision: https://reviews.freebsd.org/D2857	2015-07-21 23:03:21 +00:00
Konstantin Belousov	e637a6e3f9	The smp_rendezvous_cpus() function should ensure that all accesses done by the functions called on other CPUs, are visible to the caller. Pair otherwise useless acquire on smp_rv_waiters[3] with a release add to ensure synchronized with relation, which guarantees visibility. Reviewed by: alc Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-21 22:56:46 +00:00
Ermal Luçi	705f4d9c6a	IPSEC, remove variable argument function its already due. Differential Revision: https://reviews.freebsd.org/D3080 Reviewed by: gnn, ae Approved by: gnn(mentor)	2015-07-21 21:46:24 +00:00
Eric Joyner	39020fdfa0	Fix for a customer issue with ixl(4): - Add required MAC/VLAN filter when adding an LAA - Fix bug where code did not check for I40E_SUCCESS from a successful i40e_validate_mac_address() call in ixl_init_locked(), when setting an LAA. PR: 201240 Differential Revision: https://reviews.freebsd.org/D3111 Submitted by: Gregory Rose <gregory.v.rose@intel.com> Reviewed by: gnn, rstone Approved by: gnn MFC after: 2 weeks	2015-07-21 21:07:18 +00:00
Jim Harris	70fb74bd12	nvd: set d_delmaxsize to full capacity of NVMe namespace The NVMe specification has no ability to specify a maximum delete size that is less than the full capacity of the namespace - so just using the namespace size is the correct value here. This fixes reported issues where ZFS trim on init looked like it was hanging the system - previously the default I/O max size (128KB on Intel NVMe controllers) was used for delete operations which worked out to only about 8MB/s. With this patch I can add an 800GB DC P3700 drive to a ZFS pool in about 15-20 seconds. Reported by: Dylan Just <dylan@techtangents.com> MFC after: 3 days Sponsored by: Intel	2015-07-21 20:53:21 +00:00
Conrad Meyer	75ac3a7359	vt: Draw logos per CPU core This feature is inspired by another Unix-alike OS commonly found on airplane headrests. A number of beasties[0] are drawn at top of framebuffer during boot, based on the number of active SMP CPUs[1]. Console buffer output continues to scroll in the screen area below beastie(s)[2]. After some time[3] has passed, the beasties are erased leaving the entire terminal for use. Includes two 80x80 vga16 beastie graphics and an 80x80 vga16 orb graphic. (The graphics are RLE compressed to save some space -- 3x 3200 bytes uncompressed, or 4208 compressed.) [0]: The user may select the style of beastie with kern.vt.splash_cpu_style=(0\|1\|2) [1]: Or the number may be overridden with tunable kern.vt.splash_ncpu. [2]: https://www.youtube.com/watch?v=UP2jizfr3_o [3]: Configurable with kern.vt.splash_cpu_duration (seconds, def. 10). Differential Revision: https://reviews.freebsd.org/D2181 Reviewed by: dumbbell, emaste Approved by: markj (mentor) MFC after: 2 weeks	2015-07-21 20:33:36 +00:00
Conrad Meyer	bcfb2e3dd2	vt: De-static VT_SYSCTL_INT-defined objects Explicitly mark existing VT_SYSCTL_INTs static. This is in preparation for D2181. Reviewed by: dumbbell, emaste Approved by: markj (mentor) MFC after: 1 week	2015-07-21 20:30:06 +00:00
Andrew Turner	4027d3d62a	Teach the GICv2 driver about the Qualcomm GICv2 compatible string. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-21 18:08:10 +00:00
Zbigniew Bodek	3ed97a1a52	Add some more explanation to r285752 Add brief commentary to vendor-specific devid function in ITS and remove redundant spaces by the way. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation	2015-07-21 17:14:24 +00:00
Zbigniew Bodek	9920b3aa95	Don't allow malloc() to wait for resource while holding a lock in ITS malloc() should not go to sleep in case of lack of resource while the kernel thread is holding a non-sleepable lock. - change malloc() flags to M_NOWAIT in such cases implement lpi_free_chunk() routine as it will be needed when ITT allocation fails in its_device_alloc_locked() - do not increase verbosity of this code since upper layers will communicate an error if the interrupt setup fails Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3121	2015-07-21 15:28:07 +00:00
Ed Schouten	072cb63ddc	Make clock_gettime() and clock_getres() work for CloudABI programs. Though the standard C library uses a 'struct timespec' using a 64-bit 'time_t', there is no need to use such a type at the system call level. CloudABI uses a simple 64-bit unsigned timestamp in nanoseconds. This is sufficient to express any time value from 1970 to 2554. The CloudABI low-level interface also supports fetching timestamp values with a lower precision. Instead of overloading the clock ID argument for this purpose, the system call provides a precision argument that may be used to specify the maximum slack. The current system call implementation does not use this information, but it's good to already have this available. Expose cloudabi_convert_timespec(), as we're going to need this for fstat() as well. Obtained from: https://github.com/NuxiNL/freebsd	2015-07-21 15:08:13 +00:00
Zbigniew Bodek	1fe6a1a25a	Add support for vendor specific function for PCI devid acquisition in ITS It is possible that some HW will use different PCI devids, hence allow to replace the default domain🚌slot:func schema by implementing and registering custom function. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3118	2015-07-21 14:47:23 +00:00
Konstantin Belousov	01f5e0866b	The part of r285680 which removed release semantic for two stores to it_need was wrong []. Restore the releases and add a comment explaining why it is needed. Noted by: alc [] Reviewed by: bde [*] Sponsored by: The FreeBSD Foundation	2015-07-21 14:39:34 +00:00
Ed Schouten	d0da90b198	Describe COMPAT_CLOUDABI64 in the amd64 configuration NOTES file.	2015-07-21 12:53:47 +00:00
Zbigniew Bodek	52b584bc15	Implement get_cyclecount() on ARM64 Use Vritual Counter register associated with Generic Timer to read the cyclecount. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3134	2015-07-21 12:50:45 +00:00
Ed Schouten	21d30b29d5	Make thread creation work for CloudABI processes. Summary: Remove the stub system call that was put in place during the system call import and replace it by a target-dependent version stored in sys/amd64. Initialize the thread in a way similar to cpu_set_upcall_kse(). We provide the entry point with two arguments: the thread ID and the argument pointer. Test Plan: Thread creation still seems to work, both for FreeBSD and CloudABI binaries. Reviewers: dchagin, mjg, kib Reviewed By: kib Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3110	2015-07-21 12:47:15 +00:00
Zbigniew Bodek	13aaea2fd7	Improve ARM64 CPU_MATCH Add a method to identify CPU based on RAW MIDR value. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3117	2015-07-21 12:15:00 +00:00
Sergey Kandaurov	94df6fad1d	Fix sb_state constant names as used e.g. to display in DDB ``show sockbuf''. MFC after: 1 week	2015-07-21 09:57:13 +00:00
Randall Stewart	c0d1be08f6	When a tunneling protocol is being used with UDP we must release the lock on the INP before calling the tunnel protocol, else a LOR may occur (it does with SCTP for sure). Instead we must acquire a ref count and release the lock, taking care to allow for the case where the UDP socket has gone away and not unlocking since the refcnt decrement on the inp will do the unlock in that case. Reviewed by: tuexen MFC after: 3 weeks	2015-07-21 09:54:31 +00:00
Hiren Panchasara	4d5e6ef665	Remove a couple of TUNABLE_INT() calls which are unnecessary after r267961. r267961 did remove them but they "reappeared" when ixgbe(4) rewrite happened in r280182. Sponsored by: Limelight Networks	2015-07-21 06:48:36 +00:00
Konstantin Belousov	9b3df93bf1	Typo in comment.	2015-07-20 19:51:41 +00:00
Alexander Motin	d575325b81	Increase output amp on ASUS UX31A by +5dB. While there, implement couple helper functions.	2015-07-20 17:48:00 +00:00
Ed Schouten	62c31cffae	Make forking of CloudABI processes work. Just like FreeBSD+Capsicum, CloudABI uses process descriptors. Return the file descriptor number to the parent process. To the child process we both return a special value for the file descriptor number (CLOUDABI_PROCESS_CHILD). We also return the thread ID of the new thread in the copied process, so the threading library can reinitialize itself. Obtained from: https://github.com/NuxiNL/freebsd	2015-07-20 13:46:22 +00:00
Ed Schouten	5a170c1b0e	Add an API for easily creating userspace threads in kernelspace. This change refactors the existing create_thread() function to be more generic. It replaces almost all of its arguments by a callback that can be used to extract the thread ID and copy it out to the right place, but also to perform additional initialization steps, such as setting the trapframe. This also makes the difference between thr_new() and thr_create() more clear in my opinion. This function is going to be used by the CloudABI compatibility layer. It looks like the OpenSolaris compatibility framework already provides a function called thread_create(). Rename this function to do_thread_create() and use a macro to deal with the namespacing conflict. A similar approach is already used for thread_exit(). MFC after: 1 month	2015-07-20 10:20:04 +00:00
Alexander Motin	d3e2e28e74	Fix typo in comment. Submitted by: Masao Uebayashi	2015-07-20 09:37:42 +00:00
Marko Zec	22a9384098	Prevent null-pointer dereferencing. MFC after: 3 days	2015-07-20 08:21:51 +00:00
Andrey V. Elsukov	af9aa0a837	Add helper functions for IP checksum adjusting. Use these functions in dummynet code and for setdscp. This fixes wrong checksums in some cases. Obtained from: Yandex LLC MFC after: 2 weeks Sponsored by: Yandex LLC	2015-07-20 07:26:31 +00:00
Andrey V. Elsukov	30aee13117	Add LLE event handler to report ND6 events to userland via rtsock. Obtained from: Yandex LLC MFC after: 2 weeks Sponsored by: Yandex LLC	2015-07-20 06:58:32 +00:00
Andrey V. Elsukov	585753c432	Invoke LLE event handler when entry is deleted. MFC after: 2 weeks Sponsored by: Yandex LLC	2015-07-20 06:54:50 +00:00
Mark Johnston	bd2519480e	Declare lockstat_enabled even when KDTRACE_HOOKS is not defined. Reported by: bz X-MFC-With: r285704	2015-07-20 04:41:25 +00:00
Marcel Moolenaar	be00e09818	Check the hw.proto.attach environment variable for devices that proto(4) should attach to instead of the normal driver. Document the variable.	2015-07-19 23:37:45 +00:00
Mark Johnston	97cc6870f6	Don't increment the spin count until after the first attempt to acquire a rwlock read lock. Otherwise the lockstat:::rw-spin probe will fire spuriously. MFC after: 1 week	2015-07-19 22:26:02 +00:00
Kirk McKusick	1b79b9498b	Restructure code for readability improvement. No functional change. Reviewed by: kib	2015-07-19 22:25:16 +00:00
Mark Johnston	de2c95cc00	Consistently use a reader/writer flag for lockstat probes in rwlock(9) and sx(9), rather than using the probe function name to determine whether a given lock is a read lock or a write lock. Update lockstat(1) accordingly.	2015-07-19 22:24:33 +00:00
Mark Johnston	32cd0147fa	Implement the lockstat provider using SDT(9) instead of the custom provider in lockstat.ko. This means that lockstat probes now have typed arguments and will utilize SDT probe hot-patching support when it arrives. Reviewed by: gnn Differential Revision: https://reviews.freebsd.org/D2993	2015-07-19 22:14:09 +00:00
Mark Murray	eda4aaeb3f	Fix some untidy logic. I committed the wrong local fix; please pass the pointy hat. Approved by: so (/dev/random blanket)	2015-07-19 18:07:35 +00:00
Luigi Rizzo	847adfb7b3	add a use count so the netmap module cannot be unloaded while in use.	2015-07-19 18:07:25 +00:00
Luigi Rizzo	10b8ef3d6a	properly destroy persistent vale ports	2015-07-19 18:06:30 +00:00
Luigi Rizzo	9694aad375	do not free NULL if pipe allocation fails	2015-07-19 18:05:49 +00:00
Luigi Rizzo	05f7605789	release a reference when stopping a monitor	2015-07-19 18:04:51 +00:00
Luigi Rizzo	85fe4e7c6b	small documentation update	2015-07-19 17:54:42 +00:00
Andrew Turner	70888b7ed5	Fix atomic_store_64, it should write the value passed in, not the value read by the load. Pointy Hat: andrew	2015-07-19 16:55:47 +00:00
Mark Murray	f703e79990	Remove out-of-date comments. Approved by: so (/dev/random blanket)	2015-07-19 16:05:34 +00:00
Mark Murray	dbefaadca8	Fix the read blocking so that it is interruptable and slow down the rate of console warning spamming while blocked. Approved by: so (/dev/random blanket)	2015-07-19 16:05:30 +00:00
Mark Murray	d657959305	Clarify the intent of the RANDOM_* options. Approved by: so (/dev/random blanket)	2015-07-19 16:05:26 +00:00
Mark Murray	95b184a048	Optimise the buffer-size calculation. It was possible to get one block too many. Approved by: so (/dev/random blanket)	2015-07-19 16:05:23 +00:00
Andrew Turner	a612bbfa12	Clean up the style of the armv6 atomic code. Sponsored by: ABT Systems Ltd	2015-07-19 15:44:51 +00:00
Andrew Turner	d6a2102846	Sort the ARM atomic functions to be in alphabetical order. Sponsored by: ABT Systems Ltd	2015-07-19 13:10:47 +00:00
Konstantin Belousov	a8e1bc2e14	Revert bit of the r285627, locore.s does not need include of opt_kstack_pages.h. The asm gets the right KSTACK_PAGES from the assym.s. Reported by: bz Sponsored by: The FreeBSD Foundation	2015-07-19 10:45:58 +00:00
Marcelo Araujo	f19e47d691	Add support to the jail framework to be able to mount linsysfs(5) and linprocfs(5). Differential Revision: D2846 Submitted by: Nikolai Lifanov <lifanov@mail.lifanov.com> Reviewed by: jamie	2015-07-19 08:52:35 +00:00
John-Mark Gurney	02bee582d0	move the prototype to the lib.h header.. This makes more sense, and it's an API between boot2.c and arm_init.S which calls it..	2015-07-18 22:47:46 +00:00
John-Mark Gurney	c09626461f	other fixes to make boot2 compile for IXP... Properly end the asm sections, and for some reason, main needs a prototype... If someone has a better fix, I'm all ears... Pointed out by: Berislav Purgar	2015-07-18 20:21:25 +00:00
John-Mark Gurney	8d0440e04b	revert r278579, this is in a different compile environment than the kernel, and needs to be named cpu_id... Pointed out by: Berislav Purgar	2015-07-18 20:19:51 +00:00
Konstantin Belousov	283dfee925	Further cleanup after r285607. Remove useless release semantic for some stores to it_need. For stores where the release is needed, add a comment explaining why. Fence after the atomic_cmpset() op on the it_need should be acquire only, release is not needed (see above). The combination of atomic_cmpset() + fence_acq() is better expressed there as atomic_cmpset_acq(). Use atomic_cmpset() for swi' ih_need read and clear. Discussed with: alc, bde Reviewed by: bde Comments wording provided by: bde Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-18 19:59:29 +00:00
Ian Lepore	e7b25f9168	Deselect the sd card before re-selecting it when working around a problem with some cards that causes them to become deselected after probing for switch capabilities. The old workaround fixes the behavior with some cards, but causes problems with the cards the behave correctly and don't become deselected. Forcing a deselect then reselect appears to work correctly with all cards in initial testing.	2015-07-18 16:56:51 +00:00
Luigi Rizzo	a6e8e92404	fix a typo in a comment	2015-07-18 15:28:32 +00:00
Konstantin Belousov	b4490c6e93	The si_status field of the siginfo_t, provided by the waitid(2) and SIGCHLD signal, should keep full 32 bits of the status passed to the _exit(2). Split the combined p_xstat of the struct proc into the separate exit status p_xexit for normal process exit, and signalled termination information p_xsig. Kernel-visible macro KW_EXITCODE() reconstructs old p_xstat from p_xexit and p_xsig. p_xexit contains complete status and copied out into si_status. Requested by: Joerg Schilling Reviewed by: jilles (previous version), pho Tested by: pho Sponsored by: The FreeBSD Foundation	2015-07-18 09:02:50 +00:00
Kevin Lo	ddee45244d	Since the IETF has redefined the meaning of the tos field to accommodate a set of differentiated services, set IPTOS_PREC_* macros using IPTOS_DSCP_* macro definitions. While here, add IPTOS_DSCP_VA macro according to RFC 5865. Differential Revision: https://reviews.freebsd.org/D3119 Reviewed by: gnn	2015-07-18 06:48:30 +00:00
Mark Johnston	c6d48c8752	Fix the !KDTRACE_HOOKS build. X-MFC-With: r285664	2015-07-18 04:38:11 +00:00
Mark Johnston	e2b25737ee	Pass the lock object to lockstat_nsecs() and return immediately if LO_NOPROFILE is set. Some timecounter handlers acquire a spin mutex, and we don't want to recurse if lockstat probes are enabled. PR: 201642 Reviewed by: avg MFC after: 3 days	2015-07-18 00:57:30 +00:00
Mark Johnston	efe8b26b82	Modify lockstat_nsecs() to just return unless lockstat probes are actually enabled. The cost of a timecounter read can be quite significant, and the problem became more apparent after r284297, since that change resulted in a call to lockstat_nsecs() for each acquisition of an rwlock read lock. PR: 201642 Reviewed by: avg Tested by: Jason Unovitch MFC after: 3 days Differential Revision: https://reviews.freebsd.org/D3073	2015-07-18 00:22:00 +00:00
Benno Rice	eacbeb2b95	Merge driver for PMC Sierra's range of SAS/SATA HBAs. Submitted by: Achim Leubner <Achim.Leubner@pmcs.com> Reviewed by: scottl	2015-07-17 23:30:43 +00:00
Ed Schouten	fd054c2df9	Undo r285656. It turns out that the CDDL sources already introduce a function called thread_create(). I'll investigate what we can do to make these functions coexist. Reported by: Ivan Klymenko	2015-07-17 22:26:45 +00:00
Benno Rice	a650d8699f	Enable pms module on amd64 for now.	2015-07-17 20:30:30 +00:00
Benno Rice	5894064d12	Disable debugging. Submitted by: Vasanthalakshmi Tharmarajan <Vasanthalakshmi.Tharmarajan@pmcs.com> Reviewed by: scottl	2015-07-17 20:29:47 +00:00
Patrick Kelsey	d57724fd46	Check TCP timestamp option flag so that the automatic receive buffer scaling code does not use an uninitialized timestamp echo reply value from the stack when timestamps are not enabled. Differential Revision: https://reviews.freebsd.org/D3060 Reviewed by: hiren Approved by: jmallett (mentor) MFC after: 3 days Sponsored by: Norse Corp, Inc.	2015-07-17 17:36:33 +00:00
Ed Schouten	82a3d2cbfc	Add an API for easily creating userspace threads in kernelspace. This change refactors the existing create_thread() function to be more generic. It replaces almost all of its arguments by a callback that can be used to extract the thread ID and copy it out to the right place, but also to perform additional initialization steps, such as setting the trapframe. This also makes the difference between thr_new() and thr_create() more clear in my opinion. This function is going to be used by the CloudABI compatibility layer. Reviewed by: kib MFC after: 1 month	2015-07-17 16:34:01 +00:00
Zbigniew Bodek	b8bbefed30	Fix possible coherency issues between PEs related to I-cache Basing on B.2.3.4: Synchronization and coherency issues between data and instruction accesses. To ensure that modified instructions are visible to all PEs (Processing Elements) in a shareability domain one need to perform following sequence: 1. Clean D-cache 2. Ensure the visibility of data cleaned from cache 3. Invalidate I-cache 4. Ensure completion 5. In SMP system PE must issue isb to ensure execution of the modified instructions Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3106	2015-07-17 14:33:47 +00:00
Zbigniew Bodek	ab89029bd0	Fix secondary stacks calculation on ARM64 Secondary stack calculation is modified to provide stack_top = secondary_stacks + (cpu_id) * PAGE_SIZE * KSTACK_PAGES because on ARM64 the stack grows to lower memory addresses. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3107	2015-07-17 14:08:08 +00:00
Zbigniew Bodek	d5dfc8ad00	Increase DMAP (Direct Map) size on ARM64 Previous DMAP size was too small for systems with more than 64GB of RAM. Increase it to 128GB to support ThunderX CRB. Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3113	2015-07-17 13:58:00 +00:00
Ed Schouten	6256e57ba9	Implement CloudABI memory management system calls. Add support for the <sys/mman.h> functions by wrapping around our own implementations. There are no kern_() variants of these system calls, but we also don't need them in this case. It is sufficient to just call into the sys_() functions. Differential Revision: https://reviews.freebsd.org/D3033 Reviewed by: brooks	2015-07-17 09:00:38 +00:00
Navdeep Parhar	a1ed88571f	cxgbe(4): Ask the firmware for the start of the RSS slice for a port and save it for later. This enables direct manipulation of the indirection tables (although the stock driver doesn't do that right now). MFC after: 1 month	2015-07-17 06:46:18 +00:00
Konstantin Belousov	888e282ab4	When checking for the valid value of the frame pointer, verify that it belongs to the kernel stack address range for the thread. Right now, code checks that new frame is not farther then KSTACK_PAGES pages from the current frame, which allows the address to point past the top of the stack. Reviewed by: andrew, emaste, markj Differential revision: https://reviews.freebsd.org/D3108 Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-16 19:40:18 +00:00
Ed Schouten	6e5fcd99df	Add a sysentvec for CloudABI on x86-64. Summary: For CloudABI we need to put two things on the stack of new processes: the argument data (a binary blob; not strings) and a startup data structure. The startup data structure contains interesting things such as a pointer to the ELF program header, the thread ID of the initial thread, a stack smashing protection canary, and a pointer to the argument data. Fetching system call arguments and setting the return value is similar to FreeBSD. The only differences are that system call 0 does not exist and that we call into cloudabi_convert_errno() to convert the error code. We also need this function in a couple of other places, so we'd better reuse it here. Reviewers: dchagin, kib Reviewed By: kib Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3098	2015-07-16 18:24:06 +00:00
Sean Bruno	f46fb03de7	Add an adapter CORE lock in the DDB hook em_dump_queue to avoid WITNESS panic in em_init_locked() while debugging. MFC after: 2 weeks Sponsored by: Limelight Networks	2015-07-16 16:32:57 +00:00
Hans Petter Selasky	a529288d65	Optimise the DWC OTG host mode driver's transmit path: 1) Use the TX FIFO empty interrupts to poll the transmit FIFO usage, instead of using own software counters and waiting for SOF interrupts. Assume that enough FIFO space is available to execute one USB OUT transfer of any kind when the TX FIFO is empty. 2) Use the host channel halted event to asynchronously wait for host channels to be disabled instead of waiting for SOF interrupts. This results in less turnaround time for re-using host channels and at the same time increases the performance. The network transmit performance measured by "iperf" for the "RPi-B v1 2011/12" board, increased from 45MBit/s to 65Mbit/s after applying the changes above. No regressions seen using: - High Speed (BULK, CONTROL, INTERRUPT) - Full Speed (All transfer types) - Low Speed (Control and Interrupt) MFC after: 1 month Submitted by: Daisuke Aoyama <aoyama@peach.ne.jp>	2015-07-16 16:08:40 +00:00
Mateusz Guzik	2919a0c5c1	fd: partially deduplicate fdescfree and fdescfree_remapped This also moves vrele of cdir/rdir/jdir vnodes earlier, which should not matter.	2015-07-16 15:26:37 +00:00
Mateusz Guzik	cd672ca60f	Get rid of lim_update_thread and cred_update_thread. Their primary use was in thread_cow_update to free up old resources. Freeing had to be done with proc lock held and _cow_ funcs already knew how to free old structs.	2015-07-16 14:30:11 +00:00
Mateusz Guzik	752fc07d33	vfs: implement v_holdcnt/v_usecount manipulation using atomic ops Transitions 0->1 and 1->0 (which decide e.g. on putting the vnode on the free list) of either counter are still guarded with vnode interlock. Reviewed by: kib (earlier version) Tested by: pho	2015-07-16 13:57:05 +00:00
Andrew Turner	8fa2222f46	Split out the arm and armv6 parts of atomic.h to new files. While here use __ARM_ARCH to determine which revision of the architecture is applicable. Sponsored by: ABT Systems Ltd	2015-07-16 13:33:03 +00:00
Konstantin Belousov	1ef630fb33	Fix warnings about unused functions for UP build. Sponsored by: The FreeBSD Foundation	2015-07-16 12:16:42 +00:00
Christian Brueffer	16858c207b	Actually recognize all Intel Lynx Point devices we have device IDs for. PR: 195851 Submitted by: ftigeot@wolfpond.org MFC after: 1 week	2015-07-16 11:14:59 +00:00
Zbigniew Bodek	721555e7ee	Fix KSTACK_PAGES issue when the default value was changed in KERNCONF If KSTACK_PAGES was changed to anything alse than the default, the value from param.h was taken instead in some places and the value from KENRCONF in some others. This resulted in inconsistency which caused corruption in SMP envorinment. Ensure all places where KSTACK_PAGES are used the opt_kstack_pages.h is included. The file opt_kstack_pages.h could not be included in param.h because was breaking the toolchain compilation. Reviewed by: kib Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3094	2015-07-16 10:46:52 +00:00
Zbigniew Bodek	1038d102c4	Set-up proper TCR values for memory related to Translation Table Walking This commit adds proper cache and shareability attributes to the TCR register. Set memory attributes to Normal, outer and inner cacheable WBWA. Set shareability to inner and outer shareable when SMP is enabled. Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3093	2015-07-16 10:22:57 +00:00
Kevin Lo	f7c698e20d	Fix typo in register definition. Submitted by: James Hung Reviewed by: sbruno	2015-07-16 08:03:23 +00:00
Ed Schouten	457f7e23b1	Implement CloudABI's exec() call. Summary: In a runtime that is purely based on capability-based security, there is a strong emphasis on how programs start their execution. We need to make sure that we execute an new program with an exact set of file descriptors, ensuring that credentials are not leaked into the process accidentally. Providing the right file descriptors is just half the problem. There also needs to be a framework in place that gives meaning to these file descriptors. How does a CloudABI mail server know which of the file descriptors corresponds to the socket that receives incoming emails? Furthermore, how will this mail server acquire its configuration parameters, as it cannot open a configuration file from a global path on disk? CloudABI solves this problem by replacing traditional string command line arguments by tree-like data structure consisting of scalars, sequences and mappings (similar to YAML/JSON). In this structure, file descriptors are treated as a first-class citizen. When calling exec(), file descriptors are passed on to the new executable if and only if they are referenced from this tree structure. See the cloudabi-run(1) man page for more details and examples (sysutils/cloudabi-utils). Fortunately, the kernel does not need to care about this tree structure at all. The C library is responsible for serializing and deserializing, but also for extracting the list of referenced file descriptors. The system call only receives a copy of the serialized data and a layout of what the new file descriptor table should look like: int proc_exec(int execfd, const void data, size_t datalen, const int fds, size_t fdslen); This change introduces a set of fd*_remapped() functions: - fdcopy_remapped() pulls a copy of a file descriptor table, remapping all of the file descriptors according to the provided mapping table. - fdinstall_remapped() replaces the file descriptor table of the process by the copy created by fdcopy_remapped(). - fdescfree_remapped() frees the table in case we aborted before fdinstall_remapped(). We then add a function exec_copyin_data_fds() that builds on top these functions. It copies in the data and constructs a new remapped file descriptor. This is used by cloudabi_sys_proc_exec(). Test Plan: cloudabi-run(1) is capable of spawning processes successfully, providing it data and file descriptors. procstat -f seems to confirm all is good. Regular FreeBSD processes also work properly. Reviewers: kib, mjg Reviewed By: mjg Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3079	2015-07-16 07:05:42 +00:00
Justin Hibbits	96f3c2adbe	Fix userland program exception handling for powerpc64. It appears that the linker will not handle 64-bit relocations at addresses that are not aligned to 8-byte boundaries. Prior to this change the line: .llong generictrap was aligned to a 4-byte address, and the linker replaced that with an 8-byte 0x0. Aligning that address to 8 bytes caused the linker to generate the proper relocation. As a follow-through, the dblow from trap_subr33.S used the code sequence 'lwz %r1, TRAP_GENTRAP(0)', so this reproduces the analogue of that for 64-bit.	2015-07-16 05:13:08 +00:00
Neel Natu	62145ff347	If uart interrupts are not functioning then schedule the callout to do the polling at device attach time [1]. Add tunables 'debug.uart_force_poll' and 'debug.uart_poll_freq' to control uart polling. Submitted by: Aleksey Kuleshov (rndfax@yandex.ru) [1]	2015-07-16 04:15:22 +00:00
Konstantin Belousov	70a3efc14f	Do not use atomic_swap_int(9), it is not available on all architectures. Atomic_cmpset_int(9) is a direct replacement, due to loop. The change fixes arm, arm64, mips an sparc64, which lack atomic_swap(). Suggested and reviewed by: alc Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-15 21:44:16 +00:00
Konstantin Belousov	615b6ea2c8	Reset non-zero it_need indicator to zero atomically with fetching its current value. It is believed that the change is the real fix for the issue which was covered over by the r252683. With the current code, if the interrupt handler sets it_need between read and consequent reset, the update could be lost and ithread_execute_handlers() would not be called in response to the lost update. The r252683 could have hide the issue since at the moment of commit, atomic_load_acq_int() did locked cmpxchg on the variable, which puts the cache line into the exclusive owned state and clears store buffers. Then the immediate store of zero has very high chance of reusing the exclusive state of the cache line and make the load and store sequence operate as atomic swap. For now, add the acq+rel fence immediately after the swap, to not disturb current (but excessive) ordering. Acquire is needed for the ih_need reads after the load, while release does not serve a useful purpose []. Reviewed by: alc Noted by: alc [] Discussed with: bde Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-15 17:36:35 +00:00
Konstantin Belousov	03bbcb2f0c	Style. Remove excessive brackets. Compare non-boolean with zero. Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-15 17:14:05 +00:00
Andrew Turner	63e8633e80	Fix an infinite loop when a node doesn't have an interrupt-parent property. Submitted by: Aleksey Kuleshov <rndfax@yandex.ru> Differential Revision: https://reviews.freebsd.org/D3041	2015-07-15 13:28:25 +00:00
Alexander Motin	7dbe8f175b	MULTI_ID supported does not mean it is used.	2015-07-15 12:04:12 +00:00
Ed Schouten	952c6e1010	Implement the trivial socket system calls: shutdown() and listen().	2015-07-15 11:27:34 +00:00
Zbigniew Bodek	b49baf8065	Add identify_cpu() to ARM64 init_secondary routine Identify current CPU. This is necessary to setup affinity registers and to provide support for runtime chip identification. Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3095	2015-07-15 09:24:45 +00:00
Ed Schouten	4fa92fb538	Make posix_fallocate() and posix_fadvise() work. We can map these system calls directly to the FreeBSD counterparts. The other filesystem related system calls will be sent out for review separately, as they are a bit more complex to get right.	2015-07-15 09:14:06 +00:00
Allan Jude	ce808c7ad8	Add a new option to gpart(8) to fix Lenovo BIOS boot issue PR: 184910 Reviewed by: ae, wblock Approved by: marcel MFC after: 3 days Relnotes: yes Sponsored by: ScaleEngine Inc. Differential Revision: https://reviews.freebsd.org/D3065	2015-07-15 02:23:55 +00:00
Patrick Kelsey	2ec930efea	Revert inadvertent change to amd64/GENERIC.	2015-07-15 01:04:54 +00:00
Patrick Kelsey	8aa7fdbd78	Add netmap support for ixgbe SRIOV VFs (that is, to if_ixv). Differential Revision: https://reviews.freebsd.org/D2923 Reviewed by: erj, gnn Approved by: jmallett (mentor) Sponsored by: Norse Corp, Inc.	2015-07-15 01:02:01 +00:00
Hiren Panchasara	fd3e9bafbd	Remove FreeBSD version check for deprecated M_FLOWID. Reviewed by: erj Sponsored by: Limelight Networks	2015-07-15 01:01:17 +00:00
Patrick Kelsey	c8ed84db3a	Fix igxbe SRIOV VF (if_ixv) initialization bugs. The MAC address for an if_ixv instance can now set at creation time, and the receive ring tail pointer is correctly initialized (previously, things still worked because the receive ring tail pointer was being fixed up as a side effect of other activity). Differential Revision: https://reviews.freebsd.org/D2922 Reviewed by: erj, gnn Approved by: jmallett (mentor) Sponsored by: Norse Corp, Inc.	2015-07-15 00:35:50 +00:00
Ed Schouten	bc41a24735	Fix the build after breaking it in r285549. I performed the commit on a different system as where I wrote the change. After pulling in the change from Phabricator, I didn't notice that a single chunk did not apply. Approved by: secteam (implicit, as intended change was approved) Pointy hat to: me	2015-07-14 20:45:24 +00:00
Andrew Turner	f3856d8fcb	Also accept "ok" to enable a device, some vendor device trees use this when they mean "okay"	2015-07-14 19:11:16 +00:00
Ed Schouten	707d98fe2f	Implement the CloudABI random_get() system call. The random_get() system call works similar to getentropy()/getrandom() on OpenBSD/Linux. It fills a buffer with random data. This change introduces a new function, read_random_uio(), that is used to implement read() on the random devices. We can call into this function from within the CloudABI compatibility layer. Approved by: secteam Reviewed by: jmg, markm, wblock Obtained from: https://github.com/NuxiNL/freebsd Differential Revision: https://reviews.freebsd.org/D3053	2015-07-14 18:45:15 +00:00
Mark Johnston	02d131ad11	Fix some error-handling bugs when core dump compression is enabled: - Ensure that core dump parameters are initialized in the error path. - Don't call gzio_fini() on a NULL stream. Reported by: rpaulo	2015-07-14 18:24:05 +00:00
Ed Schouten	460ac6370a	Regenerate system call table for r285540.	2015-07-14 15:12:24 +00:00
Ed Schouten	1eb7c7cae3	Implement thread_tcb_set() and thread_yield(). The first system call is used to set the user TLS address. Right now this system call is invoked by the C library for both the initial thread and additional threads unconditionally, but in the future we'll only call this if the architecture does not support this. On recent x86-64 CPUs we could use the WRFSBASE instruction. This system call was erroneously placed in sys/compat/cloudabi64, even though it does not depend on any pointer size dependent datastructure. Move it to the right place. Obtained from: https://github.com/NuxiNL/freebsd	2015-07-14 15:11:50 +00:00
Ed Schouten	03744d7c8d	Implement {,p}{read,write}{,v}(). Add a routine similar to copyinuio() and freebsd32_copyinuio() that copies in CloudABI's struct iovecs. These are then translated into FreeBSD format and placed in a 'struct uio', so we can call into the kern_*() functions. Obtained from: https://github.com/NuxiNL/freebsd	2015-07-14 14:33:21 +00:00
Andrew Turner	b7fbd410ab	Set memory to be inner-sharable. This isn't needed on device memory as the MMU will ignore the attribute there, howeverit simplifies to code to alwas set it. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-14 12:37:47 +00:00
Ed Schouten	f9675092b8	Let proc_raise() call into pksignal() directly. Summary: As discussed with kib@ in response to r285404, don't call into kern_sigaction() within proc_raise() to reset the signal to the default action before delivery. We'd better do that during image execution. Change the code to simply use pksignal(), so we don't waste cycles on functions like pfind() to look up the currently running process itself. Test Plan: This change has also been pushed into the cloudabi branch on GitHub. The raise() tests still seem to pass. Reviewers: kib Reviewed By: kib Subscribers: imp Differential Revision: https://reviews.freebsd.org/D3076	2015-07-14 12:16:14 +00:00
Zbigniew Bodek	d1be8e59e2	Fix secondary PIC initialization order Call arm_init_secondary before any other PIC-related functions are called. This is necessary for GICv3 where PIC_INIT_SECONDARY allocates resources needed for all further operations. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3066	2015-07-14 12:02:56 +00:00
Zbigniew Bodek	b7ac293f44	Fix intr_machdep.c for ARM64 On ARMv8 IPIs are mapped to 0-15. Incrementing the number by 16 is wrong, because it sets a reserved bit in the IPI register. This patch removes all "+16" to comply with specs. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3029	2015-07-14 11:59:43 +00:00
Christian Brueffer	f4c1eac7cd	Spell crypto correctly.	2015-07-14 10:47:56 +00:00
Hiren Panchasara	df7b11fa09	Expose full 32bit RSS hash from card regardless of whether RSS is defined or not. When doing multiqueue, we are all setup to have full 32bit RSS hash from the card. We do not need to hide that under "ifdef RSS" and should expose that by default so others like lagg(4) can use that and avoid hashing the traffic by themselves. While here, delete the FreeBSD version check and use of deprecated M_FLOWID. Reviewed by: adrian, erj MFC after: 1 week Sponsored by: Limelight Networks	2015-07-14 09:13:18 +00:00
Navdeep Parhar	c7dbd80213	cxgbe(4): Update T4 and T5 firmwares to 1.14.2.0. Obtained from: Chelsio Communications MFC after: 3 days	2015-07-14 08:02:05 +00:00
John-Mark Gurney	577f7474b0	Fix XTS, and name things a bit better... Though confusing, GCM using ICM_BLOCK_LEN, but ICM does not is correct... GCM is built on ICM, but uses a function other than swcr_encdec... swcr_encdec cannot handle partial blocks which is why it must still use AES_BLOCK_LEN and is why XTS was broken by the commit... Thanks to the tests for helping sure I didn't break GCM w/ an earlier patch... I did run the tests w/o this patch, and need to figure out why they did not fail, clearly more tests are needed... Prodded by: peter	2015-07-14 07:45:18 +00:00
John-Mark Gurney	e0b231cbc8	fix typos.. Submitted by: brueffer	2015-07-14 06:34:57 +00:00
Adrian Chadd	85b543e06d	Populate hw.model with the CPU model information. Now you see something like: # sysctl hw.model hw.model: Atheros AR9330 rev 1 Tested: * Carambola 2, AR9331 SoC	2015-07-14 05:14:10 +00:00
John-Mark Gurney	b65946c631	cryptodev is not needed for TCP_SIGNATURE... Comment that cryptodev shouldn't be used unless you know what you're doing... The various arm/mips and one powerpc configs that have cryptodev in them need to be addressed, audited if they provide benefit and removed if they don't...	2015-07-14 05:09:58 +00:00
Conrad Meyer	0c40f3532d	Fix cleanup race between unp_dispose and unp_gc unp_dispose and unp_gc could race to teardown the same mbuf chains, which can lead to dereferencing freed filedesc pointers. This patch adds an IGNORE_RIGHTS flag on unpcbs marking the unpcb's RIGHTS as invalid/freed. The flag is protected by UNP_LIST_LOCK. To serialize against unp_gc, unp_dispose needs the socket object. Change the dom_dispose() KPI to take a socket object instead of an mbuf chain directly. PR: 194264 Differential Revision: https://reviews.freebsd.org/D3044 Reviewed by: mjg (earlier version) Approved by: markj (mentor) Obtained from: mjg MFC after: 1 month Sponsored by: EMC / Isilon Storage Division	2015-07-14 02:00:50 +00:00
Mateusz Guzik	6161705823	exec: textvp -> oldtextvp; binvp -> newtextvp This makes it consistent with the rest of the naming in do_execve. No functional changes.	2015-07-14 01:13:37 +00:00
Mateusz Guzik	853be5ffef	exec plug a redundant vref + vrele of the image vnode	2015-07-14 00:43:08 +00:00
Mateusz Guzik	e94e50af1d	racct: perform a lockless check for p_throttled This reduces proc lock contention. Reviewed by: trasz	2015-07-13 22:52:11 +00:00
Alexander Motin	d4f3ad3a26	Switch initiator IDs in target mode to the same address space as target IDs in initiator mode -- index in port database instead of handlers. This makes initiator IDs persist across role changes and firmware resets, when handlers previously assigned by firmware are lost and reused. Sponsored by: iXsystems, Inc.	2015-07-13 21:01:24 +00:00
Luiz Otavio O Souza	fb54940587	Bring a few simplifications to a10_gpio: o Return the real hardware state in gpio_pin_getflags() instead of keep the last state in an internal table. Now the driver returns the real state of pins (input/output and pull-up/pull-down) at all times. o Use a spin mutex. This is required by interrupts and the 1-wire code. o Use better variable names and place parentheses around them in MACROS. o Do not lock the driver when returning static data. Tested with gpioled(4) and DS1820 (1-wire) sensors on banana pi.	2015-07-13 18:19:26 +00:00
Conrad Meyer	c578e0fb48	pipe_direct_write: Fix mismatched pipelock/unlock If a signal is caught in pipelock, causing it to fail, pipe_direct_write should not try to pipeunlock. Reported by: pho Differential Revision: https://reviews.freebsd.org/D3069 Reviewed by: kib Approved by: markj (mentor) MFC after: 1 week Sponsored by: EMC / Isilon Storage Division	2015-07-13 17:45:22 +00:00
Alexander Motin	391f03dafb	Make role sysctl handling from r284727 less strict.	2015-07-13 15:51:28 +00:00
Alexander Motin	e68eef1442	Unify port database use for target and initiator roles. Aside from cleaner and more consistent code, this allows ports to be both target and initiator same time, and easily switch from any role to any. Sponsored by: iXsystems, Inc.	2015-07-13 15:11:05 +00:00
Luigi Rizzo	5f94000ee4	set the refcount for the structure (dropped by mistake in the last commit).	2015-07-13 10:23:52 +00:00
Mark Murray	b712101cf7	Rework the read routines to keep the PRNG sources happy. These work in units of crypto blocks, so must have adequate space to write. This means needing to be careful about buffers and keeping track of external read request length. Approved by: so (/dev/random blanket)	2015-07-13 08:38:21 +00:00
Adrian Chadd	d3f9e6a743	Fixes the RF switch state polling by comparing with the revision of the PHY instead of the revision of the RADIO. This fixes the RF switch state polling. This is from DragonflyBSD, Commit 202e28d1f65e9f35df6032400df3242a3bafb483 Obtained from: DragonflyBSD	2015-07-13 05:13:39 +00:00
Ian Lepore	3f3def246a	Add PRINTF_BUFR_SIZE=128 to avoid interleaved output.	2015-07-12 19:58:12 +00:00
Ian Lepore	969fc29e0b	Use the monotonic (uptime) counter rather than time-of-day to measure elapsed time between ntp_adjtime() clock offset adjustments. This eliminates spurious frequency steering after a large clock step (such as a 1970->2015 step on a system with no battery-backed clock hardware). This problem was discovered after the import of ntpd 4.2.8, which does things in a slightly different (but still correct) order than the 4.2.4 we had previously. In particular, 4.2.4 would step the clock then immediately after use ntp_adjtime() to set the frequency and offset to zero, which captured the post-step time-of-day as a side effect. In 4.2.8, ntpd sets frequency and offset to zero before any initial clock step, capturing the time as 1970-ish, then when it next calls ntp_adjtime() it's with a non-zero offset measurement. This non-zero value gets multiplied by the apparent 45-year interval, which blows up into a completely bogus frequency steer. That gets clamped to 500ppm, but that's still enough to make the clock drift so fast that ntpd has to keep stepping it every few minutes to compensate.	2015-07-12 18:38:17 +00:00
Zbigniew Bodek	686836faca	Add ARM64TODO comments to ACPI PCI stubs This will make searching for missing functionalities easier.	2015-07-12 18:32:16 +00:00
Mark Murray	3aa77530ca	* Address review (and add a bit myself). - Tweek man page. - Remove all mention of RANDOM_FORTUNA. If the system owner wants YARROW or DUMMY, they ask for it, otherwise they get FORTUNA. - Tidy up headers a bit. - Tidy up declarations a bit. - Make static in a couple of places where needed. - Move Yarrow/Fortuna SYSINIT/SYSUNINIT to randomdev.c, moving us towards a single file where the algorithm context is used. - Get rid of random__process_buffer() functions. They were only used in one place each, and are better subsumed into those places. - Remove _post_read() functions as they are stubs everywhere. - Assert against buffer size illegalities. - Clean up some silly code in the randomdev_read() routine. - Make the harvesting more consistent. - Make some requested argument name changes. - Tidy up and clarify a few comments. - Make some requested comment changes. - Make some requested macro changes. * NOTE: the thing calling itself a 'unit test' is not yet a proper unit test, but it helps me ensure things work. It may be a proper unit test at some time in the future, but for now please don't make any assumptions or hold any expectations. Differential Revision: https://reviews.freebsd.org/D2025 Approved by: so (/dev/random blanket)	2015-07-12 18:14:38 +00:00
Zbigniew Bodek	e7c14c38ba	Implement stubs for ACPI PCI routines ACPI driver requires special functions to be provided by machdep code. Add temporary stubs to satisfy the compiler when both "pci" and "acpi" are enabled in the kernel configuration file. Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3028	2015-07-12 17:28:31 +00:00
Bjoern A. Zeeb	97fc027722	Try to unbreak the build after r285390 removing the obsolete static declaration.	2015-07-12 00:26:22 +00:00
Luiz Otavio O Souza	a8921b867f	Return the FDT node of the GPIO controller to gpiobus. It is used by the children of gpiobus.	2015-07-11 21:09:43 +00:00
Ed Schouten	4f1905177a	Implement normal and abnormal process termination. CloudABI does not provide an explicit kill() system call, for the reason that there is no access to the global process namespace. Instead, it offers a raise() system call that can at least be used to terminate the process abnormally. CloudABI does not support installing signal handlers. CloudABI's raise() system call should behave as if the default policy is set up. Call into kern_sigaction(SIG_DFL) before calling sys_kill() to force this. Obtained from: https://github.com/NuxiNL/freebsd	2015-07-11 19:41:31 +00:00
Ed Schouten	a4001f4cb9	Use FDDUP_NORMAL instead of hardcoding value 0. Proposed by: mjg	2015-07-11 18:53:30 +00:00
Ed Schouten	329d1bca7f	Add missing function parameter. A function parameter got added in r285356, meaning that the call to kern_dup() needs to be patched up.	2015-07-11 18:39:16 +00:00
Justin Hibbits	20b6ee617f	cpu_number and cpu_swapout are never used, and only defined in powerpc.	2015-07-11 17:33:50 +00:00
Mateusz Guzik	b34be824a0	linprocfs: vref the vnode passed to vn_fullpath	2015-07-11 16:44:28 +00:00
Mateusz Guzik	c634b75204	vfs: always clear VI_OWEINACT in consumers bumping v_usecount Previously vputx would detect the condition and clear the flag. With this change it is invalid to have both v_usecount > 0 and the flag set. Assert the condition is met in all revlevant places. Reviewed by: kib	2015-07-11 16:28:55 +00:00
Mateusz Guzik	2d1ca3cdff	vfs: move si_usecount manipulation to dedicated functions Reviewed by: kib	2015-07-11 16:28:12 +00:00
Mateusz Guzik	8a08cec166	Create a dedicated function for ensuring that cdir and rdir are populated. Previously several places were doing it on its own, partially incorrectly (e.g. without the filedesc locked) or even actively harmful by populating jdir or assigning rootvnode without vrefing it. Reviewed by: kib	2015-07-11 16:22:48 +00:00
Mateusz Guzik	f0725a8e1e	Move chdir/chroot-related fdp manipulation to kern_descrip.c Prefix exported functions with pwd_. Deduplicate some code by adding a helper for setting fd_cdir. Reviewed by: kib	2015-07-11 16:19:11 +00:00
Andrew Turner	70915d1289	Always send a SIGSEGV on a map failure. Use the code to tell the reason for the signal. Sponsored by: ABT Systems Ltd	2015-07-11 16:02:06 +00:00
Adrian Chadd	871ef8b0d8	Regenerate syscalls.	2015-07-11 15:22:11 +00:00
Adrian Chadd	6520495abc	Add an initial NUMA affinity/policy configuration for threads and processes. This is based on work done by jeff@ and jhb@, as well as the numa.diff patch that has been circulating when someone asks for first-touch NUMA on -10 or -11. * Introduce a simple set of VM policy and iterator types. * tie the policy types into the vm_phys path for now, mirroring how the initial first-touch allocation work was enabled. * add syscalls to control changing thread and process defaults. * add a global NUMA VM domain policy. * implement a simple cascade policy order - if a thread policy exists, use it; if a process policy exists, use it; use the default policy. * processes inherit policies from their parent processes, threads inherit policies from their parent threads. * add a simple tool (numactl) to query and modify default thread/process policities. * add documentation for the new syscalls, for numa and for numactl. * re-enable first touch NUMA again by default, as now policies can be set in a variety of methods. This is only relevant for very specific workloads. This doesn't pretend to be a final NUMA solution. The previous defaults in -HEAD (with MAXMEMDOM set) can be achieved by 'sysctl vm.default_policy=rr'. This is only relevant if MAXMEMDOM is set to something other than 1. Ie, if you're using GENERIC or a modified kernel with non-NUMA, then this is a glorified no-op for you. Thank you to Norse Corp for giving me access to rather large (for FreeBSD!) NUMA machines in order to develop and verify this. Thank you to Dell for providing me with dual socket sandybridge and westmere v3 hardware to do NUMA development with. Thank you to Scott Long at Netflix for providing me with access to the two-socket, four-domain haswell v3 hardware. Thank you to Peter Holm for running the stress testing suite against the NUMA branch during various stages of development! Tested: * MIPS (regression testing; non-NUMA) * i386 (regression testing; non-NUMA GENERIC) * amd64 (regression testing; non-NUMA GENERIC) * westmere, 2 socket (thankyou norse!) * sandy bridge, 2 socket (thankyou dell!) * ivy bridge, 2 socket (thankyou norse!) * westmere-EX, 4 socket / 1TB RAM (thankyou norse!) * haswell, 2 socket (thankyou norse!) * haswell v3, 2 socket (thankyou dell) * haswell v3, 2x18 core (thankyou scott long / netflix!) * Peter Holm ran a stress test suite on this work and found one issue, but has not been able to verify it (it doesn't look NUMA related, and he only saw it once over many testing runs.) * I've tested bhyve instances running in fixed NUMA domains and cpusets; all seems to work correctly. Verified: * intel-pcm - pcm-numa.x and pcm-memory.x, whilst selecting different NUMA policies for processes under test. Review: This was reviewed through phabricator (https://reviews.freebsd.org/D2559) as well as privately and via emails to freebsd-arch@. The git history with specific attributes is available at https://github.com/erikarn/freebsd/ in the NUMA branch (https://github.com/erikarn/freebsd/compare/local/adrian_numa_policy). This has been reviewed by a number of people (stas, rpaulo, kib, ngie, wblock) but not achieved a clear consensus. My hope is that with further exposure and testing more functionality can be implemented and evaluated. Notes: * The VM doesn't handle unbalanced domains very well, and if you have an overly unbalanced memory setup whilst under high memory pressure, VM page allocation may fail leading to a kernel panic. This was a problem in the past, but it's much more easily triggered now with these tools. * This work only controls the path through vm_phys; it doesn't yet strongly/predictably affect contigmalloc, KVA placement, UMA, etc. So, driver placement of memory isn't really guaranteed in any way. That's next on my plate. Sponsored by: Norse Corp, Inc.; Dell	2015-07-11 15:21:37 +00:00
Konstantin Belousov	cf88021ab1	Do not allow creation of the dirty buffers for the dead buffer objects, i.e. for buffer objects which vnode was reclaimed. Buffer cache cannot write such buffers. Return the error and discard the buffer immediately on write attempt. BO_DIRTY now always set during vnode reclamation, since it is used not only for the INVARIANTS checks. Do allow placement of the clean buffers on dead bufobj list, otherwise filesystems cannot use bufcache at all after the devvp reclaim. Reported and tested by: trasz Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-11 11:21:56 +00:00
John-Mark Gurney	2ff9c4f915	Complete the move that was started w/ r263218.. For some reason I didn't delete the files, so that means we need to bring the changes in r282726 to the correct files.. make tinderbox completed with this patch...	2015-07-11 03:12:34 +00:00
Pawel Jakub Dawidek	4273d41299	Spoil even can happen for some time now even on providers opened exclusively (on the media change event). Update GELI to handle that situation. PR: 201185 Submitted by: Matthew D. Fuller	2015-07-10 19:27:19 +00:00
Luigi Rizzo	4af7aed7c6	assorted algorithmic fixes from Paolo Valente (one of my qfq coauthors): - use 1ULL to avoid shift truncations - recompute the sum of weight dynamically to provide better fairness - fix an erroneous constant in the computation of the slot - preserve timestamp correctness when the old timestamp is stale.	2015-07-10 19:24:36 +00:00
Luigi Rizzo	e38e277fc4	one more warning suppression when compiling the test code in userspace.	2015-07-10 19:18:49 +00:00
Luigi Rizzo	e25716b7cc	add code to compute fairness indexes; cleanups to remove compile warnings.	2015-07-10 18:10:40 +00:00
Luigi Rizzo	8fd44c9395	staticize functions only used in netmap.c (detected by jenkins run with gcc 4.9) Update documentation on the use of netmap_priv_d, rename the refcount and use the same structure in FreeBSD and linux No functional changes.	2015-07-10 16:05:24 +00:00
Ed Schouten	ea566832d7	Add missing const keyword to kern_sigaction()'s 'act' parameter. This structure is not modified by the function. Also add const to sigact_flag_test(), as it is called by kern_sigaction().	2015-07-10 14:39:46 +00:00
Mateusz Guzik	9a1ad66fb5	fd: further cleanup of kern_dup - make mode enum start from 0 so that the assertion covers all cases [1] - rename prefix _CLOEXEC flag with _FLAG - postpone fhold on the old file descriptor, which eliminates the need to fdrop in error cases. - fixup FDDUP_FCNTL check missed in the previous commit This removes 'fp == oldfde->fde_file' assertion which had little value. kern_dup only calls fd-related functions which cannot drop the lock or a whole lot of races would be introduced. Noted by: kib [1]	2015-07-10 13:54:03 +00:00
Mateusz Guzik	5fe97c20dc	fd: split kern_dup flags argument into actual flags and a mode Tidy up the code inside to switch on the mode.	2015-07-10 11:01:30 +00:00
Konstantin Belousov	85237e335d	Convert between abridged (from FXSAVE) and unabridged (from FSAVE) versions of the x87 tags. The conversion is naive, used abridged tag is converted to valid unabridged, without additional checks for zero and special values. Noted by: bde Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-10 09:20:13 +00:00
Konstantin Belousov	bcfc2be186	Duplicate the copyright from the i386/i386/machdep.c into i386/include/frame.h after a code was moved from machdep.c to frame.h in r284925. Use include guards style similar to other guards. Noted by: bde Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-10 09:15:06 +00:00
Konstantin Belousov	e8677f3885	Change the mb() use in the sched_ult tdq_notify() and sched_idletd() to more C11-ish atomic_thread_fence_seq_cst(). Note that on PowerPC, which currently uses lwsync for mb(), the change actually fixes the missed store/load barrier, intended by r271604 []. Reviewed by: alc Noted by: alc [] Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-10 08:54:12 +00:00
Luigi Rizzo	0fdeab7bc5	add netmap dependency when compiled as a module	2015-07-10 07:13:14 +00:00
Ed Schouten	47a84387ad	Let listen() return EDESTADDRREQ when not bound. We currently return EINVAL when calling listen() on a UNIX socket that has not been bound to a pathname. If my interpretation of POSIX is correct, we should return EDESTADDRREQ: "The socket is not bound to a local address, and the protocol does not support listening on an unbound socket." Return EDESTADDRREQ instead when not bound and not connected. Differential Revision: https://reviews.freebsd.org/D3038 Reviewed by: gnn, network	2015-07-10 06:47:14 +00:00
Luigi Rizzo	847bf38369	Sync netmap sources with the version in our private tree. This commit contains large contributions from Giuseppe Lettieri and Stefano Garzarella, is partly supported by grants from Verisign and Cisco, and brings in the following: - fix zerocopy monitor ports and introduce copying monitor ports (the latter are lower performance but give access to all traffic in parallel with the application) - exclusive open mode, useful to implement solutions that recover from crashes of the main netmap client (suggested by Patrick Kelsey) - revised memory allocator in preparation for the 'passthrough mode' (ptnetmap) recently presented at bsdcan. ptnetmap is described in S. Garzarella, G. Lettieri, L. Rizzo; Virtual device passthrough for high speed VM networking, ACM/IEEE ANCS 2015, Oakland (CA) May 2015 http://info.iet.unipi.it/~luigi/research.html - fix rx CRC handing on ixl - add module dependencies for netmap when building drivers as modules - minor simplifications to device-specific routines (txsync, rxsync) - general code cleanup (remove unused variables, introduce macros to access rings and remove duplicate code, Applications do not need to be recompiled, unless of course they want to use the new features (monitors and exclusive open). Those willing to try this code on stable/10 can just update the sys/dev/netmap/, sys/net/netmap with the version in HEAD and apply the small patches to individual device drivers. MFC after: 1 month Sponsored by: (partly) Verisign, Cisco	2015-07-10 05:51:36 +00:00
George V. Neville-Neil	0b75d21e18	Summary: Fix LINT build. The names of the new AES modes were not correctly used under the REGRESSION kernel option.	2015-07-10 02:23:50 +00:00
Dimitry Andric	4afafe0c86	Fix swapped copyin(9) arguments in cxgb's iwch_arm_cq() function. Detected by clang 3.7.0 with the warning: sys/dev/cxgb/ulp/iw_cxgb/iw_cxgb_provider.c:309:18: error: variable 'rptr' is uninitialized when used here [-Werror,-Wuninitialized] chp->cq.rptr = rptr; ^~~~ MFC after: 1 week	2015-07-09 22:13:23 +00:00
Mariusz Zaborski	306a82f8f4	Rename zfs nvpair files to not colidate with our nvlist. PR: 201356 Approved by: pjd (mentor)	2015-07-09 21:53:40 +00:00
Andrew Turner	3303004f1a	Remove checks for __ARM_EABI__, we only build for EABI now. Sponsored by: ABT Systems Ltd	2015-07-09 21:02:40 +00:00
Andrew Turner	6c50960be6	Add support for __aeabi_memclr4, clang 3.7 calls it. Sponsored by: ABT Systems Ltd	2015-07-09 20:54:38 +00:00
George V. Neville-Neil	16de9ac1b5	Add support for AES modes to IPSec. These modes work both in software only mode and with hardware support on systems that have AESNI instructions. Differential Revision: D2936 Reviewed by: jmg, eri, cognet Sponsored by: Rubicon Communications (Netgate)	2015-07-09 18:16:35 +00:00
Andrew Turner	bf1717e566	Clear the carry bit on the saved program state register when asked to clear the return value, it's used to indicate an error. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-09 17:26:56 +00:00
Mateusz Guzik	318b946321	vfs: cosmetic changes to namei and namei_handle_root - don't initialize cnp during declaration - don't test error/!error, compare to 0 instead	2015-07-09 17:17:26 +00:00
Mateusz Guzik	d177f49f6f	vfs: simplify error handling in namei The logic is reorganised so that there is one exit point prior to the lookup loop. This is an intermediate step to making audit logging functions use found vnode instead of translating ni_dirfd on their own. ni_startdir validation is removed. The only in-tree consumer is nfs which already makes sure it is a directory. Reviewed by: kib	2015-07-09 16:32:58 +00:00
Ermal Luçi	56844a6203	Correct issue presented in r285051, apparently neither clang nor gcc complain about this. But clang intis the var to NULL correctly while gcc on at least mips does not. Correct the undefined behavior by initializing the variable properly. PR: 201371 Differential Revision: https://reviews.freebsd.org/D3036 Reviewed by: gnn Approved by: gnn(mentor)	2015-07-09 16:28:36 +00:00
Ed Schouten	2491302a04	Add implementations for some of the CloudABI file descriptor system calls. All of the CloudABI system calls that operate on file descriptors of an arbitrary type are prefixed with fd_. This change adds wrappers for most of these system calls around their FreeBSD equivalents. The dup2() system call present on CloudABI deviates from POSIX, in the sense that it can only be used to replace existing file descriptor. It cannot be used to create new ones. The reason for this is that this is inherently thread-unsafe. Furthermore, there is no need on CloudABI to use fixed file descriptor numbers. File descriptors 0, 1 and 2 have no special meaning. This change exposes the kern_dup() through <sys/syscallsubr.h> and puts the FDDUP_* flags in <sys/filedesc.h>. It then adds a new flag, FDDUP_MUSTREPLACE to force that file descriptors are replaced -- not allocated. Differential Revision: https://reviews.freebsd.org/D3035 Reviewed by: mjg	2015-07-09 16:07:01 +00:00
Mateusz Guzik	efdc25304c	fd: prepare do_dup for being exported - rename it to kern_dup. - prefix flags with FD - assert that correct flags were passed	2015-07-09 15:19:45 +00:00
Mateusz Guzik	d19ba50e12	vfs: avoid spurious vref/vrele for absolute lookups namei used to vref fd_cdir, which was immediatley vrele'd on entry to the loop. Check for absolute lookup and vref the right vnode the first time. Reviewed by: kib	2015-07-09 15:06:58 +00:00
Mateusz Guzik	a03f1b2970	vfs: plug a use-after-free of fd_rdir in namei fd_rdir vnode was stored in ni_rootdir without refing it in any way, after which the filedsc lock was being dropped. The vnode could have been freed by mountcheckdirs or another thread doing chroot. VREF the vnode while the lock is held. Reviewed by: kib MFC after: 1 week	2015-07-09 15:06:24 +00:00
Andrew Turner	b2b5507779	Add support for SMP. This uses the FDT data to find the CPUs to start on, and psci to start them. I expect ACPI support to be added later. This has been tested on qemu with 2 cpus as that is the current value of MAXCPUS. This is expected to be increased in the future as FreeBSD has been tested on 48 cores on the Cavium ThunderX hardware. Partially based on a patch from Robin Randhawa from ARM. Approved by: ABT Systems Ltd Relnotes: yes Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3024	2015-07-09 13:23:29 +00:00
Andrew Turner	3ad7e84ef5	Add logging of synchronous exceptions. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-09 13:07:12 +00:00
Andrew Turner	7df38eabdc	Add the definition of the shareable bits in the pagetables Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-09 12:56:09 +00:00
Andrew Turner	144aa0b7f5	Clean up the types used in <machine/ucontext.h> on arm64. As some ports include this file without first including the headers needed for uint32_t and the like use the __foo type. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-09 12:51:50 +00:00
Ed Schouten	3a41ec6af7	Don't clobber td->td_retval[0] in proc_reap(). While writing tests for CloudABI, I noticed that close() on process descriptors returns the process ID of the child process. This is interesting, as close() is only allowed to return 0 or -1. It turns out that we clobber td->td_retval[0] in proc_reap(), so that wait*() properly returns the process ID. Change proc_reap() to leave td->td_retval[0] alone. Set the return value in kern_wait6() instead, by keeping track of the PID before we (potentially) reap the process. Differential Revision: https://reviews.freebsd.org/D3032 Reviewed by: kib	2015-07-09 12:04:45 +00:00
Zbigniew Bodek	6c03ba71f8	Rework CPU identification on ARM64 This commit reworks the code responsible for identification of the CPUs during runtime. It is necessary to provide a way for workarounds and erratums to be applied only for certain HW versions. The copy of MIDR is now stored in pcpu to provide a fast and convenient way for assambly code to read it (pcpu is used quite often so there is a chance it's inside the cache). The MIDR is also better way of identification than using user-friendly cpu_desc structure, because it can be compiled into comparision of single u32 with only one access to the memory - this is crucial for some erratums which are called from performance-critical places. Changes in cpu_identify makes this function safe to be called on non-boot CPUs. New function CPU_MATCH was implemented which returns boolean value based on mathing masked MIDR with chip identification. Example of usage: printf("is thunder: %d\n", CPU_MATCH(CPU_IMPL_MASK \| CPU_PART_MASK, CPU_IMPL_CAVIUM, CPU_PART_THUNDER, 0, 0)); printf("is generic: %d\n", CPU_MATCH(CPU_IMPL_MASK \| CPU_PART_MASK, CPU_IMPL_ARM, CPU_PART_FOUNDATION, 0, 0)); Reviewed by: andrew Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3030	2015-07-09 11:32:29 +00:00
Konstantin Belousov	fcb5b3a419	Cover a race between doselwakeup() and selfdfree(). If doselwakeup() loop finds the selfd entry and clears its sf_si pointer, which is handled by selfdfree() in parallel, NULL sf_si makes selfdfree() free the memory. The result is the race and accesses to the freed memory. Refcount the selfd ownership. One reference is for the sf_link linkage, which is unconditionally dereferenced by selfdfree(). Another reference is for sf_threads, both selfdfree() and doselwakeup() race to deref it, the winner unlinks and than frees the selfd entry. Reported by: Larry Rosenman <ler@lerctr.org> Tested by: Larry Rosenman <ler@lerctr.org>, pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks	2015-07-09 09:22:21 +00:00
Ed Schouten	39b160b3e2	Add forward declaration of struct thread. This structure is used in some of the functions in this header, but we don't depend on any header that pulls it i.	2015-07-09 07:31:40 +00:00
Ed Schouten	f355e810cf	Generate CloudABI system call table with proper $FreeBSD$ tags.	2015-07-09 07:21:33 +00:00
Ed Schouten	6d338f9a81	Import the CloudABI datatypes and create a system call table. CloudABI is a pure capability-based runtime environment for UNIX. It works similar to Capsicum, except that processes already run in capabilities mode on startup. All functionality that conflicts with this model has been omitted, making it a compact binary interface that can be supported by other operating systems without too much effort. CloudABI is 'secure by default'; the idea is that it should be safe to run arbitrary third-party binaries without requiring any explicit hardware virtualization (Bhyve) or namespace virtualization (Jails). The rights of an application are purely determined by the set of file descriptors that you grant it on startup. The datatypes and constants used by CloudABI's C library (cloudlibc) are defined in separate files called syscalldefs_mi.h (pointer size independent) and syscalldefs_md.h (pointer size dependent). We import these files in sys/contrib/cloudabi and wrap around them in cloudabi*_syscalldefs.h. We then add stubs for all of the system calls in sys/compat/cloudabi or sys/compat/cloudabi64, depending on whether the system call depends on the pointer size. We only have nine system calls that depend on the pointer size. If we ever want to support 32-bit binaries, we can simply add sys/compat/cloudabi32 and implement these nine system calls again. The next step is to send in code reviews for the individual system call implementations, but also add a sysentvec, to allow CloudABI executabled to be started through execve(). More information about CloudABI: - GitHub: https://github.com/NuxiNL/cloudlibc - Talk at BSDCan: https://www.youtube.com/watch?v=SVdF84x1EdA Differential Revision: https://reviews.freebsd.org/D2848 Reviewed by: emaste, brooks Obtained from: https://github.com/NuxiNL/freebsd	2015-07-09 07:20:15 +00:00
John-Mark Gurney	275a0a97ed	upon further examination, it turns out that _unregister_all already provides the guarantee that no threads will be in the _newsession code.. This is provided by the CRYPTODRIVER lock... This makes the pause unneeded...	2015-07-08 22:48:41 +00:00
Mateusz Guzik	06d1ada870	seq: use seq_consistent_nomb in seq_consistent Constify seqp argument for seq_consistent_nomb. No functional changes.	2015-07-08 22:21:25 +00:00
Zbigniew Bodek	0a3f65a107	Style cleanups after r285270 There should be no semicolons in added macro definitions. Define empty macro as "do {} while (0)". Pointed out by: jmg	2015-07-08 22:09:47 +00:00
John-Mark Gurney	e808e13b8b	Now that aesni won't reuse fpu contexts (D3016), add seatbelts to the fpu code to prevent other reuse of the contexts in the future... Differential Revision: https://reviews.freebsd.org/D3015 Reviewed by: kib, gnn	2015-07-08 19:26:36 +00:00
John-Mark Gurney	9d38fd076e	address an issue where consumers, like IPsec, can reuse the same session in multiple threads w/o locking.. There was a single fpu context shared per session, if multiple threads were using the session, and both migrated away, they could corrupt each other's fpu context... This patch adds a per cpu context and a lock to protect it... It also tries to better address unloading of the aesni module... The pause will be removed once the OpenCrypto Framework provides a better method for draining callers into _newsession... I first discovered the fpu context sharing issue w/ a flood ping over an IPsec tunnel between two bhyve machines... The patch in D3015 was used to verify that this fix does fix the issue... Reviewed by: gnn, kib (both earlier versions) Differential Revision: https://reviews.freebsd.org/D3016	2015-07-08 19:15:29 +00:00
Konstantin Belousov	f4b5a9725a	Reimplement the ordering requirements for the timehands updates, and for timehands consumers, by using fences. Ensure that the timehands->th_generation reset to zero is visible before the data update is visible []. tc_setget() allowed data update writes to become visible before generation (but not on TSO architectures). Remove tc_setgen(), tc_getgen() helpers, use atomics inline []. Noted by: alc [] Requested by: bde [**] Reviewed by: alc, bde Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-08 18:42:08 +00:00
Konstantin Belousov	261fda00cd	Use atomic_fence_fence_rel() to ensure ordering in the seq_write_begin(), instead of the load_rmb/rbm_load functions. The update does not need to be atomic due to the write lock owned. Similarly, in seq_write_end(), update of seqp needs not be atomic. Only store must be atomic with release. For seq_read(), the natural operation is the load acquire of the sequence value, express this directly with atomic_load_acq_int() instead of using custom partial fence implementation atomic_load_rmb_int(). In seq_consistent, use atomic_thread_fence_acq() which provides the desired semantic of ordering reads before fence before the re-reading of seqp, instead of custom atomic_rmb_load_int(). Reviewed by: alc, bde Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-08 18:37:08 +00:00
Konstantin Belousov	8954a9a4e6	Add the atomic_thread_fence() family of functions with intent to provide a semantic defined by the C11 fences with corresponding memory_order. atomic_thread_fence_acq() gives r \| r, w, where r and w are read and write accesses, and \| denotes the fence itself. atomic_thread_fence_rel() is r, w \| w. atomic_thread_fence_acq_rel() is the combination of the acquire and release in single operation. Note that reads after the acq+rel fence could be made visible before writes preceeding the fence. atomic_thread_fence_seq_cst() orders all accesses before/after the fence, and the fence itself is globally ordered against other sequentially consistent atomic operations. Reviewed by: alc Discussed with: bde Sponsored by: The FreeBSD Foundation MFC after: 3 weeks	2015-07-08 18:12:24 +00:00
Alan Cox	22cf98d1f3	The intention of r254304 was to scan the active queue continuously. However, I've observed the active queue scan stopping when there are frequent free page shortages and the inactive queue is steadily refilled by other mechanisms, such as the sequential access heuristic in vm_fault() or madvise(2). To remedy this problem, record the time of the last active queue scan, and always scan a number of pages proportional to the time since the last scan, regardless of whether that last scan was a timeout-triggered ("pass == 0") or free-page-shortage-triggered ("pass > 0") scan. Also, on a timeout-triggered scan, allow a full scan of the active queue when the system is short of inactive pages. Reviewed by: kib MFC after: 6 weeks Sponsored by: EMC / Isilon Storage Division	2015-07-08 17:45:59 +00:00
Andrew Turner	6bae05d951	Correctly set __WCHAR_MIN, there is no __UINT_MIN, it's 0. Sponsored by: ABT Systems Ltd	2015-07-08 16:18:28 +00:00
Andrew Turner	ded32d88f1	Add support for ipi_all_but_self on arm64. Obtained from: ABT Systems Ltd Sponsored by: The freeBSD Foundation	2015-07-08 15:32:59 +00:00
Andrew Turner	80ad08a3e9	Add an implementation of savectx that doesn't just call panic. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-08 14:07:06 +00:00
Zbigniew Bodek	4981c60e07	Add memory barrier to bus_dmamap_sync() On platforms which are fully IO-coherent, the map might be null. We need to guarantee that all data is observable after the sync operation is called. Add a memory barrier to ensure that on ARM. Reviewed by: andrew, kib Obtained from: Semihalf Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D3012	2015-07-08 13:52:59 +00:00
Konstantin Belousov	69d11def74	Handle copyout for the fcntl(F_OGETLK) using oflock structure. Otherwise, kernel overwrites a word past the destination. Submitted by: walter@pelissero.de PR: 196718 MFC after: 1 week	2015-07-08 13:19:13 +00:00
Andrew Turner	cb02f6b942	Send the correct signal when vm_fault fails. While here also set the code and address fields. Sponsored by: ABT Systems Ltd	2015-07-08 12:42:44 +00:00
John-Mark Gurney	a13589bc47	unroll the loop slightly... This improves performance enough to justify, especially for CBC performance where we can't pipeline.. I don't happen to have my measurements handy though... Sponsored by: Netflix, Inc.	2015-07-07 20:31:09 +00:00
Mark Johnston	620711e033	Fix an incorrect assertion in witness. The number of available lock list entries for a thread is LOCK_CHILDCOUNT, and each entry can record up to LOCK_NCHILDREN locks. When iterating over the locks held by a thread, a bound on the loop index is therefore given by LOCK_CHILDCOUNT * LOCK_NCHILDREN; WITNESS_COUNT is an unrelated constant. Reviewed by: jhb MFC after: 1 week Sponsored by: EMC / Isilon Storage Division Differential Revision: https://reviews.freebsd.org/D2974	2015-07-07 19:29:18 +00:00
Luiz Otavio O Souza	f935da6fee	Add the Banana Pi DTS. The Banana Pi support is in progress and this is intended to help the early adopters.	2015-07-07 19:01:54 +00:00
John-Mark Gurney	748a12e2c3	we may get here w/ non-sleepable locks held, so switch to _NOWAIT when doing this memory allocation... Reviewed by: ae	2015-07-07 18:45:32 +00:00
Ed Maste	906451276a	Avoid creating invalid UEFI device path The UEFI loader on the 10.1 release install disk (disc1) modifies an existing EFI_DEVICE_PATH_PROTOCOL instance in an apparent attempt to truncate the device path. In doing so it creates an invalid device path. Perform the equivalent action without modification of structures allocated by firmware. PR: 197641 MFC After: 1 week Submitted by: Chris Ruffin <chris.ruffin@intel.com>	2015-07-07 18:44:27 +00:00
Luiz Otavio O Souza	f4b61a34e5	Add the GMAC entries to sun7i (A20) DTS. While here make EMAC disabled unless explicitly enabled.	2015-07-07 18:32:23 +00:00
Takanori Watanabe	99043514c6	Fix rfcomm_sppd regression I could reproduced. To reproduce it, Two machine running FreeBSD and run rfcomm_sppd -c 3 -S rfcomm_sppd -a ${PEER} -c 3 on each side.	2015-07-07 15:56:51 +00:00
Pedro F. Giffuni	9129dd59be	Relocate sched_random() within the SMP section. Place sched_random nearer to where it's first used: moving the code nearer to where it is used makes the code easier to read and we can reduce the initial "#ifdef SMP" island. Reword a little the comment and clean some whitespaces while here.	2015-07-07 15:22:29 +00:00
Achim Leubner	4e1bc9a039	Driver 'pmspcv' added. Supports PMC-Sierra PM8001/8081/8088/8089/8074/8076/8077 SAS/SATA HBA Controllers.	2015-07-07 13:17:02 +00:00
Michael Tuexen	29b9533b43	Export the ssthresh value per SCTP path via the sysctl interface. MFC after: 1 month	2015-07-07 06:34:28 +00:00
Adrian Chadd	54991f37c9	Attempt to make 5GHz HT/40 work on the 6xxx series NICs. The 6205 (Taylor Peak) in the Lenovo X230 works fine in 5GHz 11a and 11n HT20, but not 11n HT40. The NIC goes RX deaf the moment HT40 is configured. It's so RX deaf that it doesn't even hear beacons and the firmware sends "BEACON MISS" events. That's pretty deaf. I tried configuring up the HT40 flags in monitor mode and it worked - so I assumed that doing the transition from 20 -> 40MHz channel configuration when going auth->assoc (ie, after the NIC has been partially configured) is a problem. So for now, let's just always set them if they're available. Tested: * Intel 5300, STA mode, 5GHz HT/40 AP; 2GHz HT/20 AP * Intel 6205, STA mode, 5GHz HT/40, HT20, 11a AP; 2GHz HT/20 AP This was pointed out to me by coworkers trying to use FreeBSD-HEAD in the office on their Thinkpad T420p laptops. TODO: * I don't like how the HT40 flags are configured - the whole interop/ protection config should be re-checked. Notably, I think curhtprotmode is 0 in a lot of cases, which means "no interoperability" and i think that's busted. Sponsored by: Norse Corp, Inc.	2015-07-07 03:51:29 +00:00
Justin Hibbits	7f78865ba0	Enable the wireless on attach. This comes from the archives of "forgotten in the original commit, and probably pointless now because nobody uses it, and the driver's broken anyway."	2015-07-07 02:42:48 +00:00
Justin Hibbits	44027a8321	style(9) cleanups. Don't use PRIxPTR, these registers are 32-bits, cast to u_long instead. Pointed out by: bde	2015-07-07 02:37:29 +00:00
Navdeep Parhar	9af71ab3bc	cxgbe(4): Add a new knob that controls the congestion response of netmap rx queues. The default is to drop rather than backpressure. This decouples the congestion settings of NIC and netmap rx queues. MFC after: 3 days	2015-07-06 20:56:59 +00:00
Navdeep Parhar	41f7622b64	cxgbe(4): Do not override the the global defaults for congestion drops. The hw.cxgbe.cong_drop knob is not affected by this change because the driver sets up congestion drop on a per-queue basis. MFC after: 3 days	2015-07-06 20:28:42 +00:00
Neel Natu	5e4f29c037	Move the 'devmem' device nodes from /dev/vmm to /dev/vmm.io Some external tools just do a 'ls /dev/vmm' to figure out the bhyve virtual machines on the host. These tools break if the devmem device nodes also appear in /dev/vmm. Requested by: grehan	2015-07-06 19:41:43 +00:00
John-Mark Gurney	5a550cca9a	Fix for non-random IV's when CRD_F_IV_PRESENT and CRD_F_IV_EXPLICIT flags are not specified... This bug was introduced in r275732... This only affects IPsec ESP only policies w/ the aesni module loaded, other subsystems specify one or both of the flags... Reviewed by: gnn, delphij, eri	2015-07-06 19:30:29 +00:00
John-Mark Gurney	bcc0b68477	remove _NORMAL flag which isn't suppose to be used w/ _alloc_ctx... Reviewed by: kib (a while ago)	2015-07-06 19:17:56 +00:00
Mateusz Guzik	aa0e2887f4	tty: replace several curthread->td_proc with stored curproc No functional changes.	2015-07-06 18:53:56 +00:00
Zbigniew Bodek	1ae9c994c8	Introduce ITS support for ARM64 Add ARM ITS (Interrupt Translation Services) support required to bring-up message signalled interrupts on some ARM64 platforms. Obtained from: Semihalf Sponsored by: The FreeBSD Foundation	2015-07-06 18:27:41 +00:00
Andrew Turner	b67d1aad6f	Add more tlb invalidations. We currently invalidate when we may not need to, but with this I can boot on a simulator that models the tlb. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-06 18:27:18 +00:00
Luiz Otavio O Souza	1d7a730974	When initializing the (unused) TX descriptors it is not necessary set the chain bit. Obtained from: NetBSD	2015-07-06 17:13:17 +00:00
Luiz Otavio O Souza	ff0752c870	Use uint32_t consistently to store registers values. Always use unsigned numbers to avoid undefined behavior on (1 << 31). Remove unused variables and some stray semicolons. No functional changes.	2015-07-06 16:45:48 +00:00
Patrick Kelsey	6f99ea0520	Don't acquire sysctlmemlock in userland_sysctl() when the old value pointer is NULL, as in that case there are no userland pages that could potentially be wired. It is common for old to be NULL and oldlenp to be non-NULL in calls to userland_sysctl(), as this is used to probe for the length of a variable-length sysctl entry before retrieving a value. Note that it is typical for such calls to be made with an uninitialized value in oldlenp, so sysctlmemlock was essentially being acquired at random (depending on the uninitialized value in oldlenp being > PAGE_SIZE or not) for these calls prior to this patch. Differential Revision: https://reviews.freebsd.org/D2987 Reviewed by: mjg, kib Approved by: jmallett (mentor) MFC after: 1 month	2015-07-06 16:07:21 +00:00
Konstantin Belousov	9889bbac23	Mutex memory is not zeroed, add MTX_NEW. Reported and tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-06 14:09:00 +00:00
Andrey V. Elsukov	280d77a3bb	Fill the port and protocol information in the SADB_ACQUIRE message in case when security policy has it as required by RFC 2367. PR: 192774 Differential Revision: https://reviews.freebsd.org/D2972 MFC after: 1 week	2015-07-06 12:40:31 +00:00
Steven Hartland	64ce6b09a3	Correct bit offsets for ahci quirks Fix bit offsets causing incorrect quirks being reported on boot for ahci introduced by r280184. MFC after: 3 days Sponsored by: Multiplay	2015-07-06 09:44:07 +00:00
Justin Hibbits	3f3cffedce	Merge booke and aim interrupt.c files. Summary: Both booke and AIM interrupt.c files contain nearly identical code. This merges the two files, to reduce duplication. Reviewers: #powerpc, marcel Reviewed By: marcel Subscribers: imp Differential Revision: https://reviews.freebsd.org/D2991	2015-07-06 05:08:57 +00:00
Luiz Otavio O Souza	a5221d68dc	Fix the sent packets statistics for if_dwc.	2015-07-06 03:06:13 +00:00
Patrick Kelsey	e9617c305c	Fix if_loop so bpfwrite() can use it regardless of the state of bd_hdrcmplt. As if_loop does not use link-level headers, its behavior when used by bpfwrite() should be the same regardless of the state of bd_hdrcmplt. Without this change, libpcap (and other BPF users that work like it) fail when writing to loopback interfaces. Differential Revision: https://reviews.freebsd.org/D2989 Reviewed by: gnn, melifaro Approved by: jmallett (mentor) MFC after: 3 days	2015-07-06 02:12:49 +00:00
Mark Johnston	947401dd50	Move the comment describing namei(9) back to namei()'s definition. MFC after: 3 days	2015-07-05 22:56:41 +00:00
Mark Johnston	8bbd1f25b1	Remove a stale descriptive comment for gbincore(). The splay trees referenced in the comment were converted to path-compressed tries in r250551. MFC after: 3 days	2015-07-05 22:44:41 +00:00
Mark Johnston	5f34e93c58	Check suspendability on the mountpoint returned by VOP_GETWRITEMOUNT. This obviates the need for a MNTK_SUSPENDABLE flag, since passthrough filesystems like nullfs and unionfs no longer need to inherit this information from their lower layer(s). This change also restores the pre-r273336 behaviour of using the presence of a susp_clean VFS method to request suspension support. Reviewed by: kib, mjg Differential Revision: https://reviews.freebsd.org/D2937	2015-07-05 22:37:33 +00:00
Mark Johnston	010ba3842c	Add a local variable initialization needed in the OBJT_DEFAULT case. Reviewed by: kib Differential Revision: https://reviews.freebsd.org/D2992	2015-07-05 22:26:19 +00:00
Mateusz Guzik	f131759f54	fd: make 'rights' a manadatory argument to fget* functions	2015-07-05 19:05:16 +00:00
Andrew Turner	5f8583891f	Add the kernel functions needed to enable threading. Sponsored by: ABT Systems Ltd	2015-07-05 18:16:06 +00:00
Bjoern A. Zeeb	31c98473c1	Fix GENERIC64 and LINT64 powerpc builds after r285144.	2015-07-05 15:30:16 +00:00
Ian Lepore	1a6570fb1f	Enable ipsec by default on all armv6 platforms.	2015-07-05 14:16:31 +00:00
Ian Lepore	b108902461	Ensure all the required files get built when you include the IPSEC option.	2015-07-05 14:15:58 +00:00
Alexander Motin	d1f4058735	Make first step toward supporting target and initiator roles same time. To avoid conflicts between target and initiator devices in CAM, make CTL use target ID reported by HBA as its initiator_id in XPT_PATH_INQ. That target ID is known to never be used for initiator role, so it won't conflict. For Fibre Channel and FireWire HBAs this specific ID choice is irrelevant since all target IDs there are virtual. Same time for SPI HBAs it seems could be even requirement to use same target ID for both initiator and target roles. While there are some more things to polish in isp(4) driver, first tests of using both roles same time on the same port appeared successfull: # camcontrol devlist -v scbus0 on isp0 bus 0: <FREEBSD CTLDISK 0001> at scbus0 target 1 lun 0 (da20,pass21) <> at scbus0 target 256 lun 0 (ctl0) <> at scbus0 target -1 lun ffffffff (ctl1)	2015-07-05 03:38:58 +00:00
Alexander Motin	766a65a50d	Remove extra level of target ID indirection (isp_dev_map). FreeBSD never had limitation on number of target IDs, and there is no any other requirement to allocate them densely. Since slots of port database already populated just sequentially, there is no much need for another indirection to allocate sequentially too.	2015-07-05 02:09:46 +00:00
George V. Neville-Neil	aaa7cbfe1a	Summary: Add missing files necessary to build with IPSEC and crypto	2015-07-04 21:32:44 +00:00
George V. Neville-Neil	0661a7c224	Fix up tabs vs. spaces	2015-07-04 20:31:06 +00:00
Justin Hibbits	0936003e3d	Use the correct type for physical addresses. On Book-E, physical addresses are actually 36-bits, not 32-bits. This is currently worked around by ignoring the top bits. However, in some cases, the boot loader configures CCSR to something above the 32-bit mark. This is stage 1 in updating the pmap to handle 36-bit physaddr.	2015-07-04 19:00:38 +00:00
Alexander Motin	8656f200dc	Change comment added in r284540. This appeared to be not card's issue, but driver's, though solution is the same so far.	2015-07-04 18:51:54 +00:00
Alexander Motin	6bef0aa0c6	Drop discovered targets when initiator role is disabled.	2015-07-04 18:38:46 +00:00
Justin Hibbits	398973f809	Add machine check register printing This will print out the Memory Subsystem Status Register on MPC745x (G4+ class), and the Machine Check Status Register on Book-E class CPUs, to aid in debugging machine checks. Other relevant registers, for other CPUs, can be added in the future.	2015-07-04 18:16:41 +00:00
George V. Neville-Neil	3839369c03	Enable IPSEC in all GENERIC kernels. Universe and kernel build tests passed 4 July 2015 PR: 128030 Sponsored by: Rubicon Communications (Netgate)	2015-07-04 17:37:00 +00:00
Mariusz Zaborski	54f98da930	Move the nvlist source and private includes from sys/kern to seperate directory sys/contrib/libnv. The goal of this operation is to NOT install header files which shouldn't be used outside the nvlist library. Approved by: pjd (mentor)	2015-07-04 16:33:37 +00:00
Luiz Otavio O Souza	06844c0f7e	Install loader.rc with ARM u-boot loader (ubldr). loader.rc is the responsible to read and process loader.conf variables. This fix the issue of loader.conf being silently ignored. MFC after: 3 days	2015-07-04 16:19:38 +00:00
Mateusz Guzik	9ca30b0e06	vfs: use shared vnode locking when looking up ".." in vop_stdvptocnp Briefly discussed with: kib	2015-07-04 15:46:39 +00:00
Mateusz Guzik	dba0bec2bb	fd: de-k&r-ify functions + some whitespace fixes No functional changes.	2015-07-04 15:42:03 +00:00
Mateusz Guzik	ee5f66f820	sysctl: get rid of sysctl_lock/unlock Inline their contents into the only consumer.	2015-07-04 14:44:39 +00:00
Mariusz Zaborski	07929e4e77	Remove non-existent dnvlist functions. Approved by: pjd (mentor)	2015-07-04 10:33:33 +00:00
John-Mark Gurney	64ff224dd5	improve dependencies for this module a bit... not great, but at least gives some basics... I would add them to DPSRC, but due to the intrinsics headers, they can't be added...	2015-07-04 08:16:32 +00:00
Mateusz Guzik	d5fc115a1a	sysctl: remove a debugging printf which crept in with r285125	2015-07-04 07:01:43 +00:00
Mateusz Guzik	b8633775a8	sysctl: switch sysctllock to a sleepable rmlock The lock is almost never taken for writing.	2015-07-04 06:54:15 +00:00
Warner Losh	8cad626fbb	Cache _MPATH and pass it down into the modules build. Some NFS setups make the find it does extremely expensive, so compute it only once. Also make sure the 'traditional' module building method works at the expense of a bit of duplicated code.	2015-07-04 05:43:45 +00:00
Adrian Chadd	3e307100f6	Quieten the scorpion SoC/WMAC reset path. Stuff the non-error stuff under HALDEBUG().	2015-07-04 03:15:42 +00:00
Adrian Chadd	35aae619d7	Call the WMAC DDR flush before handling an interrupt for the Atheros AHB (internally) connected MAC. TODO: * verify the interrupt was for us before doing the DDR flush.	2015-07-04 03:07:28 +00:00
Adrian Chadd	c37904e8ba	Reshuffle all of the DDR flush operations into a single switch/mux, and start teaching subsystems about it. The Atheros MIPS platforms don't guarantee any kind of FIFO consistency with interrupts in hardware. So software needs to do a flush when it receives an interrupt and before it calls the interrupt handler. There are new ones for the QCA934x and QCA955x, so do a few things: * Get rid of the individual ones (for ethernet and IP2); * Create a mux and enum listing all the variations on DDR flushes; * replace the uses of IP2 with the relevant one (which will typically be "PCI" here); * call the USB DDR flush before calling the real USB interrupt handlers; * call the ethernet one upon receiving an interrupt that's for us, rather than never calling it during operation. Tested: * QCA9558 (TP-Link archer c7 v2) * AR9331 (Carambola 2) TODO: * PCI, USB, ethernet, etc need to do a double-check to see if the interrupt was truely for them before doing the DDR. For now I prefer "correct" over "fast".	2015-07-04 03:05:57 +00:00
Adrian Chadd	52f5515397	Wake up the hardware before doing anything in sysctl. This stops the panics that occur on MIPS platforms when doing say, 'sysctl dev.ath.0' whilst the MAC is asleep. The MIPS platform is rather unforgiving in getting power-save register access wrong and you will get all kinds of odd failures if you don't have things woken up at the right times. Tested: * QCA9558 (TP-Link Archer C7 v2) * AR9331 (Carambola 2) .. with no VAPs configured and ath0 down (thus the MAC is definitely asleep.) PR: kern/201117	2015-07-04 02:59:30 +00:00
Rick Macklem	2a3508eb48	If a "principal" argument isn't provided for a Kerberized NFS mount, the kernel would generate a bogus one with a ":/<path>" suffix. This would only occur for the case where there was no explicit "principal" argument and the getaddrinfo() call in mount_nfs.c failed to a return a cannonical name for the server. This patch fixes this unusual case. PR: 201073 Submitted by: masato@itc.naist.jp MFC after: 2 weeks	2015-07-03 22:11:07 +00:00
George V. Neville-Neil	987de84445	New AES modes for IPSec, user space components. Update setkey and libipsec to understand aes-gcm-16 as an encryption method. A partial commit of the work in review D2936. Submitted by: eri Reviewed by: jmg MFC after: 2 weeks Sponsored by: Rubicon Communications (Netgate)	2015-07-03 20:09:14 +00:00
Andrey V. Elsukov	cb207f93ca	Keep IPv6 address specified by IPV6_PKTINFO socket option in kernel internal form to be able handle link-local IPv6 addresses. Reported by: kp Tested by: kp	2015-07-03 19:01:38 +00:00
Luiz Otavio O Souza	50ad20b383	Add the routines to activate the GMAC clock and setup the GMAC mode. Tested on Cubieboard 2 and Banana pi.	2015-07-03 18:39:25 +00:00
Luiz Otavio O Souza	9639a6c7c9	Rename a10_emac_gpio_config() to a10_gpio_ethernet_activate() to make the change to GMAC easier on A20 SoCs. On A10 only the EMAC controller is available (fast ethernet), but on A20 there is also GMAC a high (or better) performant controller (gigabit ethernet). On A20 the both controllers uses the same pins to talk to the ethernet PHY (MII or RGMII) and they can be selected by the GPIO pin mux. There is work in progress to bring in GMAC support.	2015-07-03 17:54:41 +00:00
Luiz Otavio O Souza	2e33d3583d	Remove duplicate and unnecessary includes. While here remove an unused and wrong define.	2015-07-03 17:09:27 +00:00
Marcel Moolenaar	194709a520	Remove commented-out and non-existent cbus(4) attachment for uart(4).	2015-07-03 16:02:06 +00:00
Marcel Moolenaar	472aa69925	Allow proto(4) to be compiled into the kernel.	2015-07-03 15:56:00 +00:00
Ermal Luçi	c1fc5e9601	Reduce overhead of IPSEC for traffic generated from host When IPSEC is enabled on the kernel the forwarding path has an optimization to not enter the code paths for checking security policies but first checks if there is any security policy active at all. The patch introduces the same optimization but for traffic generated from the host itself. This reduces the overhead by 50% on my tests for generated host traffic without and SP active. Differential Revision: https://reviews.freebsd.org/D2980 Reviewed by: ae, gnn Approved by: gnn(mentor)	2015-07-03 15:31:56 +00:00
Ruslan Bukin	4ebd95ae06	o Add a description for virtio block device implemented in PISM (Bluespec C-interface device) o Add a kernel config Sponsored by: HEIF5	2015-07-03 14:46:57 +00:00
Ruslan Bukin	4934834d6b	Allow BERI virtio-platform code to operate with no PIO devices specified. We will use it with Bluespec simulator of CHERI processor for invalidating caches only.	2015-07-03 14:27:28 +00:00
Ruslan Bukin	116b8d2b0b	Add 'prewrite' method allowing us to run some platform-specific code before each write happens, e.g. write-back caches. This will help booting in Bluespec simulator of CHERI processor.	2015-07-03 14:13:16 +00:00
Luiz Otavio O Souza	7ec8c789c3	Add AHCI attachment code for Allwinner A10/A20 SoCs. The Allwinner SoC has an AHCI device on its internal main bus rather than the PCI bus. This SoC is somewhat underdocumented, and its SATA controller is no exception. The methods to support this chip were harvested from the Linux Allwinner SDK, and then constants invented to describe what's going on based on low-level constants contained in the SATA standard and guess work. This SoC requires a specific AHCI channel setup in order to start the operations on the channel properly. Clock setup and AHCI channel setup idea came from NetBSD. Tested on Cubieboard 2 and Banana pi (and attachment on Cubieboard by Pratik Singhal). Differential Revision: https://reviews.freebsd.org/D737 Submitted by: imp Reviewed by: imp, ganbold, mav, andrew	2015-07-03 14:11:01 +00:00
Roger Pau Monné	6a8e9695ba	netfront: preserve configuration across migrations Try to preserve the xn configuration when migrating. This is not always possible since the backend might not have the same set of options available, in which case we will try to preserve as many as possible. MFC after: 2 weeks PR: 183139 Reported by: mcdouga9@egr.msu.edu Sponsored by: Citrix Systems R&D	2015-07-03 12:09:05 +00:00
Hans Petter Selasky	49557d2481	Fix broken implementation of "kvasprintf()" function by adding missing kmalloc() call. Make function global instead of static inline to fix compiler warnings about passing variable argument lists to inline functions. MFC after: 1 week Sponsored by: Mellanox Technologies	2015-07-03 11:16:20 +00:00
Bjoern A. Zeeb	bfbc08b848	Move comment to the right position. PR: 152791 Submitted by: vangyzen (as part of the functional change) MFC after: 3 days	2015-07-03 09:53:56 +00:00
Adrian Chadd	ef19855701	Oops - fix typo.	2015-07-03 07:00:24 +00:00
Simon J. Gerraty	96a11afdff	Updated depends	2015-07-03 06:11:54 +00:00
Adrian Chadd	b14a705362	Add initial support for the TP-Link Archer C7 v2. The SoC, the flash, the ethernet ports and ethernet switch all work. The USB works. The 11ac PCIe NIC internally is at least seen by the PCIE RC, but I haven't tried using it yet. There's no driver and I haven't yet swapped it out for a non-11ac chip. The on-chip 2GHz wifi works, but there are some data errors that get thrown up in STA mode when scanning. I have a feeling I have to finish the DDR flush code out and have it run correctly on the shared interrupts; that'll take a bit of time to get right. But if you're after an updated piece of hardware, the Archer C7 v2 is certainly there, and you can replace the 11ac NIC with a 3x3 Atheros PCIe device (eg AR9380, AR9390, AR9580, etc) and it'll "just work". Tested: * TP-Link archer c7v2.	2015-07-03 06:09:56 +00:00
Adrian Chadd	212faba17d	Add pcb1 to the QCA955x. The Tp-link Archer-C7v2 unit has a QCA9558 internally but hangs the QCA988x 11ac PCIe NIC off of PCI RC #1, not #0. So I actually finally /do/ have a board to verify whether PCIe is working. Grr. Tested: * TP-Link Archer-C7v2.	2015-07-03 06:06:44 +00:00
Marcel Moolenaar	42d3ab5d1b	Implement unload and sync operations.	2015-07-03 05:44:58 +00:00
Adrian Chadd	9d0e5a1718	Enable setting the QCA955x GPIO output mux configuration. It's not used by any boards yet, but it's going to creep up soon as more boards show up.	2015-07-03 03:34:21 +00:00
Adrian Chadd	3facd56c71	Add register defines for the QCA955x DDR flush and GPIO control.	2015-07-03 03:32:54 +00:00
Marcel Moolenaar	89abdea8f0	Add create, destroy and load of memory descriptors.	2015-07-03 01:52:22 +00:00
Warner Losh	12f05b8446	Kill MFILES and find things automatically. It turned out to be only lightly used. Find the proper .m file when we depend on *_if.[ch] in the srcs line, with seat-belts for false positive matches. This uses make's path mechanism. A further refinement would be to calculate this once, and then pass the resulting _MPATH to modules submakes. Differential Revision: https://reviews.freebsd.org/D2327	2015-07-03 01:50:26 +00:00
Rick Macklem	d189dcb6e2	Alex Burlyga reported a POLA violation for the new NFS client as compared to the old NFS client via email to the freebsd-fs@ mailing list. For the new client, when multiple clients attempted to create a symbolic link concurrently, more that one client would report success instead of EEXIST. This was caused by code in the new client that mapped EEXIST to OK assuming it was caused by a retried RPC request. Since the old client did not do this, the patch defaults to the old behaviour and permits the new behaviour to be enabled via a sysctl. Reported by: alex.burlyga.ietf@gmail.com Tested by: alex.burlyga.ietf@gmail.com MFC after: 2 weeks	2015-07-03 01:15:21 +00:00
Mariusz Zaborski	dc619c2f57	Add stddef.h for size_t typedef. Approved by: pjd (mentor)	2015-07-02 21:46:07 +00:00
Marcel Moolenaar	3a232946f7	Add an ISA/ACPI bus attachment to proto(4).	2015-07-02 19:21:29 +00:00
Mateusz Guzik	e2f5418e73	sysvshm: fix up some whitespace issues and spurious initialisation	2015-07-02 19:14:30 +00:00
Mateusz Guzik	77a26248a3	sysvshm: don't lock proc when calculating attach_va vm_daddr is constant and RLIMIT_DATA can be obtained from thread's copy of rlimits.	2015-07-02 19:03:44 +00:00
Mateusz Guzik	0be3a191a4	sysvshm: fix shmrealloc The code was supposed to initialize new segs in newsegs array, but used the old pointer.	2015-07-02 19:00:22 +00:00
Mateusz Guzik	cd336bad26	vm: don't lock proc around accesses to vm_{t,d}addr and RLIMIT_DATA in sys_mmap vm_{t,d}addr are constant and we can use thread's copy of resource limits	2015-07-02 18:30:12 +00:00
Ermal Luçi	d14122b078	Avoid doing multiple route lookups for the same destination IP during forwarding ip_forward() does a route lookup for testing this packet can be sent to a known destination, it also can do another route lookup if it detects that an ICMP redirect is needed, it forgets all of this and handovers to ip_output() to do the same lookup yet again. This optimisation just does one route lookup during the forwarding path and handovers that to be considered by ip_output(). Differential Revision: https://reviews.freebsd.org/D2964 Approved by: ae, gnn(mentor) MFC after: 1 week	2015-07-02 18:10:41 +00:00
Andrew Turner	d2676f552e	Remove an unneeded define and old comment referencing amd64.	2015-07-02 16:13:29 +00:00
Andrew Turner	b9b3574474	Remove an old comment, the cache is enabled. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-02 15:26:40 +00:00
Konstantin Belousov	be930a2021	Account for the main process stack being one page below the highest user address when ABI uses shared page. Note that the change is no-op for correctness, since shared page does not fault. The mapping for the shared page is installed at the address space creation, the page is unmanaged and its pte/pv entry cannot be reclaimed. Submitted by: Oliver Pinter Review: https://reviews.freebsd.org/D2954 MFC after: 1 week	2015-07-02 15:22:13 +00:00
Andrew Turner	40fc1dffc3	Use pmap_load to load table entries. This simplifies finding places where we access the tables. Obtained from: ABT Systems Ltd Sponsored by: The fReeBSD Foundation	2015-07-02 15:17:30 +00:00
Konstantin Belousov	6fdfd88220	Use single instance of the identical INKERNEL() and PMC_IN_KERNEL() macros on amd64 and i386. Move the definition to machine/param.h. kgdb defines INKERNEL() too, the conflict is resolved by renaming kgdb version to PINKERNEL(). On i386, correct the lowest kernel address. After the shared page was introduced, USRSTACK no longer points to the last user address + 1 [] Submitted by: Oliver Pinter [] Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-02 14:37:21 +00:00
Andrew Turner	a380ef6a02	Enable kernel debugging on arm64, other than GDB as it fails to build. Sponsored by: ABT Systems Ltd	2015-07-02 14:35:30 +00:00
Konstantin Belousov	1965f86c72	Vnode is not referenced by the vfs_domount() at the point where asserts are made. Remove them, since we might dereference freed memory. Leaked locks are asserted by the syscall return code anyway. Reported and tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-07-02 14:31:47 +00:00
Alexander Motin	b9b4269c1d	Fix couple panics on forced unmount of backing file. MFC after: 1 week Sponsored by: iXsystems, Inc.	2015-07-02 12:53:22 +00:00
Pawel Jakub Dawidek	fefb6a143a	Properly propagate errors in metadata reading. PR: 198860 Submitted by: Matthew D. Fuller	2015-07-02 10:57:34 +00:00
Pawel Jakub Dawidek	edaa9008ff	Allow to omit keyfile number for the first keyfile.	2015-07-02 10:55:32 +00:00
Andriy Gapon	74f75cb1bd	zfs_mount(MS_REMOUNT): protect zfs_(un)register_callbacks calls We now take z_teardown_lock as a writer to ensure that there is no I/O while the filesystem state is in a flux. Also, zfs_suspend_fs() -> zfsvfs_teardown() call zfs_unregister_callbacks() and zfs_resume_fs() -> zfsvfs_setup() call zfs_unregister_callbacks(). Previously there was no synchronization between those calls and the calls in the re-mounting case. That could lead to concurrent execution and a crash. PR: 180060 Differential Revision: https://reviews.freebsd.org/D2865 Suggested by: mahrens Reviewed by: delphij, pho, mahrens, will MFC after: 13 days Sponsored by: ClusterHQ	2015-07-02 08:32:02 +00:00
Alexander Motin	556e4c9a83	Disable port multiplier support on Marvell 88SE61xx chips. According to report, some recent unrelated changes in the driver triggered timeouts when testing for absent port multiplier. Cause of this behavior channge is unclear, but since these chips are old, rare and buggy, it is easier to just disable port multiplier support, same as done in Linux. Reported by: bar MFC after: 3 days	2015-07-02 08:25:45 +00:00
Luiz Otavio O Souza	5ee00411e4	Add DMA support for Allwinner MMC controller. DMA handles all data transfers up to 128K or 16 segments and fallback to pio mode when DMA requirements are not met. The read performance has improved greatly while the write performance also showed some improvement but seems limited by the card type and quality. Submitted by: Pratik Singhal <pratiksinghal@freebsd.org> Sponsored by: Google Summer of Code 2015 Tested on: A10 (cubieboard) and A20 (cubieboard 2 and banana pi)	2015-07-01 23:27:01 +00:00
Andrew Turner	c950fb6b67	Fix the logic for when to restore the VFP registers. It should restore them when a different thread last used them, or when the thread was last run on a different cpu. Obtained from: ABT Systems Ltd Sponsored by: The FreeBSD Foundation	2015-07-01 17:27:44 +00:00
Konstantin Belousov	3ce8c94f29	Disallow a debugger on 64bit system to set fs/gs bases of the 32bit process beyond the end of the process address space. Such setting is not dangerous to the kernel integrity, but it causes confusing application misbehaviour. Sponsored by: The FreeBSD Foundation MFC after: 12 days	2015-07-01 16:37:03 +00:00
Ruslan Bukin	b78ee15e9f	First cut of DTrace for AArch64. Reviewed by: andrew, emaste Sponsored by: ARM Limited Differential Revision: https://reviews.freebsd.org/D2738	2015-07-01 15:51:11 +00:00
Christian Brueffer	8b47cda2f7	Use the correct le*dec function to decode a 16bit type. PR: 194228 Submitted by: David Horwitt MFC after: 2 weeks	2015-07-01 14:54:13 +00:00
Ruslan Bukin	0ff41755cd	Add a central location for exclusion checks. We check here if function is excluded from FBT instrumentation. Reviewed by: andrew, emaste, markj Differential Revision: https://reviews.freebsd.org/D2899	2015-07-01 14:09:59 +00:00
Navdeep Parhar	fd215e45eb	cxgbe(4): request an automatic tx update when a netmap tx queue idles. The NIC tx queues already do this. MFC after: 1 week Differential Revision:	2015-07-01 00:34:14 +00:00
Li-Wen Hsu	4889014996	- Fix `make depend` in sys/modules Differential Revision: https://reviews.freebsd.org/D2951 Approved by: delphij	2015-06-30 19:35:14 +00:00
Navdeep Parhar	9523d1bfc3	Fix leak in tcp_lro_rx. Simply clearing M_PKTHDR isn't enough, any tags hanging off the header need to be freed too. Differential Revision: https://reviews.freebsd.org/D2708 Reviewed by: ae@, hiren@	2015-06-30 17:19:58 +00:00
Mark Murray	c4f9c760c9	Updated random(4) boot/shutdown scripting. Fix the man pages as well. Differential Revision: https://reviews.freebsd.org/D2924 Approved by: so (delphij)	2015-06-30 17:09:41 +00:00
Mark Murray	d1b06863fb	Huge cleanup of random(4) code. * GENERAL - Update copyright. - Make kernel options for RANDOM_YARROW and RANDOM_DUMMY. Set neither to ON, which means we want Fortuna - If there is no 'device random' in the kernel, there will be NO random(4) device in the kernel, and the KERN_ARND sysctl will return nothing. With RANDOM_DUMMY there will be a random(4) that always blocks. - Repair kern.arandom (KERN_ARND sysctl). The old version went through arc4random(9) and was a bit weird. - Adjust arc4random stirring a bit - the existing code looks a little suspect. - Fix the nasty pre- and post-read overloading by providing explictit functions to do these tasks. - Redo read_random(9) so as to duplicate random(4)'s read internals. This makes it a first-class citizen rather than a hack. - Move stuff out of locked regions when it does not need to be there. - Trim RANDOM_DEBUG printfs. Some are excess to requirement, some behind boot verbose. - Use SYSINIT to sequence the startup. - Fix init/deinit sysctl stuff. - Make relevant sysctls also tunables. - Add different harvesting "styles" to allow for different requirements (direct, queue, fast). - Add harvesting of FFS atime events. This needs to be checked for weighing down the FS code. - Add harvesting of slab allocator events. This needs to be checked for weighing down the allocator code. - Fix the random(9) manpage. - Loadable modules are not present for now. These will be re-engineered when the dust settles. - Use macros for locks. - Fix comments. * src/share/man/... - Update the man pages. * src/etc/... - The startup/shutdown work is done in D2924. * src/UPDATING - Add UPDATING announcement. * src/sys/dev/random/build.sh - Add copyright. - Add libz for unit tests. * src/sys/dev/random/dummy.c - Remove; no longer needed. Functionality incorporated into randomdev.. live_entropy_sources.c live_entropy_sources.h - Remove; content moved. - move content to randomdev.[ch] and optimise. * src/sys/dev/random/random_adaptors.c src/sys/dev/random/random_adaptors.h - Remove; plugability is no longer used. Compile-time algorithm selection is the way to go. * src/sys/dev/random/random_harvestq.c src/sys/dev/random/random_harvestq.h - Add early (re)boot-time randomness caching. * src/sys/dev/random/randomdev_soft.c src/sys/dev/random/randomdev_soft.h - Remove; no longer needed. * src/sys/dev/random/uint128.h - Provide a fake uint128_t; if a real one ever arrived, we can use that instead. All that is needed here is N=0, N++, N==0, and some localised trickery is used to manufacture a 128-bit 0ULLL. * src/sys/dev/random/unit_test.c src/sys/dev/random/unit_test.h - Improve unit tests; previously the testing human needed clairvoyance; now the test will do a basic check of compressibility. Clairvoyant talent is still a good idea. - This is still a long way off a proper unit test. * src/sys/dev/random/fortuna.c src/sys/dev/random/fortuna.h - Improve messy union to just uint128_t. - Remove unneeded 'static struct fortuna_start_cache'. - Tighten up up arithmetic. - Provide a method to allow eternal junk to be introduced; harden it against blatant by compress/hashing. - Assert that locks are held correctly. - Fix the nasty pre- and post-read overloading by providing explictit functions to do these tasks. - Turn into self-sufficient module (no longer requires randomdev_soft.[ch]) * src/sys/dev/random/yarrow.c src/sys/dev/random/yarrow.h - Improve messy union to just uint128_t. - Remove unneeded 'staic struct start_cache'. - Tighten up up arithmetic. - Provide a method to allow eternal junk to be introduced; harden it against blatant by compress/hashing. - Assert that locks are held correctly. - Fix the nasty pre- and post-read overloading by providing explictit functions to do these tasks. - Turn into self-sufficient module (no longer requires randomdev_soft.[ch]) - Fix some magic numbers elsewhere used as FAST and SLOW. Differential Revision: https://reviews.freebsd.org/D2025 Reviewed by: vsevolod,delphij,rwatson,trasz,jmg Approved by: so (delphij)	2015-06-30 17:00:45 +00:00
Konstantin Belousov	6ef120027f	Do not calculate the stack's bottom address twice. Submitted by: Olivц╘r Pintц╘r Review: https://reviews.freebsd.org/D2953 MFC after: 1 week	2015-06-30 15:22:47 +00:00
Hiren Panchasara	f85680793b	Avoid a situation where we do not set persist timer after a zero window condition. If you send a 0-length packet, but there is data is the socket buffer, and neither the rexmt or persist timer is already set, then activate the persist timer. PR: 192599 Differential Revision: D2946 Submitted by: jlott at averesystems dot com Reviewed by: jhb, jch, gnn, hiren Tested by: jlott at averesystems dot com, jch MFC after: 2 weeks	2015-06-29 21:23:54 +00:00
Christian Brueffer	9f026a420b	Set the initial system time to a sane (as in: not end of 21st century) value when booting on a PC with CMOS clock set to a year before 2000. This uses 1980 (instead of 1970 as in the initial patch) as pivot year as suggested by imp in the PR followup. PR: 195703 Submitted by: cs@soi.spb.ru Reviewed by: imp MFC after: 1 weeks	2015-06-29 17:02:09 +00:00

... 7 8 9 10 11 ...

105579 Commits