freebsd-dev

Author	SHA1	Message	Date
Warner Losh	cfb43eb12e	Tweak performance of nda completions Use xpt_done_direct in preference to xpt_done when completing a successful I/O. Continue to use xpt_done when there's an error, or for completion of the submission of a CCB. This eliminates a context switch to the cam_doneq thread. Sponsored by: Netflix Suggested by: scottl@	2017-09-28 01:27:00 +00:00
Rick Macklem	bd290946e9	Fix a memory leak that occurred in the pNFS client. When a "pnfs" NFSv4.1 mount was unmounted, it didn't free up the layouts and deviceinfo structures. This leak only affects "pnfs" mounts and only when the mount is umounted. Found while testing the pNFS Flexible File layout client code. MFC after: 2 weeks	2017-09-27 23:23:41 +00:00
John Baldwin	c2dc6d5db1	Use UMA_ALIGNOF() for name cache UMA zones. This fixes kernel crashes due to misaligned accesses to the 64-bit time_t embedded in struct namecache_ts in MIPS n32 kernels. MFC after: 1 week Sponsored by: DARPA / AFRL	2017-09-27 23:18:57 +00:00
John Baldwin	14c510c0cf	Add UMA_ALIGNOF(). This is a wrapper around _Alignof() that sets the alignment for a zone to the alignment required by a given type. This allows the compiler to determine the proper alignment rather than having the programmer try to guess. Discussed on: arch@ MFC after: 1 week Sponsored by: DARPA / AFRL	2017-09-27 23:15:33 +00:00
Landon J. Fuller	89294a783a	bhnd: Add support for supplying bus I/O callbacks when initializing an EROM parser. This allows us to use the EROM parser API in cases where the standard bus space I/O APIs are unsuitable. In particular, this will allow us to parse the device enumeration table directly from bhndb(4) drivers, prior to full attach and configuration of the bridge. Approved by: adrian (mentor) Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D12510	2017-09-27 19:48:34 +00:00
Landon J. Fuller	8e35bf8319	bhnd: Implement bhnd(4) platform device registration. Add bhnd(4) API for explicitly registering BHND platform devices (ChipCommon, PMU, NVRAM, etc) with the bus, rather than walking the newbus hierarchy to discover platform devices. These devices are now also refcounted; attempting to deregister an actively used platform device will return EBUSY. This resolves a lock ordering incompatibility with bwn(4)'s firmware loading threads; previously it was necessary to acquire Giant to protect newbus access when locating and querying the NVRAM device. Approved by: adrian (mentor) Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D12392	2017-09-27 19:44:23 +00:00
Warner Losh	922f5528bd	Since the human readable name is actually ignored, and not matching a 'human' pnp string, change it to #, the name reserved for fields that are ignored.	2017-09-27 19:22:10 +00:00
Warner Losh	4af7ac3c13	Improve description of the PNP string a bit.	2017-09-27 19:21:52 +00:00
Conrad Meyer	683d1dddf7	Unrevert r324059 With a colon and bogus name ("#") added to appease the simplistic parser used in kldxref. Sponsored by: Dell EMC Isilon	2017-09-27 19:14:00 +00:00
Mark Johnston	47f11baaca	Use C99 initializers for DTrace provider methods. This makes the definitions easier to read and more cscope-friendly. MFC after: 1 week	2017-09-27 17:46:38 +00:00
David C Somayajulu	c06073f5dc	Tx Ring Shadow Consumer Index Register needs to be cleared prior to passing it's physical address to the FW during Tx Create Context. MFC after:3 days	2017-09-27 17:46:11 +00:00
Fedor Uporov	72530f91cf	Add check to avoid raw inode iblocks fields overflow in case of huge_file feature. Use the Linux logic for now. Reviewed by: pfg (mentor) Approved by: pfg (mentor) MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D12131	2017-09-27 16:12:13 +00:00
Conrad Meyer	0e492f2faa	Remove PNP metadata from drm2 drivers until kldxref problem is resolved Reported by: np Sponsored by: Dell EMC Isilon	2017-09-27 14:59:18 +00:00
Michael Tuexen	09c53cb6cc	Remove unused function. MFC after: 1 week	2017-09-27 13:05:23 +00:00
Emmanuel Vadot	5e254379a8	vfs_export: Simplify vfs_export_lookup If the filesystem is not exported directly return NULL. If no address is given and filesystem is exported using some default one return it directly, if it doesn't have a default one directly return NULL. Reviewed by: kib, bapt MFC after: 1 week Sponsored by: Gandi.net Differential Revision: https://reviews.freebsd.org/D12505	2017-09-27 09:39:16 +00:00
Sepherosa Ziehau	61d03cd420	kernel: Bump __FreeBSD_version for the removal of M_HASHTYPE_RSS_UDP_IPV4_EX Sponsored by: Microsoft	2017-09-27 06:33:55 +00:00
Sepherosa Ziehau	f870aef090	mbuf: Remove UDP_IPV4_EX, which was never defined. Add comment to explain the IPV6_EX suffix. The confusion about these RSS hash type probably stems from the facts that they were never widely implemented by hardwares. Reviewed by: rwatson Sponsored by: Microsoft Differential Revision: https://reviews.freebsd.org/D12453	2017-09-27 06:31:35 +00:00
Sepherosa Ziehau	03b04fd4f3	ixl: Fix mbuf hash type settings. IPV6_EXs in RSS never mean fragment. They mean: "- Home address from the home address option in the IPv6 destination options header. If the extension header is not present, use the Source IPv6 Address. - IPv6 address that is contained in the Routing-Header-Type-2 from the associated extension header. If the extension header is not present, use the Destination IPv6 Address." UDP_IPV4_EX is an invalid RSS hash type, which will be removed. Quoted from: https://docs.microsoft.com/en-us/windows-hardware/drivers/network/rss-hashing-types#ndishashipv6ex Reviewed by: erj Sponsored by: Microsoft Differential Revision: https://reviews.freebsd.org/D12450	2017-09-27 05:59:54 +00:00
Sepherosa Ziehau	fc572e261f	tcp: Don't "negotiate" MSS. _NO_ OSes actually "negotiate" MSS. RFC 879: "... This Maximum Segment Size (MSS) announcement (often mistakenly called a negotiation) ..." This negotiation behaviour was introduced 11 years ago by r159955 without any explaination about why FreeBSD had to "negotiate" MSS: In syncache_respond() do not reply with a MSS that is larger than what the peer announced to us but make it at least tcp_minmss in size. Sponsored by: TCP/IP Optimization Fundraise 2005 The tcp_minmss behaviour is still kept. Syncookie fix was prodded by tuexen, who also helped to test this patch w/ packetdrill. Reviewed by: tuexen, karels, bz (previous version) MFC after: 2 week Sponsored by: Microsoft Differential Revision: https://reviews.freebsd.org/D12430	2017-09-27 05:52:37 +00:00
Sepherosa Ziehau	2be266caf2	hyperv/hn: Fix UDP checksum offload issue in Azure. UDP checksum offload does not work in Azure if following conditions are met: - sizeof(IP hdr + UDP hdr + payload) > 1420. - IP_DF is not set in IP hdr Use software checksum for UDP datagrams falling into this category. Add two tunables to disable UDP/IPv4 and UDP/IPv6 checksum offload, in case something unexpected happened. MFC after: 1 week Sponsored by: Microsoft Differential Revision: https://reviews.freebsd.org/D12429	2017-09-27 05:44:50 +00:00
Sepherosa Ziehau	c49d47daf3	hyperv/hn: Set tcp header offset for CSUM/LSO offloading. No observable effect; better safe than sorry. MFC after: 1 week Sponsored by: Microsoft Differential Revision: https://reviews.freebsd.org/D12417	2017-09-27 04:42:40 +00:00
Mateusz Guzik	a79d52d739	sysctl: remove target buffer read/write checks prior to calling the handler Said checks were inherently racy anyway as jokers could unmap target areas before the handler got around to accessing them. This saves time by avoiding locking the address space. MFC after: 1 week	2017-09-27 01:31:52 +00:00
Mateusz Guzik	956713cb74	Annotate sysctlmemlock with __exclusive_cache_line. MFC after: 1 week	2017-09-27 01:27:43 +00:00
Mateusz Guzik	2f1ddb89fc	mtx: drop the tid argument from _mtx_lock_sleep tid must be equal to curthread and the target routine was already reading it anyway, which is not a problem. Not passing it as a parameter allows for a little bit shorter code in callers. MFC after: 1 week	2017-09-27 00:57:05 +00:00
Rick Macklem	a8462c582c	Add major and minor version arguments to nfscl_reqstart(). This patch adds "vers" and "minorvers" arguments to nfscl_reqstart(). The patch always passes them in as "0" and that implies no change in semantics. These arguments will be used by a future commit that adds support for the Flexible File Layout.	2017-09-26 23:42:44 +00:00
John Baldwin	800c3e80de	Don't defer wakeup()s for completed journal workitems. Normally wakeups() are performed for completed softupdates work items in workitem_free() before the underlying memory is free()'d. complete_jseg() was clearing the "wakeup needed" flag in work items to defer the wakeup until the end of each loop iteration. However, this resulted in the item being free'd before it's address was used with wakeup(). As a result, another part of the kernel could allocate this memory from malloc() and use it as a wait channel for a different "event" with a different lock. This triggered an assertion failure when the lock passed to sleepq_add() did not match the existing lock associated with the sleep queue. Fix this by removing the code to defer the wakeup in complete_jseg() allowing the wakeup to occur slightly earlier in workitem_free() before free() is called. The main reason I can think of for deferring a wakeup() would be to avoid waking up a waiter while holding a lock that the waiter would need. However, no locks are dropped in between the wakeup() in workitem_free() and the end of the loop in complete_jseg() as far as I can tell. In general I think it is not safe to do a wakeup() after free() as one cannot control how other parts of the kernel that might reuse the address for a different wait channel will handle spurious wakeups. Reported by: pho Reviewed by: kib MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D12494	2017-09-26 23:24:15 +00:00
Conrad Meyer	916616c4c5	Add PNP metadata to more drivers GPUs: radeonkms, i915kms NICs: if_em, if_igb, if_bnxt This metadata isn't used yet, but it will be handy to have later to implement automatic module loading. Reviewed by: imp, mmacy Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12488	2017-09-26 23:23:58 +00:00
Conrad Meyer	fe182ba1d0	aesni(4): Add support for x86 SHA intrinsics Some x86 class CPUs have accelerated intrinsics for SHA1 and SHA256. Provide this functionality on CPUs that support it. This implements CRYPTO_SHA1, CRYPTO_SHA1_HMAC, and CRYPTO_SHA2_256_HMAC. Correctness: The cryptotest.py suite in tests/sys/opencrypto has been enhanced to verify SHA1 and SHA256 HMAC using standard NIST test vectors. The test passes on this driver. Additionally, jhb's cryptocheck tool has been used to compare various random inputs against OpenSSL. This test also passes. Rough performance averages on AMD Ryzen 1950X (4kB buffer): aesni: SHA1: ~8300 Mb/s SHA256: ~8000 Mb/s cryptosoft: ~1800 Mb/s SHA256: ~1800 Mb/s So ~4.4-4.6x speedup depending on algorithm choice. This is consistent with the results the Linux folks saw for 4kB buffers. The driver borrows SHA update code from sys/crypto sha1 and sha256. The intrinsic step function comes from Intel under a 3-clause BSDL.[0] The intel_sha_extensions_sha<foo>_intrinsic.c files were renamed and lightly modified (added const, resolved a warning or two; included the sha_sse header to declare the functions). [0]: https://software.intel.com/en-us/articles/intel-sha-extensions-implementations Reviewed by: jhb Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12452	2017-09-26 23:12:32 +00:00
David C Somayajulu	61fb8de9f1	Fix delete all multicast addresses Submitted by:Anand.Khoje@cavium.com MFC after:5 days	2017-09-26 20:53:25 +00:00
Emmanuel Vadot	ff8241f7f0	a10_gpio: Enable all needed clocks Do not enable only the first clock, enable them all.	2017-09-26 20:23:09 +00:00
Emmanuel Vadot	9980df7daa	a10_ehci: Enable all clocks and reset a10_ehci can have multiple clocks and reset, enable them all instead of only the first one.	2017-09-26 19:21:43 +00:00
Emmanuel Vadot	de355bea02	aw_usbphy: Only reroute OTG for phy0 We only need to route OTG port to host mode on phy0 and if no VBUS is present on the port, otherwise leave the port in periperal mode.	2017-09-26 19:20:50 +00:00
Emmanuel Vadot	1eca1d26fd	aw_usbphy: Fix write of unknown register Some SoC require a write to a unknown register to work corectly. This write should be in the pmu region not in the phy ctrl one. Reported by: Mark Millard (markmi@dsl-only.net)	2017-09-26 19:19:44 +00:00
Conrad Meyer	255811d758	opencrypto: Use C99 initializers for auth_hash instances A misordering in the Via padlock driver really strongly suggested that these should use C99 named initializers. No functional change. Sponsored by: Dell EMC Isilon	2017-09-26 17:52:52 +00:00
Conrad Meyer	3693b18840	opencrypto: Loosen restriction on HMAC key sizes Theoretically, HMACs do not actually have any limit on key sizes. Transforms should compact input keys larger than the HMAC block size by using the transform (hash) on the input key. (Short input keys are padded out with zeros to the HMAC block size.) Still, not all FreeBSD crypto drivers that provide HMAC functionality handle longer-than-blocksize keys appropriately, so enforce a "maximum" key length in the crypto API for auth_hashes that previously expressed a requirement. (The "maximum" is the size of a single HMAC block for the given transform.) Unconstrained auth_hashes are left as-is. I believe the previous hardcoded sizes were committed in the original import of opencrypto from OpenBSD and are due to specific protocol details of IPSec. Note that none of the previous sizes actually matched the appropriate HMAC block size. The previous hardcoded sizes made the SHA tests in cryptotest.py useless for testing FreeBSD crypto drivers; none of the NIST-KAT example inputs had keys sized to the previous expectations. The following drivers were audited to check that they handled keys up to the block size of the HMAC safely: Software HMAC: * padlock(4) * cesa * glxsb * safe(4) * ubsec(4) Hardware accelerated HMAC: * ccr(4) * hifn(4) * sec(4) (Only supports up to 64 byte keys despite claiming to support SHA2 HMACs, but validates input key sizes) * cryptocteon (MIPS) * nlmsec (MIPS) * rmisec (MIPS) (Amusingly, does not appear to use key material at all -- presumed broken) Reviewed by: jhb (previous version), rlibby (previous version) Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12437	2017-09-26 16:18:10 +00:00
Andriy Gapon	443efc868c	fix r324011, MFV of r323535, 8585 improve batching done in zil_commit() I managed to commit an older version of the change. Plus, even the latest version was not ready for userland compilation. Reported by: "O. Hartmann" <ohartmann@walstatt.org>, cy MFC after: 1 week X-MFC with: r324011	2017-09-26 15:38:16 +00:00
Andriy Gapon	c13f1d82c8	MFV r323535: 8585 improve batching done in zil_commit() FreeBSD notes: - this MFV reverts FreeBSD commit r314549 to make the merge easier - at present our emulation of cv_timedwait_hires is rather poor, so I elected to use cv_timedwait_sbt directly Please see the differential revision for details. Unfortunately, I did not get any positive reviews, so there could be bugs in the FreeBSD-specific piece of the merge. Hence, the long MFC timeout. illumos/illumos-gate@1271e4b10d `1271e4b10d` https://www.illumos.org/issues/8585 The current implementation of zil_commit() can introduce significant latency, beyond what is inherent due to the latency of the underlying storage. The additional latency comes from two main problems: 1. When there's outstanding ZIL blocks being written (i.e. there's already a "writer thread" in progress), then any new calls to zil_commit() will block waiting for the currently oustanding ZIL blocks to complete. The blocks written for each "writer thread" is coined a "batch", and there can only ever be a single "batch" being written at a time. When a batch is being written, any new ZIL transactions will have to wait for the next batch to be written, which won't occur until the current batch finishes. As a result, the underlying storage may not be used as efficiently as possible. While "new" threads enter zil_commit() and are blocked waiting for the next batch, it's possible that the underlying storage isn't fully utilized by the current batch of ZIL blocks. In that case, it'd be better to allow these new threads to generate (and issue) a new ZIL block, such that it could be serviced by the underlying storage concurrently with the other ZIL blocks that are being serviced. 2. Any call to zil_commit() must wait for all ZIL blocks in its "batch" to complete, prior to zil_commit() returning. The size of any given batch is proportional to the number of ZIL transaction in the queue at the time that the batch starts processing the queue; which doesn't occur until the previous batch completes. Thus, if there's a lot of transactions in the queue, the batch could be composed of many ZIL blocks, and each call to zil_commit() will have to wait for all of these writes to complete (even if the thread calling zil_commit() only cared about one of the transactions in the batch). Reviewed by: Brad Lewis <brad.lewis@delphix.com> Reviewed by: Matt Ahrens <mahrens@delphix.com> Reviewed by: George Wilson <george.wilson@delphix.com> Approved by: Dan McDonald <danmcd@joyent.com> Author: Prakash Surya <prakash.surya@delphix.com> MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D12355	2017-09-26 11:04:08 +00:00
Conrad Meyer	a317fb03c2	crypto(9): Use a more specific error code when a capable driver is not found When crypto_newsession() is given a request for an unsupported capability, raise a more specific error than EINVAL. This allows cryptotest.py to skip some HMAC tests that a driver does not support. Reviewed by: jhb, rlibby Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12451	2017-09-26 01:31:49 +00:00
John Baldwin	09f3bb8756	Log signal number passed to PT_STEP requests in KTR_PTRACE traces. MFC after: 1 week	2017-09-25 20:38:55 +00:00
John Baldwin	5b01ccb01e	Use tmpfs_print for tmpfs FIFOs. Reviewed by: kib (part of a larger patch)	2017-09-25 20:26:16 +00:00
Toomas Soome	f6eef0f9a2	libefi: efipart_floppy() will should not pass acpi pointer if the HID test fails The current efipart_floppy() implementation is leaking the acpi pointer.	2017-09-25 19:49:56 +00:00
Ian Lepore	a78b4d1462	Use nstosbt() instead of multiplying by SBT_1NS to avoid roundoff errors. Differential Revision: https://reviews.freebsd.org/D11779	2017-09-25 15:03:27 +00:00
Marcin Wojtas	40c2a6da0d	Fix gcc compilation issues in the mvneta driver Compiling mvneta driver with gcc unveiled two issues, that required fixing. Reported by: andrew Obtained from: Semihalf	2017-09-25 02:06:51 +00:00
Alan Cox	43cc906f40	Change vm_page_try_to_free() to require a managed page. Essentially, vm_page_try_to_free() is testing conditions, like clean versus dirty, that only vary in managed pages. Suggested by: kib Reviewed by: markj X-MFC after: never	2017-09-24 23:35:01 +00:00
Alan Cox	01c0757bae	Modernize the use of vm_page_unwire(). Since r288122, vm_page_unwire() has returned TRUE when the wire count transitions to zero, eliminating the need for callers to inspect the page's wire count. MFC after: 1 week	2017-09-24 22:29:11 +00:00
Pedro F. Giffuni	2c75d7b08d	Small style(9) issue: spaces vs TAB.	2017-09-24 20:57:03 +00:00
Rick Macklem	ce8d06fe87	Change a panic to an error return. There was a panic() in the NFS server's write operation that didn't need to be a panic() and could just be an error return. This patch makes that change. Found by code inspection during development of the pNFS service. MFC after: 2 weeks	2017-09-24 20:05:48 +00:00
Conrad Meyer	a523de2365	g_resize_provider_event: Do not invoke orphan method twice Like r266444, g_resize_provider_event can attempt to orphan an already orphaned geom_dev consumer. This will cause a panic in g_dev_orphan. Apply the same fix as was applied to g_orphan_register. Reviewed by: ae Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12469	2017-09-24 19:59:26 +00:00
Rick Macklem	c36e087097	Remove 0 filling from nfsm_uiombuflist(). nfsm_uiombuflist() zero filled the mbuf list to a multiple of 4bytes as required for XDR. Unfortunately that modified an mbuf list after it was m_copym()'d and was broken. This patch removes the zero filling code. Since nfsm_uiombuflist() is not yet used in head/current, this has no effect on users. The function will be used by a future commit of code that adds Flex File Layout support.	2017-09-24 19:43:31 +00:00
Alan Cox	494c6e43d3	Optimize vm_page_try_to_free(). Specifically, the call to pmap_remove_all() can be avoided when the page's containing object has a reference count of zero. (If the object has a reference count of zero, then none of its pages can possibly be mapped.) Address nearby style issues in vm_page_try_to_free(), and change its return type to "bool". Reviewed by: kib, markj MFC after: 1 week	2017-09-24 16:50:10 +00:00
Andrew Turner	5493a64ac4	Remove the VIRT kernel config, it's now useable through GENERIC. Sponsored by: DARPA, AFRL	2017-09-24 13:28:24 +00:00
Scott Long	867aa8cd99	Add the ability to report and set debug flags as text strings instead of just integer flags. Report both for convenience. Submitted by: Eygene Ryabinkin (manpage) Sponsored by: Netflix	2017-09-24 13:14:50 +00:00
Andrew Turner	6b81683f9e	Add i.MX6 and Xilinx to GENERIC. Merge in the missing devices from the IMX6 and ZEDBOARD kernel configs. The Freescale sdma device has been renamed to fslsdma to mark it as a platform specific driver. Reviewed by: ian Sponsored by: DARPA, AFRL Differential Revision: https://reviews.freebsd.org/D11564	2017-09-24 09:33:08 +00:00
Ilya Bakulin	d91f1a1094	Rename sdhci_cam_start_slot() into sdhci_start_slot() This change allows to just call sdhci_start_slot() in SDHCI drivers and not to think about which stack handles the operation. As a side effect, this will also fix MMCCAM with sdhci_acpi driver. Approved by: imp (mentor) Differential Revision: https://reviews.freebsd.org/D12471	2017-09-24 09:05:35 +00:00
Cy Schubert	27cb792d48	Fix typo from r323945. Reported by: Gary Jennejohn <gljennjohn@gmail.com> Point hat to: cy (me)	2017-09-24 03:33:26 +00:00
Alan Cox	92993af469	Since the page "frame" doesn't belong to a vm object, it can't be paged out. Since it can't be paged out, it is never actually enqueued in a paging queue. Nonetheless, passing PQ_INACTIVE to vm_page_unwire() creates the appearance that the page "frame" is being enqueued in the inactive queue. As of r288122, we can avoid this false impression by passing PQ_NONE. MFC after: 1 week	2017-09-24 02:50:59 +00:00
Conrad Meyer	f41b85a63c	ddb(4): Add 'show badstacks' command to show witness badstacks Add a DDB command that mirrors sysctl debug.witness.badstacks. Reapply r323935 after fixing trivial deficiency. I forgot to compile with WITNESS enabled. Thanks emaste@ for fixing the build while I was asleep. Reported by: rstone Reviewed by: rstone (previous version) Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12468	2017-09-23 17:48:49 +00:00
Stephen Hurd	59059cda6a	bnxt: Choose better HW LRO defaults for performance 1) Choose correct Firmware options for HW LRO for best performance 2) Delete TBD and other comments which are not required. 3) Added sysctl interface to enable / disable / modify different factors of HW LRO. 4) Disabled HW LRO by default to avoid issues with packet forwarding This allows much better control over the LRO configuration via sysctls, and uses much better defaults. Hardware LRO can now be enabled/disabled independantly from the software LRO, and the tuning parameters are exposed. manpage updates coming soon. Submitted by: Bhargava Chenna Marreddy <bhargava.marreddy@broadcom.com> Reviewed by: shurd, sbruno Approved by: sbruno (mentor) Sponsored by: Broadcom Limited Differential Revision: https://reviews.freebsd.org/D12223	2017-09-23 16:59:37 +00:00
Stephen Hurd	1225d9da9f	Have ifmp_ring_enqueue() abdicate instead of switch to a consumer Move TX out of the enqueue() path. As a result, we need to have ifmp_ring_check_drainage() pick up from the abdicate state. We also need to either enqueue the TX task, or check drainage after calling ifmp_ring_enqueue() to ensure it's sent. This change results in a 30% small packet forwarding improvement. Reviewed by: olivier, sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12439	2017-09-23 16:46:30 +00:00
Mariusz Zaborski	290e7bac6e	After the r317886 support for TFTP and NFS can be enable simultaneously. The cleanup of this distinction was done in the r318988, but this Makefile was omitted. Submitted by: kczekirda@	2017-09-23 12:44:42 +00:00
Ed Maste	4c087f8a83	Revert r323935 as it broke the build subr_witness.c:2577:4: error: use of undeclared identifier 'req' req->oldidx = 0; ^	2017-09-23 12:35:46 +00:00
Scott Long	55f1f05248	Garbage collect usued fields Sponsored by: Netflix	2017-09-23 08:26:42 +00:00
Cy Schubert	37c50e570d	Correct two misspellings. Also align */.	2017-09-23 06:00:17 +00:00
Stephen Hurd	d57a78580e	Make struct grouptask gt_name member a char array Previously, it was just a pointer which was copied, but some callers pass in a stack variable which will go out of scope. Add GROUPTASK_NAMELEN macro (32) and snprintf() the name into it, using "grouptask" if name is NULL. We can now safely include gtask->gt_name in console messages. Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12449	2017-09-23 01:39:16 +00:00
Stephen Hurd	f4d2154e0c	Make the rx budget a tunable This allows tuning the rx budget for special load profiles as well as more easily testing to determine sane defaults. Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12445	2017-09-23 01:37:01 +00:00
Stephen Hurd	20f63282f8	Chain mbufs before passing to if_input() Build a list of mbufs to pass to if_input() after LRO. Results in 12% small packet forwarding rate improvement. Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12444	2017-09-23 01:35:14 +00:00
Stephen Hurd	c5cf217261	Some small packet performance improvements If the packet is smaller than MTU, disable the TSO flags. Move TCP header parsing inside the IS_TSO?() test. Add a new IFLIB_NEED_ZERO_CSUM flag to indicate the checksums need to be zeroed before TX. Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12442	2017-09-23 01:33:20 +00:00
Conrad Meyer	6fec2d2cce	ddb(4): Add 'show badstacks' command to show witness badstacks Add a DDB command that mirrors sysctl debug.witness.badstacks. Reported by: rstone Reviewed by: rstone Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12468	2017-09-22 20:01:12 +00:00
John Baldwin	64f73a4c67	Detect NEON and set HWCAP_NEON if present. Reviewed by: andrew, ian MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D12389	2017-09-22 17:58:57 +00:00
John Baldwin	1e159074cb	Correct HWCAP_VFP3* values to match Linux. Reviewed by: andrew, ian MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D12388	2017-09-22 17:57:38 +00:00
Konstantin Belousov	55e5a5c1f4	Fix 32bit build. Reported by: emaste Sponsored by: The FreeBSD Foundation	2017-09-22 16:42:41 +00:00
Kirk McKusick	75e3597abb	Continuing efforts to provide hardening of FFS, this change adds a check hash to cylinder groups. If a check hash fails when a cylinder group is read, no further allocations are attempted in that cylinder group until it has been fixed by fsck. This avoids a class of filesystem panics related to corrupted cylinder group maps. The hash is done using crc32c. Check hases are added only to UFS2 and not to UFS1 as UFS1 is primarily used in embedded systems with small memories and low-powered processors which need as light-weight a filesystem as possible. Specifics of the changes: sys/sys/buf.h: Add BX_FSPRIV to reserve a set of eight b_xflags that may be used by individual filesystems for their own purpose. Their specific definitions are found in the header files for each filesystem that uses them. Also add fields to struct buf as noted below. sys/kern/vfs_bio.c: It is only necessary to compute a check hash for a cylinder group when it is actually read from disk. When calling bread, you do not know whether the buffer was found in the cache or read. So a new flag (GB_CKHASH) and a pointer to a function to perform the hash has been added to breadn_flags to say that the function should be called to calculate a hash if the data has been read. The check hash is placed in b_ckhash and the B_CKHASH flag is set to indicate that a read was done and a check hash calculated. Though a rather elaborate mechanism, it should also work for check hashing other metadata in the future. A kernel internal API change was to change breada into a static fucntion and add flags and a function pointer to a check-hash function. sys/ufs/ffs/fs.h: Add flags for types of check hashes; stored in a new word in the superblock. Define corresponding BX_ flags for the different types of check hashes. Add a check hash word in the cylinder group. sys/ufs/ffs/ffs_alloc.c: In ffs_getcg do the dance with breadn_flags to get a check hash and if one is provided, check it. sys/ufs/ffs/ffs_vfsops.c: Copy across the BX_FFSTYPES flags in background writes. Update the check hash when writing out buffers that need them. sys/ufs/ffs/ffs_snapshot.c: Recompute check hash when updating snapshot cylinder groups. sys/libkern/crc32.c: lib/libufs/Makefile: lib/libufs/libufs.h: lib/libufs/cgroup.c: Include libkern/crc32.c in libufs and use it to compute check hashes when updating cylinder groups. Four utilities are affected: sbin/newfs/mkfs.c: Add the check hashes when building the cylinder groups. sbin/fsck_ffs/fsck.h: sbin/fsck_ffs/fsutil.c: Verify and update check hashes when checking and writing cylinder groups. sbin/fsck_ffs/pass5.c: Offer to add check hashes to existing filesystems. Precompute check hashes when rebuilding cylinder group (although this will be done when it is written in fsutil.c it is necessary to do it early before comparing with the old cylinder group) sbin/dumpfs/dumpfs.c Print out the new check hash flag(s) sbin/fsdb/Makefile: Needs to add libufs now used by pass5.c imported from fsck_ffs. Reviewed by: kib Tested by: Peter Holm (pho)	2017-09-22 12:45:15 +00:00
Scott Long	aeb9ac0df5	Clean up error messages related to device discovery Sponsored by: Netflix	2017-09-22 12:07:03 +00:00
Andriy Gapon	f94aa61c33	MFV r323917: 8648 Fix range locking in ZIL commit codepath illumos/illumos-gate@42b1411172 `42b1411172` https://www.illumos.org/issues/8648 I'm opening this bug to track integration of the following ZFS on Linux commit into illumos: commit `f763c3d1df` Author: LOLi <loli10K@users.noreply.github.com> Date: Mon Aug 21 17:59:48 2017 +0200 Fix range locking in ZIL commit codepath Since OpenZFS 7578 (`1b7c1e5`) if we have a ZVOL with logbias=throughput we will force WR_INDIRECT itxs in zvol_log_write() setting itx->itx_lr offset and length to the offset and length of the BIO from zvol_write()->zvol_log_write(): these offset and length are later used to take a range lock in zillog->zl_get_data function: zvol_get_data(). Now suppose we have a ZVOL with blocksize=8K and push 4K writes to offset 0: we will only be range-locking 0-4096. This means the ASSERTion we make in dbuf_unoverride() is no longer valid because now dmu_sync() is called from zilog's get_data functions holding a partial lock on the dbuf. Fix this by taking a range lock on the whole block in zvol_get_data(). Reviewed-by: Chunwei Chen <tuxoko@gmail.com> Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: loli10K <ezomori.nozomu@gmail.com> Reviewed by: Igor Kozhukhov <igor@dilos.org> Reviewed by: Matt Ahrens <mahrens@delphix.com> Reviewed by: Andriy Gapon <avg@FreeBSD.org> Reviewed by: Alexander Motin <mav@FreeBSD.org> Approved by: Robert Mustacchi <rm@joyent.com> Author: LOLi <loli10K@users.noreply.github.com> MFC after: 10 days	2017-09-22 08:27:27 +00:00
Hans Petter Selasky	d384b1673f	Extend sysctl description for hw.usb.disable_enumeration . PR: 222505 Submitted by: Julian H. Stacey <jhs@berklix.com> MFC after: 1 week Sponsored by: Mellanox Technologies	2017-09-22 08:21:35 +00:00
Andriy Gapon	1c6ea90df5	MFV r323914: 8661 remove "zil-cw2" dtrace probe illumos/illumos-gate@bd9d3f9046 `bd9d3f9046` https://www.illumos.org/issues/8661 The "zil-cw1" dtrace probe was previously removed in 8558, and the "zil-cw2" probe should have been removed in that patch as well. Unfortunately, the "zil- cw2" was not removed in 8558, so this bug is to track it's removal. Reviewed by: Serapheim Dimitropoulos <serapheim@delphix.com> Reviewed by: Matthew Ahrens <mahrens@delphix.com> Reviewed by: Igor Kozhukhov <igor@dilos.org> Approved by: Robert Mustacchi <rm@joyent.com> Author: Prakash Surya <prakash.surya@delphix.com> MFC after: 1 week	2017-09-22 08:21:14 +00:00
Hans Petter Selasky	40f53a7cdc	Add support for 32-bit compatibility IOCTLs in the LinuxKPI. Bump the FreeBSD version to force recompilation of external kernel modules due to structure change. PR: 222504 Submitted by: Greg V <greg@unrelenting.technology> MFC after: 1 week Sponsored by: Mellanox Technologies	2017-09-22 08:12:08 +00:00
Toomas Soome	c57460d994	libefi: define EISA PNP constants Define EISA PNP constants and use them, also fix ID for 0x701	2017-09-22 07:44:36 +00:00
Toomas Soome	f43a98e6c2	libefi: efipart_hdinfo_add_filepath should check strtol result Use errno for error checking.	2017-09-22 07:40:05 +00:00
Toomas Soome	abf054b4c6	libefi: efipart.c cstyle fix for efipart_print_common() The else statement should have { }	2017-09-22 07:37:42 +00:00
Toomas Soome	5af584ffc1	libefi: efipart_strategy() should return ENXIO when there is no media We should return ENXIO to indicate the situation with device present, but no media.	2017-09-22 07:34:08 +00:00
Toomas Soome	b07d6aa2ae	libefi: pdinfo_t pd_unit and pd_open should be unsigned The device index, partition index and reference counter are all positive numbers. However, since our internal partition number may be negative to indicate GPT table, the compare expression need to take care when comparing pdinfo_t and partition data.	2017-09-22 07:29:26 +00:00
Michael Tuexen	d28a3a393b	Add missing locking. Found by Coverity while scanning the usrsctp library. MFC after: 1 week	2017-09-22 06:33:01 +00:00
Michael Tuexen	afb908dada	Add missing socket lock. MFC after: 1 week	2017-09-22 06:07:47 +00:00
Toomas Soome	832d45d219	efilib.h: typo in structure member description The link should be replaced by list.	2017-09-22 02:58:47 +00:00
Toomas Soome	024cc69661	r323885 did miss efilib.h update The efilib.h update was left out from r323885 by mistake.	2017-09-22 02:56:26 +00:00
Toomas Soome	cbc1b3de8f	libefi: efi_devpath_match local len should be unsigned DevicePathNodeLength() will always return unsigned value.	2017-09-22 02:53:01 +00:00
Warner Losh	78ed811e6c	cam iosched: Bettar account IOPS for smoother performance Prevent cam_iosched_iops_tick() from discarding 'unspent' ios unless it's a new accounting interval. Previously ios that weren't used between ticks were lost, as a result the iops limiter could enforce a limit below the configured maximum. Obtained from: ElectroBSD Submitted by: Fabian Keil PR: 221974	2017-09-22 02:36:36 +00:00
Warner Losh	f777123b83	cam iosched: Enforce iop limits below the quanta value Previously the iops limiter would always allow at least quanta ios per second as cam_iosched_iops_tick() never set ios->l_value1 below 1. Submitted by: Fabian Keil <fk@fabiankeil.de> Obtained from: ElectroBSD PR: 221974	2017-09-22 02:36:32 +00:00
John Baldwin	cc05c7d256	Support AEAD requests with non-GCM algorithms. In particular, support chaining an AES cipher with an HMAC for a request including AAD. This permits submitting requests from userland to encrypt objects like IPSec packets using these algorithms. In the non-GCM case, the authentication crypto descriptor covers both the AAD and the ciphertext. The GCM case remains unchanged. This matches the requests created internally in IPSec. For the non-GCM case, the COP_F_CIPHER_FIRST is also supported since the ordering matters. Note that while this can be used to simulate IPSec requests from userland, this ioctl cannot currently be used to perform TLS requests using AES-CBC and MAC-before-encrypt. Reviewed by: cem Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D11759	2017-09-22 00:34:46 +00:00
John Baldwin	2c907637bc	Add a new COP_F_CIPHER_FIRST flag for struct crypt_op. This requests that the cipher be performed before rather than after the HMAC when both are specified for a single operation. Reviewed by: cem Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D11757	2017-09-22 00:21:58 +00:00
John Baldwin	95f076384f	Place the AAD before the plaintext/ciphertext for CIOCRYPTAEAD. Software crypto implementations don't care how the buffer is laid out, but hardware implementations may assume that the AAD is always before the plain/cipher text and that the hash/tag is immediately after the end of the plain/cipher text. In particular, this arrangement matches the layout of both IPSec packets and TLS frames. Linux's crypto framework also assumes this layout for AEAD requests. Reviewed by: cem Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D11758	2017-09-22 00:15:54 +00:00
Stephen Hurd	bf227542f3	Fix undeclared identifier error introduced in r323879 It doesn't appear to be safe to use gtask->gt_name. Reported by: Mark Johnston, Jenkins Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12448	2017-09-21 23:27:35 +00:00
Toomas Soome	cfe103a2ac	libefi: efipart.c should use calloc() The device specific *_add functions are using malloc() + memset, should use calloc instead.	2017-09-21 23:22:18 +00:00
Toomas Soome	59fcc285f4	libefi: efi_devpath_match() should return bool The current implementation of efi_devpath_match() is returning values 0 or 1, so it should be updated to return bool.	2017-09-21 23:14:07 +00:00
John Baldwin	e1d15b892a	Only handle _PC_MAX_CANON, _PC_MAX_INPUT, and _PC_VDISABLE for TTY devices. Move handling of these three pathconf() variables out of vop_stdpathconf() and into devfs_pathconf() as TTY devices can only be devfs files. In addition, only return settings for these three variables for devfs devices whose device switch has the D_TTY flag set. Discussed with: bde, kib Sponsored by: Chelsio Communications	2017-09-21 23:05:32 +00:00
Mark Johnston	568aef2f6a	Simplify i915_gem_wire_page() and avoid unneeded page-busying. Reviewed by: alc, kib MFC after: 1 week	2017-09-21 22:15:45 +00:00
Stephen Hurd	326aacb0e3	Improved logging of gtaskqueue failues Check the return code of intr_setaffinity() and log any errors it returns. When a qid is not located, log an error before returning failure. Also, use __func__ rather than hardcoding the function name Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12436	2017-09-21 21:14:48 +00:00
Stephen Hurd	a0fcc37122	Fix M_GTASKQUEUE definition Previously had the same short and long description as taskqueues. This could cause problems with memguard(9) and vmstat -m which use the short description as a unique identifier. Reviewed by: sbruno Approved by: sbruno (mentor) MFC after: 3 days Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12438	2017-09-21 20:34:33 +00:00
Stephen Hurd	23e90483ec	bnxt: Fix driver when attached to a VF - Use HWRM_FUNC_VF_CFG instead of HWRM_FUNC_CFG on VFs - Fix NPAR/VF detection - Clean up flag definitions - Don't allow WoL on VFs Although the bnxt driver doesn't support SR-IOV so can create VFs yet, the PF could be running Linux or ESCi with a VF passed through to a FreeBSD guest. This fixes the driver for that use case. Submitted by: Siva Kallam <siva.kallam@@broadcom.com> Reviewed by: shurd, sbruno Approved by: sbruno (mentor) Sponsored by: Broadcom Limited Differential Revision: https://reviews.freebsd.org/D12410	2017-09-21 20:27:43 +00:00
Eugene Grosbein	10633c7e5a	Unprotected modification of ng_iface(4) private data leads to kernel panic. Fix a race with per-node read-mostly lock and refcounting for a hook. PR: 220076 Tested by: peixoto.cassiano Approved by: avg (mentor), mav (mentor) MFC after: 1 week Relnotes: yes Differential Revision: https://reviews.freebsd.org/D12435	2017-09-21 20:16:10 +00:00
Alan Cox	4aef95b3f0	Modernize calls to vm_page_unwire(). As of r288122, vm_page_unwire() accepts PQ_NONE as the specified queue and returns a Boolean indicating whether the page's wire count transitioned to zero. Use these features in dev/drm2. Reviewed by: kib, markj MFC after: 1 week	2017-09-21 15:32:41 +00:00
Toomas Soome	5370de88cc	libefi: devicename.c cleanups Remove duplicated free()+return statements, default unit to 0 and improve strtol error processing.	2017-09-21 15:30:20 +00:00
Kristof Provost	ed9de14d2f	bridge: Set module version This ensures that the loader will not load the module if it's also built in to the kernel. PR: 220860 Submitted by: Eugene Grosbein <eugen@freebsd.org> Reported by: Marie Helene Kvello-Aune <marieheleneka@gmail.com>	2017-09-21 14:14:01 +00:00
Michael Tuexen	cdd2d7d4a5	Code cleanup, no functional change. MFC after: 1 week	2017-09-21 11:56:31 +00:00
Mariusz Zaborski	ef0c8428f9	Plug memory leak in case when nvlist allocation succeeds, but nvpair allocation fails. Submitted by: pjd@ MFC after: 1 month Sponsored by: Wheel Systems	2017-09-21 10:28:22 +00:00
Mariusz Zaborski	b6960f00fa	Simplify the code by _not_ expecting success under 'fail'. Submitted by: pjd@ and oshogbo@ MFC after: 1 month Sponsored by: Wheel Systems	2017-09-21 10:18:02 +00:00
Mariusz Zaborski	56117a342f	IMHO it is possible that failure will be treated as success because we don't initialize nvp on every loop iteration and the code under 'fail'(!) label detects success by checking of nvp != NULL. Submitted by: pjd@ MFC after: 1 month Sponsored by: Wheel Systems	2017-09-21 10:16:44 +00:00
Mariusz Zaborski	0a5f83e3fa	Free 'value' only once we are done freeing all individual Submitted by: pjd@ MFC after: 1 month Found by: scan-build Sponsored by: Wheel Systems	2017-09-21 10:14:43 +00:00
Mariusz Zaborski	c696dd0687	Because nvp wasn't initialized on every loop iteration once we jumped to 'fail' on error it was treated as success, because nvp!=NULL. Fix this by not handling success under 'fail' label and by using separate variable for parent nvpair. If we succeeded to allocate nvlist, but failed to allocated nvpair we would leak nvls[ii] on return. Destroy it when we cannot allocate nvpair, before we goto fail. Submitted by: pjd@ and oshogbo@ (minor changes) Found by: scan-build MFC after: 1 month Sponsored by: Wheel Systems	2017-09-21 10:10:42 +00:00
Mariusz Zaborski	a3c485d38d	Make the code consistent by always using 'fail' label. Submitted by: pjd@ and oshogbo@ MFC after: 1 month Sponsored by: Wheel Systems	2017-09-21 10:06:00 +00:00
Mariusz Zaborski	1dacabe1ab	The 'while (array != NULL) { }' suggests scan-build that array may be initially NULL, which is not possible. Change the loop to 'do {} while (array != NULL)' to satisfy scan-build and assert that array really cannot be NULL just in case. Submitted by: pjd@ Found by: scan-build MFC after: 1 month Sponsored by: Wheel Systems	2017-09-21 10:03:14 +00:00
Mariusz Zaborski	08016b3185	Remove redundant initialization. Don't use variable - just return the value. Make scan-build happy by casting to 'void ' instead of 'void *'. Submitted by: pjd@ MFC after: 1 month Found by: scan-build and cppcheck Sponsored by: Wheel Systems	2017-09-21 10:00:16 +00:00
Michael Tuexen	53999485e0	Free the control structure after using is, not before. Found by Coverity while scanning the usrsctp library. MFC after: 1 week	2017-09-21 09:47:56 +00:00
Michael Tuexen	d0d8c7de19	No need to wakeup, since sctp_add_to_readq() does it. MFC after: 1 week	2017-09-21 09:18:05 +00:00
Rick Macklem	6b43e06029	Add a few definitions for Flex File Layout for pNFS. These definitions will be used by a future commit.	2017-09-21 00:41:12 +00:00
Jung-uk Kim	8c294161aa	Remove an ancient comment about the existence of READ(16) and WRITE(16). MFC after: 3 days	2017-09-21 00:03:59 +00:00
Andrey V. Elsukov	5df8171da3	Use in_localip() function instead of unlocked access to addresses hash to determine that an address is our local. PR: 220078 MFC after: 1 week	2017-09-20 22:35:28 +00:00
Andrey V. Elsukov	369bc48dc5	Do not acquire IPFW_WLOCK when a named object is created and destroyed. Acquiring of IPFW_WLOCK is requried for cases when we are going to change some data that can be accessed during processing of packets flow. When we create new named object, there are not yet any rules, that references it, thus holding IPFW_UH_WLOCK is enough to safely update needed structures. When we destroy an object, we do this only when its reference counter becomes zero. And it is safe to not acquire IPFW_WLOCK, because noone references it. The another case is when we failed to finish some action and thus we are doing rollback and destroying an object, in this case it is still not referenced by rules and no need to acquire IPFW_WLOCK. This also fixes panic with INVARIANTS due to recursive IPFW_WLOCK acquiring. MFC after: 1 week Sponsored by: Yandex LLC	2017-09-20 22:00:06 +00:00
Warner Losh	5fff95cc1d	Fix queue depth for nda. 1/4 of the number of queues times queue entries is too limiting. It works up to about 4k IOPS / 3.0GB/s for hardware that can do 4.4k/3.2GB/s with nvd. 3/4 works better, though it highlights issues in the fairness of nda's choice of TRIM vs READ. That will be fixed separately.	2017-09-20 21:42:25 +00:00
Michael Tuexen	2c62ba7377	Protect the address workqueue timer by a mutex. MFC after: 1 week	2017-09-20 21:29:54 +00:00
Warner Losh	89d26636f3	cam iosched: Call cam_iosched_limiter_init() after ios->current is set to the default Previously ios->current was set to 0 until the first cam_iosched_cl_maybe_steer() call. PR: 221954 Obtained from: ElectroBSD Submitted by: Fabian Keil Differential Revision: https://reviews.freebsd.org/D12349	2017-09-20 21:26:01 +00:00
Warner Losh	3028dd8dd5	cam iosched: Schedule cam_iosched_ticker() quanta times per second Previously callout_reset() was called with a "ticks" value that was off by one. As a result cam_iosched_ticker() was called a bit too frequently: On systems with hz=1000 a quanta value of 200 resulted in ~250 calls and a value of 100 in ~111 calls. For the "queue_depth" and "bandwidth" limiters the difference doesn't matter but the "iops" limiter depends on the scheduling to enforce the correct maximum. PR: 221956 Obtained from: ElectroBSD Submitted by: Fabian Keil Differential Revision: https://reviews.freebsd.org/D12350	2017-09-20 21:25:56 +00:00
Warner Losh	2d22619adc	cam iosched: Add a handler for the quanta sysctl to enforce valid values Invalid values can result in devision-by-zero panics or other undefined behaviour so lets not allow them. PR: 221957 Obtained from: ElectroBSD Submitted by: Fabian Keil Differential Revision: https://reviews.freebsd.org/D12351	2017-09-20 21:19:53 +00:00
Warner Losh	84c12dcdd0	cam iosched: Use the write queue for BIO_ZONE commands Use the write queue for BIO_ZONE commands so they can't get executed ahead of writes that were sent after them. More generally, since they introduce strong ordering into the list, they need to go to the write queue (which is the only queue that BIO_ORDERED is honored for at the moment). In fact, fix mismatch between queueing and dequeueing code by changing this to queue all non-reads (and non-trims) to the write queue. As a side effect this prevents the kernel message: kernel: Found bio_cmd = 0x9 which cam_iosched_next_bio() emits when finding commands other than BIO_READ in the read queue. PR: 221973 Obtained from: ElectroBSD Submitted by: Fabian Keil Differential Revision: https://reviews.freebsd.org/D12353	2017-09-20 21:13:20 +00:00
Stephen Hurd	d0d0ad0ae2	Fix iflib netmap RX RXQ setup for netmap was broken because netmap_rxq_init was getting called before IFDI_INIT - thus we ended up with ring tail pointer being reset to zero. Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12140	2017-09-20 20:40:49 +00:00
David C Somayajulu	203f9d1828	1. ql_hw.c: In ql_hw_send() return EINVAL when TSO framelength exceeds max supported length by HW.(davidcs) 2. ql_os.c: In qla_send() call bus_dmamap_unload before freeing mbuf or recreating dmmamap.(davidcs) In qla_fp_taskqueue() Add additional checks for IFF_DRV_RUNNING Fix qla_clear_tx_buf() call bus_dmamap_sync() before freeing mbuf. Submitted by:David.Bachu@netapp.com MFC after:5 days	2017-09-20 20:07:45 +00:00
Conrad Meyer	d616681cec	aesni(4): Fix another trivial typo (aensi -> aesni) Sponsored by: Dell EMC Isilon	2017-09-20 18:31:36 +00:00
Conrad Meyer	194446f9b7	x86: Decode AMD "Extended Feature Extensions ID EBX" bits In particular, this determines CPU support for the CLZERO instruction. (No, I am not making this name up.) Sponsored by: Dell EMC Isilon	2017-09-20 18:30:37 +00:00
Conrad Meyer	81326306dd	aesni(4): Fix trivial typo (AQUIRE -> ACQUIRE) Sponsored by: Dell EMC Isilon	2017-09-20 17:53:25 +00:00
Alan Somers	cd037f075c	MFV r323789: 8473 scrub does not detect errors on active spares illumos/illumos-gate@554675eee7 `554675eee7` https://www.illumos.org/issues/8473 Scrubbing is supposed to detect and repair all errors in the pool. However, it wrongly ignores active spare devices. The problem can easily be reproduced in OpenZFS at git rev 0ef125d with these commands: truncate -s 64m /tmp/a /tmp/b /tmp/c sudo zpool create testpool mirror /tmp/a /tmp/b spare /tmp/c sudo zpool replace testpool /tmp/a /tmp/c /bin/dd if=/dev/zero bs=1024k count=63 oseek=1 conv=notrunc of=/tmp/c sync sudo zpool scrub testpool zpool status testpool # Will show 0 errors, which is wrong sudo zpool offline testpool /tmp/a sudo zpool scrub testpool zpool status testpool # Will show errors on /tmp/c, # which should've already been fixed FreeBSD head is partially affected: the first scrub will detect some errors, but the second scrub will detect more. Reviewed by: Andy Stormont <astormont@racktopsystems.com> Reviewed by: Matt Ahrens <mahrens@delphix.com> Reviewed by: George Wilson <george.wilson@delphix.com> Approved by: Richard Lowe <richlowe@richlowe.net> MFC after: 1 week Sponsored by: Spectra Logic Corp	2017-09-20 16:31:00 +00:00
Andriy Gapon	aacd0b4bb2	add vfs_zfs.abd_chunk_size tunable It is reported that the default value of 4KB results in a substantial memory use overhead (at least, on some configurations). Using 1KB seems to reduce the overhead significantly. PR: 222377 Reported by: Sean Chittenden <sean@chittenden.org> MFC after: 1 week	2017-09-20 08:36:31 +00:00
Andriy Gapon	3d5487d981	fix memory leak in g_bio zone introduced in r320452, another ABD fallout I overlooked the fact that that ZIO_IOCTL_PIPELINE does not include ZIO_STAGE_VDEV_IO_DONE stage. We do allocate a struct bio for an ioctl zio (a disk cache flush), but we never freed it. This change splits bio handling into two groups, one for normal read/write i/o that passes data around and, thus, needs the abd data tranform; the other group is for "data-less" i/o such as trim and cache flush. PR: 222288 Reported by: Dan Nelson <dnelson@allantgroup.com> Tested by: Borja Marcos <borjam@sarenet.es> MFC after: 10 days	2017-09-20 08:27:21 +00:00
Andriy Gapon	8c9377cde7	MFV r323792: 8602 remove unused "dp_early_sync_tasks" field from "dsl_pool" structure illumos/illumos-gate@2bcb545854 `2bcb545854` https://www.illumos.org/issues/8602 When I landed the fix for 8558, I incorrectly added the "dp_early_sync_tasks" field to the "dsl_pool" structure. This field is used in DelphixOS, but not in illumos. It was incorrectly pulled into illumos, so this bug is to remove it from the structure. Reviewed by: Serapheim Dimitropoulos <serapheim@delphix.com> Reviewed by: Matthew Ahrens <mahrens@delphix.com> Approved by: Robert Mustacchi <rm@joyent.com> Author: Prakash Surya <prakash.surya@delphix.com> MFC after: 1 week	2017-09-20 07:26:52 +00:00
Alan Cox	e9bfbb02c5	In r288122, we changed vm_page_unwire() so that it returns a Boolean indicating whether the page's wire count transitioned to zero. Use that return value in zbuf_page_free() rather than checking the wire count. MFC after: 1 week	2017-09-20 04:59:52 +00:00
Alan Cox	2582d7a969	Sync with amd64/arm/arm64/i386/mips pmap change r288256: Exploit r288122 to address a cosmetic issue. Since PV chunk pages don't belong to a vm object, they can't be paged out. Since they can't be paged out, they are never enqueued in a paging queue. Nonetheless, passing PQ_INACTIVE to vm_page_unwire() creates the appearance that these pages are being enqueued in the inactive queue. As of r288122, we can avoid this false impression by passing PQ_NONE. MFC after: 1 week	2017-09-20 04:19:49 +00:00
Olivier Houchard	4583315a06	Define CPU_XSCALE_CORE3 when relevant. It was lost when cpuconf.h was deobirted.	2017-09-19 23:41:55 +00:00
Rick Macklem	0f29b8292d	Make the nfsrpc_layoutget() function a static. Make the NFSv4 pNFS client function nfsrpc_layoutget() a static, since it is only used in sys/fs/nfsclient/nfs_clrpcops.c. This prepares the code for future patches that add Flex File layout support.	2017-09-19 23:28:22 +00:00
David C Somayajulu	1faeac0fd5	Add sysctl "enable_minidump" to turn on/off automatic minidump retrieval MFC after:5 days	2017-09-19 23:26:27 +00:00
David C Somayajulu	9bc6b5ac37	Update minidump template for version 5.4.66 MFC after:5 days	2017-09-19 22:17:30 +00:00
Rick Macklem	2742a21091	Add a new function called nfsm_uiombuflist(), similar to nfsm_uiombuf(). This patch adds a new function called nfsm_uiombuflist(), which is similar to nfsm_uiombuf(), but doesn't not use the fields in struct nfsrv_descript. This new function will be used by the pNFS client for writing to mirrors using Flex Files layout. The function is not yet called anywhere. Also, get rid of #ifndef APPLE, which is ancient cruft left over from the Mac OSX port of the NFSv4 client.	2017-09-19 21:31:36 +00:00
Rick Macklem	b0932afacc	Simplify nfsrpc_layoutreturn() args. Simplify nfsrpc_layoutreturn() args. in preparation for the addition of Flex File layout support, since File layout uses a 0 length field. Flex Files does use a longer field, but that will be added in a subsequent commit.	2017-09-19 20:45:25 +00:00
Josh Paetzel	c77037f16f	Fix indentation for r323068 PR: 220170 Reported by: lidl MFC after: 3 days Pointyhat to: jpaetzel	2017-09-19 20:40:05 +00:00
Olivier Houchard	7ce16cd956	i81342 is little endian, not big endian.	2017-09-19 20:33:22 +00:00
Michael Tuexen	3ec509bcd3	Fix a warning. MFC after: 1 week	2017-09-19 20:24:13 +00:00
Rick Macklem	ab118d04be	Simplify nfsrpc_layoutcommit() args. Simplify nfsrpc_layoutcommit() args. in preparation for the addition of Flex File layout support, since it also uses a 0 length field.	2017-09-19 20:18:41 +00:00
Michael Tuexen	564a95f485	Avoid an overflow when computing the staleness. This issue was found by running libfuzz on the userland stack. MFC after: 1 week	2017-09-19 20:09:58 +00:00
Konstantin Belousov	3cabd93e26	Do not do torn writes to active LDTs. Care must be taken when updating the active LDT, since parallel threads might try to load a segment descriptor which is currently updated. Since the results are undefined, this cannot be ignored by claiming to be an application race. Reviewed by: jhb Sponsored by: The FreeBSD Foundation MFC after: 2 weeks Differential revision: https://reviews.freebsd.org/D12413	2017-09-19 17:57:04 +00:00
Konstantin Belousov	9770475ce7	Do not vrele() covered vnode under the mp mutex. If vrele() changes the hold count to zero, it needs to acquire the vnode lock. Sponsored by: The FreeBSD Foundation Discussed with: avg X-MFC with: r323578	2017-09-19 16:49:45 +00:00
Konstantin Belousov	5bf949377e	For unlinked files, do not msync(2) or sync on the vnode deactivation. One consequence of the patch is that msyncing unlinked file mappings no longer reduces the amount of the dirty memory in the system, but I do not think that there are users of msync(2) that utilize it for such side-effect. Reported and tested by: tjil PR: 222356 Reviewed by: alc Sponsored by: The FreeBSD Foundation MFC after: 2 weeks Differential revision: https://reviews.freebsd.org/D12411	2017-09-19 16:46:37 +00:00
Michael Tuexen	ad608f06ed	Remove a no longer used variable. Reported by: Felix Weinrank MFC after: 1 week	2017-09-19 15:00:19 +00:00
Sepherosa Ziehau	d74831eea2	hyperv/hn: Incease max supported MTU MFC after: 1 week Sponsored by: Microsoft Differential Revision: https://reviews.freebsd.org/D12365	2017-09-19 06:46:00 +00:00
Sepherosa Ziehau	eb2fe04416	hyperv/hn: Fix MTU setting - Add size of an ethernet header to the value configured to NVS. This does not seem to have any effects if MTU is 1500, but fix hypervisor side's setting if MTU > 1500. - Override the MTU setting according to the view from the hypervisor side. MFC after: 1 week Sponsored by: Microsoft Differential Revision: https://reviews.freebsd.org/D12352	2017-09-19 06:38:57 +00:00
Sepherosa Ziehau	642ec226bb	hyperv/hn: Apply VF's RSS setting Since in Azure SYN and SYN\|ACK go through the synthetic parts while the rest of the same TCP flow goes through the VF, apply VF's RSS settings to synthetic parts to have a consistent hash value/type for the same TCP flow. MFC after: 1 week Sponsored by: Microsoft Differential Revision: https://reviews.freebsd.org/D12333	2017-09-19 06:29:38 +00:00
John Baldwin	4fedff3ca1	Enable support for lookaside crypto operations by default. This permits ccr(4) to be used with the default firmware configuration file. Discussed with: np Sponsored by: Chelsio Communications	2017-09-18 23:50:34 +00:00
John Baldwin	4f45713ae2	Add UFS_LINK_MAX for the UFS-specific limit on link counts. ino64 expanded nlink_t to 64 bits, but the on-disk format for UFS is still limited to 16 bits. This is a nop currently but will matter if LINK_MAX is increased in the future. Reviewed by: kib Sponsored by: Chelsio Communications	2017-09-18 23:30:39 +00:00
Konstantin Belousov	5efe338f3d	Fix handling of the segment registers on i386. Suppose that userspace is executing with the non-standard segment descriptors. Then, until exception or interrupt handler executed SET_KERNEL_SEGS, kernel is still executing with user %ds, %es and %fs. If an interrupt occurs in this window, the interrupt handler is executed unsafely, relying on usability of the usermode registers. If the interrupt results in the context switch on return, the contamination of the kernel state spreads to the thread we switched to. As result, kernel data accesses might fault or, if only the base is changed, completely messed up. More, if the user segment was allocated in LDT, another thread might mark the descriptor as invalid before doreti code tried to reload them. In this case kernel panics. The issue exists for all exception entry points which use trap gate, and thus do not automatically disable interrupts on entry, and for lcall_handler. Fix is two-fold: first, we need to disable interrupts for all kernel entries, changing the IDT descriptor types from trap gate to interrupt gate. Interrupts are re-enabled not earlier than the kernel segments are loaded into the segment registers. Second, we only load the segment registers from the trap frame when returning to usermode. For the later, all interrupt return paths must happen through the doreti common code. There is no way to disable interrupts on call gate, which is the supposed mode of servicing for lcall $7,$0 syscalls. Change the LDT descriptor 0 into a code segment type and point it to the userspace trampoline which redirects the syscall to int $0x80. All the measures make the segment register handling similar to that of amd64. We do not apply amd64 optimizations of not reloading segment registers on return from the syscall. Reported by: Maxime Villard <max@m00nbsd.net> Tested by: pho (the non-lcall part) Reviewed by: jhb Sponsored by: The FreeBSD Foundation MFC after: 2 weeks Differential revision: https://reviews.freebsd.org/D12402	2017-09-18 20:22:42 +00:00
Ilya Bakulin	ffa317051b	Add kern.features flag for MMCCAM kern.features.mmcam will be present and equal to 1 if the kernel has been compiled with option MMCCAM. This will help sdio-related userland tools to fail-fast if running on the kernel without MMCCAM enabled. Approved by: imp (mentor) Differential Revision: https://reviews.freebsd.org/D12386	2017-09-18 20:17:08 +00:00
Cy Schubert	70394ab378	Don't use an apostrophe in a possesive pronoun. MFC after: 3 days	2017-09-18 19:16:41 +00:00
Ryan Libby	2e6418c0c5	linsysfs: quiet gcc -Wformat after r323692 Reviewed by: cem Sponsored by: Dell EMC Isilon	2017-09-18 19:09:40 +00:00
Scott Long	1ab9094c90	Hide a normal probe warning message under bootverbose, similar to atkbdc Sponsored by: Netflix	2017-09-18 18:42:28 +00:00
Conrad Meyer	4d367f2501	linsysfs(5): Fix two unrelated issues 1. Swap the order of device_get_ivars with device_get_devclass and devclass name validation. This bug was introduced in r323692. 2. Error check device_get_children and free the returned list. This bug was introduced in the original linsysfs commit. Reported by: Oleg V. Nauman <oleg AT theweb.org.ua>, hselasky (1); hselasky (2) Reviewed by: hselasky Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12407	2017-09-18 17:14:13 +00:00
Toomas Soome	3f247eab7a	loader: biosmem allocate heap just below 4GB The current biosmem code is walking bios smap entries and looking for smap entry just below 4GB line, if there is such entry, its base and size is set for heap base and size. Instead of entry base, we should use last HEAP_MIN (currently 64MB) bytes just below 4GB, to make maximum space for kernel and modules. The problem was revealed on ASUS B350M-A system board, an AMD Ryzen 3 1200 CPU memory map: SMAP type=01 base=0000000000000000 len=000000000009d400 attr=01 SMAP type=02 base=000000000009d400 len=0000000000002c00 attr=01 SMAP type=02 base=00000000000e0000 len=0000000000020000 attr=01 SMAP type=01 base=0000000000100000 len=0000000009c00000 attr=01 SMAP type=02 base=0000000009d00000 len=0000000000300000 attr=01 SMAP type=01 base=000000000a000000 len=00000000be69b000 attr=01 SMAP type=03 base=00000000c869b000 len=0000000000016000 attr=01 SMAP type=01 base=00000000c86b1000 len=00000000124e7000 attr=01 SMAP type=02 base=00000000dab98000 len=0000000000138000 attr=01 SMAP type=03 base=00000000dacd0000 len=0000000000008000 attr=01 SMAP type=01 base=00000000dacd8000 len=0000000000100000 attr=01 SMAP type=04 base=00000000dadd8000 len=00000000003b3000 attr=01 SMAP type=02 base=00000000db18b000 len=0000000000d42000 attr=01 SMAP type=01 base=00000000dbecd000 len=0000000002133000 attr=01 SMAP type=01 base=0000000100000000 len=000000011f380000 attr=01 SMAP type=02 base=00000000de000000 len=0000000002000000 attr=01 SMAP type=02 base=00000000f8000000 len=0000000004000000 attr=01 SMAP type=02 base=00000000fdf00000 len=0000000000100000 attr=01 SMAP type=02 base=00000000fea00000 len=0000000000010000 attr=01 SMAP type=02 base=00000000feb80000 len=0000000000082000 attr=01 SMAP type=02 base=00000000fec10000 len=0000000000001000 attr=01 SMAP type=02 base=00000000fec30000 len=0000000000001000 attr=01 SMAP type=02 base=00000000fed00000 len=0000000000001000 attr=01 SMAP type=02 base=00000000fed40000 len=0000000000005000 attr=01 SMAP type=02 base=00000000fed80000 len=0000000000010000 attr=01 SMAP type=02 base=00000000fedc2000 len=000000000000e000 attr=01 SMAP type=02 base=00000000fedd4000 len=0000000000002000 attr=01 SMAP type=02 base=00000000fee00000 len=0000000000100000 attr=01 SMAP type=02 base=00000000ff000000 len=0000000001000000 attr=01 Reviewed by: imp Differential Revision: https://reviews.freebsd.org/D12368	2017-09-18 15:17:01 +00:00
Hans Petter Selasky	e35e2f3ebe	Bump the __FreeBSD_version after recent LinuxKPI changes. MFC after: 1 week Sponsored by: Mellanox Technologies	2017-09-18 13:39:51 +00:00
Hans Petter Selasky	62bae5d421	The LinuxKPI atomics do not have acquire nor release semantics unless specified. Fix code to use READ_ONCE() and WRITE_ONCE() where appropriate. Suggested by: kib @ MFC after: 1 week Sponsored by: Mellanox Technologies	2017-09-18 13:37:14 +00:00
Hans Petter Selasky	1f7c7e1bec	Only wire pages in the LinuxKPI instead of holding and wiring them. This prevents the page daemon from regularly scanning the held pages. Suggested by: kib @ MFC after: 1 week Sponsored by: Mellanox Technologies	2017-09-18 13:23:59 +00:00
Hans Petter Selasky	c05238a681	Add support for shared memory functions to the LinuxKPI. Obtained from: kmacy @ MFC after: 1 week Sponsored by: Mellanox Technologies	2017-09-18 13:17:23 +00:00
Conrad Meyer	2d347b2ef8	linsysfs(5): Add support for recent libdrm Expose more information about PCI devices (and GPUs in particular) via linsysfs to libdrm. This allows unmodified modern 64-bit Linux libdrm to work, which allows modern Linux Mesa to work. The submitter reports that he tested the change with an Ubuntu 16.04 chroot + amdgpu from graphics/drm-next-kmod. PR: 222375 Submitted by: Greg V <greg AT unrelenting.technology>	2017-09-17 23:40:16 +00:00
Ian Lepore	d18915fadb	Give icee(4) a detach() method so it can be used as a module. Add a module makefile for it.	2017-09-17 22:58:13 +00:00
Conrad Meyer	c50df68a08	MCA: Expand AMD Thresholding support to cover all banks When it was added in r314636, AMD Thresholding was hardcoded to only bank 4 (Northbridge) for some reason. However, even on family 10h the MCAx_MISC register Valid/Present bits determine whether thresholding is supported on that bank. Expand thresholding support to monitor all monitorable banks. This simplifies some of the logic and makes it more consistent with our Intel CMCI support. Reviewed by: markj (earlier version) Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12321	2017-09-17 22:58:13 +00:00
Rick Macklem	ccf038250a	Fix bogus FREAD with NFSV4OPEN_ACCESSREAD. No functional change. The code in nfscl_doflayoutio() bogusly used FREAD instead of NFSV4OPEN_ACCESSREAD. Since both happen to be defined as "1", this worked and the patch doesn't result in a functional change. Found by inspection during development of Flex File Layout support. MFC after: 2 weeks	2017-09-17 22:18:01 +00:00
Justin Hibbits	d11e86549e	Don't use a non-zero argument for __builtin_frame_address __builtin_frame_address with a non-zero argument is unsafe and rejected by newer gcc. Since it doesn't seem to impact the stacktrace, don't bother with gymnastics to unwind to a different frame for starting. PR: kern/220118 MFC after: 2 weeks	2017-09-17 20:07:20 +00:00
Justin Hibbits	6b7530563b	Print the correct bitmask for the running Book-E CPU All the Book-E world is no longer e500v{1,2}. e500mc the 64-bit derivatives do not use the DOZE/NAP bits with MSR[WE], instead using the `wait' instruction to wait for interrupts, and SoC plane controls (via CCSR) for power management. MFC after: 1 week	2017-09-17 19:40:17 +00:00
Mark Johnston	b999e9c813	Implement mmu_page_init for AIM platforms. As of r323290 we cannot rely on the vm_page array being zero-initialized. Reported and tested by: andreast MFC after: 1 week	2017-09-17 15:40:12 +00:00
Michael Tuexen	72e23aba22	Fix an accounting bug and use sctp_timer_start to start a timer. MFC after: 1 week	2017-09-17 09:27:27 +00:00
Michael Tuexen	fe40f49bb3	Remove code not used on any platform currently supported. MFC after: 1 week	2017-09-16 21:26:06 +00:00
Alan Cox	ec371b57e8	Modify blst_leaf_alloc to take only the cursor argument. Modify blst_leaf_alloc to find allocations that cross the boundary between one leaf node and the next when those two leaves descend from the same meta node. Update the hint field for leaves so that it represents a bound on how large an allocation can begin in that leaf, where it currently represents a bound on how large an allocation can be found within the boundaries of the leaf. The first phase of blst_leaf_alloc currently shrinks sequences of consecutive 1-bits in mask until each has been shrunken by count-1 bits, so that any bits remaining show where an allocation can begin, or until all the bits have disappeared, in which case the allocation fails. This change amends that so that the high-order bit is copied, as if, when the last block was free, it was followed by an endless stream of free blocks. It also amends the early stopping condition, so that the shrinking of 1-sequences stops early when there are none, or there is only one unbounded one remaining. The search for the first set bit is unchanged, and the code path thereafter is mostly unchanged unless the first set bit is in a position that makes some of those copied sign bits matter. In that case, we look for a next leaf, and at what blocks it can provide, to see if a cross-boundary allocation is possible. The hint is updated on a successful allocation that clears the last bit, but it not updated on a failed allocation that leaves the last bit set. So, as long as the last block is free, the hint value for the leaf is large. As long as the last block is free, and there's a next leaf, a large allocation can begin here, perhaps. A stricter rule than this would mean that allocations and frees in one leaf could require hint updates to the preceding leaf, and this change seeks to leave the freeing code unmodified. Define BLIST_BMAP_MASK, and use it for bit masking in blst_leaf_free and blist_leaf_fill, as well as in blst_leaf_alloc. Correct a panic message in blst_leaf_free. Submitted by: Doug Moore <dougm@rice.edu> Reviewed by: markj (an earlier version) MFC after: 1 week Differential Revision: https://reviews.freebsd.org/D11819	2017-09-16 18:12:15 +00:00
Ian Lepore	8dc710184a	Add a missing header file to SRCS to fix out-of-kernel builds. PR: 222354 Submitted by: eugen@ Pointy hat: ian@	2017-09-16 16:09:05 +00:00
Emmanuel Vadot	36dcd6a499	Allwinner usb phy: Rework resource allocation The usbphy node for allwinner have two kind of resources, one for the phy_ctrl and one per phy. Instead of blindy allocating resources, alloc the phy_ctrl and pmu ones separately. Also add a configuration struct for all different phy that hold the difference between them (number of phys, unknow needed register write etc ...). While here remove A83T code as upstream and FreeBSD dts don't have nodes for USB. This (plus 323640) re-enable OHCI on Pine64 on the bottom USB port. The top USB port is routed to the OHCI0/EHCI0 which is by default in OTG mode. While the phy code can handle the re-route to standard OHCI/EHCI we still need a driver for musb to probe and configure it in host mode. EHCI is still buggy on Pine64 (hang the board) so do not enable it for now. Tested On: Bananapi (A20), BananapiM2 (A31S), OrangePi One (H3) Pine64 (A64)	2017-09-16 15:58:20 +00:00
Emmanuel Vadot	489cba7d58	A64 CCUNG: Correct gate and reset for OHCI0/1 Reported by: jmcneill Pointy Hat: manu	2017-09-16 15:50:31 +00:00
Emmanuel Vadot	082f09757c	Allwinner: a10_gpio Fix panic on multiple lock r323392 introduce gpio_pin_get/gpio_pin_set for a10_gpio driver. When called via gpio method they must aquire the device lock while when they are called via gpio_pin_configure the lock is already aquire. Introduce a10_gpio_pin_{s,g}et_locked and call them in pin_gpio_configure instead. Tested On: BananaPi (A20) Reported by: Richard Puga richard@puga.net	2017-09-16 14:08:20 +00:00
Stephen Hurd	ab2e3f7958	Revert r323516 (iflib rollup) This was really too big of a commit even if everything worked, but there are multiple new issues introduced in the one huge commit, so it's not worth keeping this until it's fixed. I'll work on splitting this up into logical chunks and introduce them one at a time over the next week or two. Approved by: sbruno (mentor) Sponsored by: Limelight Networks	2017-09-16 02:41:38 +00:00
John Baldwin	91a65e2f6b	Avoid reusing the wrong buffer for a DDP AIO request. To optimize the case of ping-ponging between two buffers, the DDP code caches the last two buffers used keeping the pages wired and page pods stored in the NIC's RAM. If a new aio_read() request uses one of the same buffers, then the work of holding pages, etc. can be avoided. However, the starting virtual address of an aio buffer was not saved, only the page count, length, and initial page offset. Thus, an aio_read() request could match a different buffer in the address space. (Earlier during development vm_fault_hold_quick_pages() was always called and the vm_page_t values were compared, but that was eventually removed without being adequately replaced.) Fix by storing the starting virtual address and comparing that (along with other fields) to determine if a buffer can be reused. MFC after: 3 days Sponsored by: Chelsio Communications	2017-09-15 22:40:57 +00:00
Scott Long	7eed4c1853	Fix line wrap issues. Sponsored by: Netflix	2017-09-15 20:58:52 +00:00
Warner Losh	851063e16a	Allow multiple TRIMs to be done for nda Don't call cam_iosched_trim_done or cam_iosched_submit_trim for nda since its hardware can handle almost an arbitrary number of TRIMs and we don't have to be careful to only ever do one. Sponsored by: Netflix	2017-09-15 20:16:06 +00:00
Warner Losh	55c770b40a	Update comments on what the CAM_IOSCHED_FLAG_TRIM_ACTIVE means. It's intended only for those situations where the periph driver ones to limit the number of trims active to one and only one. Also update comments on associated functions. Sponsored by: Netflix	2017-09-15 20:15:55 +00:00
Landon J. Fuller	011e84e0a7	Add MIPS32/64 Rev2 CP0 intctl register definitions. Approved by: adrian (mentor) Differential Revision: https://reviews.freebsd.org/D12300	2017-09-15 19:56:21 +00:00
Ilya Bakulin	02c474b481	Miscellaneous fixes and improvements to MMCCAM stack * Demote the level of several debug messages to CAM_DEBUG_TRACE * Add detection for SDHC cards that can do 1.8V. No voltage switch sequence is issued yet; * Don't create a separate LUN for each SDIO function. We need just one to make pass(4) attach; * Remove obsolete mmc_sdio* files. SDIO functionality will be moved into the separate device that will manage a new sdio(4) bus; * Terminate probing if got no reply to CMD0; * Make bcm2835 SDHCI host controller driver compile with 'option MMCCAM'. Approved by: imp (mentor) Differential Revision: https://reviews.freebsd.org/D12109	2017-09-15 19:47:44 +00:00
Konstantin Belousov	bba52ecadd	Batch freeing of the pages in vm_object_page_remove() under the same free queue mutex lock owning session, same as it was done for the object termination in r323561. Reported and tested by: mjg Reviewed by: alc, markj Sponsored by: The FreeBSD Foundation MFC after: 1 week	2017-09-15 16:07:09 +00:00
Mark Johnston	e04223bf94	Include _bitset.h to get BITSET_DEFINE, used to define struct slabbits. MFC after: 1 week	2017-09-15 14:59:35 +00:00
Andriy Gapon	7103ac8ad6	gmirror: treat ENXIO as disk disconnect, not media error In theory, all data access errors mean that a member is out of sync at most. But they were treated as more serious errors to avoid the situation where a flaky disk gets repeatedly disconnected, re-synchronized, reconnected and then disconnected again. ENXIO is a special error that means that the member disk disappeared, so it should get the same handling as the GEOM orphaning event. There is a better chance that when the disk is reconnected, it will be a good member again. When ENXIO happens on a read we use the exisiting G_MIRROR_BUMP_SYNCID mechanism which means that the mirror's syncid is increased as soon as there is a write to the mirror. That's because no data has got out of sync yet, but the problematic memeber is disconnected, so the future write will make it stale. When ENXIO happens on a write we use a new G_MIRROR_BUMP_SYNCID_NOW mechanism which means that we update the mirror metadata as soon as possible because the problematic memeber is already behind. Reviewed by: markj, imp MFC after: 3 weeks Differential Revision: https://reviews.freebsd.org/D9463	2017-09-15 13:57:08 +00:00
Andrew Turner	ca289945b2	Add the ARMv8.3 ID register fields. These were found in the A-Profile exploration tools documentation: https://developer.arm.com/products/architecture/a-profile/exploration-tools Sponsored by: DARPA, AFRL	2017-09-15 12:57:34 +00:00
John Baldwin	2bd1e600e3	Fix some incorrect sysctl pointers for some error stats. The bad_session, sglist_error, and process_error sysctl nodes were returning the value of the pad_error node instead of the appropriate error counters. Sponsored by: Chelsio Communications	2017-09-14 21:06:08 +00:00
Gleb Smirnoff	584ab65a75	Fix locking in soisconnected(). When a newborn socket moves from incomplete queue to complete one, we need to obtain the listening socket lock after the child, which is a wrong order. The old code did that in potentially endless loop of mtx_trylock(). The new one does only one attempt of mtx_trylock(), and in case of failure references listening socket, unlocks child and locks everything in right order. In case if listening socket shuts down during that, just bail out. Reported & tested by: Jason Eggleston <jeggleston llnw.com> Reported & tested by: Jason Wolfe <jason llnw.com>	2017-09-14 18:05:54 +00:00
Andrew Turner	bcf2b954c3	Add support for handling undefined instructions in userspace and the kernel. We can register callbacks to perform the required operation on the saved registers before returning. This is initially used to work around a bug in old versions of QEMU that trigger such an exception when reading from an ID register when it should load z zero value. I expect this could be used with other exception types, e.g. to emulate special register access from userland. Sponsored by: DARPA, AFRL	2017-09-14 17:29:51 +00:00
Toomas Soome	8b448cf1d6	loader: biosmem.c cstyle cleanup No functional changes, just cleanup. Reviewed by: allanjude, imp Differential Revision: https://reviews.freebsd.org/D12370	2017-09-14 16:42:29 +00:00
Ed Maste	34cb0eb2ed	octeon sdk: initialize variable to quiet Clang warning Clang complains "variable 'dummy' is uninitialized when used here". Reported by: Clang MFC after: 1 week Sponsored by: The FreeBSD Foundation	2017-09-14 16:41:22 +00:00
Conrad Meyer	a64bf59c49	Add PNP metadata to a few drivers An eventual devd(8) or other component should be able to scan buses and automatically load drivers that match device ids described in this metadata. Reviewed by: imp Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12364	2017-09-14 15:34:45 +00:00
John Baldwin	8df419f2df	Add AT_EHDRFLAGS and AT_HWCAP on amd64. x86 has two separate (but identical) list of AT_* constants and the earlier commit to add AT_HWCAP only updated the i386 list.	2017-09-14 15:34:29 +00:00
John Baldwin	27efb0a242	Add a NT_ARM_VFP ELF core note to hold VFP registers for each thread. The core note matches the format and layout of NT_ARM_VFP on Linux. Debuggers use the AT_HWCAP flags to determine how many VFP registers are actually used and their format. Reviewed by: mmel (earlier version w/o gcore) MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D12293	2017-09-14 15:07:48 +00:00
John Baldwin	ca2b367f5c	Export get/set_vfpcontext from machdep.c. Should have been part of the previous commit to add ptrace operations for VFP registers. MFC after: 1 month	2017-09-14 15:06:29 +00:00
John Baldwin	197e3ae5fc	Add ptrace operations to fetch and store VFP registers. Reviewed by: mmel, kib MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D12294	2017-09-14 15:03:43 +00:00
John Baldwin	21994598e4	Only mess with VFP state on the CPU for curthread for get/set_vfpcontext. Future changes will use these functions to fetch and store VFP state for threads other than curthread. Reviewed by: andrew, stevek, Michal Meloun <meloun-miracle-cz> MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D12292	2017-09-14 14:36:56 +00:00
John Baldwin	19e1bd0104	Add AT_HWCAP flags for VFP settings for FreeBSD/arm. These flags match the meaning and value of flags in Linux, though Linux has many more flags. Reviewed by: stevek, Michal Meloun <meloun-miracle-cz> (earlier version) MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D12291	2017-09-14 14:30:43 +00:00
John Baldwin	c2f37b9245	Add AT_HWCAP and AT_EHDRFLAGS on all platforms. A new 'u_long sv_hwcap' field is added to 'struct sysentvec'. A process ABI can set this field to point to a value holding a mask of architecture-specific CPU feature flags. If an ABI does not wish to supply AT_HWCAP to processes the field can be left as NULL. The support code for AT_EHDRFLAGS was already present on all systems, just the #define was not present. This is a step towards unifying the AT_ constants across platforms. Reviewed by: kib MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D12290	2017-09-14 14:26:55 +00:00
Andriy Gapon	cbc785c293	dounmount: do not release the mount point's reference on the covered vnode As long as mnt_ref is not zero there can be a consumer that might try to access mnt_vnodecovered. For this reason the covered vnode must not be freed until mnt_ref goes to zero. So, move the release of the covered vnode to vfs_mount_destroy. Reviewed by: kib MFC after: 3 weeks Differential Revision: https://reviews.freebsd.org/D12329	2017-09-14 08:47:06 +00:00
Alexander Motin	83feae78cc	Add second entry to LUT on a link side in B2B mode. Each of two entries on a virtual side should have its counterpart on a peer's link side. MFC after: 1 week	2017-09-14 04:51:17 +00:00
Ryan Libby	4e51f184e6	gcc builds: reenable -Wstrict-overflow for kern.mk Reviewed by: emaste Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12284	2017-09-14 03:42:41 +00:00
Gleb Smirnoff	d37aa3ccce	Use soref() in sendfile(2) instead fhold() to reference a socket. The problem is that fdrop() requires syscall context, as it may enter sleep in some cases. The reason to use it in the original non-blocking sendfile implementation, was to avoid use of global ACCEPT_LOCK() on every I/O completion. Now in head sorele() no longer requires this lock.	2017-09-13 22:11:05 +00:00
Mark Johnston	2d54d4bb9f	Widen uk_pgoff, the slab header offset field. 16 bits is only wide enough for kegs with an item size of up to 64KB. At that size or larger, slab headers are typically offpage because the item size is a multiple of the page size, but there is no requirement that this be the case. We can widen the field without affecting the layout of struct uma_keg since the removal of uk_slabsize in r315077 left an adjacent hole. PR: 218911 MFC after: 2 weeks	2017-09-13 21:54:37 +00:00
Konstantin Belousov	e82e50e681	Remove inline specifier from vm_page_free_wakeup(), do not micro-manage compiler. Reviewed by: alc, markj Sponsored by: The FreeBSD Foundation MFC after: 1 week	2017-09-13 19:30:09 +00:00
Konstantin Belousov	2fcd1ff68f	Do not relock free queue mutex for each page, free whole terminating object' page queue under the single mutex lock. First, all pages on the queue are prepared for free by calls to vm_page_free_prep(), and pages which should not be returned to the physical allocator (e.g. wired or fictitious) are simply removed from the queue. On the second pass, vm_page_free_phys_pglist() inserts all pages from the queue without relocking the mutex. The change improves the object termination, e.g. on the process exit where large anonymous memory objects otherwise cause relocks the free queue mutex for each page. More, if several such processes are exiting or execing in parallel, the mutex was highly contended on the address space demolition. Diagnosed and tested by: mjg (previous version) Reviewed by: alc, markj Sponsored by: The FreeBSD Foundation MFC after: 1 week	2017-09-13 19:22:07 +00:00
Konstantin Belousov	540ac3b310	Split vm_page_free_toq() into two parts, preparation vm_page_free_prep() and insertion into the phys allocator free queues vm_page_free_phys(). Also provide a wrapper vm_page_free_phys_pglist() for batched free. Reviewed by: alc, markj Tested by: mjg (previous version) Sponsored by: The FreeBSD Foundation MFC after: 1 week	2017-09-13 19:11:52 +00:00
Konstantin Belousov	b9e8fb647e	Use existing tag name for the vm_object' memq. Reviewed by: alc, markj Sponsored by: The FreeBSD Foundation MFC after: 1 week	2017-09-13 19:03:59 +00:00
Navdeep Parhar	26cee56642	Retire the T3 iWARP and TOE drivers. This saves catch-up work when OFED or other kernel infrastructure changes. Note that this doesn't affect the base cxgb(4) NIC driver for T3 at all. MFC after: No MFC. Sponsored by: Chelsio Communications	2017-09-13 17:49:23 +00:00
Conrad Meyer	e5dc78af11	intpm(4): Decrease requested i/o port range width On some AMD FCH devices driven by intpm(4) (read: mine), the SMBus I/O port range is split in two and the low range is only 0x10 wide. intpm(4) does not access any registers above 0x0f, so there is no need for the wider range. Discussed with: avg Sponsored by: Dell EMC Isilon	2017-09-13 17:43:18 +00:00
Allan Jude	dbfcf648a3	Increase EFI boot file size frok 128k to 384k generate_fat.sh does the following: - create an 800kb zero-filled file - create an md device backed by this file - format the device fat12 - mount the filesystem - create the EFI ESP directory structure - create the EFI boot file (BOOTx64 for amd64, BOOTaa64 for aarch64, etc) - Adds a marker to the beginning of the file, and pad it to 384kb - 384kb was chosen as it is less than half of 800kb, thus allowing users to keep a backup of their older boot file in the small partition - Unmount the filesystem - Scan the image and find the offset where the marker was inserted - The process requires root, to make image generation easier, images for each architecture are pregenerated, compressed with xz, and checked into svn. The Makefile that generates boot1.efifat does the following: - Ensure the compiled boot1.efi file is no larger than the generated image - Decompress the template created by generate-fat.sh - dd the contents of boot1.efi into boot1.efifat starting at the offset where the marker is found. This allows any file less than the maximum size to be written into the fat filesystem without having to mount it, so no root privileges are required. Later work by imp and myself makes bsdinstall create a 200mb fat16 instead of using this process, but it is retained to make image generation easier. Submitted by: Eric McCorkle (original version) Reviewed by: emaste, tsoome, Eric McCorkle MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D9680	2017-09-13 17:00:02 +00:00
Ian Lepore	1e4042d44e	Defer attaching and probing iicbus and its children until interrupts are available, in i2c controller drivers that require interrupts for transfers. This is the result of auditing all 22 existing drivers that attach iicbus. These drivers were the only ones remaining that require interrupts and were not using config_intrhook to defer attachment. That has led, over the years, to various i2c slave device drivers needing to use config_intrhook themselves rather than performing bus transactions in their probe() and attach() methods, just in case they were attached too early.	2017-09-13 16:54:27 +00:00
Gleb Smirnoff	100db364eb	Fix two issues with not ready data in sockets (read: sendfile) in UNIX sockets. o Check that socket is still connected in uipc_ready(). If not we are responsible to free mbufs. o In uipc_send() if socket appears to be disconnected, but we are sending data with pending I/Os, don't free mbufs. Reported by: Kevin Bowling <kbowling llnw.com> Tested by: Kevin Bowling <kbowling llnw.com> PR: 222259 Reported by: Mark Martinec <Mark.Martinec ijs.si> MFC after: 3 days	2017-09-13 16:47:23 +00:00
Conrad Meyer	02e015aa38	intpm(4): While here, remove redundant 'res' check Reported by: avg Sponsored by: Dell EMC Isilon	2017-09-13 16:43:31 +00:00
Gordon Tetlow	4572fb3faf	Deorbit catman. The tradeoff of disk for performance has long since tipped in favor of just rendering the manpage instead of relying on pre-formatted catpages. Note, this does not impede the ability to use existing catpages, it just removes the utility to generate them. Reviewed by: imp, allanjude Approved by: emaste (mentor) Differential Revision: https://reviews.freebsd.org/D12317	2017-09-13 16:35:16 +00:00
Conrad Meyer	54d89ef114	intpm(4): Do not attach if io_res can not be allocated Attempts to use the driver without an io_res result in immediate panic. Sponsored by: Dell EMC Isilon	2017-09-13 16:23:59 +00:00
Mark Johnston	2934eb8a22	Fix a logic error in the item size calculation for internal UMA zones. Kegs for internal zones always keep the slab header in the slab itself. Therefore, when determining the allocation size, we need to take the slab header size into account. Reported and tested by: ae, rakuco Reviewed by: avg MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D12342	2017-09-13 15:44:54 +00:00
Sean Bruno	19ebd288fb	Don't (try to) build lio(4) if the SOURCELESS_UCODE is set. Submitted by: Fabien Keil <fk@fabiankeil.de>	2017-09-13 15:17:35 +00:00
Toomas Soome	0a0c72ff93	libefi: efipart_realstrategy rsize pointer may be NULL Need to check rsize before dereferencing it.	2017-09-13 14:27:13 +00:00
Andriy Gapon	ceb1a4fb2d	jedec_ts: add many more devices from various vendors The new IDs are taken from the hardware to which I have access and from open datasheets. Also, the hardware probing is moved to the device probe method. Reviewed by: rpokala MFC after: 1 week Differential Revision: https://reviews.freebsd.org/D11730	2017-09-13 13:03:29 +00:00
Ed Maste	eadaf05db0	qlnx: exclude if WITHOUT_SOURCELESS_UCODE set PR: 222277 Submitted by: Fabian Keil Obtained from: ElectroBSD MFC after: 1 week	2017-09-13 12:16:27 +00:00
Ilya Bakulin	a9bfc8d2ae	Add MMCCAM-enabled kernel config for IMX6, reduce debug noice in MMCCAM kernels CAM_DEBUG_TRACE results in way too much debug output than needed now. When debugging, it's always possible to turn on trace level using camcontrol. Approved by: imp (mentor) Differential Revision: https://reviews.freebsd.org/D12110	2017-09-13 10:56:02 +00:00
Andriy Gapon	86261a95ed	slightly simplify zfs_vptocnp It's not necessary to look up the parent's ID to check if the node is the root node of the filesystem. MFC after: 2 weeks	2017-09-13 07:09:58 +00:00
Navdeep Parhar	efeb46889f	cxgbe(4): Ignore capabilities that depend on TOE when the firmware reports TOE is not available. MFC after: 1 week Sponsored by: Chelsio Communications	2017-09-13 06:07:02 +00:00
Sean Bruno	68467b1206	Jenkins i386 LINT build uses NOTES to generate its LINT kernel config. ixl(4) isn't in here either, so I'll remove lio(4) too.	2017-09-13 03:56:03 +00:00
Stephen Hurd	ea4c57fe0c	Fix GCC build failure caused by r323516 No need to declare cold when we #include <sys/systm.h> Reported by: Jenkins Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12347	2017-09-13 02:44:50 +00:00
Stephen Hurd	d300df0182	Roll up iflib commits from github. This pulls in most of the work done by Matt Macy as well as other changes which he has accepted via pull request to his github repo at https://github.com/mattmacy/networking/ This should bring -CURRENT and the github repo into close enough sync to allow small feature branches rather than a large chain of interdependant patches being developed out of tree. The reset of the synchronization should be able to be completed on github by splitting the remaining changes that are not yet ready into short feature branches for later review as smaller commits. Here is a summary of changes included in this patch: 1) More checks when INVARIANTS are enabled for eariler problem detection 2) Group Task Queue cleanups - Fix use of duplicate shortdesc for gtaskqueue malloc type. Some interfaces such as memguard(9) use the short description to identify malloc types, so duplicates should be avoided. 3) Allow gtaskqueues to use ithreads in addition to taskqueues - In some cases, this can improve performance 4) Better logging when taskqgroup_attach*() fails to set interrupt affinity. 5) Do not start gtaskqueues until they're needed 6) Have mp_ring enqueue function enter the ABDICATED rather than BUSY state. This moves the TX to the gtaskq and allows processing to continue faster as well as make TX batching more likely. 7) Add an ift_txd_errata function to struct if_txrx. This allows drivers to inspect/modify mbufs before transmission. 8) Add a new IFLIB_NEED_ZERO_CSUM for drivers to indicate they need checksums zeroed for checksum offload to work. This avoids modifying packet data in the TX path when possible. 9) Use ithreads for iflib I/O instead of taskqueues 10) Clean up ioctl and support async ioctl functions 11) Prefetch two cachlines from each mbuf instead of one up to 128B. We often need to parse packet header info beyond 64B. 12) Fix potential memory corruption due to fence post error in bit_nclear() usage. 13) Improved hang detection and handling 14) If the packet is smaller than MTU, disable the TSO flags. This avoids extra packet parsing when not needed. 15) Move TCP header parsing inside the IS_TSO?() test. This avoids extra packet parsing when not needed. 16) Pass chains of mbufs that are not consumed by lro to if_input() rather call if_input() for each mbuf. 17) Re-arrange packet header loads to get as much work as possible done before a cache stall. 18) Lock the context when calling IFDI_ATTACH_PRE()/IFDI_ATTACH_POST()/ IFDI_DETACH(); 19) Attempt to distribute RX/TX tasks across cores more sensibly, especially when RX and TX share an interrupt. RX will attempt to take the first threads on a core, and TX will attempt to take successive threads. 20) Allow iflib_softirq_alloc_generic() to request affinity to the same cpus an interrupt has affinity with. This allows TX queues to ensure they are serviced by the socket the device is on. 21) Add new iflib sysctls to net.iflib: - timer_int - interval at which to run per-queue timers in ticks - force_busdma 22) Add new per-device iflib sysctls to dev.X.Y.iflib - rx_budget allows tuning the batch size on the RX path - watchdog_events Count of watchdog events seen since load 23) Fix error where netmap_rxq_init() could get called before IFDI_INIT() 24) e1000: Fixed version of r323008: post-cold sleep instead of DELAY when waiting for firmware - After interrupts are enabled, convert all waits to sleeps - Eliminates e1000 software/firmware synchronization busy waits after startup 25) e1000: Remove special case for budget=1 in em_txrx.c - Premature optimization which may actually be incorrect with multi-segment packets 26) e1000: Split out TX interrupt rather than share an interrupt for RX and TX. - Allows better performance by keeping RX and TX paths separate 27) e1000: Separate igb from em code where suitable Much easier to understand separate functions and "if (is_igb)" than previous tests like "if (reg_icr & (E1000_ICR_RXSEQ \| E1000_ICR_LSC))" #blamebruno Reviewed by: sbruno Approved by: sbruno (mentor) Sponsored by: Limelight Networks Differential Revision: https://reviews.freebsd.org/D12235	2017-09-13 01:18:42 +00:00
Matt Joras	fdbf11746a	Allow vlan interfaces to rx through netmap(4). Normally after receiving a packet, a vlan(4) interface sends the packet back through its parent interface's rx routine so that it can be processed as an untagged frame. It does this by using the parent's ifp->if_input. This is incompatible with netmap(4), which replaces the vlan(4) interface's if_input with a netmap(4) hook. Fix this by using the vlan(4) interface's ifp instead of the parent's directly. Reported by: Harry Schmalzbauer <freebsd@omnilan.de> Reviewed by: rstone Approved by: rstone (mentor) MFC after: 3 days Sponsored by: Dell EMC Isilon Differential Revision: https://reviews.freebsd.org/D12191	2017-09-13 00:25:09 +00:00
Sean Bruno	be17336036	Leave the Cavium Liquid IO driver exist in files, not files.amd64 Submitted by: imp	2017-09-12 23:58:38 +00:00
Warner Losh	d7fa1ab02d	cam iosched: Limit the quanta default to hz if it's below 200 The cam_iosched_ticker() can't be scheduled more than once per tick. Some limiters depend on quanta matching the number of calls per second to enforce the proper limits. Limit the quanta to no faster than 1 per clock tick. This fixes some features when running in VMs where the default HZ is 100. PR: 221953 Obtained from: ElectroBSD Differential Revision: https://reviews.freebsd.org/D12337 Submitted by: Fabian Keil	2017-09-12 23:46:33 +00:00
Sean Bruno	e460f3adbb	Do not try to build the Cavium Liquidio driver on all architechtures. For now, limit to amd64 only.	2017-09-12 23:42:52 +00:00
Sean Bruno	f173c2b77e	The diff is the initial submission of Cavium Liquidio 2350/2360 10/25G Intelligent NIC driver. The submission conconsists of firmware binary file and driver sources. Submitted by: pkanneganti@cavium.com (Prasad V Kanneganti) Relnotes: Yes Sponsored by: Cavium Networks Differential Revision: https://reviews.freebsd.org/D11927	2017-09-12 23:36:58 +00:00
Michael Tuexen	292efb1bc0	Export the UDP encapsualation port and the path state.	2017-09-12 21:08:50 +00:00
Alan Somers	71cd87c66c	Remove spaces from CTL devices' default serial numbers It's awkward to have spaces in CAM device serial numbers. That leads to such things as device nodes named "/dev/diskid/MYSERIAL%20%20%201". Better to replace the spaces with "0"s. This change only affects the default serial numbers for users who don't provide their own. Reviewed by: ken, mav MFC after: Never Relnotes: Yes Sponsored by: Spectra Logic Corp Differential Revision: https://reviews.freebsd.org/D12263	2017-09-12 19:36:24 +00:00
John Baldwin	b4e9a36bf7	Handle relocations for newer non-PIC MIPS ABI. Newer binutils supports extensions to the MIPS ABI for non-PIC code that is used when compiling O32 binaries with clang 5 (but not used for N64 oddly enough). These extensions require support for R_MIPS_COPY relocations as well as a second PLT GOT using R_MIPS_JUMP_SLOT relocations. For R_MIPS_COPY, use the same approach as on other architectures where fixups are deferred to the MD do_copy_relocations. The additional PLT GOT for jump slots is located in a .got.plt section which is identified by a DT_MIPS_PLTGOT dynamic entry. This GOT also requires fixups for the first two GOT entries just as the normal GOT. However, the entry point for this second GOT uses a different calling convention. Rather than passing an offset into the GOT, it passes an offset into the .rel.plt section. This requires a second entry point (_rtld_pltbind_start) which calls the normal _rtld_bind() rather than _mips_rtld_bind(). This also means providing a real version of reloc_jmpslot() which is used by _rtld_bind(). In addition, add real implementions of reloc_plt() and reloc_jmpslots() which walk .rel.plt handling R_MIPS_JUMP_SLOT relocations. Reviewed by: kib Sponsored by: DARPA / AFRL Differential Revision: https://reviews.freebsd.org/D12326	2017-09-12 17:46:30 +00:00
Toomas Soome	c7847a364a	libefi: efipart_open should check the status from disk_open In case of error from disk_open(), we should clean up properly. Reviewed by: allanjude, imp Differential Revision: https://reviews.freebsd.org/D12340	2017-09-12 14:18:45 +00:00
Toomas Soome	b40aaca6dd	loader should support large_dnode The zfsonlinux feature large_dnode is not yet supported by the loader. Reviewed by: avg, allanjude Differential Revision: https://reviews.freebsd.org/D12288	2017-09-12 13:45:04 +00:00
Michael Tuexen	e5cccc35c3	Add support to print the TCP stack being used. Sponsored by: Netflix, Inc.	2017-09-12 13:34:43 +00:00
Andriy Gapon	1a2ddb2997	fix a fallout from the ZTOV tightening, r323479 MFC after: 13 days X-MFC with: r323479	2017-09-12 13:21:14 +00:00
Olivier Houchard	69d14913fc	Some devices come with the same name as TI devices, so we can't rely on the "probe" method of those drivers to mean we're on e TI SoC. Introduce a new function, ti_soc_is_supported(), and use it to be sure we're really a TI system. PR: 222250	2017-09-12 10:43:02 +00:00
Andriy Gapon	bcab65cab5	zfsctl_snapdir_lookup should be able to handle an uncovered vnode The uncovered vnode is possible because there is no guarantee that its hold count would go to zero (and it would be inactivated and reclaimed) immediately after a covering filesystem is unmounted. So, such a vnode should be expected and it is possible to re-use it without any trouble. MFC after: 3 weeks Sponsored by: Panzura	2017-09-12 06:06:58 +00:00
Andriy Gapon	c09d0da8d1	zfs_ctldir: remove obsolete / bogus ARGSUSED lint directives None of the tagged functions had unused parameters. MFC after: 1 week	2017-09-12 06:05:30 +00:00
Andriy Gapon	65b38f7311	zfsvfs_hold: assert that the busied filesystem can not be unmounted This is a FreeBSD specific feature. MFC after: 3 weeks Sponsored by: Panzura	2017-09-12 06:04:50 +00:00
Andriy Gapon	d092f79489	zfs_get_vfs: reference a requested filesystem instead of vfs_busy-ing it The only consumer of zfs_get_vfs, zfs_unmount_snap, does not need the filesystem to be busy, it just need a reference that it can pass to dounmount. Also, previously the code was racy as it unbusied the filesystem before taking a reference on it. Now the code should be simpler and safer. MFC after: 2 weeks Sponsored by: Panzura	2017-09-12 06:04:01 +00:00

... 3 4 5 6 7 ...

118790 Commits