freebsd-skq

Author	SHA1	Message	Date
Kristof Provost	52b83a0618	pf: do not remove kifs that are referenced by rules Even if a kif doesn't have an ifp or if_group pointer we still can't delete it if it's referenced by a rule. In other words: we must check rulerefs as well. While we're here also teach pfi_kif_unref() not to remove kifs with flags. Reported-by: syzbot+b31d1d7e12c5d4d42f28@syzkaller.appspotmail.com MFC after: 2 weeks	2020-10-13 11:04:00 +00:00
Andrew Turner	ed50d40834	Bump __FreeBSD_version for the fix to arm64 write-only mappings Sponsored by: Innovate UK	2020-10-13 10:31:12 +00:00
Andrew Turner	f56a08c810	Fix write only mappings on arm64 When trapping on a wrote access to a buffer the kernel has mapped as write only we should only pass the VM_PROT_WRITE flag. Previously the call to vm_fault_trap as the VM_PROT_READ flag was unexpected. Reported by: manu Sponsored by: Innovate UK	2020-10-13 10:26:15 +00:00
Warner Losh	9257c69b1c	Turn off zstd on aarch64 loader support for zstd and zfs doesn't work for aarch64. Disable it to unbreak the build.	2020-10-13 02:36:16 +00:00
Warner Losh	2fec3ae896	Add zstd support to the boot loader. Add support to the _STANDALONE environment enough bits of the kernel that we can compile it. We still have a small zstd_shim.c since there were 3 items that were a bit hard to nail down and may be cleaned up in the future. These go hand in hand with a number of commits to sys/sys in the past weeks, should this need be MFCd. Discussed with: mmacy (in review and on IRC/Slack) Reviewed by: freqlabs (on openzfs repo) Differential Revision: https://reviews.freebsd.org/D26218	2020-10-12 22:19:07 +00:00
Warner Losh	e59db46854	newbus: use ssize_t to match sb's len and size, fix ordering of space check Both s_len and s_size are ssize_t, so their differece is also more properly a ssize_t not a size_t. Also, assert that len is <= size when we enter. This should always be the case. Ensure that we have that one byte that we write to the end of the buffer before we do so, though the error should already be set on the buffer if not, and the only times we supply 'partial' buffers they should be plenty large. Reviewed by: cem, jhb (prior version, I did cem's suggestion) Differential Revsion: https://reviews.freebsd.org/D26752	2020-10-12 22:07:44 +00:00
Ian Lepore	c88da3b94d	Bug fixes for the ads111x driver... make configurable gain and sample rate hints work on per-channel basis as documented, rather than chip-wide. Also, when configured via hints, return BUS_PROBE_NOWILDCARD on successful hints match, so that the hints don't bogusly match other types of i2c chips.	2020-10-12 18:02:51 +00:00
Kristof Provost	c9449e4fb8	pf: create a kif for flags If userspace tries to set flags (e.g. 'set skip on <ifspec>') and <ifspec> doesn't exist we should create a kif so that we apply the flags when the <ifspec> does turn up. Otherwise we'd end up in surprising situations where the rules say the interface should be skipped, but it's not until the rules get re-applied. Reviewed by: Lutz Donnerhacke <lutz_donnerhacke.de> MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D26742	2020-10-12 12:39:37 +00:00
Warner Losh	af928ad562	systm.h: forward declare ucred for _STANDALONE too There's a number of types we forward declare for the kernel. We need struct ucred for the ZSTD ZFS integration, so go ahead and forward declare it here too.	2020-10-12 05:56:29 +00:00
Emmanuel Vadot	7113afc84c	10Gigabit Ethernet driver for AMD SoC This patch has the driver for 10Gigabit Ethernet controller in AMD SoC. This driver is written compatible to the Iflib framework. The existing driver is for the old version of hardware. The submitted driver here is for the recent versions of the hardware where the Ethernet controller is PCI-E based. Submitted by: Rajesh Kumar <rajesh1.kumar@amd.com> MFC after: 1 month Relnotes: yes Differential Revision: https://reviews.freebsd.org/D25793	2020-10-11 16:01:16 +00:00
Bjoern A. Zeeb	506512b170	ip_mroute: fix the viftable export sysctl It seems that in r354857 I got more than one thing wrong. Convert the SYSCTL_OPAQUE to a SYSCTL_PROC to properly export the these days allocated and not longer static per-vnet viftable array. This fixes a problem with netstat -g which would show bogus information for the IPv4 Virtual Interface Table. PR: 246626 Reported by: Ozkan KIRIK (ozkan.kirik gmail.com) MFC after: 3 days	2020-10-11 00:01:00 +00:00
Conrad Meyer	f8e8a06d23	random(4) FenestrasX: Push root seed version to arc4random(3) Push the root seed version to userspace through the VDSO page, if the RANDOM_FENESTRASX algorithm is enabled. Otherwise, there is no functional change. The mechanism can be disabled with debug.fxrng_vdso_enable=0. arc4random(3) obtains a pointer to the root seed version published by the kernel in the shared page at allocation time. Like arc4random(9), it maintains its own per-process copy of the seed version corresponding to the root seed version at the time it last rekeyed. On read requests, the process seed version is compared with the version published in the shared page; if they do not match, arc4random(3) reseeds from the kernel before providing generated output. This change does not implement the FenestrasX concept of PCPU userspace generators seeded from a per-process base generator. That change is left for future discussion/work. Reviewed by: kib (previous version) Approved by: csprng (me -- only touching FXRNG here) Differential Revision: https://reviews.freebsd.org/D22839	2020-10-10 21:52:00 +00:00
Conrad Meyer	10b1a17594	arc4random(9): Integrate with RANDOM_FENESTRASX push-reseed There is no functional change for the existing Fortuna random(4) implementation, which remains the default in GENERIC. In the FenestrasX model, when the root CSPRNG is reseeded from pools due to an (infrequent) timer, child CSPRNGs can cheaply detect this condition and reseed. To do so, they just need to track an additional 64-bit value in the associated state, and compare it against the root seed version (generation) on random reads. This revision integrates arc4random(9) into that model without substantially changing the design or implementation of arc4random(9). The motivation is that arc4random(9) is immediately reseeded when the backing random(4) implementation has additional entropy. This is arguably most important during boot, when fenestrasX is reseeding at 1, 3, 9, 27, etc., second intervals. Today, arc4random(9) has a hardcoded 300 second reseed window. Without this mechanism, if arc4random(9) gets weak entropy during initial seed (and arc4random(9) is used early in boot, so this is quite possible), it may continue to emit poorly seeded output for 5 minutes. The FenestrasX push-reseed scheme corrects consumers, like arc4random(9), as soon as possible. Reviewed by: markm Approved by: csprng (markm) Differential Revision: https://reviews.freebsd.org/D22838	2020-10-10 21:48:06 +00:00
Conrad Meyer	a3c41f8bfb	Add "Fenestras X" alternative /dev/random implementation Fortuna remains the default; no functional change to GENERIC. Big picture: - Scalable entropy generation with per-CPU, buffered local generators. - "Push" system for reseeding child generators when root PRNG is reseeded. (Design can be extended to arc4random(9) and userspace generators.) - Similar entropy pooling system to Fortuna, but starts with a single pool to quickly bootstrap as much entropy as possible early on. - Reseeding from pooled entropy based on time schedule. The time interval starts small and grows exponentially until reaching a cap. Again, the goal is to have the RNG state depend on as much entropy as possible quickly, but still periodically incorporate new entropy for the same reasons as Fortuna. Notable design choices in this implementation that differ from those specified in the whitepaper: - Blake2B instead of SHA-2 512 for entropy pooling - Chacha20 instead of AES-CTR DRBG - Initial seeding. We support more platforms and not all of them use loader(8). So we have to grab the initial entropy sources in kernel mode instead, as much as possible. Fortuna didn't have any mechanism for this aside from the special case of loader-provided previous-boot entropy, so most of these sources remain TODO after this commit. Reviewed by: markm Approved by: csprng (markm) Differential Revision: https://reviews.freebsd.org/D22837	2020-10-10 21:45:59 +00:00
Emmanuel Vadot	e63faa9ba8	arm: Check dtb version against the one we're expecting to find Reviewed by: imp, emaste, mmel Differential Revision: https://reviews.freebsd.org/D26725	2020-10-10 07:20:59 +00:00
Emmanuel Vadot	4a63c1c1ed	Brand our DTS with the Linux version it was imported from DTS must be synced with the kernel, add a freebsd,dts-version string in the root node of each DTS that we compile so we can later in the kernel check that it contain a correct value. Reviewed by: imp, mmel Differential Revision: https://reviews.freebsd.org/D26724	2020-10-10 07:18:51 +00:00
Mateusz Guzik	a4bcd20486	zfs: use cache_rename	2020-10-10 04:18:49 +00:00
Mateusz Guzik	dd28b379cb	vfs: support lockless dirfd lookups	2020-10-10 03:48:17 +00:00
Ed Maste	49d48f45c8	modules/crypto: reenable assembly optimized skein implementation r366344 corrected the optimized amd64 skein assembly implementation, so we can now enable it again. Also add a dependency on this Makefile for the skein_block object, so that it will be rebuit (similar to r366362). PR: 248221 Sponsored by: The FreeBSD Foundation	2020-10-10 01:13:14 +00:00
Bryan Drewery	c2c6fb90e0	Use unlocked page lookup for inmem() to avoid object lock contention Reviewed By: kib, markj Submitted by: mlaier Sponsored by: Dell EMC Differential Revision: https://reviews.freebsd.org/D26653	2020-10-09 23:49:42 +00:00
Navdeep Parhar	31deb3cc76	cxgbe(4): More fixes for the T6 FCS error counter. r365732 was the first attempt to get an accurate count but it was writing to some read-only registers to clear them and that obviously didn't work. Instead, note the counter's value when it is supposed to be cleared and subtract it from future readings. dev.<port>.stats.rx_fcs_error should not be serviced from the MPS register for T6. The stats.* sysctls should all use T5_PORT_REG for T5 and above. This must have been missed in the initial T5 support years ago. Fix it while here. MFC after: 3 days Sponsored by: Chelsio Communications	2020-10-09 22:23:39 +00:00
Mateusz Guzik	deb1339f3f	vfs: fix a panic when truncating comming from copy_file_range Truncating requires an exclusive lock, but it was not taken if the filesystem indicates support for shared writes. This only concerns ZFS. In particular fixes cp of files which have trailing holes. Reported by: bdrewery	2020-10-09 20:31:42 +00:00
John Baldwin	7e8bd70cff	Don't invoke semunload() if seminit() fails during MOD_LOAD. The module handler code invokes a MOD_UNLOAD event immediately if MOD_LOAD fails. The result was that if seminit() failed, semunload() was invoked twice. semunload() is not idempotent however and would try to remove it's process_exit eventhandler twice resulting in a panic. Reviewed by: kib, markj Obtained from: CheriBSD MFC after: 1 month Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26696	2020-10-09 20:20:42 +00:00
Mateusz Guzik	eb88fed446	cache: fix vexec panic when racing against vgone Use of dead_vnodeops would result in a panic instead of returning the intended EOPNOTSUPP error. While here make sure to abort, not just try to return a partial result. The former allows the regular lookup to restart from scratch, while the latter makes it stuck with an unusable vnode. Reported by: kevans	2020-10-09 19:10:00 +00:00
Mitchell Horne	eff4c46e28	RISC-V LINT kernel config Create the RISC-V NOTES and LINT files. As of r366559, LINT configs are no longer generated but checked in to the tree. Reviewed by: imp Differential Revision: https://reviews.freebsd.org/D26502	2020-10-09 14:45:41 +00:00
Richard Scheffenegger	bfabdade5c	Add DSCP support for network QoS to iscsi initiator. Allow the DSCP codepoint also to be configurable for the traffic in the direction from the initiator to the target, such that writes and any requests are also treated in the appropriate QoS class. Reviewed by: mav MFC after: 2 weeks Sponsored by: NetApp, Inc. Differential Revision: https://reviews.freebsd.org/D26714	2020-10-09 14:33:09 +00:00
Ruslan Bukin	72479f6bf9	Add iommu_get_dev_ctx() helper that allows to instantiate an iommu context for a given device_t. Submitted by: andrew Reviewed by: kib Sponsored by: DARPA, AFRL	2020-10-09 13:11:14 +00:00
Richard Scheffenegger	4b72ae16ed	Stop sending tiny new data segments during SACK recovery Consider the currently in-use TCP options when calculating the amount of new data to be injected during SACK loss recovery. That addresses the effect that very small (new) segments could be injected on partial ACKs while still performing a SACK loss recovery. Reported by: Liang Tian Reviewed by: tuexen, chengc_netapp.com MFC after: 2 weeks Sponsored by: NetApp, Inc. Differential Revision: https://reviews.freebsd.org/D26446	2020-10-09 12:44:56 +00:00
Richard Scheffenegger	868aabb470	Add IP(V6)_VLAN_PCP to set 802.1 priority per-flow. This adds a new IP_PROTO / IPV6_PROTO setsockopt (getsockopt) option IP(V6)_VLAN_PCP, which can be set to -1 (interface default), or explicitly to any priority between 0 and 7. Note that for untagged traffic, explicitly adding a priority will insert a special 801.1Q vlan header with vlan ID = 0 to carry the priority setting Reviewed by: gallatin, rrs MFC after: 2 weeks Sponsored by: NetApp, Inc. Differential Revision: https://reviews.freebsd.org/D26409	2020-10-09 12:06:43 +00:00
Richard Scheffenegger	5432120028	Extend netstat to display TCP stack and detailed congestion state (2) Extend netstat to display TCP stack and detailed congestion state Adding the "-c" option used to show detailed per-connection congestion control state for TCP sessions. This is one summary patch, which adds the relevant variables into xtcpcb. As previous "spare" space is used, these changes are ABI compatible. Reviewed by: tuexen MFC after: 2 weeks Sponsored by: NetApp, Inc. Differential Revision: https://reviews.freebsd.org/D26518	2020-10-09 10:55:19 +00:00
Jung-uk Kim	f15e9afb1f	MFV: r366539 Merge ACPICA 20200925.	2020-10-09 05:27:02 +00:00
Warner Losh	3ae49a035a	Remove now-unused files makeLINT.mk isn't needed or used anymore, remove it and all the files it uses. Reviewed by: kevans Differential Revision: https://reviews.freebsd.org/D26540	2020-10-09 01:48:21 +00:00
Warner Losh	7e46dafa58	Create in-tree LINT files Now that config(8) has supported include for 19 years, transition to including the NOTES files. include support didn't exist at the time, nor did the envvar stuff recently added. Now that it does, eliminate the building of LINT files by just including everything you need. Note: This may cause conflicts with updating in some cases. find sys -name LINT\* -rm is suggested across this commit to remove the generated LINT files. Reviewed by: kevans Differential Revision: https://reviews.freebsd.org/D26540	2020-10-09 01:48:14 +00:00
Rick Macklem	19fe23fa2b	Make vn_generic_copy_file_range() interruptible via a signal. Without this patch, when vn_generic_copy_file_range() is doing a large copy, it will remain in the function for a considerable amount of time, delaying handling of any outstanding signals until the copy completes. This patch adds checks for signals that need to be processed after each successful data copy cycle. When sig_intr() returns non-zero, vn_generic_copy_file_range() will return. The check "if (len < savlen)" ensures that some data has been copied, so that progress will be made. Note that, since copy_file_range(2) is allowed to return fewer bytes copied than requested, it will never return EINTR/ERESTART when sig_intr() returns non-zero. Reviewed by: kib, asomers Differential Revision: https://reviews.freebsd.org/D26620	2020-10-09 01:04:28 +00:00
Konstantin Belousov	42f96162c3	vm_page_dump_index_to_pa(): Add braces to the expression involving + and &. The precedence of the '&' operator is less than of '+'. Added braces do change the order of evaluation into the natural one, in my opinion. On the other hand, the value of the expression should not change since all elements should have page-aligned values. This fixes a gcc warning reported. Reported by: adrian Sponsored by: The FreeBSD Foundation MFC after: 1 week	2020-10-08 22:46:15 +00:00
Konstantin Belousov	e1ef4c29a3	Do not leak B_BARRIER. Normally when a buffer with B_BARRIER is written, the flag is cleared by g_vfs_strategy() when creating bio. But in some cases FFS buffer might not reach g_vfs_strategy(), for instance when copy-on-write reports an error like ENOSPC. In this case buffer is returned to dirty queue and might be written later by other means. Among then bdwrite() reasonably asserts that B_BARRIER is not set. In fact, the only current use of B_BARRIER is for lazy inode block initialization, where write of the new inode block is fenced against cylinder group write to mark inode as used. The situation could be seen that we break dependency by updating cg without written out inode. Practically since CoW was not able to find space for a copy of inode block, for the same reason cg group block write should fail. Reported by: pho Discussed with: chs, imp, mckusick Sponsored by: The FreeBSD Foundation MFC after: 1 week Differential revision: https://reviews.freebsd.org/D26511	2020-10-08 22:41:02 +00:00
Konstantin Belousov	203dda8a63	sig_intr(9): return early if AST is not scheduled. Check td_flags for relevant AST requests lock-less. This opens the race slightly wider where sig_intr() returns false negative, but might be it is worth it. Requested by: mjg Sponsored by: The FreeBSD Foundation MFC after: 1 week	2020-10-08 22:34:34 +00:00
Konstantin Belousov	4ea4966009	Do not allow to use O_BENEATH as an oracle. Specifically, if lookup() returned any error and the topping directory was not latched, which means that (non-existent) path did not returned to the topping location, give ENOTCAPABLE a priority over the lookup() error. PR: 249960 Reviewed by: emaste, ngie Sponsored by: The FreeBSD Foundation MFC after: 1 week Differential revision: https://reviews.freebsd.org/D26695	2020-10-08 22:31:11 +00:00
Alfredo Dal'Ava Junior	9470013f5a	[PowerPC] add machdep.uprintf_signal sysctl Add support for sysctl 'machdep.uprintf_signal' that prints debugging information on trap signal. Reviewed by: jhibbits, luporl, bdragon Sponsored by: Eldorado Research Institute (eldorado.org.br) Differential Revision: https://reviews.freebsd.org/D26004	2020-10-08 22:00:31 +00:00
Warner Losh	8e82f10172	timer_restore is now unused, remove it apm was the only consumer of timer_restore. Now that it's gone, this can be removed.	2020-10-08 20:56:11 +00:00
Warner Losh	8c576a279e	Remove APM BIOS support APM BIOS was relevant only to early laptops (approximately P166 or P200 and slower). These have not been relevant for a long time, and this code has been untested for a long time (as far as I can tell). The APM compat code in ACPI and the apm(8) command is not being retired. Both of these items are still in use (apm(8) is more scriptable than the replacement acpiconf, for the most part). This has been commented out of i386 GENERIC since 2002. This code is not relevant to any other port. Discussed on: arch@	2020-10-08 20:56:06 +00:00
Warner Losh	28942db891	Remove apm screen saver. APM BIOS support is about to be removed. Remove the apm screen saver and its module. They are about to be irrelevant.	2020-10-08 20:56:00 +00:00
Warner Losh	1a6947fb00	Remove apm module The apm code is about to be removed. Remove the module since it's about to be useless.	2020-10-08 20:55:55 +00:00
Mitchell Horne	841dad02e9	Fix a loop condition The correct way to identify the end of the metadata is two adjacent entries set to zero/MODINFO_END. I made a typo and this was checking the first entry twice. Reported by: rpokala Sponsored by: NetApp, Inc. Sponsored by: Klara, Inc.	2020-10-08 18:29:17 +00:00
Mitchell Horne	22e6a67086	Add a routine to dump boot metadata The boot metadata (also referred to as modinfo, or preload metadata) provides information about the size and location of the kernel, pre-loaded modules, and other metadata (e.g. the EFI framebuffer) to be consumed during by the kernel during early boot. It is encoded as a series of type-length-value entries and is usually constructed by loader(8) and passed to the kernel. It is also faked on some architectures when booted by other means. Although much of the module information is available via kldstat(8), there is no easy way to debug the metadata in its entirety. Add some routines to parse this data and allow it to be printed to the console during early boot or output via a sysctl. Since the output can be lengthly, printing to the console is gated behind the debug.dump_modinfo_at_boot kenv variable as well as the BOOTVERBOSE flag. The sysctl to print the metadata is named debug.dump_modinfo. Reviewed by: tsoome Sponsored by: NetApp, Inc. Sponsored by: Klara, Inc. Differential Revision: https://reviews.freebsd.org/D26687	2020-10-08 18:02:05 +00:00
Hans Petter Selasky	c3a38d6c91	Try a bit harder to get the USB device descriptor in case the initial read fails. MFC after: 1 week Sponsored by: Mellanox Technologies // NVIDIA Networking	2020-10-08 11:30:22 +00:00
Hans Petter Selasky	eccb214897	The ethernet header structure is read-only. Add const keyword. (This is a diff reduction towards D26254) MFC after: 1 week Sponsored by: Mellanox Technologies // NVIDIA Networking	2020-10-08 11:25:19 +00:00
Edward Tomasz Napierala	5319fa1b3e	Remove yet another useless assignment, adding a KASSERT just in case. Reviewed by: kp Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26698	2020-10-08 11:04:32 +00:00
Hans Petter Selasky	d6dbbdea20	Allow evdev's rcpt_mask and sysmouse_t_axis parameters to be specified in /boot/loader.conf . MFC after: 1 week Sponsored by: Mellanox Technologies // NVIDIA Networking	2020-10-08 10:59:49 +00:00
Navdeep Parhar	77af2b2c85	cxgbe(4): knobs to drop various kinds of undesirable frames on ingress. These kind of drops come for free in the sense that they do not use the filter TCAM or any other resource that wouldn't normally be used during rx. Frames dropped by the hardware get counted in the MAC's rx stats but are not delivered to the driver. hw.cxgbe.attack_filter Set to 1 to enable the "attack filter". Default is 0. The attack filter will drop an incoming frame if any of these conditions is true: src ip/ip6 == dst ip/ip6; tcp and src/dst ip is not unicast; src/dst ip is loopback (127.x.y.z); src ip6 is not unicast; src/dst ip6 is loopback (::1/128) or unspecified (::/128); tcp and src/dst ip6 is mcast (ff00::/8). hw.cxgbe.drop_ip_fragments Set to 1 to drop all incoming IP fragments. Default is 0. Note that this drops valid frames. hw.cxgbe.drop_pkts_with_l2_errors Set to 1 to drop incoming frames with Layer 2 length or checksum errors. Default is 1. hw.cxgbe.drop_pkts_with_l3_errors Set to 1 to drop incoming frames with IP version, length, or checksum errors. Default is 0. hw.cxgbe.drop_pkts_with_l4_errors Set to 1 to drop incoming frames with Layer 4 length, checksum, or other errors. Default is 0. MFC after: 2 weeks Sponsored by: Chelsio Communications	2020-10-08 10:00:13 +00:00
Mitchell Horne	44c705cf15	Handle kmod local relocation failures gracefully It is possible for elf_reloc_local() to fail in the unlikely case of an unsupported relocation type. If this occurs, do not continue to process the file. Reviewed by: kib, markj (earlier version) MFC after: 1 week Sponsored by: NetApp, Inc. Sponsored by: Klara, Inc. Differential Revision: https://reviews.freebsd.org/D26701	2020-10-07 23:14:49 +00:00
Bjoern A. Zeeb	5152b4f74f	net80211: whitespace Fix indentation for the multi-line copies of ieee80211_add_channel_list_5ghz() for the 3 bands. MFC after: 1 week Sponsored by: The FreeBSD Foundation	2020-10-07 22:52:24 +00:00
Bjoern A. Zeeb	90707c4e44	LinuxKPI: add a bitfield.h implementation. This code was iteratively implemented during the work on various WiFi drivers -- from individual functions to a macro-created implementations for the various bit sized needed (and then extended to more for comepleteness). Some of the bit combinations do not seem to make sense so are left out. The __bf_shf(x) was obtained from D26681 [1]. Requested by: manu [1] Reviewed by: hselasky, manu MFC after: 1 week Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26708	2020-10-07 22:07:26 +00:00
Bjoern A. Zeeb	1e375f3aa7	80211: non-functional changes Sort a few VHT160 and 80+80 lines, update some comments, and remove a superfluous ','. No functional changes intended. MFC after: 1 week Sponsored by: The FreeBSD Foundation	2020-10-07 21:56:58 +00:00
Mitchell Horne	8481aab1ac	Print symbol index for unsupported relocation types It is unlikely, but possible, that an unrecognized or unsupported relocation type is encountered while trying to load a kernel module. If this occurs we should offer the symbol index as a hint to the user. While here, fix some small style issues. Reviewed by: markj, kib (amd64 part, in D26701) Sponsored by: NetApp, Inc. Sponsored by: Klara, Inc.	2020-10-07 18:48:10 +00:00
Hans Petter Selasky	194ddc011a	Properly cleanup driver during remove_one() in mlx5core. Cleanup all host resources, SYSCTLs, MSIX vectors and memory used by the host and only leave the device allocated memory behind, if any, because it may still be in use, when the PCI remove function is called. Else future probe calls may fail due to SYSCTLs already existing. MFC after: 1 week Sponsored by: Mellanox Technologies // NVIDIA Networking	2020-10-07 17:46:49 +00:00
Michael Tuexen	e7a39b856a	Minor cleanups. MFC after: 3 days	2020-10-07 15:22:48 +00:00
Edward Tomasz Napierala	29c4e4b1af	Don't use critical section when calling intr_irq_handler() - that function enters critical section by itself anyway. Reviewed by: kp Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26694	2020-10-07 12:11:11 +00:00
Konstantin Belousov	cefdb89514	Fix typo. Sponsored by: Mellanox Technologies/NVIDIA Networking MFC after: 3 days	2020-10-07 10:58:56 +00:00
Warner Losh	bc683a89a3	Move kernel env global variables, etc to sys/kenv.h The kernel globals for kenv are confined to 2 files that need them and a few that likely shouldn't (but as written the code does). Move them from sys/systm.h to sys/kenv.h. This removed a XXX from systm.h and cleans it up a little bit...	2020-10-07 06:16:37 +00:00
Warner Losh	863f967f95	cam: Add quirk for Samsung MZ7* behind a SATA-to-SAS interposer Sometimes, this drive will be present in the system such that the the firmware identification string doesn't start with ATA, such as when it's behind a SATA-to-SAS interposer. Add another quirk for that. Submitted by: github user mr44er Github PR: 423	2020-10-07 05:44:35 +00:00
Warner Losh	f8503fde31	nvme: Note where the CCB was released for passthrough command	2020-10-06 23:35:26 +00:00
Warner Losh	a1975719dd	cam: Assert we have a reference when freeing sim Before we decrement refcount to sleep on the sim, assert that the refcount >= 1. If it were 0 here, we'd never wake up.	2020-10-06 23:33:56 +00:00
Mitchell Horne	6debfd4b13	Remove unused function cpu_boot() The prototype was added with the creation of kern_shutdown.c in r17658, but it appears to have never been implemented. Remove it now. Reviewed by: cem, kib Differential Revision: https://reviews.freebsd.org/D26702	2020-10-06 23:16:56 +00:00
Pedro F. Giffuni	c2f0581e43	ext2fs: minor typo. Obtained from: Dragonfly MFC after: 3 days	2020-10-06 21:31:04 +00:00
Kristof Provost	4af1bd8157	bridge: call member interface ioctl() without NET_EPOCH We're not allowed to hold NET_EPOCH while sleeping, so when we call ioctl() handlers for member interfaces we cannot be in NET_EPOCH. We still need some protection of our CK_LISTs, so hold BRIDGE_LOCK instead. That requires changing BRIDGE_LOCK into a sleepable lock, and separating the BRIDGE_RT_LOCK, to protect bridge_rtnode lists. That lock is taken in the data path (while in NET_EPOCH), so it cannot be a sleepable lock. While here document the locking strategy. MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D26418	2020-10-06 19:19:56 +00:00
John Baldwin	f34702b76e	Don't permit DRM buffer mappings to be upgraded to executable. Reviewed by: kib MFC after: 1 month Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26697	2020-10-06 18:13:15 +00:00
John Baldwin	e0b155fe4a	Simplify swcr_authcompute() after removal of deprecated algorithms. - Just use sw->octx != NULL to handle the HMAC case when finalizing the MAC. - Explicitly zero the on-stack auth context. Reviewed by: markj Sponsored by: Netflix Differential Revision: https://reviews.freebsd.org/D26688	2020-10-06 18:07:52 +00:00
John Baldwin	9aed26b906	Check if_capenable, not if_capabilities when enabling rate limiting. if_capabilities is a read-only mask of supported capabilities. if_capenable is a mask under administrative control via ifconfig(8). Reviewed by: gallatin Sponsored by: Netflix Differential Revision: https://reviews.freebsd.org/D26690	2020-10-06 18:02:33 +00:00
John Baldwin	56fb710f1b	Store the send tag type in the common send tag header. Both cxgbe(4) and mlx5(4) wrapped the existing send tag header with their own identical headers that stored the type that the type-specific tag structures inherited from, so in practice it seems drivers need this in the tag anyway. This permits removing these extra header indirections (struct cxgbe_snd_tag and struct mlx5e_snd_tag). In addition, this permits driver-independent code to query the type of a tag, e.g. to know what type of tag is being queried via if_snd_query. Reviewed by: gallatin, hselasky, np, kib Sponsored by: Netflix Differential Revision: https://reviews.freebsd.org/D26689	2020-10-06 17:58:56 +00:00
Michael Tuexen	6f155d690b	Reset delayed SACK state when restarting an SCTP association. MFC after: 3 days	2020-10-06 14:26:05 +00:00
Jessica Clarke	2152743f11	riscv: Remove outdated condition in page_fault_handler Since r366355 and r366284 we panic on access faults rather than treating them like page faults so this condition is never true. Reviewed by: jhb (mentor), markj, mhorne Approved by: jhb (mentor), markj, mhorne Differential Revision: https://reviews.freebsd.org/D26686	2020-10-06 13:03:31 +00:00
Jessica Clarke	105708ca1c	riscv: Handle supervisor instruction page faults We should never take instruction page faults when in the kernel, but by using the standard page fault code we should get a more-informative message about faulting on a NOFAULT page rather than branching to the default case here and printing an "Unknown kernel exception ..." message. Reviewed by: jhb (mentor), markj Approved by: jhb (mentor), markj Differential Revision: https://reviews.freebsd.org/D26685	2020-10-06 13:02:20 +00:00
Jessica Clarke	da8944d96d	riscv: De-Arm a few names These names were inherited from the arm64 port and should be changed to the RISC-V terminology. Reviewed by: jhb (mentor), kp, markj Approved by: jhb (mentor), kp, markj Differential Revision: https://reviews.freebsd.org/D26671	2020-10-06 12:56:29 +00:00
Michael Tuexen	b954d81662	Ensure variables are initialized before used. MFC after: 3 days	2020-10-06 11:29:08 +00:00
Michael Tuexen	6176f9d6df	Remove dead stores reported by clang static code analysis MFC after: 3 days	2020-10-06 11:08:52 +00:00
Michael Tuexen	11daa73adc	Cleanup, no functional change intended. MFC after: 3 days	2020-10-06 10:41:04 +00:00
Emmanuel Vadot	a113b1037f	linuxkpi: Add pagemap.h Add release_pages needed by drm which simply calls put_page for all the pages provided Reviewed by: bz Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26680	2020-10-06 10:41:00 +00:00
Emmanuel Vadot	b74986e7fa	linuxkpi: Add power_supply.h Add power_supply_is_system_supplied which is needed by drm. Reviewed by: bz Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26679	2020-10-06 10:39:40 +00:00
Emmanuel Vadot	49c85a33e5	linuxkpi: Add prefetch.h Only add prefetchw as it is the only function used by drm. Simply use the __builtin_prefetch which is available in all compiler for a long time. Reviewed by: bz Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26677	2020-10-06 10:37:21 +00:00
Emmanuel Vadot	3ee75811a6	linuxkpi: Add numa.h Only contain NUMA_NO_NODE needed by drm Reviewed by: bz Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26676	2020-10-06 10:36:16 +00:00
Emmanuel Vadot	2aa0ea94ea	linuxkpi: Add gcd function This compute the common greater divider Taken from OpenBSD Reviewed by: bz, imp Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26674	2020-10-06 10:35:03 +00:00
Michael Tuexen	c8e55b3c0c	Whitespace changes. MFC after: 3 days	2020-10-06 09:51:40 +00:00
Navdeep Parhar	8741306b3b	cxgbe(4) sysctls do not need Giant. Sponsored by: Chelsio Communications	2020-10-05 22:18:04 +00:00
Ryan Moeller	92e17803cd	Enable iterating all sysctls, even ones with CTLFLAG_SKIP Add an "nextnoskip" sysctl that allows for listing of sysctls intended to be normally skipped for cost reasons. This makes it so the names/descriptions of those sysctls can be discovered with sysctl -aN/sysctl -ad/sysctl -at. It also makes it so children are visited when a node flagged with CTLFLAG_SKIP is explicitly requested. The intended use case is to mark the root "kstat" node with CTLFLAG_SKIP so that the extensive and expensive stats are skipped by default but may still be easily obtained without having to know them all (which may not even be possible) and request each one-by-one. Reviewed by: jhb MFC after: 2 weeks Relnotes: yes Sponsored by: iXsystems, Inc. Differential Revision: https://reviews.freebsd.org/D26560	2020-10-05 20:13:22 +00:00
Mark Johnston	ce3e137ca1	re(4): Add a 8168-compatible device ID This is described in RealTek's driver as a "RTL8168 Series add-on card." PR: 250037 Submitted by: Hiroshi HASEGAWA <hhase1973@gmail.com> MFC after: 1 week	2020-10-05 19:58:55 +00:00
Mateusz Guzik	4e2266100d	cache: fix pwd use-after-free in setting up fallback Since the code exits smr section prior to calling pwd_hold, the used pwd can be freed and a new one allocated with the same address, making the comparison erroneously true. Note it is very unlikely anyone ran into it.	2020-10-05 19:38:51 +00:00
Edward Tomasz Napierala	2622708419	Tweak arm64's cpu_fetch_syscall_args(). This should make it possible for the compiler to inline the memcpy(). Reviewed by: andrew Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26629	2020-10-05 18:46:14 +00:00
Edward Tomasz Napierala	f157761902	Drop useless assignment, and add a KASSERT to make sure it really was useless. Reviewed by: nick, jhb Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26649	2020-10-05 18:41:35 +00:00
Chuck Silvers	8b88330ed6	ufs: restore uniqueness of st_dev as returned by ufs_stat() switch ufs_stat() to use the same value for st_dev as was used by the previous ufs_getattr() stat path. Submitted by: gallatin Reviewed by: mjg, imp, kib, mckusick Sponsored by: Netflix Differential Revision: https://reviews.freebsd.org/D26596	2020-10-05 18:17:50 +00:00
Mark Johnston	780766eb52	Remove sysctl_kern_consmute() It is a trivial wrapper for sysctl_handle_int() since r184521. Also remove the NEEDGIANT flag, cn_mute is accessed locklessly. MFC after: 1 week	2020-10-05 15:54:19 +00:00
Ryan Moeller	3331a1d173	Explicit CTLFLAG_DYN not needed Dynamically created OIDs automatically get this flag set. Reviewed by: jhb MFC after: 1 week Sponsored by: iXsystems, Inc. Differential Revision: https://reviews.freebsd.org/D26561	2020-10-04 19:37:15 +00:00
Hans Petter Selasky	4c2dddd8a7	Populate the acquire context field of a ww_mutex in the LinuxKPI. Bump the FreeBSD version to force recompilation of external kernel modules. MFC after: 1 week Differential Revision: https://reviews.freebsd.org/D26657 Submitted by: greg_unrelenting.technology (Greg V) Sponsored by: Mellanox Technologies // NVIDIA Networking	2020-10-04 17:23:39 +00:00
Hans Petter Selasky	8853522919	Add support for Google Cr50 (GSC) Closed Case Debugging UART interfaces to the USB generic serial port driver, ugensa. MFC after: 1 week Differential Revision: https://reviews.freebsd.org/D21863 Submitted by: greg_unrelenting.technology (Greg V) Sponsored by: Mellanox Technologies // NVIDIA Networking	2020-10-04 17:17:16 +00:00
Konstantin Belousov	0400be45e9	Add sig_intr(9). It gives the answer would the thread sleep according to current state of signals and suspensions. Of course the answer is racy and allows for false-negatives (no sleep when signal is delivered after process lock is dropped). Also the answer might change due to signal rescheduling among threads in multi-threaded process. Still it is the best approximation I can provide, to answering the question was the thread interrupted. Reviewed by: markj Tested by: pho, rmacklem Sponsored by: The FreeBSD Foundation MFC after: 2 weeks Differential revision: https://reviews.freebsd.org/D26628	2020-10-04 16:33:42 +00:00
Konstantin Belousov	0c82fb267b	Refactor sleepq_catch_signals(). - Extract suspension check into sig_ast_checksusp() helper. - Extract signal check and calculation of the interruption errno into sig_ast_needsigchk() helper. The helpers are moved to kern_sig.c which is the proper place for signal-related code. Improve control flow in sleepq_catch_signals(), to handle ret == 0 (can sleep) and ret != 0 (interrupted) only once, by separating checking code into sleepq_check_ast_sq_locked(), which return value is interpreted at single location. Reviewed by: markj Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 2 weeks Differential revision: https://reviews.freebsd.org/D26628	2020-10-04 16:30:05 +00:00
Michael Tuexen	9f2d6263bb	Use __func__ instead of __FUNCTION__ for consistency. MFC after: 3 days	2020-10-04 15:37:34 +00:00
Michael Tuexen	d0ed75b3b1	Cleanup, no functional change intended. MFC after: 3 days	2020-10-04 15:22:14 +00:00
Alexander V. Chernikov	1b95005e95	Fix route flags update during RTM_CHANGE. Nexthop lookup was not consireding rt_flags when doing structure comparison, which lead to an original nexthop selection when changing flags. Fix the case by adding rt_flags field into comparison and rearranging nhop_priv fields to allow for efficient matching. Fix `route change X/Y flags` case - recent changes disallowed specifying RTF_GATEWAY flag without actual gateway. It turns out, route(8) fills in RTF_GATEWAY by default, unless -interface flag is specified. Fix regression by clearing RTF_GATEWAY flag instead of failing. Fix route flag reporting in RTM_CHANGE messages by explicitly updating rtm_flags after operation competion. Add IPv4/IPv6 tests for flag-only route changes.	2020-10-04 13:24:58 +00:00
Konstantin Belousov	df01340989	amd64: Store full 64bit of FIP/FDP for 64bit processes when using XSAVE. If current process is 64bit, use rex-prefixed version of XSAVE (XSAVE64). If current process is 32bit and CPU supports saving segment registers cs/ds in the FPU save area, use non-prefixed variant of XSAVE. Reported and tested by: Michał Górny <mgorny@mgorny@moritz.systems> PR: 250043 Reviewed by: emaste, markj Sponsored by: The FreeBSD Foundation MFC after: 1 week Differential revision: https://reviews.freebsd.org/D26643	2020-10-03 23:17:29 +00:00
Konstantin Belousov	9f2a3e3b0a	Fix pmap_pti_add_kva() call for doublefault stack page. After r354889 stack got struct nmi_pcpu at top, which makes IST top not page-aligned. Since pmap_pti_add_kva() truncates/rounds up addresses, it erronously entered a page mapped before double fault stack into the pti page table. Sponsored by: The FreeBSD Foundation MFC after: 3 days	2020-10-03 23:11:20 +00:00
Konstantin Belousov	5e8ea68fd8	Move ctx_switch_xsave declaration to amd64 md_var.h. Sponsored by: The FreeBSD Foundation MFC after: 3 days	2020-10-03 23:07:09 +00:00
Alexander V. Chernikov	9c584fa4bc	Remove ROUTE_MPATH-related warnings introduced in r366390. Reported by: mjg	2020-10-03 14:37:54 +00:00
Emmanuel Vadot	04d672afa8	pwm_backlight: Add regnode_if.h to SRCS If the kernel config doesn't have this pseudo device it will not be generated and then the module will fail to compile. Reported by: mjg	2020-10-03 14:01:20 +00:00
Emmanuel Vadot	0d95c2e27a	pwm_backlight: Depend on ext_resources This driver cannot work without it.	2020-10-03 14:00:33 +00:00
Edward Tomasz Napierala	f726515758	Optimize riscv's cpu_fetch_syscall_args(), making it possible for the compiler to inline the memcpy. Reviewed by: arichardson, mhorne MFC after: 2 weeks Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26528	2020-10-03 13:01:07 +00:00
Edward Tomasz Napierala	4658877815	Move KTRUSERRET() from userret() to ast(). It's a really long detour - it writes ktrace entries to the filesystem - so the overhead of ast() won't make any difference. Reviewed by: kib Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26404	2020-10-03 12:03:08 +00:00
Alexander V. Chernikov	fedeb08b6a	Introduce scalable route multipath. This change is based on the nexthop objects landed in D24232. The change introduces the concept of nexthop groups. Each group contains the collection of nexthops with their relative weights and a dataplane-optimized structure to enable efficient nexthop selection. Simular to the nexthops, nexthop groups are immutable. Dataplane part gets compiled during group creation and is basically an array of nexthop pointers, compiled w.r.t their weights. With this change, `rt_nhop` field of `struct rtentry` contains either nexthop or nexthop group. They are distinguished by the presense of NHF_MULTIPATH flag. All dataplane lookup functions returns pointer to the nexthop object, leaving nexhop groups details inside routing subsystem. User-visible changes: The change is intended to be backward-compatible: all non-mpath operations should work as before with ROUTE_MPATH and net.route.multipath=1. All routes now comes with weight, default weight is 1, maximum is 2^24-1. Current maximum multipath group width is statically set to 64. This will become sysctl-tunable in the followup changes. Using functionality: * Recompile kernel with ROUTE_MPATH * set net.route.multipath to 1 route add -6 2001:db8::/32 2001:db8::2 -weight 10 route add -6 2001:db8::/32 2001:db8::3 -weight 20 netstat -6On Nexthop groups data Internet6: GrpIdx NhIdx Weight Slots Gateway Netif Refcnt 1 ------- ------- ------- --------------------------------------- --------- 1 13 10 1 2001:db8::2 vlan2 14 20 2 2001:db8::3 vlan2 Next steps: * Land outbound hashing for locally-originated routes ( D26523 ). * Fix net/bird multipath (net/frr seems to work fine) * Add ROUTE_MPATH to GENERIC * Set net.route.multipath=1 by default Tested by: olivier Reviewed by: glebius Relnotes: yes Differential Revision: https://reviews.freebsd.org/D26449	2020-10-03 10:47:17 +00:00
Vincenzo Maffione	adf41f0788	netmap: fix constness warnings generated by "-Wcast-qual" Submitted by: milosz.kaniewski@gmail.com MFC after: 3 days	2020-10-03 09:33:29 +00:00
Emmanuel Vadot	b48668250e	pwm_backlight: Fix 32 bits build Reported by: jenkins, mjg	2020-10-03 08:31:28 +00:00
Navdeep Parhar	73f6606b47	cxgbe(4): set up the firmware flowc for the tid before send_abort_rpl. MFC after: 3 days Sponsored by: Chelsio Communications	2020-10-02 23:48:57 +00:00
Emmanuel Vadot	90b8c0ea10	Fix LINT: Add backlight to NOTES	2020-10-02 20:52:09 +00:00
Emmanuel Vadot	4a84542103	pwm_backlight: Restrict module to armv7 and aarch64 Both powerpc64 and riscv uses fdt but don't use EXT_RESOURCES. Reported by: jenkins	2020-10-02 19:56:54 +00:00
Mark Johnston	2913cc4637	vm_pageout: Avoid rounding down the inactive scan target With helper page daemon threads, enabled by default in r364786, we divide the inactive target by the number of threads, rounding down, and sum the total number of pages freed by the threads. This sum is compared with the original target, but by rounding down we might lose pages, causing the page daemon control loop to conclude that inactive queue scanning isn't keeping up with demand for free pages. Typically this results in excessive swapping. Fix the problem by accounting for the error in the main pagedaemon thread's target. Note that by default the problem will manifest only in systems with >16 CPUs in a NUMA domain. Reviewed by: cem Discussed with: dougm Reported and tested by: dhw, glebius Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26610	2020-10-02 19:16:06 +00:00
Mark Johnston	06d8bdcbf7	uma: Use the bucket cache for cross-domain allocations uma_zalloc_domain() allocates from the requested domain instead of following a first-touch policy (the default for most zones). Currently it is only used by malloc_domainset(), and consumers free returned items with free(9) since r363834. Previously uma_zalloc_domain() worked by always going to the keg for an item. As a result, the use of UMA zone caches was unbalanced: we free items to the caches, but always allocate from the keg, skipping the caches. Make some effort to allocate from the UMA caches when performing a cross-domain allocation. This avoids blowing up the caches when something is performing many transient allocations with malloc_domainset(). Reported and tested by: dhw, glebius Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26427	2020-10-02 19:04:29 +00:00
Mark Johnston	5afdf5c1ca	uma: Use LIFO for non-SMR bucket caches When SMR was introduced, zone_put_bucket() was changed to always place full buckets at the end of the queue. However, it is generally preferable to use recently used buckets since their items are more likely to be resident in cache. So, for buckets that have no constraint on item reuse, use a last-in-first-out ordering as we did before. Reviewed by: rlibby Tested by: dhw, glebius Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26426	2020-10-02 19:04:09 +00:00
Mark Johnston	952c8964ba	uma: Remove newlines from panic messages Sponsored by: The FreeBSD Foundation	2020-10-02 19:03:42 +00:00
Mark Johnston	c88285c54a	Fix the INVARIANTS build for 32-bit platforms Reported by: Jenkins MFC with: r366368	2020-10-02 18:54:37 +00:00
Emmanuel Vadot	1e145e73b8	Bump __FreeBSD_version after latest linuxkpi changes	2020-10-02 18:29:25 +00:00
Emmanuel Vadot	a91b408a36	linuxkpi: Add dmi_* function dmi function are used to get smbios values. The DRM subsystem and drivers use it to enabled (or not) quirks. Reviewed by: hselasky Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26046	2020-10-02 18:28:00 +00:00
Emmanuel Vadot	2b68c97337	linuxkpi: Add backlight support Add backlight function to linuxkpi. Graphics drivers expose the backlight of the panel directly so allow them to use the backlight subsystem so user can use backlight(8) to configure them. Reviewed by: hselasky Relnotes: yes Differential Revision: The FreeBSD Foundation	2020-10-02 18:26:41 +00:00
Emmanuel Vadot	38d94a4bc7	Add pwm_backlight Driver for pwm-backlight compatible device. Relnotes: yes Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26252	2020-10-02 18:23:27 +00:00
Emmanuel Vadot	675aae732d	Add backlight subsystem This is a simple subsystem that allow drivers to register as a backlight. Each backlight creates a device node under /dev/backlight/backlightX and an alias based on the name provided. Relnotes: yes Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26250	2020-10-02 18:18:01 +00:00
Mark Johnston	f31695cc64	Implement sparse core dumps Currently we allocate and map zero-filled anonymous pages when dumping core. This can result in lots of needless disk I/O and page allocations. This change tries to make the core dumper more clever and represent unbacked ranges of virtual memory by holes in the core dump file. Add a new page fault type, VM_FAULT_NOFILL, which causes vm_fault() to clean up and return an error when it would otherwise map a zero-filled page. Then, in the core dumper code, prefault all user pages and handle errors by simply extending the size of the core file. This also fixes a bug related to the fact that vn_io_fault1() does not attempt partial I/O in the face of errors from vm_fault_quick_hold_pages(): if a truncated file is mapped into a user process, an attempt to dump beyond the end of the file results in an error, but this means that valid pages immediately preceding the end of the file might not have been dumped either. The change reduces the core dump size of trivial programs by a factor of ten simply by excluding unaccessed libc.so pages. PR: 249067 Reviewed by: kib Tested by: pho MFC after: 1 month Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26590	2020-10-02 17:50:22 +00:00
Mark Johnston	fec41f0751	Simplify the check for non-dumpable VM object types OBJT_DEFAULT, _SWAP, _VNODE and _PHYS is exactly the set of non-fictitious object types, so just check for OBJ_FICTITIOUS. The check no longer excludes dead objects, but such objects have to be handled regardless. No functional change intended. Reviewed by: alc, dougm, kib Tested by: pho MFC after: 2 weeks Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26589	2020-10-02 17:49:13 +00:00
Nick O'Brien	3f59a7f97b	flash: Add support for SPI flash s25fl512s Reviewed by: kp Approved by: kp (mentor) Sponsored by: Axiado	2020-10-02 17:33:56 +00:00
Mateusz Guzik	aa34e791fa	cache: update the commentary for path parsing	2020-10-02 14:50:03 +00:00
Kristof Provost	75f022774f	riscv: handle access faults in user mode Access faults in user mode are treated like TLB misses, which leads to an endless loop of faults. It's less serious than the same fault in kernel mode, because we can just terminate the process, but that's not ideal. Treat user mode access faults as a bus error. Suggested by: jrtc27 Reviewed by: br, jhb Sponsored by: Axiado Differential Revision: https://reviews.freebsd.org/D26621	2020-10-02 07:30:11 +00:00
Navdeep Parhar	7676c62aa3	cxgbe(4): validate largest_rx_cluster and safest_rx_cluster. These tunables can only be set to a valid cluster size (2K, 4K, 9K, or 16K) as documented in the man page. Anything else could lead to a panic on interface up. Reported by: mav@ MFC after: 1 week Sponsored by: Chelsio Communications	2020-10-02 05:59:55 +00:00
Matt Macy	11322826a4	OpenZFS: don't call fpu_kern_thread on i386	2020-10-02 01:25:08 +00:00
Matt Macy	c40487d49b	OpenZFS: MFV 2.0-rc3-gfc5966 - Annotate FreeBSD sysctls with CTLFLAG_MPSAFE - Reduce stack usage of Lua - Don't save user FPU context in kernel threads - Add support for procfs_list - Code cleanup in zio_crypt - Add DB_RF_NOPREFETCH to dbuf_read()s in dnode.c - Drop references when skipping dmu_send due to EXDEV - Eliminate gratuitous bzeroing in dbuf_stats_hash_table_data - Fix legacy compat for platform IOCs	2020-10-01 23:28:21 +00:00
Mark Johnston	494955366a	Remove svn:executable from a couple of vmm(4) source files. MFC after: 3 days	2020-10-01 22:20:29 +00:00
Ed Maste	36972ee3e0	libmd: fix assembly optimized skein implementation The assembly implementation incorrectly used logical AND instead of bitwise AND. Fix, and re-enable in libmd. Submitted by: Yang Zhong <yzhong@freebsdfoundation.org> Reviewed by: cem (earlier) Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D26614	2020-10-01 21:05:50 +00:00
Bryan Drewery	9ceba22462	Revert r366340. CR wasn't finished and it breaks the build.	2020-10-01 20:08:27 +00:00
Bryan Drewery	2398cd1103	Use unlocked page lookup for inmem() to avoid object lock contention Reviewed By: kib, markj Sponsored by: Dell EMC Isilon Submitted by: mlaier Differential Revision: https://reviews.freebsd.org/D26597	2020-10-01 19:17:03 +00:00
Edward Tomasz Napierala	4c6f466cb4	Only clear TDP_NERRNO when needed, ie when it's previously been set. Reviewed by: kib Tested by: pho Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26612	2020-10-01 18:45:31 +00:00
Emmanuel Vadot	48c13e5270	ichsmb_pci: convert to pci_device_table / add PCI_PNP_INFO Submitted by: Greg V <greg@unrelenting.technology> Reviewed by: mav Differential Revision: https://reviews.freebsd.org/D25260	2020-10-01 16:55:01 +00:00
John Baldwin	a3f2a9c57e	Clear the upper 32-bits of registers in x86_emulate_cpuid(). Per the Intel manuals, CPUID is supposed to unconditionally zero the upper 32 bits of the involved (rax/rbx/rcx/rdx) registers. Previously, the emulation would cast pointers to the 64-bit register values down to `uint32_t`, which while properly manipulating the lower bits, would leave any garbage in the upper bits uncleared. While no existing guest OSes seem to stumble over this in practice, the bhyve emulation should match x86 expectations. This was discovered through alignment warnings emitted by gcc9, while testing it against SmartOS/bhyve. SmartOS bug: https://smartos.org/bugview/OS-8168 Submitted by: Patrick Mooney Reviewed by: rgrimes Differential Revision: https://reviews.freebsd.org/D24727	2020-10-01 16:45:11 +00:00
Kristof Provost	57712c0b76	riscv: Add memmmap so we can mmap /dev/mem Reviewed by: mhorne Sponsored by: Axiado Differential Revision: https://reviews.freebsd.org/D26622	2020-10-01 15:04:55 +00:00
Ed Maste	92d47dce78	Add cd device to arm64 GENERIC Big-iron arm64 machines might have a CD, possibly provided by some IPMI emulation. Reported by: scottph	2020-10-01 13:29:29 +00:00
Michal Meloun	c19440e350	Fix the inverted condition in mtx_asserts. Mutex should be owned in affected functions. Reborted by: emaste MFC after: 4 weeks MFC with: r366161	2020-10-01 09:50:08 +00:00
Mateusz Guzik	b5ab177a99	cache: properly report ENOTDIR on foo/bar lookups where foo is a file Reported by: fernape	2020-10-01 08:46:21 +00:00
Kyle Evans	7cc42f6d25	Do a sweep and remove most WARNS=6 settings Repeating the default WARNS here makes it slightly more difficult to experiment with default WARNS changes, e.g. if we did something absolutely bananas and introduced a WARNS=7 and wanted to try lifting the default to that. Drop most of them; there is one in the blake2 kernel module, but I suspect it should be dropped -- the default WARNS in the rest of the build doesn't currently apply to kernel modules, and I haven't put too much thought into whether it makes sense to make it so.	2020-10-01 01:10:51 +00:00
Rick Macklem	9f669985b2	Modify the NFSv4.2 VOP_COPY_FILE_RANGE() client call to return after one successful RPC. Without this patch, the NFSv4.2 VOP_COPY_FILE_RANGE() client call would loop until the copy "len" was completed. The problem with doing this is that it might take a considerable time to complete for a large "len". By returning after a single successful Copy RPC that copied some of the data, the application that did the copy_file_range(2) syscall will be more responsive to signal delivery for large "len" copies.	2020-10-01 00:47:35 +00:00
Rick Macklem	961afe3c99	Clip the "len" argument to vn_generic_copy_file_range() at a hole size boundary. By clipping the len argument of vn_generic_copy_file_range() to end at an exact multiple of hole size, holes are more likely to be maintained during the copy. A hole can still straddle the boundary at the end of the copy range, resulting in a block being allocated in the output file as it is being grown in size, but this will reduce the likelyhood of this happening. While here, also modify setting of blksize to better handle the case where _PC_MIN_HOLE_SIZE is returned as 1. Reviewed by: asomers Differential Revision: https://reviews.freebsd.org/D26570	2020-10-01 00:33:44 +00:00
John Baldwin	8128c65b4c	Avoid a dubious assignment to bio_data in aio_qbio(). A user pointer is not a suitable value for bio_data and the next block of code always overwrites bio_data anyway. Just use cb->aio_buf directly in the call to vm_fault_quick_hold_pages(). Reviewed by: kib Obtained from: CheriBSD MFC after: 1 month Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26595	2020-09-30 17:49:06 +00:00
Emmanuel Vadot	6b74091dd5	ahci_generic: add quirk for NXP0004 (NXP Layerscape LX2160A) This fixes this error : (aprobe3:ahcich3:0:15:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 (aprobe3:ahcich3:0:15:0): CAM status: Command timeout (aprobe3:ahcich3:0:15:0): Error 5, Retries exhausted Submitted by: Greg V <greg@unrelenting.technology> Reviewed by: imp, mav MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D25157	2020-09-30 17:10:49 +00:00
Emmanuel Vadot	a52c8a6502	acpi_resource: support multiple IRQs Some DSDT entries have multiple interrupts for one device. Add support for it. This fixes ahci on NXP LS2160 and genet on RPi4 Submitted by: Greg V <greg@unrelenting.technology> Reviewed by: jhb MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D25145	2020-09-30 17:09:17 +00:00
Mateusz Guzik	4301a5a794	cache: push the lock into cache_purge_impl	2020-09-30 17:08:34 +00:00
Conrad Meyer	a91812f69f	gdb(4): Don't escape GDB special characters at application layer In r351368, we introduced this XML- and GDB-encoded data. The protocol 'offset' should reflex the logical XML data offset, but unfortunately we counted the GDB escapes as well. In fact, we cannot safely do GDB character escaping at this layer at all, because we don't know what will be flushed in a packet. It is bogus to send only the first character of a two-character escape sequence. This patch "corrects" the problem by squashing these characters in the transmitted XML document. It would be nice to transmit the characters faithfully, but that is a more complicated change. Thread names are a nice convenience feature for the GDB client, but one can always inspect td_name or p_comm directly to find the true name. Reported by: Ka Ho Ng <khng300 AT gmail.com> Tested by: Ka Ho Ng Reviewed by: emaste, markj, rlibby Differential Revision: https://reviews.freebsd.org/D26599	2020-09-30 14:55:54 +00:00
Cy Schubert	d9bc41a1c2	Continued ipfilter #ifdef cleanup. The r343701 log entry contains a complete description. MFC after: 1 week	2020-09-30 08:26:25 +00:00
Kristof Provost	0d3aa0fb64	riscv: Panic on PMP errors Load/store/fetch access exceptions always indicate a violation of a PMP rule. We can't treat those as page faults, because updating the page table and trying again will only result in exactly the same access exception recurring. This leaves us in an endless exception loop. We cannot recover from these exceptions, so panic instead. Reviewed by: jhb Sponsored by: Axiado Differential Revision: https://reviews.freebsd.org/D26544	2020-09-30 08:23:43 +00:00
Mateusz Guzik	d4cac59429	cache: use cache_has_entries where appropriate instead of opencoding it	2020-09-30 04:27:38 +00:00
Jessica Clarke	7de649170f	riscv: Define __PCI_REROUTE_INTERRUPT Every other architecture defines this and this is required for interrupts to work when using QEMU's PCI VirtIO devices (which all report an interrupt line of 0) for two reasons. Firstly, interrupt line 0 is wrong; they use one of 0x20-0x23 with the lines being cycled across devices like normal. Moreover, RISC-V uses INTRNG, whose IRQs are virtual as indices into its irq_map, so even if we have the right interrupt line we still need to try and route the interrupt in order to ultimately call into intr_map_irq and get back a unique index into the map for the given line, otherwise we will use whatever happens to be in irq_map[line] (which for QEMU where the line is initialised to 0 results in using the first allocated interrupt, namely the RTC on IRQ 11 at time of commit). Note that pci_assign_interrupt will still do the wrong thing for INTRNG when using a tunable, as it will bypass INTRNG entirely and use the tunable's value as the index into irq_map, when it should instead (indirectly) call intr_map_irq to allocate a new entry for the given IRQ and treat the tunable as stating the physical line in use, which is what one would expect. This, however, is a problem shared by all INTRNG architectures, and not exclusive to RISC-V. Reviewed by: kib Approved by: kib Differential Revision: https://reviews.freebsd.org/D26564	2020-09-30 02:21:38 +00:00
Rick Macklem	164aa1e941	Make copy_file_range(2) Linux compatible for overflow of offset + len. Without this patch, if a call to copy_file_range(2) specifies an input file offset + len that would wrap around, EINVAL is returned. I thought that was the Linux behaviour, but recent testing showed that Linux accepts this case and does the copy_file_range() to EOF. This patch changes the FreeBSD code to exhibit the same behaviour as Linux for this case. Reviewed by: asomers, kib Differential Revision: https://reviews.freebsd.org/D26569	2020-09-30 02:18:09 +00:00
Mitchell Horne	fe9602fbf8	arm64: set the correct HWCAP This appears to be a typo. The AdvSIMD field encodes support for half-precision floating point SIMD instructions, which corresponds to HWCAP_ASIMDHP, not HWCAP_ASIMDDP. MFC after: 3 days Sponsored by: The FreeBSD Foundation	2020-09-29 23:21:56 +00:00
John Baldwin	0e99339684	Fallback to software for more GCM and CCM requests. ccr(4) uses software to handle GCM and CCM requests not supported by the crypto engine (e.g. with only AAD and no payload). This change adds a fallback for a few more requests such as those with more SGL entries than can fit in a work request (this can happen for GCM when decrypting a TLS record split across 15 or more packets). Reported by: Chelsio QA Reviewed by: np Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D26582	2020-09-29 21:51:32 +00:00
Bjoern A. Zeeb	3917c9ba65	rtwn: narrow the epoch area Rather than placing the epoch around the entire receive loop which might call into rtwn_rx_frame() and USB and sleep, split the loop into two[1] and leave us with one unlock/lock cycle as well. PR: 249925 Reported by: thj, (rkoberman gmail.com) Tested by: thj Suggested by: adrian [1] Reviewed by: adrian MFC after: 3 days Sponsored by: The FreeBSD Foundation (initially, paniced my iwl lab host) Differential Revision: https://reviews.freebsd.org/D26554	2020-09-29 20:46:25 +00:00
Ruslan Bukin	6186bfbd18	Rename kernel option ACPI_DMAR to IOMMU. This is mostly needed for a common arm64/amd64 iommu code. Reviewed by: kib Sponsored by: DARPA, AFRL Differential Revision: https://reviews.freebsd.org/D26587	2020-09-29 20:29:07 +00:00
Warner Losh	dc761d84e2	Standalone SX shims Create a do-nothing version of SX locks. OpenZFS needs them. However, since the boot loader is single threaded, they can be nops.	2020-09-29 18:06:02 +00:00
Ruslan Bukin	025730aad6	o Rename acpi_iommu_get_dma_tag() -> iommu_get_dma_tag(). This function isn't ACPI dependent and we may use it on FDT systems as well. o Don't repeat the function declaration, include iommu.h instead. Reviewed by: andrew, kib Sponsored by: DARPA, AFRL Differential Revision: https://reviews.freebsd.org/D26584	2020-09-29 15:10:56 +00:00
Mark Johnston	752c1d14e3	ZFS: Fix a logic bug in the FreeBSD getpages VOP This was introduced when I merged r361287 to OpenZFS and has been fixed there already, commit 3f6bb6e43fd68e. Reported by: swills Reviewed by: allanjude, freqlabs, mmacy	2020-09-29 13:41:47 +00:00
Edward Tomasz Napierala	39e75a5a79	Build debug kernels with -O2. LLVM 11 changed the meaning of '-O' from '-O2' to '-O1', which resulted in debug kernels (with 'makeoptions DEBUG=-g') being built with inlining disabled, causing severe performance hit. The -O2 was already being used for building amd64, powerpc, and powerpcspe. Discussed with: jrtc27, arichardson, bdragon, jhibbits Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26471	2020-09-29 11:48:22 +00:00
Edward Tomasz Napierala	3409864922	Use the 'traced' variable instead of comparing p->p_flag again. Reviewed by: kib Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26577	2020-09-29 11:18:48 +00:00
Michael Tuexen	b15f541113	Improve the input validation and processing of cookies. This avoids setting the association in an inconsistent state, which could result in a use-after-free situation. This can be triggered by a malicious peer, if the peer can modify the cookie without the local endpoint recognizing it. Thanks to Ned Williamson for reporting the issue. MFC after: 3 days	2020-09-29 09:36:06 +00:00
Navdeep Parhar	822967e7e5	cxgbe(4): Avoid unnecessary work in the firmware during netmap tx. Bind the netmap tx queues to a special '0xff' scheduling class which makes the firmware skip some processing related to rate limiting on the outgoing traffic. Future firmwares will do this automatically. MFC after: 1 week Sponsored by: Chelsio Communications	2020-09-29 09:25:52 +00:00
Navdeep Parhar	7efe256233	Remove duplicate line.	2020-09-29 09:11:51 +00:00
Navdeep Parhar	15ca0766ed	cxgbe(4): adjust the doorbell threshold for netmap freelists to match the maximum burst size used when fetching descriptors from the list. MFC after: 1 week Sponsored by: Chelsio Communications	2020-09-29 07:51:06 +00:00
Navdeep Parhar	f7b8615af5	cxgbe(4): display an error message when netmap cannot be enabled because the interface is down. MFC after: 1 week	2020-09-29 07:36:21 +00:00
Navdeep Parhar	a9f476580e	cxgbe(4): fixes for netmap operation with only some queues active. - Only active netmap receive queues should be in the RSS lookup table. - The RSS table should be restored for NIC operation when the last active netmap queue is switched off, not the first one. - Support repeated netmap ON/OFF on a subset of the queues. This works whether the the queues being enabled and disabled are the only ones active or not. Some kring indexes have to be reset in the driver for the second case. MFC after: 1 week Sponsored by: Chelsio Communications	2020-09-29 05:08:45 +00:00
Kyle Evans	5f0601fd19	Address whitespace nits in subr_rtc.c These were separated out from a nearby patch from Andrew Gierth. MFC after: 3 days	2020-09-28 17:19:57 +00:00
Ed Maste	c1aedfcbd9	add SIOCGIFDATA ioctl For interfaces that do not support SIOCGIFMEDIA (for which there are quite a few) the only fallback is to query the interface for if_data->ifi_link_state. While it's possible to get at if_data for an interface via getifaddrs(3) or sysctl, both are heavy weight mechanisms. SIOCGIFDATA is a simple ioctl to retrieve this fast with very little resource use in comparison. This implementation mirrors that of other similar ioctls in FreeBSD. Submitted by: Roy Marples <roy@marples.name> Reviewed by: markj MFC after: 1 month Differential Revision: https://reviews.freebsd.org/D26538	2020-09-28 16:54:39 +00:00
Warner Losh	ab3f5b6ef2	For mulitcons boot, report it and which console is primary Until we can do proper /etc/rc output on both consoles in multicons boot (or all of them if we ever generalize), report when we are booting multicons. Also report the primary console. This will be a big hint why output stops after this line (though some slow USB discovery still happens after mountroot / init starts). Reviewed by: scottl@, tsoome@ Differential Revision: https://reviews.freebsd.org/D26574	2020-09-28 16:19:29 +00:00
Michael Tuexen	fbc6840bae	Minor cleanup. MFC after: 3 days	2020-09-28 14:11:53 +00:00
Michal Meloun	722779c7dd	Fix booting arm64 EFI with LINUX_BOOT_ABI enabled. Use address of the pointer passed to kernel to determine whether the pointer is a FDT block (physical address) or a module pointer (virtual kernel address). This fragment was supposed to be committed before r366196, but I accidentally skipped it in a patch series. Reported by: bz	2020-09-28 09:16:27 +00:00
Edward Tomasz Napierala	1e2521ffae	Get rid of sa->narg. It serves no purpose; use sa->callp->sy_narg instead. Reviewed by: kib Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26458	2020-09-27 18:47:06 +00:00
Cy Schubert	c4390e6da6	Remove extraneous bracket. MFC after: 3 days	2020-09-27 18:39:15 +00:00
Edward Tomasz Napierala	4abea760e7	Shrink struct sysent from 48 to 32 bytes (on LP64; on ILP32 its probably from 32 to 28) by shrinking some entries and reordering them. Reviewed by: kib Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26508	2020-09-27 18:14:01 +00:00
Michal Meloun	ad86fd010c	Check the result of the function, not the pointer to it.	2020-09-27 16:15:03 +00:00
Michael Tuexen	1d1b4bce53	Cleanup, no functional change intended. MFC after: 3 days	2020-09-27 13:32:02 +00:00
Michael Tuexen	8f269b8242	Improve the handling of receiving unordered and unreliable user messages using DATA chunks. Don't use fsn_included when not being sure that it is set to an appropriate value. If the default is used, which is -1, this can result in SCTP associaitons not making any user visible progress. Thanks to Yutaka Takeda for reporting this issue for the the userland stack in https://github.com/pion/sctp/issues/138. MFC after: 3 days	2020-09-27 13:24:01 +00:00
Michal Meloun	2e3294cd04	Don't send a signal with uninitialized 'sig' and 'code' fields. We have a few shortcuts in the arm trap code to speed up obvious "must fail" cases. In these situations, make sure that we fill in the "sig" and "code" fields of the generated signal. MFC after: 3 weeks	2020-09-27 11:37:17 +00:00
Michal Meloun	1b5a4fc401	Add LINUX_BOOT_ABI back to arm64 GENERIC kernel. It was removed in r355289 but forgot to return it back when new u-boot booti support was committed. Although booti is not the preferred method of booting the kernel, it is very useful for the initial phase of porting FreeBSD to a new platform or booting the kernel on various embedded boards in an industrial environment.	2020-09-27 10:15:03 +00:00
Michal Meloun	f10ab2d5a9	Reapply r366193 with proper commit log. Don't map same physical memory multiple times with different cache attributes. This is explicitly stated as architectural undefined behavior, leading to coherency issues sooner or later.	2020-09-27 09:27:39 +00:00
Michal Meloun	19fd4977f2	Revert r366193, it was committed with unsaved commit log.	2020-09-27 09:24:31 +00:00
Michal Meloun	7b34701e31	Don't map same physical memory multiple times with different cache attributes. This is explicitly stated as architectural undefined behavior, leadint to coherencz issues sonner or later.	2020-09-27 09:14:16 +00:00
Michal Meloun	0e417b55d5	Don't try to print EFI memeory map if it doesn't exist. MFC after: 1 week	2020-09-27 09:12:36 +00:00
Rick Macklem	ff45b9fc1a	Bjorn reported a problem where the Linux NFSv4.1 client is using an open_to_lock_owner4 when that lock_owner4 has already been created by a previous open_to_lock_owner4. This caused the NFS server to reply NFSERR_INVAL. For NFSv4.0, this is an error, although the updated NFSv4.0 RFC7530 notes that the correct error reply is NFSERR_BADSEQID (RFC3530 did not specify what error to return). For NFSv4.1, it is not obvious whether or not this is allowed by RFC5661, but the NFSv4.1 server can handle this case without error. This patch changes the NFSv4.1 (and NFSv4.2) server to handle multiple uses of the same lock_owner in open_to_lock_owner so that it now correctly interoperates with the Linux NFS client. It also changes the error returned for NFSv4.0 to be NFSERR_BADSEQID. Thanks go to Bjorn for diagnosing this and testing the patch. He also provided a program that I could use to reproduce the problem. Tested by: bj@cebitec.uni-bielefeld.de (Bjorn Fischer) PR: 249567 Reported by: bj@cebitec.uni-bielefeld.de (Bjorn Fischer) MFC after: 3 days	2020-09-26 23:05:38 +00:00
Justin Hibbits	b2668f7b49	Check for the only 32-bit MIPS ABIs we support, rather than !n64 There may be additional 64-bit ABIs supported, so use a positive check rather than a negative check. Suggested by: imp MFC after: 1 week Sponsored by: Juniper Networks, Inc	2020-09-26 21:47:11 +00:00
John Baldwin	83a277830f	Revert most of r360179. I had failed to notice that sgsendccb() was using cam_periph_mapmem() and thus was not passing down user pointers directly to drivers. In practice this broke requests submitted from userland. PR: 249395 Reported by: Trenton Schulz <trueos@norwegianrockcat.com> Reviewed by: scottl MFC after: 3 days Differential Revision: https://reviews.freebsd.org/D26550	2020-09-25 21:19:56 +00:00
Warner Losh	728757f256	Adjustments to includes for openzfs in _STANDALONE Allow the necessary parts of systm.h to be visible in the _STANDALONE environnment. Limit the reset to only being visible for _KERNEL builds. Map KASSERT, etc to printf on failure in the bootloader until we have more confidence things won't break and leave systems unbootable. Eventually, this should map to a full panic in the bootloader, but that also needs some enhancement to be more useful. Reviewed by: tsoome, jhb Differential Revision: https://reviews.freebsd.org/D26543	2020-09-25 20:51:07 +00:00
Justin Hibbits	af399c5bf7	Fix mips64 build Original patch was against FreeBSD 12, and a test compile wasn't run against head. md_tls_tcb_offset field was moved from mdthread to mdproc in the meantime. MFC after: 1 week Sponsored by: Juniper Networks, Inc.	2020-09-25 20:27:36 +00:00
Justin Hibbits	ebf7855dcd	mips: Fix compat32 library builds from r366162 Re-add the a_ptr and a_fcn fields to Elf32_Auxinfo. MFC after: 1 week Sponsored by: Juniper Networks, Inc.	2020-09-25 19:04:03 +00:00
Warner Losh	fcefa24551	Dont let kernel and standalone both be defined at the same time _KERNEL and _STANDALONE are different things. They cannot both be true at the same time. If things that are normally visible only to _KERNEL are needed for the _STANDALONE environment, you need to also make them visible to _STANDALONE. Often times, this will be just a subset of the required things for _KERNEL (eg global variables are but one example). sys/cdefs.h is included by pretty much everything in both the loader and the kernel, so is the ideal choke point.	2020-09-25 19:02:49 +00:00
Mark Johnston	e62e4b8594	ng_l2tp: Fix callout synchronization in the rexmit timeout handler A received control packet may cause the transmit queue to be flushed, in which case ng_l2tp_seq_recv_nr() cancels the transmit timeout handler. The handler checks to see if it was cancelled before doing anything, but did so before acquiring the node lock, so a small race window could cause ng_l2tp_seq_rack_timeout() to attempt to flush an empty queue, ultimately causing a null pointer dereference. PR: 241133 Reviewed by: bz, glebius, Lutz Donnerhacke MFC after: 3 days Sponsored by: Rubicon Communications, LLC (Netgate) Differential Revision: https://reviews.freebsd.org/D26548	2020-09-25 18:55:50 +00:00
Justin Hibbits	6d5ca5199c	Fix compat32 on mips64 Summary: Two bugs: * Elf32_Auxinfo is broken, using pointers in the union, which are 64-bits not 32. * freebsd32_sysarch() doesn't update the 'user local' register when handling MIPS_SET_TLS, leading to a NULL pointer dereference in the 32-bit application. Reviewed by: #mips, brooks MFC after: 1 week Sponsored by: Juniper Networks, Inc Differential Revision: https://reviews.freebsd.org/D26556	2020-09-25 17:13:45 +00:00
Michal Meloun	01d0f9c0e4	Refine locking inside of syscon driver. In some cases, the syscon driver may be used by consumer requiring better control about locking (ie. it may be used as registe file provider for clock driver which needs locked access to multiple registers). Add fine locking protocol methods together with bunch of helper functions in syscon driver and implement this functionality in syscon_generic driver. MFC after: 4 weeks	2020-09-25 16:44:01 +00:00
Michal Meloun	8dc348a479	Correctly handle nodes compatible with "syscon", "simple-bus". Syscon can also have child nodes that share a registration file with it. To do this correctly, follow these steps: - subclass syscon from simplebus and expose it if the node is also "simple-bus" compatible. - block simplebus probe for this compatible string, so it's priority (bus pass) doesn't colide with syscon driver. While I'm in, also block "syscon", "simple-mfd" for the same reason. MFC after: 4 weeks	2020-09-25 13:52:31 +00:00
Richard Scheffenegger	e399566123	TCP: send full initial window when timestamps are in use The fastpath in tcp_output tries to send out full segments, and avoid sending partial segments by comparing against the static t_maxseg variable. That value does not consider tcp options like timestamps, while the initial window calculation is using the correct dynamic tcp_maxseg() function. Due to this interaction, the last, full size segment is considered too short and not sent out immediately. Reviewed by: tuexen MFC after: 2 weeks Sponsored by: NetApp, Inc. Differential Revision: https://reviews.freebsd.org/D26478	2020-09-25 10:38:19 +00:00
Richard Scheffenegger	1567c937e2	TCP newreno: improve after_idle ssthresh Adjust ssthresh in after_idle to the maximum of the prior ssthresh, or 3/4 of the prior cwnd. See RFC2861 section 2 for an in depth explanation for the rationale around this. As newreno is the default "fall-through" reaction, most tcp variants will benefit from this. Reviewed by: tuexen MFC after: 2 weeks Sponsored by: NetApp, Inc. Differential Revision: https://reviews.freebsd.org/D22438	2020-09-25 10:23:14 +00:00
Edward Tomasz Napierala	0c5bd5f993	Regen after r366145. Sponsored by: DARPA	2020-09-25 10:05:38 +00:00
Michal Meloun	b95a8021ec	Make simplebus friendlier for subclassing. MFC after: 1 week	2020-09-25 09:56:50 +00:00
Edward Tomasz Napierala	586bd2de78	Make makesyscalls.lua initialize 'struct sysent' entries using c99 designated initializers. This makes it easier to modify 'struct sysent' layout. Reviewed by: kevans Sponsored by: DARPA Differential Revision: https://reviews.freebsd.org/D26530	2020-09-25 09:34:00 +00:00
Andriy Gapon	e08dc44162	aw_pwm: add a check and some comments related to long periods The hardware supports periods as long as 196 seconds[] when using the maximal prescaling of 72000 and maximum cycle count of 2^16. But the code becomes incorrect when the period length approaches 1 second. That's because of things like NS_PER_SEC / period. [] At the same time I must note that the KPI provides for maximum period of about 4 seconds (2^32 nanoseconds). MFC after: 2 weeks	2020-09-25 07:41:51 +00:00
Andriy Gapon	6957a14075	aw_pwm: ensure sane configuration, just in case Make sure that the hardware is configured to cycle mode and that the bypass is disabled. MFC after: 2 weeks	2020-09-25 07:40:56 +00:00
Andriy Gapon	fc1ec731c8	aw_pwm: fix programming of the period The programmed value is biased by one: 0 means 1 cycle, 1 means 2 cycles, etc. MFC after: 3 weeks	2020-09-25 07:40:26 +00:00
Andriy Gapon	1c2c602a17	aw_pwm: fix selection of the prescaler Prescaling divides the frequency, not multiplies it. MFC after: 2 weeks	2020-09-25 07:40:02 +00:00
Andriy Gapon	108d235ae6	aw_pwm: remove the busy bit check The bit seems to always be set on my hardware, H3. However, programming the hardware seems to work just fine. MFC after: 3 weeks	2020-09-25 07:39:41 +00:00
Andriy Gapon	b1dbb66d49	aw_pwm: trivially add H3 support MFC after: 2 weeks	2020-09-25 07:39:14 +00:00
Conrad Meyer	5b50517079	amdtemp(4), amdsmn(4): Attach to Ryzen 4000 APU (Zen 2, "Renoir") PR: 249864 Reported by: Florian Millet <florian.millet AT laposte.net> Tested by: Florian Millet	2020-09-25 04:16:28 +00:00
Alan Somers	a62772a78e	fusefs: fix mmap'd writes in direct_io mode If a FUSE server returns FOPEN_DIRECT_IO in response to FUSE_OPEN, that instructs the kernel to bypass the page cache for that file. This feature is also known by libfuse's name: "direct_io". However, when accessing a file via mmap, there is no possible way to bypass the cache completely. This change fixes a deadlock that would happen when an mmap'd write tried to invalidate a portion of the cache, wrongly assuming that a write couldn't possibly come from cache if direct_io were set. Arguably, we could instead disable mmap for files with FOPEN_DIRECT_IO set. But allowing it is less likely to cause user complaints, and is more in keeping with the spirit of open(2), where O_DIRECT instructs the kernel to "reduce", not "eliminate" cache effects. PR: 247276 Reported by: trapexit@spawn.link Reviewed by: cem MFC after: 3 days Differential Revision: https://reviews.freebsd.org/D26485	2020-09-24 16:27:53 +00:00
Alan Somers	5710395f4d	Fix some signed/unsigned comparison warnings in NFS Reviewed by: rmacklem MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D26533	2020-09-24 15:38:01 +00:00
Michael Tuexen	b6db274d1e	Whitespace changes. MFC after: 3 days	2020-09-24 12:26:06 +00:00
Konstantin Belousov	5dca94ee82	Remove pointless local variable. Reported by: alc Sponsored by: The FreeBSD Foundation MFC after: 6 days	2020-09-24 12:14:25 +00:00
Bjoern A. Zeeb	fe5ebb23cc	Provide MS() and SM() macros for 80211 and wireless drivers. We have (two versions) of MS() and SM() macros which we use throughout the wireless code. Change all but three places (ath_hal, rtwn, and rsu) to the newly provided _IEEE80211_MASKSHIFT() and _IEEE80211_SHIFTMASK() macros. Also change one internal case using both _S and _M instead of just _S away from _M (one of the reasons rtwn and rsu were not changed). This was done semi-mechanically. No functional changes intended. Requested by: gnn (D26091) Reviewed by: adrian (pre line wrap) MFC after: 2 weeks Sponsored by: Rubicon Communications, LLC (d/b/a "Netgate") Differential Revision: https://reviews.freebsd.org/D26539	2020-09-24 10:57:39 +00:00
Andrew Turner	122e47836e	Clean up the arm64 bus_dma_run_filter - We can exit the loop as soon as the filter check passes. - The alignment check has already passed so there is no need to also run it here. Sponsored by: Innovate UK	2020-09-24 10:42:28 +00:00
Andrew Turner	ec9d068513	Ensure arm64 DMA alignment is passed from parents to children This ensures the alignment check will take these alignments into account. Sponsored by: Innovate UK	2020-09-24 10:40:49 +00:00
Michal Meloun	88f7c52f31	Add missing declarations of 64-bit variants of bus_peek/bus_poke on amd64. It fixes GENERIC-KCSAN build. Reported by: rpokala MFC after: 1 month MFC with: r365899	2020-09-24 08:40:32 +00:00
Andrew Turner	2e3b7d8041	Bounce in more cases in the arm64 busdma We need to use a bounce buffer when the memory we are operating on is not aligned to a cacheline, and not aligned to the maps alignment. The former is to stop other threads from dirtying the cacheline while we are performing DMA operations with it. The latter is to check memory passed in by a driver is correctly aligned for the device. Reviewed by: mmel Sponsored by: Innovate UK Differential Revision: https://reviews.freebsd.org/D26496	2020-09-24 07:17:05 +00:00
Andrew Turner	f0e50a4416	Ensure we always align and size arm64 busdma allocations to a cacheline This will ensure nothing modifies the cacheline while DMA is in progress so we won't need to bounce the data. Reviewed by: mmel Sponsored by: Innovate UK Differential Revision: https://reviews.freebsd.org/D26495	2020-09-24 07:13:13 +00:00
Warner Losh	d9524c1232	Don't define _STANDALONE when building kernel modules. _STANDALONE is only for the bootloader, not kernel modules. Remove it from the build. This was harmless before, but sys/malloc.h now does different things for the standalone environment, triggering the issue.	2020-09-24 07:10:34 +00:00
Andrew Turner	0aaa66cc79	Add a coherent flag on the arm64 dma map struct Use it to decide if we can skip cache management. While here remove the DMAMAP_COULD_BOUNCE flag as it's unneeded. Reviewed by: mmel Sponsored by: Innovate UK Differential Revision: https://reviews.freebsd.org/D26494	2020-09-24 07:07:54 +00:00
Andrew Turner	66cbbb75b2	Add bounce helpers to the arm64 busdma Add helper functions to the arm64 busdma for common cases of checking if we may need to bounce, and if we must bounce for a given address. These will be expanded later as we handle cache-misaligned memory. Reported by: mmel Sponsored by: Innovate UK Differential Revision: https://reviews.freebsd.org/D26493	2020-09-24 07:03:26 +00:00
Warner Losh	0672da33f3	Create a standalone version of sys/malloc.h The ZSTD support for the boot loader will need to include files that use the kernel's malloc interface. Create a standalone stub version that's functional enough to allow this to work. There's some limitations in this interface, and it's not quite a perfect match. Specifically, M_WAITOK allocations can fail because there's nothing that can be done we no memory is available.	2020-09-24 06:40:35 +00:00
Mateusz Guzik	1b2edd6e2b	cache: eliminate cache_zap_locked_vnode It is only ever called for negative entries and for those it is just a wrapper around cache_zap_negative_locked_vnode_kl which always succeeds. This also fixes a bug where cache_lookup_fallback should have been calling cache_zap_locked_bucket instead. Note that in order to trigger the bug NOCACHE must not be set, which currently only happens when creating a new coredump (and then the coredump-to-be has to have a negative entry).	2020-09-24 03:38:32 +00:00
Mark Johnston	114484b7ec	Flag vm_reserv and vm_phys sysctls as MPSAFE. Nothing in these subsystems relies on Giant. MFC after: 1 week	2020-09-23 19:36:07 +00:00
Mark Johnston	78257765f2	Add a vmparam.h constant indicating pmap support for large pages. Enable SHM_LARGEPAGE support on arm64. Reviewed by: alc, kib Sponsored by: Juniper Networks, Inc., Klara, Inc. Differential Revision: https://reviews.freebsd.org/D26467	2020-09-23 19:34:21 +00:00
Mark Johnston	4168aedcde	Add largepage support to the arm64 pmap. Reviewed by: alc, kib Sponsored by: Juniper Networks, Inc., Klara, Inc. Differential Revision: https://reviews.freebsd.org/D26466	2020-09-23 19:33:47 +00:00
Warner Losh	f9ba2bbe3a	Use envvar rather than nonstandard hint. lines The NOTES files have a bunch of hint lines that are removed when generating LINT. However, we can achieve the same effect by prepending each of the lines with 'envvar' so the NOTES files become standard config(8) files. No functional changes as the sed script to generate the LINT files filters these either way. Suggested by: kevans	2020-09-23 19:18:53 +00:00
Nick O'Brien	e1c8f8f87d	riscv: Trap cleanup - use nitems() No functional changes, just cleanup. Reviewed by: kp Approved by: kp (mentor) Sponsored by: Axiado	2020-09-23 18:54:14 +00:00
Konstantin Belousov	aaf78c16f5	Do not leak oldvmspace if image activation failed and current address space is already destroyed, so kern_execve() terminates the process. While there, clean up some internals of post_execve() inlined in init_main. Reported by: Peter <pmc@citylink.dinoex.sub.org> Reviewed by: markj Sponsored by: The FreeBSD Foundation MFC after: 1 week Differential revision: https://reviews.freebsd.org/D26525	2020-09-23 18:03:07 +00:00
Ed Maste	64d33e9e60	remove reference to obsolete arm NOTES files We left these in the clean rule to avoid having stale files remain in working trees, but enough time has now passed that it's no longer relevant. Discussed with: imp	2020-09-23 14:52:43 +00:00
Mateusz Guzik	254c54c65a	Bump __FreeBSD_version after cache_purgevfs change	2020-09-23 11:02:23 +00:00
Mateusz Guzik	a3d9bf49b5	cache: drop the force flag from purgevfs The optional scan is wasteful, thus it is removed altogether from unmount. Callers which always want it anyway remain unaffected.	2020-09-23 10:46:07 +00:00
Mateusz Guzik	a952fefff2	cache: reimplement purgevfs to iterate vnodes instead of the entire hash The entire cache scan was a leftover from the old implementation. It is incredibly wasteful in presence of several mount points and does not win much even for single ones.	2020-09-23 10:44:49 +00:00
Mateusz Guzik	efeec5f0c6	cache: clean up atomic ops on numneg and numcache - use subtract instead of adding -1 - drop the useless _rel fence Note this should be converted to a scalable scheme.	2020-09-23 10:42:41 +00:00
Brandon Bergren	d20d17f6d4	[PowerPC64LE] Fix RTAS LE calls in pseries. Similar to OPAL calls, switch to big endian to do calls to RTAS. (Missed this one when I was doing the bulk commit of PowerPC64LE support.) Sponsored by: Tag1 Consulting, Inc.	2020-09-23 04:09:02 +00:00
Brandon Bergren	af22c7e495	__FreeBSD_version bump for introduction of the powerpc64le arch. Although this is technically not a breaking change, I believe it is best to have a fresh version to use to define where the starting point was here.	2020-09-23 03:19:20 +00:00
Brandon Bergren	93a5341930	[PowerPC64LE] Fix sleeping on POWER8. Due to enter_idle_powerx fabricating a MSR from scratch, it is necessary for it to care about the endianness, so we don't accidentally switch endian the first time we idle a thread. Took about five seconds to spot after seeing an unmangled backtrace. The hard bit was needing to temporarily set up a mutex to sort out the logjam that happens when every thread simultaneously wakes up in the wrong endian due to the panic IPI and panics, leaving what I can best describe as "alphabet soup" on the console. Luckily, I already had a patch sitting around to do that. This brings POWER8 up to equivilence with POWER9 on PPC64LE. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 02:28:19 +00:00
Brandon Bergren	0d356a5349	[PowerPC64LE] Fix AP spinup on powernv. OPAL unconditionally enters secondary CPUs with only HV and SF set. I tried writing a secondary entry point instead, but OPAL rejected it and I am unsure why, so I resorted to making the system reset interrupt endian-flexible. This means we take a slight performance hit on wakeup on LE, but it is a good stopgap until we can figure out a reliable way to make OPAL enter where we want it to. It probably makes sense to have it around anyway, because I can imagine scenarios where the cpu resets itself to BE and does a software reset. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 01:56:26 +00:00
Brandon Bergren	05c3051f86	[PowerPC64LE] Endian fix for opal_hmi.c Another boring one. We need to endian swap before checking flags. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 01:51:01 +00:00
Brandon Bergren	f9acb7a818	[PowerPC64LE] Get XIVE up and running. More endian conversion. * Install TCEs correctly (i.e. in big endian) * Convert to big endian and back when setting up queue pages and IRQs. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 01:49:37 +00:00
Brandon Bergren	bf933a83ec	[PowerPC64LE] Endian fix for opal_dev.c. Not much to say here, another missing be64toh() in memory that was written from OPAL. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 01:41:51 +00:00
Brandon Bergren	9cbcb6ffce	[PowerPC64LE] Endian fixes for opal_pci.c. Since OPAL runs in big endian, any data being passed back and forth via memory instead of registers needs to be byteswapped. From my notes during development: "A good way to find candidates is to look for vtophys() in opal_call() parameters. The memory being passed will be written into in BE." Sponsored by: Tag1 Consulting, Inc.	2020-09-23 01:37:01 +00:00
Brandon Bergren	d418d3f616	[PowerPC64LE] Implement endian-independent dword atomic PTE lock. It's much easier to implement this in an endian-independent way when we don't also have to worry about masking half of the dword off. Given that this code ran on a machine that ran a poudriere bulk with no kernel oddities, I am relatively certain it is correctly implemented. ;) This should be a minor performance boost on BE as well. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 01:33:54 +00:00
Brandon Bergren	f475e00fb3	[PowerPC64LE] Fix endian conversion bugs in moea64. For a body of code that had its endian conversion bits written blind without the ability to test, moea64 was VERY close to being correct. There were only four instances where the existing code was getting it wrong. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 01:29:33 +00:00
Brandon Bergren	6e5dbfb2bf	[PowerPC64LE] Initial GENERIC64LE kernel config. This is slightly stripped down from GENERIC64, as PowerMac G5 machines are incapable of running in LE mode (so we can skip the Mac drivers.) While technically POWER6 and POWER7 have the hardware capability of running in LE mode, they have a tendency to trap excessively when a load/store is misaligned. (an extremely common occurrence in LE code, and one of the main reasons I consider BE to be superior, as it turns potential security issues into immediately obvious mangled numbers.) Additionally, there was no mechanism to control what endian interrupts are delivered in, so supporting LE operation on POWER6 and POWER7 involves some really dirty tricks in the interrupt vectors that I would rather avoid. IBM drew the line in the sand at POWER8 some time around 2013, embracing full support for LE in the platform, and making a push across the board for LE code to target POWER8 as a minimum requirement. As such, usage of LE kernels on POWER6 and POWER7 is practically nil, despite it being technically possible to do. The so-called "TRUELE" feature bit which is the baseline requirement for needed for PowerPC64LE was introduced in POWER8. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 01:07:55 +00:00
Brandon Bergren	c16359cf66	[PowerPC64LE] powernv ILE setup code. When running without a hypervisor, we need to set the ILE bit in the LPCR ourselves. For the boot processor, handle it in powernv_attach() like we do for other LPCR bits. No change for the APs, as they will use the lpcr global to set up their own LPCR when they do their own cpudep_ap_early_bootstrap() and pick up this automatically. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 00:32:50 +00:00
Brandon Bergren	dadfbc2e60	[PowerPC64LE] LE opal_call() implementation OPAL runs in big endian, so we need to rfid into it to switch endian atomically when branching to it, and we need to do the RETURN_TO_NATIVE_ENDIAN dance when it returns to us. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 00:28:47 +00:00
Brandon Bergren	c0290b3de8	[PowerPC64LE] Fix endianness issues in phyp_vscsi. Unlike virtio, which in legacy mode is guest endian, the hypervisor vscsi interface operates in big endian, so we must convert back and forth in several places. These changes are enough to attach a rootdisk. Sponsored by: Tag1 Consulting, Inc.	2020-09-23 00:13:58 +00:00

... 3 4 5 6 7 ...

134541 Commits