freebsd-nq

Author	SHA1	Message	Date
Nathan Whitehorn	c884d31122	Add two fake properties ("fdtbootcpu" and "fdtmemreserv") to the device tree's /chosen node to provide out-of-band header fields of the FDT. This emulation is not perfect without corresponding changes to ofw_fdt_nextprop(), but is enough to enable lookup by memory-map-parsing code. MFC after: 1 week	2015-01-16 18:47:20 +00:00
Alan Cox	5268042bbd	Revamp the default page clustering strategy that is used by the page fault handler. For roughly twenty years, the page fault handler has used the same basic strategy: Fetch a fixed number of non-resident pages both ahead and behind the virtual page that was faulted on. Over the years, alternative strategies have been implemented for optimizing the handling of random and sequential access patterns, but the only change to the default strategy has been to increase the number of pages read ahead to 7 and behind to 8. The problem with the default page clustering strategy becomes apparent when you look at how it behaves on the code section of an executable or shared library. (To simplify the following explanation, I'm going to ignore the read that is performed to obtain the header and assume that no pages are resident at the start of execution.) Suppose that we have a code section consisting of 32 pages. Further, suppose that we access pages 4, 28, and 16 in that order. Under the default page clustering strategy, we page fault three times and perform three I/O operations, because the first and second page faults only read a truncated cluster of 12 pages. In contrast, if we access pages 8, 24, and 16 in that order, we only fault twice and perform two I/O operations, because the first and second page faults read a full cluster of 16 pages. In general, truncated clusters are more common than full clusters. To address this problem, this revision changes the default page clustering strategy to align the start of the cluster to a page offset within the vm object that is a multiple of the cluster size. This results in many fewer truncated clusters. Returning to our example, if we now access pages 4, 28, and 16 in that order, the cluster that is read to satisfy the page fault on page 28 will now include page 16. So, the access to page 16 will no longer page fault and perform an I/O operation. Since the revised default page clustering strategy is typically reading more pages at a time, we are likely to read a few more pages that are never accessed. However, for the various programs that we looked at, including clang, emacs, firefox, and openjdk, the reduction in the number of page faults and I/O operations far outweighed the increase in the number of pages that are never accessed. Moreover, the extra resident pages allowed for many more superpage mappings. For example, if we look at the execution of clang during a buildworld, the number of (hard) page faults on the code section drops by 26%, the number of superpage mappings increases by about 29,000, but the number of never accessed pages only increases from 30.38% to 33.66%. Finally, this leads to a small but measureable reduction in execution time. In collaboration with: Emily Pettigrew <ejp1@rice.edu> Differential Revision: https://reviews.freebsd.org/D1500 Reviewed by: jhb, kib MFC after: 6 weeks	2015-01-16 18:17:09 +00:00
Nathan Whitehorn	348a26e6c7	Make netbooting work again by disabling BOOTP_NFSV3. Investigate why this causes bad RPC errors later.	2015-01-16 17:41:21 +00:00
Alexander Motin	1b922b707e	Don't count status as sent until CTIO completes successfully. If we aggregated status sending with data move and got error, allow status to be updated and resent again separately. Without this command may stuck without status sent at all. MFC after: 2 weeks	2015-01-16 12:35:55 +00:00
Hans Petter Selasky	c8563d530c	Add more USB device IDs. Submitted by: max.n.boyarov@gmail.com PR: 196362 MFC after: 1 week	2015-01-16 12:16:21 +00:00
Alexander V. Chernikov	0b47e42b49	Use ipfw runtime lock only when real modification is required.	2015-01-16 10:49:27 +00:00
Alexander V. Chernikov	29e0d65d7a	Eliminate SIOCGIFADDR handling in bpf. Quoting 19 years bpf.4 manual from bpf-1.2a1: " (SIOCGIFADDR is obsolete under BSD systems. SIOCGIFCONF should be used to query link-level addresses.) " * SIOCGIFADDR was not imported in NetBSD (bpf.c 1.36) and OpenBSD. * Last bits (e.g. manpage claiming SIOCGIFADDR exists) was cleaned from NetBSD via kern/21513 5 years ago, from OpenBSD via documentation/6352 5 years ago.	2015-01-16 10:09:28 +00:00
Konstantin Belousov	271ab2406f	For sigaction(2), ignore possible garbage in sa_flags for sa_handler == SIG_DFL or SIG_IGN. Sloppy code does not fully initialize struct sigaction for such cases, and being too demanding in the case of default handler does not catch anything. Reported and tested by: Alex Tutubalin <lexa@lexa.ru> Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-01-16 07:06:58 +00:00
Warner Losh	966e729842	Always enable I/O, memory and dma cycles. Some BIOSes don't enable them, sometimes they are reset for power state transitions or during whatever happens while suspended. Also, it is good practice to always do this.	2015-01-16 06:19:52 +00:00
Warner Losh	53d673996b	Move the suspsned and resume functions to the bus attachment. They were accessing PCI config registers, which won't work for the ISA version.	2015-01-16 06:19:39 +00:00
Warner Losh	47a66ea835	Suspend and resume were the only two functions not to follow the brdev convention here, so fix that.	2015-01-16 06:19:24 +00:00
Warner Losh	b45c7d14d0	Back out the refactor. It turns out to cause interrupt storms on resume sometimes (but not others). On powerup, other wierd issues show up (sometimes the card comes up, but with really bogus pci config space stuff. There may be more, but given my experience of historical fussiness, stick to what works and make more minimal changes to that.	2015-01-16 06:19:08 +00:00
Warner Losh	1acffcc470	Add back a couple PC Card devices to amd64. There's only a couple of them that were popular enough, so this doesn't adversly affect build times.	2015-01-16 06:19:05 +00:00
Navdeep Parhar	b7d3aa1130	Build cxgbe(4) on powerpc64 too.	2015-01-16 03:39:38 +00:00
Navdeep Parhar	24e2fa2b4d	Use parentheses instead of close proximity to ensure layer + 1 is evaluated before the rest of the expression.	2015-01-16 02:20:24 +00:00
Adrian Chadd	3b48f36ef6	Check the right value correctly. Thanks to clang for pointing out this silliness.	2015-01-16 01:52:26 +00:00
Navdeep Parhar	ca7fe84a61	Plug cxgbe(4) back into !powerpc && !arm builds, instead of building it on amd64 only.	2015-01-16 01:39:24 +00:00
Navdeep Parhar	88d7f6bddf	Allow cxgbe(4) to be built on i386. Driver attach will succeed only on a subset of i386 systems.	2015-01-16 01:32:40 +00:00
Navdeep Parhar	cddd227c5f	Make cxgbe(4) buildable with the gcc in base.	2015-01-16 01:28:28 +00:00
Roger Pau Monné	ca49b3342d	loader: implement multiboot support for Xen Dom0 Implement a subset of the multiboot specification in order to boot Xen and a FreeBSD Dom0 from the FreeBSD bootloader. This multiboot implementation is tailored to boot Xen and FreeBSD Dom0, and it will most surely fail to boot any other multiboot compilant kernel. In order to detect and boot the Xen microkernel, two new file formats are added to the bootloader, multiboot and multiboot_obj. Multiboot support must be tested before regular ELF support, since Xen is a multiboot kernel that also uses ELF. After a multiboot kernel is detected, all the other loaded kernels/modules are parsed by the multiboot_obj format. The layout of the loaded objects in memory is the following; first the Xen kernel is loaded as a 32bit ELF into memory (Xen will switch to long mode by itself), after that the FreeBSD kernel is loaded as a RAW file (Xen will parse and load it using it's internal ELF loader), and finally the metadata and the modules are loaded using the native FreeBSD way. After everything is loaded we jump into Xen's entry point using a small trampoline. The order of the multiboot modules passed to Xen is the following, the first module is the RAW FreeBSD kernel, and the second module is the metadata and the FreeBSD modules. Since Xen will relocate the memory position of the second multiboot module (the one that contains the metadata and native FreeBSD modules), we need to stash the original modulep address inside of the metadata itself in order to recalculate its position once booted. This also means the metadata must come before the loaded modules, so after loading the FreeBSD kernel a portion of memory is reserved in order to place the metadata before booting. In order to tell the loader to boot Xen and then the FreeBSD kernel the following has to be added to the /boot/loader.conf file: xen_cmdline="dom0_mem=1024M dom0_max_vcpus=2 dom0pvh=1 console=com1,vga" xen_kernel="/boot/xen" The first argument contains the command line that will be passed to the Xen kernel, while the second argument is the path to the Xen kernel itself. This can also be done manually from the loader command line, by for example typing the following set of commands: OK unload OK load /boot/xen dom0_mem=1024M dom0_max_vcpus=2 dom0pvh=1 console=com1,vga OK load kernel OK load zfs OK load if_tap OK load ... OK boot Sponsored by: Citrix Systems R&D Reviewed by: jhb Differential Revision: https://reviews.freebsd.org/D517 For the Forth bits: Submitted by: Julien Grall <julien.grall AT citrix.com>	2015-01-15 16:27:20 +00:00
Baptiste Daroussin	6455cdfb29	Sound: fix typos in user visible messages etc. Submitted by: Sascha Wildner <saw@online.de> Obtained from: DragonFly MFC after: 3 days	2015-01-15 16:09:35 +00:00
Hans Petter Selasky	1a26c3c047	Major callout subsystem cleanup and rewrite: - Close a migration race where callout_reset() failed to set the CALLOUT_ACTIVE flag. - Callout callback functions are now allowed to be protected by spinlocks. - Switching the callout CPU number cannot always be done on a per-callout basis. See the updated timeout(9) manual page for more information. - The timeout(9) manual page has been updated to reflect how all the functions inside the callout API are working. The manual page has been made function oriented to make it easier to deduce how each of the functions making up the callout API are working without having to first read the whole manual page. Group all functions into a handful of sections which should give a quick top-level overview when the different functions should be used. - The CALLOUT_SHAREDLOCK flag and its functionality has been removed to reduce the complexity in the callout code and to avoid problems about atomically stopping callouts via callout_stop(). If someone needs it, it can be re-added. From my quick grep there are no CALLOUT_SHAREDLOCK clients in the kernel. - A new callout API function named "callout_drain_async()" has been added. See the updated timeout(9) manual page for a complete description. - Update the callout clients in the "kern/" folder to use the callout API properly, like cv_timedwait(). Previously there was some custom sleepqueue code in the callout subsystem, which has been removed, because we now allow callouts to be protected by spinlocks. This allows us to tear down the callout like done with regular mutexes, and a "td_slpmutex" has been added to "struct thread" to atomically teardown the "td_slpcallout". Further the "TDF_TIMOFAIL" and "SWT_SLEEPQTIMO" states can now be completely removed. Currently they are marked as available and will be cleaned up in a follow up commit. - Bump the __FreeBSD_version to indicate kernel modules need recompilation. - There has been several reports that this patch "seems to squash a serious bug leading to a callout timeout and panic". Kernel build testing: all architectures were built MFC after: 2 weeks Differential Revision: https://reviews.freebsd.org/D1438 Sponsored by: Mellanox Technologies Reviewed by: jhb, adrian, sbruno and emaste	2015-01-15 15:32:30 +00:00
Konstantin Belousov	b53fc49cd4	fcntl F_O{GET,SET}LK take pointer as the arg, handle them properly for compat32. Reported and tested by: Alex Tutubalin <lexa@lexa.ru> Sponsored by: The FreeBSD Foundation MFC after: 1 week	2015-01-15 10:43:58 +00:00
Bjoern A. Zeeb	6333a401ff	Fix cpsw(4) after r277203 which folded 'struct m_hdr' into 'struct mbuf'. While in theory this should have been a transparent change (and was for all other drivers), cpsw(4) never used the proper accessor macros in a few places but spelt the indirect m_hdr.mh_* out itself. Convert those to use m_len and m_data and unbreak the driver build.	2015-01-15 02:22:52 +00:00
Luiz Otavio O Souza	36bcb64f93	Fix the PMIC node name to match the PMIC i2c address. Remove an extra blank line. No functional changes. MFC after: 3 days	2015-01-15 01:20:56 +00:00
Luiz Otavio O Souza	e4b6eaf73a	Fix the C -> K temperature conversion for the dev.cpu.0.temperature sysctl. Previous code was discarding the last digit. Remove the unused temperature conversion macros.	2015-01-15 01:05:05 +00:00
Luiz Otavio O Souza	02a42e683f	Catch a few cases where we need to release memory resources on errors. Place parentheses around variables in macros. MFC after: 3 days	2015-01-15 00:48:53 +00:00
Warner Losh	b03747e983	Reserve and ignore the a new module metadata type MDT_PNP_INFO for associating an optional PNP hint table with this module. In the future, when these are added, these changes will silently ignore the new type they would otherwise warn about. It will always be safe to ignore this data. Get this into the builds today for some future proofing. MFC After: 3 days	2015-01-15 00:46:30 +00:00
Warner Losh	af8cf71035	New MINIMAL kernel config. The goal with this configuration is to only compile in those options in GENERIC that cannot be loaded as modules. ufs is still included because many of its options aren't present in the kernel module. There's some other exceptions documented in the file. This is part of some work to get more things automatically loading in the hopes of obsoleting GENERIC one day.	2015-01-15 00:42:06 +00:00
Robert Watson	3d1a9ed34e	In order to support ongoing work to implement variable-size mbufs, and more generally make it easier to extend 'struct mbuf in the future', make a number of changes to the data structure: - As we anticipate embedding mbufs headers within variable-size regions of memory in the future, change the definitions of byte arrays embedded in mbufs to be of size [0] rather than [MLEN] and [MHLEN]. In fact, the cxgbe driver already uses 'struct mbuf' on the front of other storage sizes, but we would like the global mbuf allocator do be able to do this as well. - Fold 'struct m_hdr' into 'struct mbuf' itself, eliminating a set of macros that aliased 'mh_foo' field names to 'm_foo' names such as 'm_next'. These present a particular problem as we would like to add new mbuf-header fields -- e.g., 'm_size' -- that, if similarly named via macros, would introduce collisions with many other variable names in the kernel. - Rename 'struct m_ext' to 'struct struct_m_ext' so that we can add compile-time assertions without bumping into the still-extant 'm_ext' macro. - Remove the MSIZE compile-time assertion for 'struct mbuf', but add new assertions for alignment of embedded data arrays (64-bit alignment even on 32-bit platforms), and for the sizes the mbuf header, packet header, and m_ext structure. - Document that these assertions exist in comments in mbuf.h. This change is not intended to cause (non-trivial) behavioural differences, but is a precursor to further mbuf-allocator work. Differential Revision: https://reviews.freebsd.org/D1483 Reviewed by: bz, gnn, np, glebius ("go ahead, I trust you") Sponsored by: EMC / Isilon Storage Division	2015-01-14 23:44:00 +00:00
Hans Petter Selasky	d2955419cd	Avoid race with "dev_rel()" when using the recently added "delist_dev()" function. Make sure the character device structure doesn't go away until the end of the "destroy_dev()" function due to concurrently running cleanup code inside "devfs_populate()". MFC after: 1 week Reported by: dchagin@	2015-01-14 22:07:13 +00:00
Alexander Motin	38feff972b	Fix overflow bug from r248577, turning 30s TRIM timeout into ~4s. MFC after: 2 weeks	2015-01-14 16:22:00 +00:00
Hans Petter Selasky	07dbde6777	Add a kernel function to delist our kernel character devices, so that the device name can be re-used right away in case we are destroying the character devices in the background. MFC after: 4 days Reported by: dchagin@	2015-01-14 14:04:29 +00:00
Ed Schouten	c24341d2e3	Make sure CAP_BINDAT and CAP_CONNECTAT are part of CAP_ALL0. This makes sure that file descriptors of opened directories will actually get these capabilities. Without this change, bindat() and connectat() don't seem to work for me. MFC after: 2 weeks Reviewed by: rwatson, pjd	2015-01-14 13:03:03 +00:00
Randall Stewart	d95b3509e1	Update the hwpmc driver to have the new type HASWELL_XEON. Also go back through HASWELL, IVY_BRIDGE, IVY_BRIDGE_XEON and SANDY_BRIDGE to straighten out all the missing PMCs. We also add a new pmc tool pmcstudy, this allows one to run the various formulas from the documents "Using Intel Vtune Amplifier XE on XXX Generation platforms" for IB/SB and Haswell. The tool also allows one to postulate your own formulas with any of the various PMC's. At some point I will enahance this to work with Brendan Gregg's flame-graphs so we can flamegraph various PMC interactions. Note the manual page also needs some work (lots of work) but gnn has committed to help me with that ;-) Reviewed by: gnn MFC after:1 month Sponsored by: Netflix Inc.	2015-01-14 12:46:58 +00:00
Gleb Smirnoff	501b391d56	Clean some dead code.	2015-01-14 12:46:38 +00:00
Edward Tomasz Napierala	01de1a0650	Add devd(8) notifications for creation and destruction of GEOM devices. Differential Revision: https://reviews.freebsd.org/D1211 MFC after: 1 month Sponsored by: The FreeBSD Foundation	2015-01-14 11:15:57 +00:00
Alexander Motin	d4f46a775d	Reimplement TRIM throttling added in r248577. Previous throttling implementation approached problem from the wrong side. It significantly limited useful delaying of TRIM requests and aggregation potential, while not so much controlled TRIM burstiness under heavy load. With this change random 4K write benchmarks (probably the worst case for TRIM) show me IOPS increase by 20%, average latency reduction by 30%, peak TRIM bursts reduction by 3 times and same peak TRIM map size (memory usage). Also the new logic does not force map size down so heavily, really allowing to keep deleted data for 32 TXG or 30 seconds under moderate load. It was practically impossible with old throttling logic, which pushed map down to only 64 segments. Reviewed by: smh MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2015-01-14 09:39:57 +00:00
Neel Natu	07820b4b4c	Fix typo (missing comma). MFC after: 3 days	2015-01-14 07:18:51 +00:00
Justin Hibbits	64b83a0576	Add Altivec/VMX register support to ptrace. MFC after: 2 weeks Relnotes: yes	2015-01-14 07:01:21 +00:00
Warner Losh	8b91d5b008	Various interrelated fixes to make suspend / resume work better. We now can suspend / resume and unload / load cbb and cardbus without errors on my Lenovo T400, which wasn't possible before. Cards suspending and resuming in the CardBus slot not yet tested. o Enable memory cycles to the bridge early (as part of the new cbb_pci_bridge_init). This fixes the Bad VCC errors which were caused by the code accessing the device registers with this cleared. The suspend / resume process clears it. o Refactor suspend / resume into bus specific code (though the ISA code is just stubbed). This isn't strictly necessary, but makes the initializaiton code more uniform and should be more bullet proof in the face of variant behavior among cardbus bridges. o Fixup comments in the power-up sequence to reflect reality. These comments were written for one regime of power-up, but not updated as things were revised. o Add a paranoid small delay (100ms) to cover noisy cards powering down. o Fix some debugging prints to be easier to grep from dmesg. Sponsored by: Netflix	2015-01-14 05:41:33 +00:00
Warner Losh	8bf007e1e6	Add a rather obnoxious warning if you don't have NEW_PCIB defined since it's a total crap shoot if things will work.	2015-01-14 05:41:31 +00:00
Warner Losh	5355ab2008	On x86 force NEW_PCIB, since that's the default. While this option would be picked up for kernel builds, it isn't picked up for old-fashioned builds. Without this option, PCI bus numbers are busted for modules build iteratively.	2015-01-14 05:41:28 +00:00
Jamie Gritton	6a3f277901	Remove the prison flags PR_IP4_DISABLE and PR_IP6_DISABLE, which have been write-only for as long as they've existed.	2015-01-14 04:50:28 +00:00
Jamie Gritton	0e5e396ede	Don't set prison's pr_ip4s or pr_ip6s to -1. PR: 196474 MFC after: 3 days	2015-01-14 03:52:41 +00:00
Nathan Whitehorn	bf27800837	Do not remap Open Firmware mappings covered by the direct map. It's pointless and wastes resources. MFC after: 1 week	2015-01-14 02:18:29 +00:00
Ganbold Tsagaankhuu	d84d6e7716	Correct cpu type, it was rather Cortex A12 R0. Approved by: stas (mentor)	2015-01-14 01:23:50 +00:00
Jack F Vogel	df1d7a71d4	Cleanup some bogus code in the RSS config, and add the include for the rss option file. And bump the version. MFC after: 1 week	2015-01-13 22:13:30 +00:00
Neel Natu	c9c75df48c	'struct vm_exception' was intended to be used only as the collateral for the VM_INJECT_EXCEPTION ioctl. However it morphed into other uses like keeping track pending exceptions for a vcpu. This in turn causes confusion because some fields in 'struct vm_exception' like 'vcpuid' make sense only in the ioctl context. It also makes it harder to add or remove structure fields. Fix this by using 'struct vm_exception' only to communicate information from userspace to vmm.ko when injecting an exception. Also, add a field 'restart_instruction' to 'struct vm_exception'. This field is set to '1' for exceptions where the faulting instruction is restarted after the exception is handled. MFC after: 1 week	2015-01-13 22:00:47 +00:00
Jack F Vogel	6b30e6ae6c	Complete the RX side RSS code: parse the encoded portion of the RX descriptor to determine the correct hash type. MFC after:1 week	2015-01-13 18:56:29 +00:00

1 2 3 4 5 ...

101969 Commits