freebsd-skq

Author	SHA1	Message	Date
Konstantin Belousov	2c1531e746	Do not flush buffers when the v_object of the passed vnode does not really belong to it. Such vnodes, with the pointers to other vnodes v_objects, are typically instantiated by the bypass filesystems. Invalidating mappings of other vnode pages and the pages is wrong, since reclamation of the upper vnode does not imply that lower vnode is reclaimed too. One of the consequences of the improper reclamation was destruction of the wired mappings of the lower vnode pages, triggering miscellaneous assertions in the VM system. Reported by: John Marshall <john.marshall@riverwillow.com.au> Tested by: John Marshall <john.marshall@riverwillow.com.au>, pho Sponsored by: The FreeBSD Foundation MFC after: 1 week Approved by: re (gjb)	2013-10-09 18:43:29 +00:00
Konstantin Belousov	1744fe5048	When growing the file descriptor table, new larger memory chunk is allocated, but the old table is kept around to handle the case of threads still performing unlocked accesses to it. Grow the table exponentially instead of increasing its size by sizeof(long) * 8 chunks when overflowing. This mode significantly reduces the total memory use for the processes consuming large numbers of the file descriptors which open them one by one. Reported and tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week Approved by: re (marius)	2013-10-09 18:41:35 +00:00
Konstantin Belousov	3625bde45d	Reduce code duplication, introduce the getmaxfd() helper to calculate the max filedescriptor index. Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week Approved by: re (marius)	2013-10-09 18:39:44 +00:00
Edward Tomasz Napierala	c48f182bb1	Remove no longer useful debugging output and a stale comment. Approved by: re (gjb) Sponsored by: FreeBSD Foundation	2013-10-09 17:34:45 +00:00
Jack F Vogel	7609433eb6	Update the Intel igb driver to version 2.4.0 - This version has support for the new Intel Avoton systems, including 2.5Gb support, further it now has IPv6/TSO6 support as well. Shared code has been updated where necessary as well. Thanks to my new assistant Eric Joyner for doing the transmit path changes to bring in the IPv6/TSO6 support. Thanks to Gleb for catching the one bug and change needed in NETMAP. Approved by: re	2013-10-09 17:32:52 +00:00
Dimitry Andric	209536b767	Initialize a variable in sys/contrib/ipfilter/netinet/ip_rcmd_pxy.c, to silence a gcc warning. Approved by: re (gjb) X-MFC-With: r255332	2013-10-09 17:07:50 +00:00
Edward Tomasz Napierala	02147e9cd0	Make the error handling more consistant. Shouldn't make any functional difference. Approved by: re (gjb) Sponsored by: FreeBSD Foundation	2013-10-09 17:06:03 +00:00
Dimitry Andric	98c28062e0	Remove redundant redeclarations of uart_s3c2410_class in sys/arm/s3c2xx0/uart_bus_s3c2410.c and uart_cpu_s3c2410.c, to silence two gcc warnings. Approved by: re (gjb) X-MFC-With: r252394	2013-10-09 17:05:02 +00:00
Edward Tomasz Napierala	8ba0396077	Tidy up, cache return value of a function, and add an assertion; shouldn't make any functional difference. Approved by: re (gjb) Sponsored by: FreeBSD Foundation	2013-10-09 16:55:52 +00:00
Alexander Motin	6dfc67e379	Close the race on path ID allocation in xpt_bus_register() if two buses are registered simultaneously. Due to topology unlock between the ID allocation and the bus registration there is a chance that two buses may get the same IDs. That is supposed reason of lock assertion panic in CAM during initial bus scanning after new iscsid initiates two sessions same time. Reported by: trasz Approved by: re (glebus, marius) MFC after: 2 weeks	2013-10-09 12:09:01 +00:00
Edward Tomasz Napierala	1008ac5eb7	Fix NOP-In/NOP-Out payload handling. Previous way didn't work at all; fortunately nothing seems to actually use this feature, but it's required by standard. Approved by: re (glebius) Sponsored by: FreeBSD Foundation	2013-10-09 12:03:04 +00:00
Gleb Smirnoff	c11a15bf8d	When processing ACK in tcp_do_segment, use sbcut_locked() instead of sbdrop_locked() to cut acked mbufs from the socket buffer. Free this chain a batch manner after the socket buffer lock is dropped. This measurably reduces contention on socket buffer. Sponsored by: Netflix Sponsored by: Nginx, Inc. Approved by: re (marius)	2013-10-09 12:00:38 +00:00
Gleb Smirnoff	1d2df300e9	- Substitute sbdrop_internal() with sbcut_internal(). The latter doesn't free mbufs, but return chain of free mbufs to a caller. Caller can either reuse them or return to allocator in a batch manner. - Implement sbdrop()/sbdrop_locked() as a wrapper around sbcut_internal(). - Expose sbcut_locked() for outside usage. Sponsored by: Netflix Sponsored by: Nginx, Inc. Approved by: re (marius)	2013-10-09 11:57:53 +00:00
Dimitry Andric	cb53fc2dad	Remove redundant declaration of cmclass in sys/ofed/drivers/infiniband/core/ucm.c, to silence a gcc warning. Approved by: re (kib) X-MFC-With: r255932	2013-10-09 07:02:03 +00:00
Adrian Chadd	490cbcd9cb	Add some missing AR934x register definitions. These are needed for ethernet bootstrap. Approved by: re@ (gjb)	2013-10-09 03:19:05 +00:00
Adrian Chadd	50c5514227	Fix interrupt handling from the APB periperals (ie, UART) - it also requires an explicit acknowledgement. Tested: * AR9344 (DB120) SoC Approved by: re@ (gjb)	2013-10-09 02:01:20 +00:00
Adrian Chadd	83220b2e9f	Mark AR934x based boards to be mips74k. Reviewed by: imp@ Approved by: re@ (gjb)	2013-10-09 00:27:33 +00:00
Adrian Chadd	4cfbfdd4b3	Add "better" MIPS24k and MIPS74k barriers. * the mips74k cores only need EHB (which is 'sll $0, $0, 3') here; NOPs don't actually work. * add EHB as the last NOP for the default barriers/hazards; that is "better" behaviour and should work on a wider variety of processors. This allows the existing (icky) TLB code to work, allowing the AR9344 SoC (mips74k) to actually get through kernel startup. Tested: * AR9344 SoC - (mips74k) * AR9331 SoC - (mips24k) TODO: * test on mips4k CPUs, just to be sure. * document that sll $0, $0, 3 is actually "EHB" and that it falls back to being a NOP for pre-mips32r1. * mips24k has an errata that we currently don't correctly explicitly state - ie, that after DERET/ERET, the only valid instruction is a NOP. Reviewed by: imp@ Approved by: re@ (gjb)	2013-10-09 00:27:12 +00:00
Adrian Chadd	2e2f21556e	Shuffle the includes around so they occur after opt_global.h is included; this way the CPU ABI / core #define items are there. Reviewed by: imp@ Approved by: re@ (gjb)	2013-10-09 00:22:21 +00:00
Adrian Chadd	57778cfec4	Add two new MIPS CPU families - mips24k and mips74k. They're both different cores: * mips24k is an 8-stage pipeline, mips32r1 ABI, non-superscalar core. * mips74k is a dual-issue 15-stage superscalar design, mips32r2 ABI. They have different sets of quirks and bugs; these #define entries will be used to work around these. Now, strictly speaking, we should have CPU ABI families (mips32r1, mips32r2, etc) and CPU core types (mips4k, mips24k, mips74k, etc.) But this is the starting point of that particular tidy-up. Reviewed by: imp@ Approved by: re@ (gjb)	2013-10-09 00:21:21 +00:00
Jim Harris	8a959ae073	Fix the LINT build. Approved by: re (implicit) MFC after: 1 week	2013-10-08 23:23:04 +00:00
Dimitry Andric	9cba9d0157	In sys/amd64/amd64/pmap.c, fix several gcc warnings about uninitialized variables in reclaim_pv_chunk(). Approved by: re (marius) Reviewed by: neel, kib X-MFC-With: r256072	2013-10-08 20:04:35 +00:00
Edward Tomasz Napierala	a9c80a534a	Properly fix out of memory handling in the iSCSI target. Approved by: re (glebius) Sponsored by: FreeBSD Foundation	2013-10-08 19:18:02 +00:00
Dimitry Andric	e7ad54b2b4	Now our binutils's assembler supports the Intel Random Number Generator extensions, we can change the .byte directives in sys/dev/random/ivy.c to plain 'rdrand' mnemonics. This already worked for clang users, but now it will also work for gcc users. Approved by: re (kib) Approved by: so (des) MFC after: 1 week	2013-10-08 17:26:28 +00:00
Jim Harris	7aa27dbac5	Do not leak resources during attach if nvme_ctrlr_construct() or the initial controller resets fail. Sponsored by: Intel Reviewed by: carl Approved by: re (hrs) MFC after: 1 week	2013-10-08 16:01:43 +00:00
Jim Harris	bb2f67fd72	Log and then disable asynchronous notification of persistent events after they occur. This prevents repeated notifications of the same event. Status of these events may be viewed at any time by viewing the SMART/Health Info Page using nvmecontrol, whether or not asynchronous events notifications for those events are enabled. This log page can be viewed using: nvmecontrol logpage -p 2 <ctrlr id> Future enhancements may re-enable these notifications on a periodic basis so that if the notified condition persists, it will continue to be logged. Sponsored by: Intel Reviewed by: carl Approved by: re (hrs) MFC after: 1 week	2013-10-08 16:00:12 +00:00
Jim Harris	d5fc982133	Do not enable temperature threshold as an asynchronous event notification on NVMe controllers that do not support it. Sponsored by: Intel Reviewed by: carl Approved by: re (hrs) MFC after: 1 week	2013-10-08 15:49:14 +00:00
Jim Harris	992db80f1d	Extend some 32-bit fields and variables to 64-bit to prevent overflow when calculating stats in nvmecontrol perftest. Sponsored by: Intel Reported by: Joe Golio <joseph.golio@emc.com> Reviewed by: carl Approved by: re (hrs) MFC after: 1 week	2013-10-08 15:47:22 +00:00
Jim Harris	a40e72a695	Add driver-assisted striping for upcoming Intel NVMe controllers that can benefit from it. Sponsored by: Intel Reviewed by: kib (earlier version), carl Approved by: re (hrs) MFC after: 1 week	2013-10-08 15:44:04 +00:00
Mark Johnston	cb7320ce7b	Initialize and free the DTrace taskqueue in the dtrace module load/unload handlers rather than in the dtrace device open/close methods. The current approach can cause a panic if the device is closed which the taskqueue thread is active, or if a kernel module containing a provider is unloaded while retained enablings are present and the dtrace device isn't opened. Submitted by: gibbs (original version) Reviewed by: gibbs Approved by: re (glebius) MFC after: 2 weeks	2013-10-08 12:56:46 +00:00
Adrian Chadd	0a2cefc676	Add channel survey support to the AR5212 HAL. The AR5212 series of MACs implement the same channel counters as the later 11n chips - except, of course, the 11n specific counter (extension channel busy.) This allows users of these NICs to use 'athsurvey' to see how busy their current channel is. Tested: * AR5212, AR2413 NICs, STA mode Approved by: re@ (gleb)	2013-10-08 11:28:59 +00:00
Xin LI	6eb151f212	Improve lzjb decompress performance by reorganizing the code to tighten the copy loop. Submitted by: Denis Ahrens <denis h3q com> MFC after: 2 weeks Approved by: re (gjb)	2013-10-08 01:38:24 +00:00
Dimitry Andric	64db896617	Fix kernel build on amd64 after r256118, since the machine/md_var.h header is not implicitly included there. So include it explicitly. Approved by: re (delphij) Pointy hat to: dim MFC after: 3 days X-MFC-With: r256118	2013-10-07 22:30:03 +00:00
Konstantin Belousov	50fbc8e7e8	Make isci(4) loadable. Reviewed by: jimharris Sponsored by: The FreeBSD Foundation MFC after: 1 week Approved by: re (gjb)	2013-10-07 20:48:24 +00:00
Dimitry Andric	e38c5308a2	Remove redundant declarations of szsigcode and sigcode in sys/i386/ibcs2/ibcs2_sysvec.c, to silence two gcc warnings. Approved by: re (gjb) MFC after: 3 days	2013-10-07 16:57:48 +00:00
Dimitry Andric	42355a4ff6	Remove redundant declaration of cpu_clflush_line_size in sys/dev/cxgbe/t4_sge.c, to silence a gcc warning. Approved by: re (gjb) MFC after: 3 days	2013-10-07 16:56:56 +00:00
Dimitry Andric	faf8ff5faa	Initialize a variable in sys/dev/xen/control/control.c, to silence a gcc warning. Approved by: re (gjb) MFC after: 3 days	2013-10-07 16:55:34 +00:00
Dimitry Andric	7ea82d6741	Give an unnamed union in sys/ofed/include/rdma/ib_verbs.h a name, to silence a gcc warning. Approved by: re (gjb) MFC after: 3 days	2013-10-07 16:54:29 +00:00
Dimitry Andric	d359802e10	Remove redundant declaration of force_evtchn_callback() in the i386-specific xen-os.h, to silence a gcc warning. Approved by: re (gjb) MFC after: 3 days	2013-10-07 16:53:26 +00:00
Gleb Smirnoff	491b520174	Fix mbuf leak. Submitted by: Loganaden Velvindron <logan elandsys.com> Obtained from: NetBSD Approved by: re (kib)	2013-10-07 12:07:40 +00:00
Poul-Henning Kamp	eafc73a8a9	Add a va_copy() to our fall-back stdarg implementation for use with lint(1) Approved by: re@ (glebius@)	2013-10-07 10:01:23 +00:00
Konstantin Belousov	ed1bfa0c21	Add the definition of DF_1_INTERPOSE flag. Reviewed by: kan Sponsored by: The FreeBSD Foundation MFC after: 1 week Approved by: re (glebius)	2013-10-07 08:14:58 +00:00
Glen Barber	24907464b4	Update head/ to 10.0-ALPHA5. Approved by: re (implicit)	2013-10-06 19:47:06 +00:00
Konstantin Belousov	505cdd82bf	Remove the uipc_cow.c file, which is not used since the zero copy sockets removal. Noted by: alc Sponsored by: The FreeBSD Foundation Approved by: re (delphij)	2013-10-06 06:57:28 +00:00
Justin T. Gibbs	5fdd34ee20	Formalize the concept of virtual CPU ids by adding a per-cpu vcpu_id field. Perform vcpu enumeration for Xen PV and HVM environments and convert all Xen drivers to use vcpu_id instead of a hard coded assumption of the mapping algorithm (acpi or apic ID) in use. Submitted by: Roger Pau Monné Sponsored by: Citrix Systems R&D Reviewed by: gibbs Approved by: re (blanket Xen) amd64/include/pcpu.h: i386/include/pcpu.h: Add vcpu_id to the amd64 and i386 pcpu structures. dev/xen/timer/timer.c x86/xen/xen_intr.c Use new vcpu_id instead of assuming acpi_id == vcpu_id. i386/xen/mp_machdep.c: i386/xen/mptable.c x86/xen/hvm.c: Perform Xen HVM and Xen full PV vcpu_id mapping. x86/xen/hvm.c: x86/acpica/madt.c Change SYSINIT ordering of acpi CPU enumeration so that it is guaranteed to be available at the time of Xen HVM vcpu id mapping.	2013-10-05 23:11:01 +00:00
Neel Natu	318224bbe6	Merge projects/bhyve_npt_pmap into head. Make the amd64/pmap code aware of nested page table mappings used by bhyve guests. This allows bhyve to associate each guest with its own vmspace and deal with nested page faults in the context of that vmspace. This also enables features like accessed/dirty bit tracking, swapping to disk and transparent superpage promotions of guest memory. Guest vmspace: Each bhyve guest has a unique vmspace to represent the physical memory allocated to the guest. Each memory segment allocated by the guest is mapped into the guest's address space via the 'vmspace->vm_map' and is backed by an object of type OBJT_DEFAULT. pmap types: The amd64/pmap now understands two types of pmaps: PT_X86 and PT_EPT. The PT_X86 pmap type is used by the vmspace associated with the host kernel as well as user processes executing on the host. The PT_EPT pmap is used by the vmspace associated with a bhyve guest. Page Table Entries: The EPT page table entries as mostly similar in functionality to regular page table entries although there are some differences in terms of what bits are used to express that functionality. For e.g. the dirty bit is represented by bit 9 in the nested PTE as opposed to bit 6 in the regular x86 PTE. Therefore the bitmask representing the dirty bit is now computed at runtime based on the type of the pmap. Thus PG_M that was previously a macro now becomes a local variable that is initialized at runtime using 'pmap_modified_bit(pmap)'. An additional wrinkle associated with EPT mappings is that older Intel processors don't have hardware support for tracking accessed/dirty bits in the PTE. This means that the amd64/pmap code needs to emulate these bits to provide proper accounting to the VM subsystem. This is achieved by using the following mapping for EPT entries that need emulation of A/D bits: Bit Position Interpreted By PG_V 52 software (accessed bit emulation handler) PG_RW 53 software (dirty bit emulation handler) PG_A 0 hardware (aka EPT_PG_RD) PG_M 1 hardware (aka EPT_PG_WR) The idea to use the mapping listed above for A/D bit emulation came from Alan Cox (alc@). The final difference with respect to x86 PTEs is that some EPT implementations do not support superpage mappings. This is recorded in the 'pm_flags' field of the pmap. TLB invalidation: The amd64/pmap code has a number of ways to do invalidation of mappings that may be cached in the TLB: single page, multiple pages in a range or the entire TLB. All of these funnel into a single EPT invalidation routine called 'pmap_invalidate_ept()'. This routine bumps up the EPT generation number and sends an IPI to the host cpus that are executing the guest's vcpus. On a subsequent entry into the guest it will detect that the EPT has changed and invalidate the mappings from the TLB. Guest memory access: Since the guest memory is no longer wired we need to hold the host physical page that backs the guest physical page before we can access it. The helper functions 'vm_gpa_hold()/vm_gpa_release()' are available for this purpose. PCI passthru: Guest's with PCI passthru devices will wire the entire guest physical address space. The MMIO BAR associated with the passthru device is backed by a vm_object of type OBJT_SG. An IOMMU domain is created only for guest's that have one or more PCI passthru devices attached to them. Limitations: There isn't a way to map a guest physical page without execute permissions. This is because the amd64/pmap code interprets the guest physical mappings as user mappings since they are numerically below VM_MAXUSER_ADDRESS. Since PG_U shares the same bit position as EPT_PG_EXECUTE all guest mappings become automatically executable. Thanks to Alan Cox and Konstantin Belousov for their rigorous code reviews as well as their support and encouragement. Thanks for John Baldwin for reviewing the use of OBJT_SG as the backing object for pci passthru mmio regions. Special thanks to Peter Holm for testing the patch on short notice. Approved by: re Discussed with: grehan Reviewed by: alc, kib Tested by: pho	2013-10-05 21:22:35 +00:00
Justin T. Gibbs	bf57e9793a	Correct panic caused by attaching both Xen PV and HyperV virtualization aware drivers on Xen hypervisors that advertise support for some HyperV features. x86/xen/hvm.c: When running in HVM mode on a Xen hypervisor, set vm_guest to VM_GUEST_XEN so other virtualization aware components in the FreeBSD kernel can detect this mode is active. dev/hyperv/vmbus/hv_hv.c: Use vm_guest to ignore Xen's HyperV emulation when Xen is detected and Xen PV drivers are active. Reported by: Shanker Balan Submitted by: Roger Pau Monné Sponsored by: Citrix Systems R&D Reviewed by: gibbs Approved by: re (Xen blanket)	2013-10-05 19:51:09 +00:00
Hiren Panchasara	5b9d734b08	Expose system level ixgbe sysctls. Device level sysctls are already exposed as dev.ix.<device> Fixing the case where number of queues for igb is auto-tuned and hw.igb.num_queues does not return current/updated value. Reviewed by: jfv Approved by: re (delphij) MFC after: 2 weeks	2013-10-05 19:17:56 +00:00
Alan Cox	61083fcc61	Tidy up kmeminit(): Since r245575, 'nmbclusters' is calculated after kmeminit() runs, so it contributes nothing to 'vm_kmem_size'; update a comment to reflect that r254025 replaced the kmem submap with the kmem arena. Reviewed by: kib Approved by: re (gjb) Sponsored by: EMC / Isilon Storage Division	2013-10-05 18:53:03 +00:00
Bryan Venteicher	d797300b75	Do not hold the vtnet Rx queue lock when calling up into the stack This matches other similar drivers and avoids various LOR warnings. Approved by: re (marius)	2013-10-05 18:07:24 +00:00

1 2 3 4 5 ...

95108 Commits