freebsd-nq

Author	SHA1	Message	Date
Mark Johnston	38e6967f04	Ensure that all eight syscall arguments are available to dtrace_probe(), rather than just the first five. This is done by calling dtrace_probe() through a function pointer, as in illumos. MFC after: 3 weeks	2014-04-14 00:23:18 +00:00
Mark Johnston	0626f3e435	DTrace's pid provider works by inserting breakpoint instructions at probe sites and installing a hook at the kernel's trap handler. The fasttrap code will emulate the overwritten instruction in some common cases, but otherwise copies it out into some scratch space in the traced process' address space and ensures that it's executed after returning from the trap. In Solaris and illumos, this (per-thread) scratch space comes from some reserved space in TLS, accessible via the fs segment register. This approach is somewhat unappealing on FreeBSD since it would require some modifications to rtld and jemalloc (for static TLS) to ensure that TLS is executable, and would thus introduce dependencies on their implementation details. I think it would also be impossible to safely trace static binaries compiled without these modifications. This change implements the functionality in a different way, by having fasttrap map pages into the target process' address space on demand. Each page is divided into 64-byte chunks for use by individual threads, and fasttrap's process descriptor struct has been extended to keep track of any scratch space allocated for the corresponding process. With this change it's possible to trace all libc functions in a program, e.g. with pid$target:libc.so.*::entry {@[probefunc] = count();} Previously this would generally cause the victim process to crash, as tracing memcpy on amd64 requires the functionality described above. Tested by: Prashanth Kumar <pra_udupi@yahoo.co.in> (earlier version) MFC after: 6 weeks	2014-04-14 00:22:42 +00:00
Davide Italiano	2f9e29745c	Fix a panic in zfs_rename(). this is due to a wrong dereference of a vnode when it's not locked and can be (potentially) recycled. 'sdvp' cannot be locked on zfs_rename() entry point because the VFS can't be sure that this scenario is LOR-free (it might violate the parent->child lock acquisition rule). Dereference 'tdvp' instead, which is already locked on entry, and access 'sdvp' fields only when it's safe, i.e. under ZFS_ENTER scope. While at it, remove the usage of VOP_REALVP, as long as this is a NOP on FreeBSD. Discussed with: avg Reviewed by: pjd	2014-04-13 01:15:37 +00:00
Alexander Motin	f6e1dc83c3	Create zvol devices on zfs clone. While big and shiny patch is not ready, it is better to have something. PR: kern/178999 MFC after: 1 week	2014-04-11 11:56:16 +00:00
Alexander Motin	a96fefe042	In addition to r264077, tell GEOM that we do support BIO_DELETE now.	2014-04-06 16:31:28 +00:00
Alexander Motin	537650f54d	Add property and sysctl to control how ZVOLs are exposed to OS. New ZFS property volmode and sysctl vfs.zfs.vol.mode allow switching ZVOL between three modes: geom -- existing fully functional behavior (default); dev -- exposing volumes only as raw disk device file in devfs; none -- not exposing volumes outside ZFS. The "dev" mode is less functional (can't be partitioned, mounted, etc), but it is faster, and in some scenarios with untrusted consumers safer. It can be useful for NAS, VM block storages, etc. The "none" mode may be convenient for backup servers, etc. that don't need direct data access. Due to the way ZVOL is integrated with main ZFS code, those property and sysctl are checked only during pool import and volume creation. MFC after: 1 month Sponsored by: iXsystems, Inc.	2014-04-05 13:01:44 +00:00
Alexander Motin	89e84aead6	MFV r258922: 3580 Want zvols to return volblocksize when queried for physical block size illumos/illumos-gate@a0b60564df It is irrelevant for FreeBSD, just reducing diff.	2014-04-03 20:18:55 +00:00
Alexander Motin	4a03e8b64d	Add BIO_DELETE support to ZVOL. It is an adapted merge from the vendor branch of: 701 UNMAP support for COMSTAR (in part related to ZFS) 2130 zvol DKIOCFREE uses nested DMU transactions	2014-04-03 15:04:32 +00:00
Pedro F. Giffuni	23e4da439c	MFV r258379; 4248 dtrace(1M) should never create DOF with empty probes section 4249 Only probes from the first DTrace object file will be included Illumos Revision: 4a20ab41aadcb81c53e72fc65886e964e9add59 Reference: https://www.illumos.org/issues/4248 https://www.illumos.org/issues/4249 Obtained from: Illumos MFC after: 1 month	2014-04-02 15:32:44 +00:00
Bryan Drewery	44f1c91610	Rename global cnt to vm_cnt to avoid shadowing. To reduce the diff struct pcu.cnt field was not renamed, so PCPU_OP(cnt.field) is still used. pc_cnt and pcpu are also used in kvm(3) and vmstat(8). The goal was to not affect externally used KPI. Bump __FreeBSD_version_ in case some out-of-tree module/code relies on the the global cnt variable. Exp-run revealed no ports using it directly. No objection from: arch@ Sponsored by: EMC / Isilon Storage Division	2014-03-22 10:26:09 +00:00
Robert Watson	4a14441044	Update kernel inclusions of capability.h to use capsicum.h instead; some further refinement is required as some device drivers intended to be portable over FreeBSD versions rely on __FreeBSD_version to decide whether to include capability.h. MFC after: 3 weeks	2014-03-16 10:55:57 +00:00
Alexander Motin	68d17718e0	Report ZVOL block size as GEOM stripesize. MFC after: 2 weeks	2014-03-13 19:26:26 +00:00
Xin LI	8e41e26f65	MFV r262983: 4638 Panic in ZFS via rfs3_setattr()/rfs3_write(): dirtying snapshot! illumos/illumos-gate@2144b121c0	2014-03-11 00:23:50 +00:00
Xin LI	ba680558a0	All callers of static method load_nvlist() in spa.c handles error case, so there is no reason to assert that we won't hit an error. Instead, just return that error to caller and have the upper layer handle it. Obtained from: FreeNAS Reported by: rodrigc Reviewed by: Matthew Ahrens MFC after: 2 weeks	2014-03-02 02:41:33 +00:00
Mark Johnston	b53bfbba65	Expose a few DTrace parameters as sysctls under kern.dtrace and add descriptions for several existing sysctls. PR: 187027 Submitted by: Fedor Indutny <fedor@indutny.com> (original version) MFC after: 2 weeks	2014-03-01 19:06:43 +00:00
Mark Johnston	ae520d3dc4	Fix emulation of call and jmp instructions on i386 and for 32-bit processes on amd64. Submitted by: Prashanth Kumar <pra_udupi@yahoo.co.in> MFC after: 2 weeks	2014-03-01 17:55:20 +00:00
Mark Johnston	ae9f1a185c	4478 dtrace_dof_maxsize is far too small illumos/illumos-gate@d339a29bb4 PR: 187027 MFC after: 1 week	2014-02-28 02:04:41 +00:00
Mark Johnston	c0c943de72	Fix the struct reg mappings for i386 and amd64, which differ between illumos and FreeBSD. Submitted by: Prashanth Kumar <pra_udupi@yahoo.co.in> MFC after: 2 weeks	2014-02-27 01:24:47 +00:00
Mark Johnston	0339a1c2b4	Move some files that are identical on i386 and amd64 to an x86 subdirectory rather than keeping duplicate copies. Discussed with: avg MFC after: 1 week	2014-02-27 01:04:35 +00:00
Mark Johnston	5bcd30f3b1	Revert r262466, as it does not compile on PowerPC. Reported by: jhibbits	2014-02-26 01:00:00 +00:00
Mark Johnston	68ac8d05d3	Make all 8 syscall arguments available to syscall probes in the same way that this is done for SDT probes. This fixes the syscall/tst.args.d test, which was failing because mmap(2)'s sixth argument wasn't available to the probe. MFC after: 2 weeks	2014-02-25 02:58:11 +00:00
Mark Johnston	33db01542c	1452 DTrace buffer autoscaling should be less violent illumos/illumos-gate@6fb4854bed This fixes the tst.resize1.d and tst.resize2.d DTrace tests, which have been failing since r261122 since they were causing dtrace(1) to attempt to allocate and use large amounts of memory, and get killed by the OOM killer as a result. MFC after: 1 month	2014-02-22 05:18:55 +00:00
Mark Johnston	dc0f030e51	Define the KM_NORMALPRI flag for kmem_alloc(), as it is used in some upstream DTrace code. It indicates that the kernel memory allocator need not attempt to satisfy non-blocking allocations in low-memory conditions. This has no direct equivalent in the malloc(9) flags, so it is just defined to 0 for now.	2014-02-22 05:13:35 +00:00
Xin LI	5f62f8cdcb	MFV r261619: 4574 get_clones_stat does not call zap_count in non-debug kernel zap_count(...) is never called in non-DEBUG kernel. As result "count" variable is always 0, and "goto fail" is always reached. This means get_clones_stat function never makes up list of clones for "clones" properties. MFC after: 2 weeks	2014-02-08 05:35:36 +00:00
Xin LI	bea6313e6b	MFV r260834: Fix memory leak of compressed buffers in l2arc_write_done (Illumos #3995).	2014-01-18 01:45:39 +00:00
Andriy Gapon	6d03ca5789	traverse_visitbp: visit DMU_GROUPUSED_OBJECT before DMU_USERUSED_OBJECT This is done to ensure that visited object IDs are always increasing. Also, pass correct object ID to prefetch_dnode_metadata for os_groupused_dnode. Without this change we would hit an assert if traversal was paused on a GROUPUSED object, which is unlikely but possible. Apparently the same change was independently developed by Deplhix. Reviewed by: Matthew Ahrens <mahrens@delphix.com> MFC after: 10 days Sponsored by: HybridCluster	2014-01-17 10:23:46 +00:00
Andriy Gapon	fec721bc43	fix a build problem with INVARIANTS enabled introduced in r260704 Reported by: glebius MFC after: 5 days X-MFC with: r260704	2014-01-16 13:44:37 +00:00
Andriy Gapon	876fa2c17b	fix a bug in ZFS mirror code for handling multiple DVAa The bug was introduced in r256956 "Improve ZFS N-way mirror read performance". The code in vdev_mirror_dva_select erroneously considers already tried DVAs for the next attempt. Thus, it is possible that a failing DVA would be retried forever. As a secondary effect, if the attempts fail with checksum error, then checksum error reports are accumulated until the original request ultimately fails or succeeds. But because retrying is going on indefinitely the cheksum reports accumulation will effectively be a memory leak. Reviewed by: gibbs MFC after: 13 days Sponsored by: HybridCluster	2014-01-16 13:24:10 +00:00
Andriy Gapon	00126789e6	Revert r260705: wrong patch committed by accident An earlier, less efficient version was committed by accident.	2014-01-16 13:20:20 +00:00
Andriy Gapon	19f5e9076b	zfs_deleteextattr: name buffer from namei is needed by zfs_rename If we prematurely free the name buffer and it gets quickly recycled, then zfs_rename may see data from another lookup or even unmapped memory via cn_nameptr. MFC after: 6 days Sponsored by: HybridCluster	2014-01-16 12:31:27 +00:00
Andriy Gapon	2f9a31944f	fix a bug in ZFS mirror code for handling multiple DVAa The bug was introduced in r256956 "Improve ZFS N-way mirror read performance". The code in vdev_mirror_dva_select erroneously considers already tried DVAs for the next attempt. Thus, it is possible that a failing DVA would be retried forever. As a secondary effect, if the attempts fail with checksum error, then checksum error reports are accumulated until the original request ultimately fails or succeeds. But because retrying is going on indefinitely the cheksum reports accumulation will effectively be a memory leak. Reviewed by: gibbs MFC after: 13 days Sponsored by: HybridCluster	2014-01-16 12:26:54 +00:00
Andriy Gapon	b8ca4667ed	zfs: getnewvnode_reserve must be called outside of a zfs transaction Otherwise we could run into the following deadlock. A thread has a transaction open and assigned to a transaction group. That would prevent the transaction group from be quiesced and synced. The thread is blocked in getnewvnode_reserve waiting for a vnode to a be reclaimed. vnlru thread is blocked trying to enter ZFS VOP because a filesystem is suspended by an ongoing rollback or receive operation. In its turn the operation is waiting for the current transaction group to be synced. zfs_zget is always used outside of active transactions, but zfs_mknode is always used in a transaction context. Thus, we hoist getnewvnode_reserve from zfs_mknode to its callers. While there, assert that ZFS always calls getnewvnode while having a vnode reserved. Reported by: adrian Tested by: adrian MFC after: 17 days Sponsored by: HybridCluster	2014-01-16 12:22:46 +00:00
Marcel Moolenaar	642ebd6a18	In atomic_or_8_nv() load 1 and not 8 bytes from the address given. Note that atomic_or_8_nv() is not used at this time.	2014-01-06 05:00:58 +00:00
Alexander Motin	77e2eaf5b8	Fix build after r260234 by converting ddi_get_lbolt64() from inline into a macro. Otherwise compiler complains that hz variable used there either undefined or defined twice, thanks to header mess caused by compat shims.	2014-01-05 19:07:42 +00:00
Alexander Motin	ce05e707c4	In dmu_zfetch_stream_reclaim() replace division with multiplication and move it out of the loop and lock.	2014-01-03 18:44:37 +00:00
Alexander Motin	99e2428636	Remove extra conversion to nanoseconds from ddi_get_lbolt64(). As result this uses one multiplication and shifts instead of one division and two multiplications.	2014-01-03 18:08:31 +00:00
Xin LI	7c88e58f46	MFV r260155: When we encounter an I/O error on a piece of metadata while deleting a file system or zvol, we don't update the bptree_entry_phys_t's bookmark. This would lead to double free of bp's which will lead to space map corruption. Instead of tolerating and allowing the corruption, panic immediately. See Illumos #4390 for more details. 4391 panic system rather than corrupting pool if we hit bug 4390 Illumos/illumos-gate@8b36997aa2 MFC after: 2 weeks	2014-01-02 08:10:35 +00:00
Xin LI	ab0b9f6b30	MFV r260154 + 260182: 4369 implement zfs bookmarks 4368 zfs send filesystems from readonly pools Illumos/illumos-gate@78f1710053 MFC after: 2 weeks	2014-01-02 07:34:36 +00:00
Xin LI	6f2791f53a	Fix build on platforms where atomic_swap_64 is not available.	2014-01-02 03:24:44 +00:00
Xin LI	647795d181	MFV r260153: 4121 vdev_label_init should treat request as succeeded when pool is read only Illumos/illumos-gate@973c78e94b MFC after: 2 weeks	2014-01-01 01:26:39 +00:00
Xin LI	f4c8ba8370	MFV r259170: 4370 avoid transmitting holes during zfs send 4371 DMU code clean up illumos/illumos-gate@43466aae47 NOTE: Make sure the boot code is updated if a zpool upgrade is done on boot zpool. MFC after: 2 weeks	2014-01-01 00:45:28 +00:00
Xin LI	cca1e7c623	MFV r258385: (Note: this change is not applicable to FreeBSD and the file is not included in build. It's integrated for completeness). 4128 disks in zpools never go away when pulled illumos/illumos-gate@39cddb10a3 MFC after: 2 weeks	2013-12-31 21:24:00 +00:00
Xin LI	db2aff5f8b	MFV r242733: 3306 zdb should be able to issue reads in parallel 3321 'zpool reopen' command should be documented in the man page and help message illumos/illumos-gate@31d7e8fa33 FreeBSD porting notes: the kernel part of this changeset depends on Solaris buf(9S) interfaces and are not really applicable for our use. vdev_disk.c is patched as-is to reduce diverge from upstream, but vdev_file.c is left intact. MFC after: 2 weeks	2013-12-31 19:39:15 +00:00
Mark Johnston	b69b2ff588	Allocate the probe ID unrhdr before the DTrace kld_* event handlers are registered. Otherwise there is a small window during which probe IDs may be allocated before the unrhdr is allocated. MFC after: 2 weeks	2013-12-31 15:41:16 +00:00
Mark Johnston	a333376bba	Revert r260091. The vmem calls seem to be slower than the *_unr() calls that they replaced, which is important considering that probe IDs are allocated during process startup for USDT probes.	2013-12-31 15:37:51 +00:00
Mark Johnston	b9c04b396a	Now that vmem(9) is available, use vmem arenas to allocate probe and aggregation IDs, as is done in the upstream illumos code. This still requires some FreeBSD-specific code, as our vmem API is not identical to the one in illumos. Submitted by: Mike Ma <mikemandarine@gmail.com>	2013-12-30 17:37:32 +00:00
Xin LI	1aaa945f67	MFV r258374: 4171 clean up spa_feature_*() interfaces 4172 implement extensible_dataset feature for use by other zpool features illumos/illumos-gate@2acef22db7 MFC after: 2 weeks	2013-12-24 07:14:25 +00:00
Xin LI	ec097c1634	MFV r258373: 4168 ztest assertion failure in dbuf_undirty 4169 verbatim import causes zdb to segfa 4170 zhack leaves pool in ACTIVE state illumos/illumos-gate@7fdd916c47 MFC after: 2 weeks	2013-12-24 06:56:17 +00:00
Justin Hibbits	ad5017b513	Fix a brain-o. I had misread the limit as a size, but it's a pointer. Submitted by: Howard Su MFC after: 2 weeks X-MFC-with: r259668	2013-12-21 00:37:32 +00:00
Justin Hibbits	a76f5d59f4	Fix a couple bugs in FBT PowerPC. Clamp the size to a 'instruction size' not 'byte size', and fix a typo. MFC after: 2 weeks	2013-12-20 23:18:14 +00:00

1 2 3 4 5 ...

989 Commits