freebsd-dev

Author	SHA1	Message	Date
Pawel Jakub Dawidek	7746b6461d	Work-around READDIRPLUS problem with .zfs/ and .zfs/snapshot/ directories by just returning EOPNOTSUPP. This will allow NFS server to fall back to regular READDIR. Note that converting inode number to snapshot's vnode is expensive operation. Snapshots are stored in AVL tree, but based on their names, not inode numbers, so to convert inode to snapshot vnode we have to interate over all snalshots. This is not a problem in OpenSolaris, because in their READDIRPLUS implementation they use VOP_LOOKUP() on d_name, instead of VFS_VGET() on d_fileno as we do. PR: kern/125149 Reported by: Weldon Godfrey <wgodfrey@ena.com> Analysis by: Jaakko Heinonen <jh@saunalahti.fi> MFC after: 3 days	2009-09-13 16:05:20 +00:00
Pawel Jakub Dawidek	7b4a12379b	When zfs.ko is compiled with debug, make sure that znode and vnode point at each other. MFC after: 3 days	2009-09-13 10:33:51 +00:00
Pawel Jakub Dawidek	33a0ef82f2	Extend scope of the z_teardown_lock lock for consistency and "just in case". MFC after: 3 days	2009-09-13 10:29:51 +00:00
Pawel Jakub Dawidek	7dae3c4faf	Be sure not to overflow struct fid. MFC after: 3 days	2009-09-13 10:25:33 +00:00
Pawel Jakub Dawidek	f53901193d	There is a bug where mze_insert() can trigger an assert() of inserting the same entry twice. This bug is not fixed yet, but leads to situation where when try to access corrupted directory the kernel will panic. Until the bug is properly fixed, try to recover from it and log that it happened. Reported by: marck OpenSolaris bug: 6709336 MFC after: 3 days	2009-09-13 10:12:29 +00:00
Pawel Jakub Dawidek	f5516e3d1d	- Protect reclaim with z_teardown_inactive_lock. - Be prepared for dbuf to disappear in zfs_reclaim_complete() and check if z_dbuf field is NULL - this might happen in case of rollback or forced unmount between zfs_freebsd_reclaim() and zfs_reclaim_complete(). - On forced unmount wait for all znodes to be destroyed - destruction can be done asynchronously via zfs_reclaim_complete(). MFC after: 1 week	2009-09-12 19:53:31 +00:00
Pawel Jakub Dawidek	2a8e7dad33	Tighten up the check for race in zfs_zget() - ZTOV(zp) can not only contain NULL, but also can point to dead vnode, take that into account. PR: kern/132068 Reported by: Edward Fisk" <7ogcg7g02@sneakemail.com>, kris Fix based on patch from: Jaakko Heinonen <jh@saunalahti.fi> MFC after: 1 week	2009-09-12 19:27:54 +00:00
Pawel Jakub Dawidek	3770996142	Only log successful commands! Without this fix we log even unsuccessful commands executed by unprivileged users. Action is not really taken, but it is logged to pool history, which might be confusing. Reported by: Denis Ahrens <denis@h3q.com> MFC after: 3 days	2009-09-08 16:40:08 +00:00
Pawel Jakub Dawidek	d6b8039292	We don't export individual snapshots, so mnt_export field in snapshot's mount point is NULL. That's why when we try to access snapshots over NFS use mnt_export field from the parent file system. MFC after: 1 week	2009-09-08 15:57:03 +00:00
Pawel Jakub Dawidek	f148fd9a4a	When we automatically mount snapshot we want to return vnode of the mount point from the lookup and not covered vnode. This is one of the fixes for using .zfs/ over NFS. MFC after: 1 week	2009-09-08 15:51:40 +00:00
Pawel Jakub Dawidek	2391003912	On FreeBSD we don't have to look for snapshot's mount point, because fhtovp method is already called with proper mount point. MFC after: 1 week	2009-09-08 15:42:55 +00:00
Pawel Jakub Dawidek	6f8e88e1da	Call ZFS_EXIT() after locking the vnode. MFC after: 1 week	2009-09-08 15:37:01 +00:00
Pawel Jakub Dawidek	1ea3566294	Fix reference count leak for a case where snapshot's mount point is updated. Such situation is not supported. This problem was triggered by something like this: # zpool create tank da0 # zfs snapshot tank@snap # cd /tank/.zfs/snapshot/snap (this will mount the snapshot) # cd # mount -u nosuid /tank/.zfs/snapshot/snap (refcount leak) # zpool export tank cannot export 'tank': pool is busy MFC after: 1 week	2009-09-08 08:54:15 +00:00
Pawel Jakub Dawidek	28e449adf2	If we have to use avl_find(), optimize a bit and use avl_insert() instead of avl_add() (the latter is actually a wrapper around avl_find() + avl_insert()). Fix similar case in the code that is currently commented out.	2009-09-07 21:58:54 +00:00
Pawel Jakub Dawidek	3f6043a57d	When snapshot mount point is busy (for example we are still in it) we will fail to unmount it, but it won't be removed from the tree, so in that case there is no need to reinsert it. This fixes a panic reproducable in the following steps: # zfs create tank/foo # zfs snapshot tank/foo@snap # cd /tank/foo/.zfs/snapshot/snap # umount /tank/foo panic: avl_find() succeeded inside avl_add() Reported by: trasz MFC after: 3 days	2009-09-07 21:46:51 +00:00
Edward Tomasz Napierala	343775c0b4	Enable NFSv4 ACL support in ZFS. Reviewed by: pjd	2009-09-07 19:43:13 +00:00
Pawel Jakub Dawidek	c739b7b22b	Don't recheck ownership on update mount. This will eliminate LOR between vfs_busy() and mount mutex. We check ownership in vfs_domount() anyway. Noticed by: kib Reviewed by: kib MFC after: 1 week	2009-09-07 18:54:55 +00:00
Edward Tomasz Napierala	900b1670c4	Prevent the line from wrapping.	2009-09-07 16:56:41 +00:00
Pawel Jakub Dawidek	841bcfea21	Changing provider size is not really supported by GEOM, but doing so when provider is closed should be ok. When administrator requests to change ZVOL size do it immediately if ZVOL is closed or do it on last ZVOL close. PR: kern/136942 Requested by: Bernard Buri <bsd@ask-us.at> MFC after: 1 week	2009-09-07 14:16:50 +00:00
Pawel Jakub Dawidek	5e65224daf	bzero() on-stack argument, so mutex_init() won't misinterpret that the lock is already initialized if we have some garbage on the stack. PR: kern/135480 Reported by: Emil Mikulic <emikulic@gmail.com> MFC after: 3 days	2009-09-07 11:38:43 +00:00
Edward Tomasz Napierala	a41422a93e	Improve wording. Discussed with: pjd, cperciva, rink, wkoszek and des, in order of appearance.	2009-09-05 15:08:58 +00:00
Pawel Jakub Dawidek	26d0605727	Backport the 'dirtying dbuf' panic fix from newer ZFS version. Reported by: Thomas Backman <serenity@exscape.org> MFC after: 1 week	2009-08-31 16:27:00 +00:00
Pawel Jakub Dawidek	575c1d371c	Add missing mountpoint vnode locking. This fixes panic on assertion with DEBUG_VFS_LOCKS and vfs.usermount=1 when regular user tries to mount dataset owned by him. MFC after: 1 week	2009-08-30 21:03:40 +00:00
Pawel Jakub Dawidek	5d5535163a	- Hide ZFS kernel threads under zfskern process. - Use better (shorter) threads names: 'zvol:worker zvol/tank/vol00' -> 'zvol tank/vol00' 'vdev:worker da0' -> 'vdev da0'	2009-08-23 11:33:46 +00:00
Pawel Jakub Dawidek	4ec2b0e7ce	Set priority of vdev_geom threads and zvol threads to PRIBIO.	2009-08-23 11:27:08 +00:00
Pawel Jakub Dawidek	8e9fd65fbf	getcwd() (when __getcwd() fails) works by stating current directory, going up (..), calling readdir and looking for previous directory inode. In case of .zfs/ directory this doesn't work, because .zfs/ is hidden by default, so it won't be visible in readdir output. Fix this by implementing VPTOCNP for snapshot directories, so __getcwd() doesn't fail and getcwd() doesn't have to use readdir method. This fixes /bin/pwd from within .zfs/snapshot/<name>/. Suggested by: kib Approved by: re (rwatson)	2009-08-17 10:00:18 +00:00
Pawel Jakub Dawidek	8461b0f043	Manage asynchronous vnode release just like Solaris. Discussed with: kmacy Approved by: re (kib)	2009-08-17 09:48:34 +00:00
Pawel Jakub Dawidek	e35eb914f4	- Reduce z_teardown_lock lock scope a bit. - The error variable is int, not bool. - Convert spaces to tabs where needed. Approved by: re (kib)	2009-08-17 09:28:15 +00:00
Pawel Jakub Dawidek	0330a5dc10	If z_buf is NULL, we should free znode immediately. Noticed by: avg Approved by: re (kib)	2009-08-17 09:25:37 +00:00
Pawel Jakub Dawidek	d83cfc37a4	- We need to recycle vnode instead of freeing znode. Submitted by: avg - Add missing vnode interlock unlock. - Remove redundant znode locking. Approved by: re (kib)	2009-08-17 09:21:39 +00:00
Pawel Jakub Dawidek	f820bc079f	Fix panic in zfs recv code. The last vnode (mountpoint's vnode) can have 0 usecount. Reported by: Thomas Backman <serenity@exscape.org> Approved by: re (kib)	2009-08-17 09:13:22 +00:00
Pawel Jakub Dawidek	159ef108e1	Remove OpenSolaris taskq port (it performs very poorly in our kernel) and replace it with wrappers around our taskqueue(9). To make it possible implement taskqueue_member() function which returns 1 if the given thread was created by the given taskqueue. Approved by: re (kib)	2009-08-17 09:01:20 +00:00
Pawel Jakub Dawidek	fddc954016	- Fix a race where /dev/zfs control device is created before ZFS is fully initialized. Also destroy /dev/zfs before doing other deinitializations. - Initialization through taskq is no longer needed and there is a race where one of the zpool/zfs command loads zfs.ko and tries to do some work immediately, but /dev/zfs is not there yet. Reported by: pav Approved by: re (kib)	2009-08-17 08:36:41 +00:00
Pawel Jakub Dawidek	830940567b	Remove files that are no longer used. Discussed with: kmacy Approved by: re (kib)	2009-08-17 08:03:02 +00:00
Marcel Moolenaar	538e86713d	Fix misalignment in nvpair_native_embedded() caused by the compiler replacing the bzero(). See also revision 195627, which fixed the misalignment in nvpair_native_embedded_array(). Approved by: re (kensmith)	2009-08-16 01:48:46 +00:00
Pawel Jakub Dawidek	abd8353f5d	We don't support ephemeral IDs in FreeBSD and without this fix ZFS can panic when in zfs_fuid_create_cred() when userid is negative. It is converted to unsigned value which makes IS_EPHEMERAL() macro to incorrectly report that this is ephemeral ID. The most reasonable solution for now is to always report that the given ID is not ephemeral. PR: kern/132337 Submitted by: Matthew West <freebsd@r.zeeb.org> Tested by: Thomas Backman <serenity@exscape.org>, Michael Reifenberger <mike@reifenberger.com> Approved by: re (kib) MFC after: 2 weeks	2009-07-27 14:52:34 +00:00
Edward Tomasz Napierala	d2ceff236a	Fix extattr_list_file(2) on ZFS in case the attribute directory doesn't exist and user doesn't have write access to the file. Without this fix, it returns bogus value instead of 0. For some reason this didn't manifest on my kernel compiled with -O0. PR: kern/136601 Submitted by: Jaakko Heinonen <jh at saunalahti dot fi> Approved by: re (kib)	2009-07-22 15:15:58 +00:00
Edward Tomasz Napierala	65588fd503	Fix permission handling for extended attributes in ZFS. Without this change, ZFS uses SunOS Alternate Data Streams semantics - each EA has its own permissions, which are set at EA creation time and - unlike SunOS - invisible to the user and impossible to change. From the user point of view, it's just broken: sometimes access is granted when it shouldn't be, sometimes it's denied when it shouldn't be. This patch makes it behave just like UFS, i.e. depend on current file permissions. Also, it fixes returned error codes (ENOATTR instead of ENOENT) and makes listextattr(2) return 0 instead of EPERM where there is no EA directory (i.e. the file never had any EA). Reviewed by: pjd (idea, not actual code) Approved by: re (kib)	2009-07-20 19:16:42 +00:00
Marcel Moolenaar	dd8a461e83	In nvpair_native_embedded_array(), meaningless pointers are zeroed. The programmer was aware that alignment was not guaranteed in the packed structure and used bzero() to NULL out the pointers. However, on ia64, the compiler is quite agressive in finding ILP and calls to bzero() are often replaced by simple assignments (i.e. stores). Especially when the width or size in question corresponds with a store instruction (i.e. st1, st2, st4 or st8). The problem here is not a compiler bug. The address of the memory to zero-out was given by '&packed->nvl_priv' and given the type of the 'packed' pointer the compiler could assume proper alignment for the replacement of bzero() with an 8-byte wide store to be valid. The problem is with the programmer. The programmer knew that the address did not have the alignment guarantees needed for a regular assignment, but failed to inform the compiler of that fact. In fact, the programmer told the compiler the opposite: alignment is guaranteed. The fix is to avoid using a pointer of type "nvlist_t " and instead use a "char " pointer as the basis for calculating the address. This tells the compiler that only 1-byte alignment can be assumed and the compiler will either keep the bzero() call or instead replace it with a sequence of byte-wise stores. Both are valid. Approved by: re (kib)	2009-07-11 22:43:20 +00:00
Konstantin Belousov	e0c161b89c	Add another flags argument to vn_open_cred. Use it to specify that some vn_open_cred invocations shall not audit namei path. In particular, specify VN_OPEN_NOAUDIT for dotdot lookup performed by default implementation of vop_vptocnp, and for the open done for core file. vn_fullpath is called from the audit code, and vn_open there need to disable audit to avoid infinite recursion. Core file is created on return to user mode, that, in particular, happens during syscall return. The creation of the core file is audited by direct calls, and we do not want to overwrite audit information for syscall. Reported, reviewed and tested by: rwatson	2009-06-21 13:41:32 +00:00
Jamie Gritton	c1f192193d	Rename the host-related prison fields to be the same as the host.* parameters they represent, and the variables they replaced, instead of abbreviated versions of them. Approved by: bz (mentor)	2009-06-13 15:39:12 +00:00
Kip Macy	f0c6b798a3	pjd has requested that I keep the tunable as zfs_prefetch_disable to minimize gratuitous differences with Opensolaris' ZFS Sorry for the churn	2009-06-11 22:24:08 +00:00
Kip Macy	e4e5e663e0	check against prefetch_enable	2009-06-11 09:51:21 +00:00
Kip Macy	3fa5485637	use default policy for enabling prefetching unless the TUNABLE is set	2009-06-10 21:05:37 +00:00
Kip Macy	107b659450	As far as I can tell systems that have less than 4GB are more often hurt by prefetched than helped. On i386 systems and systems with less than 4GB, prefetch is now disabled by default. I've added a prefetch enable tunable, to enable prefetching for those systems. The prefetch disable tunable will continue to unconditionally disable prefetching.	2009-06-10 01:21:32 +00:00
Paul Saab	a6d545d8ed	Support shared vnode locks for write operations when the offset is provided on filesystems that support it. This really improves mysql + innodb performance on ZFS. Reviewed by: jhb, kmacy, jeffr	2009-06-04 16:18:07 +00:00
Doug Rabson	8be608b58c	Allow the bootfs property to be set for raidz pools on FreeBSD. Reviewed by: pjd	2009-05-31 11:59:32 +00:00
Kip Macy	762169b50a	fix xdrmem_control to be safe in an if statement fix zfs to depend on krpc remove xdr from zfs makefile Submitted by: dchagin@freebsd.org	2009-05-30 22:23:58 +00:00
Kip Macy	139ccddec0	work around snapshot shutdown race reported by Henri Hennebert	2009-05-30 19:26:35 +00:00
Kip Macy	c334d2d544	MFdevbranch 192944 - add FreeBSD implementation of xdrmem_control needed by zfs - have zfs define xdr_ops using FreeBSD's definition - remove solaris xdr files from zfs compile	2009-05-28 08:18:12 +00:00

1 2 3 4 5

215 Commits