freebsd-nq

Author	SHA1	Message	Date
Alan Somers	427d205cb5	fusefs: remove superfluous counter_u64_zero Reported by: glebius Sponsored by: The FreeBSD Foundation	2019-08-06 00:50:25 +00:00
Konstantin Belousov	30d49d536b	Try to decrease the number of bugs in unionfs after the VV_TEXT flag removal. - Provide unionfs_add_writecount() which passes the writecount to the lower or upper vnode as appropriate. - In unionfs VOP_RECLAIM() implementation, annulate unionfs writecounts from upper or lower vnode. It is not clear that it is always correct to remove the all references from either lower or upper vnode, but we currently do not track which vnode get how many refs anyway. Reported and tested by: t_uemura@macome.co.jp MFC after: 1 week Sponsored by: The FreeBSD Foundation	2019-08-01 14:40:37 +00:00
Alan Somers	508abc9494	fusefs: fix the build after r350446 fuse needs to include an additional header after r350446 Sponsored by: The FreeBSD Foundation	2019-07-31 21:48:35 +00:00
Alan Somers	58df81b339	MFHead @350426 Sponsored by: The FreeBSD Foundation	2019-07-30 04:17:36 +00:00
Mark Johnston	918988576c	Avoid relying on header pollution from sys/refcount.h. MFC after: 3 days Sponsored by: The FreeBSD Foundation	2019-07-29 20:26:01 +00:00
Alan Somers	669a092af1	fusefs: fix panic when writing with O_DIRECT and using writeback cache When a fusefs file system is mounted using the writeback cache, the cache may still be bypassed by opening a file with O_DIRECT. When writing with O_DIRECT, the cache must be invalidated for the affected portion of the file. Fix some panics caused by inadvertently invalidating too much. Sponsored by: The FreeBSD Foundation	2019-07-28 15:17:32 +00:00
Alan Somers	a63915c2d7	MFHead @r350386 Sponsored by: The FreeBSD Foundation	2019-07-28 04:02:22 +00:00
Alan Somers	ed74f781c9	fusefs: add a intr/nointr mount option FUSE file systems can optionally support interrupting outstanding operations. However, the file system does not identify to the kernel at mount time whether it's capable of doing that. Instead it signals its noncapability by returning ENOSYS to the first FUSE_INTERRUPT operation it receives. That's a problem for reliable signal delivery, because the kernel must choose which thread should get a signal before it knows whether the FUSE server can handle interrupts. The problem is even worse because the FUSE protocol allows a file system to simply ignore all FUSE_INTERRUPT operations. Fix the signal delivery logic by making interruptibility an opt-in mount option. This will require a corresponding change to libfuse, but not to most file systems that link to libfuse. Bump __FreeBSD_version due to the new mount option. Sponsored by: The FreeBSD Foundation	2019-07-18 17:55:13 +00:00
Alan Somers	f05962453e	fusefs: fix another semi-infinite loop bug regarding signal handling fticket_wait_answer would spin if it received an unhandled signal whose default disposition is to terminate. The reason is because msleep(9) would return EINTR even for a masked signal. One reason is when the thread is stopped, which happens for example during sigexit(). Fix this bug by returning immediately if fticket_wait_answer ever gets interrupted a second time, for any reason. Sponsored by: The FreeBSD Foundation	2019-07-18 15:30:00 +00:00
Alan Somers	d26d63a4af	fusefs: multiple interruptility improvements 1) Don't explicitly not mask SIGKILL. kern_sigprocmask won't allow it to be masked, anyway. 2) Fix an infinite loop bug. If a process received both a maskable signal lower than 9 (like SIGINT) and then received SIGKILL, fticket_wait_answer would spin. msleep would immediately return EINTR, but cursig would return SIGINT, so the sleep would get retried. Fix it by explicitly checking whether SIGKILL has been received. 3) Abandon the sig_isfatal optimization introduced by r346357. That optimization would cause fticket_wait_answer to return immediately, without waiting for a response from the server, if the process were going to exit anyway. However, it's vulnerable to a race: 1) fatal signal is received while fticket_wait_answer is sleeping. 2) fticket_wait_answer sends the FUSE_INTERRUPT operation. 3) fticket_wait_answer determines that the signal was fatal and returns without waiting for a response. 4) Another thread changes the signal to non-fatal. 5) The first thread returns to userspace. Instead of exiting, the process continues. 6) The application receives EINTR, wrongly believes that the operation was successfully interrupted, and restarts it. This could cause problems for non-idempotent operations like FUSE_RENAME. Reported by: kib (the race part) Sponsored by: The FreeBSD Foundation	2019-07-17 22:45:43 +00:00
Alan Somers	07e86257e6	fusefs: fix the build with some NODEBUG kernels systm.h needs to be included before counter.h Sponsored by: The FreeBSD Foundation	2019-07-13 21:41:12 +00:00
Alan Somers	97b0512b23	projects/fuse2: build fixes * Fix the kernel build with gcc by removing a redundant extern declaration * In the tests, fix a printf format specifier that assumed LP64 Sponsored by: The FreeBSD Foundation	2019-07-13 14:42:09 +00:00
Fedor Uporov	6ce04e595a	Add additional check for 'blocks per group' and 'fragments per group' superblock fields. These fields will not be equal only in case if bigalloc filesystem feature is turned on. This feature is not supported for now. Reported by: Christopher Krah, Thomas Barabosch, and Jan-Niclas Hilgert of Fraunhofer FKIE Reported as: FS-27-EXT2-12: Denial of Service in openat-0 (vm_fault_hold/ext2_clusteracct) MFC after: 2 weeks	2019-07-07 08:58:02 +00:00
Fedor Uporov	c008656263	Remove ufs fragments logic. The ext2fs fragments are different from ufs fragments. In case of ext2fs the fragment should be equal or more then block size. The values more than block size are used only in case of bigalloc feature, which is does not supported for now. Reported by: Christopher Krah, Thomas Barabosch, and Jan-Niclas Hilgert of Fraunhofer FKIE Reported as: FS-22-EXT2-9: Denial of service in ftruncate-0 (ext2_balloc) MFC after: 2 weeks	2019-07-07 08:56:13 +00:00
Fedor Uporov	590517d05a	Remove unneeded mount point unlock call. Reported by: Christopher Krah, Thomas Barabosch, and Jan-Niclas Hilgert of Fraunhofer FKIE Reported as: FS-11-EXT2-6: Denial Of Service in write-1 (ext2_balloc) MFC after: 2 weeks	2019-07-07 08:53:52 +00:00
Alan Somers	7e1f5432f4	fusefs: don't leak memory of unsent operations on unmount Sponsored by: The FreeBSD Foundation	2019-06-28 18:48:02 +00:00
Alan Somers	8aafc8c389	[skip ci] update copyright headers in fusefs files Sponsored by: The FreeBSD Foundation	2019-06-28 04:18:10 +00:00
Alan Somers	7f49ce7a0b	MFHead @349476 Sponsored by: The FreeBSD Foundation	2019-06-27 23:50:54 +00:00
Alan Somers	c1afff113c	fusefs: fix a memory leak regarding FUSE_INTERRUPT We were leaking the fuse ticket if the original operation completed before the daemon received the INTERRUPT operation. Fixing this was easier than I expected. Sponsored by: The FreeBSD Foundation	2019-06-27 22:24:56 +00:00
Alan Somers	435ecf40bb	fusefs: recycle vnodes after their last unlink Previously fusefs would never recycle vnodes. After VOP_INACTIVE, they'd linger around until unmount or the vnlru reclaimed them. This commit essentially actives and inlines the old reclaim_revoked sysctl, and fixes some issues dealing with the attribute cache and multiply linked files. Sponsored by: The FreeBSD Foundation	2019-06-27 20:18:12 +00:00
Alan Somers	38c8634635	fusefs: counter(9) variables should not be statically initialized Reported by: rpokala Sponsored by: The FreeBSD Foundation	2019-06-27 17:59:15 +00:00
Alan Somers	560a55d094	fusefs: convert statistical sysctls to use counter(9) counter(9) is more performant than using atomic instructions to update sysctls that just report statistics to userland. Sponsored by: The FreeBSD Foundation	2019-06-27 16:30:25 +00:00
Alan Somers	caeea8b4cc	fusefs: fix some memory leaks Fix memory leaks relating to FUSE_BMAP and FUSE_CREATE. There are still leaks relating to FUSE_INTERRUPT, but they'll be harder to fix since the server is legally allowed to never respond to a FUSE_INTERRUPT operation. Sponsored by: The FreeBSD Foundation	2019-06-27 00:00:48 +00:00
Alan Somers	f8ebf1cd7e	fusefs: implement protocol 7.23's FUSE_WRITEBACK_CACHE option As of protocol 7.23, fuse file systems can specify their cache behavior on a per-mountpoint basis. If they set FUSE_WRITEBACK_CACHE in fuse_init_out.flags, then they'll get the writeback cache. If not, then they'll get the writethrough cache. If they set FOPEN_DIRECT_IO in every FUSE_OPEN response, then they'll get no cache at all. The old vfs.fusefs.data_cache_mode sysctl is ignored for servers that use protocol 7.23 or later. However, it's retained for older servers, especially for those running in jails that lack access to the new protocol. This commit also fixes two other minor test bugs: * WriteCluster:SetUp was using an uninitialized variable. * Read.direct_io_pread wasn't verifying that the cache was actually bypassed. Sponsored by: The FreeBSD Foundation	2019-06-26 17:32:31 +00:00
Alan Somers	205696a17d	fusefs: delete some unused mount options The fusefs kernel module allegedly supported no_attrcache, no_readahed, no_datacache, no_namecache, and no_mmap mount options, but the mount_fusefs binary never did. So there was no way to ever activate these options. Delete them. Some of them have alternatives: no_attrcache: set the attr_valid time to 0 in FUSE_LOOKUP and FUSE_GETATTR responses. no_readahed: set max_readahead to 0 in the FUSE_INIT response. no_datacache: set the vfs.fusefs.data_cache_mode sysctl to 0, or (coming soon) set the attr_valid time to 0 and set FUSE_AUTO_INVAL_DATA in the FUSE_INIT response. no_namecache: set entry_valid time to 0 in FUSE_LOOKUP and FUSE_GETATTR responses. Sponsored by: The FreeBSD Foundation	2019-06-26 15:15:24 +00:00
Alan Somers	fef464546c	fusefs: implement the "time_gran" feature. If a server supports a timestamp granularity other than 1ns, it can tell the client this as of protocol 7.23. The client will use that granularity when updating its cached timestamps during write. This way the timestamps won't appear to change following flush. Sponsored by: The FreeBSD Foundation	2019-06-26 02:09:22 +00:00
Alan Somers	0a8fe2d369	fusefs: set ctime during FUSE_SETATTR following a write As of r349396 the kernel will internally update the mtime and ctime of files on write. It will also flush the mtime should a SETATTR happen before the data cache gets flushed. Now it will flush the ctime too, if the server is using protocol 7.23 or higher. This is the only case in which the kernel will explicitly set a file's ctime, since neither utimensat(2) nor any other user interfaces allow it. Sponsored by: The FreeBSD Foundation	2019-06-26 00:03:37 +00:00
Alan Somers	788af9538a	fusefs: automatically update mtime and ctime on write Writing should implicitly update a file's mtime and ctime. For fuse, the server is supposed to do that. But the client needs to do it too, because the FUSE_WRITE response does not include time attributes, and it's not desirable to issue a GETATTR after every WRITE. When using the writeback cache, there's another hitch: the kernel should ignore the mtime and ctime fields in any GETATTR response for files with a dirty write cache. Sponsored by: The FreeBSD Foundation	2019-06-25 23:40:18 +00:00
Alan Somers	0d3a88d76c	fusefs: writes should update the file size, even when data_cache_mode=0 Writes that extend a file should update the file's size. r344185 restricted that behavior for fusefs to only happen when the data cache was enabled. That probably made sense at the time because the attribute cache wasn't fully baked yet. Now that it is, we should always update the cached file size during write. Sponsored by: The FreeBSD Foundation	2019-06-25 18:36:11 +00:00
Alan Somers	b9e2019755	fusefs: rewrite vop_getpages and vop_putpages Use the standard facilities for getpages and putpages instead of bespoke implementations that don't work well with the writeback cache. This has several corollaries: * Change the way we handle short reads _again_. vfs_bio_getpages doesn't provide any way to handle unexpected short reads. Plus, I found some more lock-order problems. So now when the short read is detected we'll just clear the vnode's attribute cache, forcing the file size to be requeried the next time it's needed. VOP_GETPAGES doesn't have any way to indicate a short read to the "caller", so we just bzero the rest of the page whenever a short read happens. * Change the way we decide when to set the FUSE_WRITE_CACHE bit. We now set it for clustered writes even when the writeback cache is not in use. Sponsored by: The FreeBSD Foundation	2019-06-25 17:24:43 +00:00
Hans Petter Selasky	43a9329e1b	Free all allocated unit IDs in cuse(3) after the client character devices have been destroyed to avoid creating character devices with identical name. MFC after: 1 week Sponsored by: Mellanox Technologies	2019-06-25 11:46:01 +00:00
Hans Petter Selasky	c7ffaed92e	Fix for deadlock situation in cuse(3) The final server unref should be done by the server thread to prevent deadlock in the client cdevpriv destructor, which cannot destroy itself. MFC after: 1 week Sponsored by: Mellanox Technologies	2019-06-25 11:42:53 +00:00
Warner Losh	e5500f1efa	Replay r349334 by markj accidentally reverted by r349352 Remove a lingering use of splbio(). The buffer must be locked by the caller. No functional change intended. Reviewed by: kib MFC after: 1 week Sponsored by: The FreeBSD Foundation	2019-06-25 06:14:00 +00:00
Warner Losh	f5a95d9a07	Remove NAND and NANDFS support NANDFS has been broken for years. Remove it. The NAND drivers that remain are for ancient parts that are no longer relevant. They are polled, have terrible performance and just for ancient arm hardware. NAND parts have evolved significantly from this early work and little to none of it would be relevant should someone need to update to support raw nand. This code has been off by default for years and has violated the vnode protocol leading to panics since it was committed. Numerous posts to arch@ and other locations have found no actual users for this software. Relnotes: Yes No Objection From: arch@ Differential Revision: https://reviews.freebsd.org/D20745	2019-06-25 04:50:09 +00:00
Alan Somers	1734e205f3	fusefs: refine the short read fix from r349332 b_fsprivate1 needs to be initialized even for write operations, probably because a buffer can be used to read, write, and read again with the final read serviced by cache. Sponsored by: The FreeBSD Foundation	2019-06-24 20:08:28 +00:00
Mark Johnston	673c1c2944	Remove a lingering use of splbio(). The buffer must be locked by the caller. No functional change intended. Reviewed by: kib MFC after: 1 week Sponsored by: The FreeBSD Foundation	2019-06-24 19:19:37 +00:00
Alan Somers	17575bad85	fusefs: improve the short read fix from r349279 VOP_GETPAGES intentionally tries to read beyond EOF, so fuse_read_biobackend can't rely on bp->b_resid > 0 indicating a short read. And adjusting bp->b_count after a short read seems to cause some sort of resource leak. Instead, store the shortfall in the bp->b_fsprivate1 field. Sponsored by: The FreeBSD Foundation	2019-06-24 17:05:31 +00:00
Alan Somers	44f654fdc5	fusefs: fix corruption on short reads caused by r349279 Even if a short read is caused by EOF, it's still necessary to bzero the remaining buffer, because that buffer could become valid as a result of a future ftruncate or pwrite operation. Reported by: fsx Sponsored by: The FreeBSD Foundation	2019-06-21 23:29:29 +00:00
Alan Somers	aef22f2d75	fusefs: correctly handle short reads A fuse server may return a short read for three reasons: * The file is opened with FOPEN_DIRECT_IO. In this case, the short read should be returned directly to userland. We already handled this case correctly. * The file was truncated server-side, and the read hit EOF. In this case, the kernel should update the file size. Fixed in the case of VOP_READ. Fixing this for VOP_GETPAGES is TODO. * The file is opened in writeback mode, there are dirty buffers past what the server thinks is the file's EOF, and the read hit what the server thinks is the file's EOF. In this case, the client is trying to read a hole, and should zero-fill it. We already handled this case, and I added a test for it. Sponsored by: The FreeBSD Foundation	2019-06-21 21:44:31 +00:00
Alan Somers	87ff949a7b	fusefs: raise protocol level to 7.23 None of the new features are implemented yet. This commit just adds the new protocol definitions and adds backwards-compatibility code for pre 7.23 servers. Sponsored by: The FreeBSD Foundation	2019-06-21 04:57:23 +00:00
Alan Somers	8f9b3ba718	fusefs: use standard integer types in fuse_kernel.h This is a merge of Linux revision 4c82456eeb4da081dd63dc69e91aa6deabd29e03. No functional change. Sponsored by: The FreeBSD Foundation	2019-06-21 03:17:27 +00:00
Alan Somers	b160acd1c0	fusefs: raise the protocol level to 7.21 Jumping from protocol 7.15 to 7.21 adds several new features. While they're all potentially useful, they're also all optional, and I'm not implementing any right now because my highest priority lies in a later version. Sponsored by: The FreeBSD Foundation	2019-06-21 03:04:56 +00:00
Alan Somers	ecb489158c	fusefs: diff reduction of fuse_kernel.h vs the upstream version fuse_kernel.h is based on Linux's fuse.h. In r349250 I modified fuse_kernel.h by generating a diff of two versions of Linux's fuse.h and applying it to our tree. patch succeeded, but it put one chunk in the wrong location. This commit fixes that. No functional changes. Sponsored by: The FreeBSD Foundation	2019-06-21 02:55:43 +00:00
Alan Somers	7cbb8e8a06	fusefs: raise protocol level to 7.15 This protocol level adds two new features: the ability for the server to store or retrieve data into/from the client's cache. But the messages aren't defined soundly since they identify the file only by its inode, without the generation number. So it's possible for them to modify the wrong file's cache. Also, I don't know of any file systems in ports that use these messages. So I'm not implementing them. I did add a (disabled) test for the store message, however. Sponsored by: The FreeBSD Foundation	2019-06-20 23:32:25 +00:00
Alan Somers	bb23d43901	fusefs: trivially raise protocol level to 7.14 The only new feature is splice(2) support on /dev/fuse, which FreeBSD can't support. Sponsored by: The FreeBSD Foundation	2019-06-20 23:12:19 +00:00
Alan Somers	38b06f8ac4	fcntl: fix overflow when setting F_READAHEAD VOP_READ and VOP_WRITE take the seqcount in blocks in a 16-bit field. However, fcntl allows you to set the seqcount in bytes to any nonnegative 31-bit value. The result can be a 16-bit overflow, which will be sign-extended in functions like ffs_read. Fix this by sanitizing the argument in kern_fcntl. As a matter of policy, limit to IO_SEQMAX rather than INT16_MAX. Also, fifos have overloaded the f_seqcount field for a completely different purpose ever since r238936. Formalize that by using a union type. Reviewed by: cem MFC after: 2 weeks Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D20710	2019-06-20 23:07:20 +00:00
Alan Somers	192a918194	fusefs: attempt to support servers as old as protocol 7.4 Previously we allowed servers as old as 7.1 to connect (there never was a 7.0). However, we wrongly assumed a few things about protocols older than 7.8. This commit attempts to support servers as old as 7.4 but no older. I added no new tests because I'm not sure there actually _are_ any servers this old in the wild. Sponsored by: The FreeBSD Foundation	2019-06-20 22:21:42 +00:00
Alan Somers	2ffddc5ee9	fusefs: raise protocol level to 7.13 This protocol version adds one new feature: the ability for the server to set the maximum number of background requests and a "congestion threshold" with ill-defined properties. I don't know of any fuse file systems in ports that use this feature, so I'm not implementing it. Sponsored by: The FreeBSD Foundation	2019-06-20 21:29:28 +00:00
Alan Somers	a1c9f4ad0d	fusefs: implement VOP_BMAP If the fuse daemon supports FUSE_BMAP, then use that for the block mapping. Otherwise, use the same technique used by vop_stdbmap. Report large values for runp and runb in order to maximize read clustering and minimize upcalls, even if we don't know the true layout. The major result of this change is that sequential reads to FUSE files will now usually happen 128KB at a time instead of 64KB. Sponsored by: The FreeBSD Foundation	2019-06-20 17:08:21 +00:00
Alan Somers	e532a99901	MFHead @349234 Sponsored by: The FreeBSD Foundation	2019-06-20 15:56:08 +00:00
Alan Somers	84879e46c2	fusefs: multiple fixes related to the write cache * Don't always write the last page synchronously. That's not actually required. It was probably just masking another bug that I fixed later, possibly in r349021. * Enable the NotifyWriteback tests now that Writeback cache is working. * Add a test to ensure that the write cache isn't flushed synchronously when in writeback mode. Sponsored by: The FreeBSD Foundation	2019-06-17 23:34:11 +00:00
Alan Somers	402b609c80	fusefs: use cluster_read for more readahead fusefs will now use cluster_read. This allows readahead of more than one cache block. However, it won't yet actually cluster the reads because that requires VOP_BMAP, which fusefs does not yet implement. Sponsored by: The FreeBSD Foundation	2019-06-17 22:01:23 +00:00
Xin LI	f89d207279	Separate kernel crc32() implementation to its own header (gsb_crc32.h) and rename the source to gsb_crc32.c. This is a prerequisite of unifying kernel zlib instances. PR: 229763 Submitted by: Yoshihiro Ota <ota at j.email.ne.jp> Differential Revision: https://reviews.freebsd.org/D20193	2019-06-17 19:49:08 +00:00
Alan Somers	d569012f45	fusefs: implement non-clustered readahead fusefs will now read ahead at most one cache block at a time (usually 64 KB). Clustered reads are still TODO. Individual file systems may disable read ahead by setting fuse_init_out.max_readahead=0 during initialization. Sponsored by: The FreeBSD Foundation	2019-06-17 16:56:51 +00:00
Alan Somers	b5aaf286ea	fusefs: fix the "write-through" of write-through cacheing Our fusefs(5) module supports three cache modes: uncached, write-through, and write-back. However, the write-through mode (which is the default) has never actually worked as its name suggests. Rather, it's always been more like "write-around". It wrote directly, bypassing the cache. The cache would only be populated by a subsequent read of the same data. This commit fixes that problem. Now the write-through mode works as one would expect: write(2) immediately adds data to the cache and then blocks while the daemon processes the write operation. A side effect of this change is that non-cache-block-aligned writes will now incur a read-modify-write cycle of the cache block. The old behavior (bypassing write cache entirely) can still be achieved by opening a file with O_DIRECT. PR: 237588 Sponsored by: The FreeBSD Foundation	2019-06-14 19:47:48 +00:00
Alan Somers	8eecd9ce05	fusefs: enable write clustering Enable write clustering in fusefs whenever cache mode is set to writeback and the "async" mount option is used. With default values for MAXPHYS, DFLTPHYS, and the fuse max_write mount parameter, that means sequential writes will now be written 128KB at a time instead of 64KB. Also, add a regression test for PR 238565, a panic during unmount that probably affects UFS, ext2, and msdosfs as well as fusefs. PR: 238565 Sponsored by: The FreeBSD Foundation	2019-06-14 18:14:51 +00:00
Alan Somers	dff3a6b410	fusefs: fix a bug with WriteBack cacheing An errant vfs_bio_clrbuf snuck in in r348931. Surprisingly, it doesn't have any effect most of the time. But under some circumstances it cause the buffer to behave in a write-only fashion. Sponsored by: The FreeBSD Foundation	2019-06-13 19:07:03 +00:00
Alan Somers	93c0c1d4ce	fusefs: fix a page fault with writeback cacheing When truncating a file downward through a dirty buffer, it's neccessary to update the buffer's b->dirtyend. Sponsored by: The FreeBSD Foundation	2019-06-11 23:46:31 +00:00
Alan Somers	a87e0831ab	fusefs: WIP fixing writeback cacheing The current "writeback" cache mode, selected by the vfs.fusefs.data_cache_mode sysctl, doesn't do writeback cacheing at all. It merely goes through the motions of using buf(9), but then writes every buffer synchronously. This commit: * Enables delayed writes when the sysctl is set to writeback cacheing * Fixes a cache-coherency problem when extending a file whose last page has just been written. * Removes the "sync" mount option, which had been set unconditionally. * Adjusts some SDT probes * Adds several new tests that mimic what fsx does but with more control and without a real file system. As I discover failures with fsx, I add regression tests to this file. * Adds a test that ensures we can append to a file without reading any data from it. This change is still incomplete. Clustered writing is not yet supported, and there are frequent "panic: vm_fault_hold: fault on nofault entry" panics that I need to fix. Sponsored by: The FreeBSD Foundation	2019-06-11 16:32:33 +00:00
Alan Somers	ddc51e453e	fusefs: remove some stuff that was copy/pasted from nfsclient fusefs's I/O methods were originally copy/pasted from nfsclient. This commit removes some irrelevant parts, like stuff involving B_NEEDCOMMIT. Sponsored by: The FreeBSD Foundation	2019-06-06 20:35:41 +00:00
Alan Somers	0269ae4c19	MFHead @348740 Sponsored by: The FreeBSD Foundation	2019-06-06 16:20:50 +00:00
Alan Somers	011bca9948	fusefs: simplify fuse_write_biobackend. No functional change. Sponsored by: The FreeBSD Foundation	2019-06-05 20:18:56 +00:00
Konstantin Belousov	3c93d22758	Manually clear text references on reclaim for nullfs and tmpfs. Both filesystems do no use vnode_pager_dealloc() which would handle this case otherwise. Nullfs because vnode vm_object handle never points to nullfs vnode. Tmpfs because its vm_object is never vnode object at all. Tested by: pho Sponsored by: The FreeBSD Foundation MFC after: 1 week	2019-06-05 20:16:25 +00:00
Alan Somers	a639731ba9	fusefs: respect RLIMIT_FSIZE Sponsored by: The FreeBSD Foundation	2019-06-03 23:24:07 +00:00
Alan Somers	6ff7f297f8	fusefs: don't require FUSE_EXPORT_SUPPORT for async invalidation In r348560 I thought that FUSE_EXPORT_SUPPORT was required for cases where the node to be invalidated (or the parent of the entry to be invalidated) wasn't cached. But I realize now that that's not the case. During entry invalidation, if the parent isn't in the vfs hash table, then it must've been reclaimed. And since fuse_vnop_reclaim does a cache_purge, that means the entry to be invalidated has already been removed from the namecache. And during inode invalidation, if the inode to be invalidated isn't in the vfs hash table, then it too must've been reclaimed. In that case it will have no buffer cache to invalidate. Sponsored by: The FreeBSD Foundation	2019-06-03 20:45:32 +00:00
Alan Somers	eae1ae132c	fusefs: support asynchronous cache invalidation Protocol 7.12 adds a way for the server to notify the client that it should invalidate an inode's data cache and/or attributes. This commit implements that mechanism. Unlike Linux's implementation, ours requires that the file system also supports FUSE_EXPORT_SUPPORT (NFS-style lookups). Otherwise the invalidation operation will return EINVAL. Sponsored by: The FreeBSD Foundation	2019-06-03 17:34:01 +00:00
Alan Somers	c2d70d6e6f	fusefs: support name cache invalidation Protocol 7.12 adds a way for the server to notify the client that it should invalidate an entry from its name cache. This commit implements that mechanism. Sponsored by: The FreeBSD Foundation	2019-06-01 00:11:19 +00:00
Alan Somers	0d2bf48996	fusefs: check the vnode cache when looking up files for the NFS server FUSE allows entries to be cached for a limited amount of time. fusefs's vnop_lookup method already implements that using the timeout functionality of cache_lookup/cache_enter_time. However, lookups for the NFS server go through a separate path: vfs_vget. That path can't use the same timeout functionality because cache_lookup/cache_enter_time only work on pathnames, whereas vfs_vget works by inode number. This commit adds entry timeout information to the fuse vnode structure, and checks it during vfs_vget. This allows the NFS server to take advantage of cached entries. It's also the same path that FUSE's asynchronous cache invalidation operations will use. Sponsored by: The FreeBSD Foundation	2019-05-31 21:22:58 +00:00
Rick Macklem	6aab442af9	Get rid of extraneous initialization. Get rid of an extraneous initialization, mainly to keep a static analyser happy. No semantic change. PR: 238167 Submitted by: Alexey Dokuchaev	2019-05-31 03:13:09 +00:00
Rick Macklem	26fd36b29d	Clean up silly code case. This silly code segment has existed in the sources since it was brought into FreeBSD 10 years ago. I honestly have no idea why this was done. It was possible that I thought that it might have been better to not set B_ASYNC for the "else" case, but I can't remember. Anyhow, this patch gets rid of the if/else that does the same thing either way, since it looks silly and upsets a static analyser. This will have no semantic effect on the NFS client. PR: 238167	2019-05-31 00:56:31 +00:00
Alan Somers	a4856c96d0	fusefs: raise protocol level to 7.12 This commit raises the protocol level and adds backwards-compatibility code to handle structure size changes. It doesn't implement any new features. The new features added in protocol 7.12 are: * server-side umask processing (which FreeBSD won't do) * asynchronous inode and directory entry invalidation (which I'll do next) Sponsored by: The FreeBSD Foundation	2019-05-29 16:39:52 +00:00
Alan Somers	e039bafa87	fusefs: add comments explaining why 7.11 features aren't implemented Protocol 7.11 adds two new features, but neither of them were defined correctly. FUSE_IOCTL messages don't work for 32-bit daemons on a 64-bit host (fixed in protocol 7.16). FUSE_POLL is basically unusable until 7.21. Before 7.21, the client can't choose which events to register for; the client registers for "something" and the server replies to say which events the client is registered for. Also, before 7.21 there was no way for a client to deregister a file handle. Sponsored by: The FreeBSD Foundation	2019-05-29 02:03:08 +00:00
Alan Somers	9c62bc7045	fusefs: raise protocol level to 7.11 This commit adds the definitions for protocol 7.11 but doesn't yet implement the new features. The new features are optional, so they can come later. Sponsored by: The FreeBSD Foundation	2019-05-29 00:54:49 +00:00
Alan Somers	3f105d16a0	fusefs: raise protocol level to 7.10 Protocol version 7.10 has only one new feature, and I'm choosing not to implement it, so this commit is basically a noop. The sole new feature is the FOPEN_NONSEEKABLE flag, which a fuse file system can return to indicate that a certain file handle cannot be seeked. However, I'm unaware of any file system in ports that uses this flag. Sponsored by: The FreeBSD Foundation	2019-05-29 00:01:36 +00:00
Johannes Lundberg	1e363d64a5	pseudofs: Ignore unsupported commands in vop_setattr. Users of pseudofs (e.g. lindebugfs), should be able to receive input from command line via commands like "echo 1 > /path/to/file". Currently this fails because sh tries to truncate the file first and vop_setattr returns not supported error for this. This patch simply ignores the error and returns 0 instead. Reviewed by: imp (mentor), asomers Approved by: imp (mentor), asomers MFC after: 1 week Differential Revision: D20451	2019-05-28 20:54:59 +00:00
Alan Somers	d4fd0c8148	fusefs: set the flags fields of fuse_write_in and fuse_read_in These fields are supposed to contain the file descriptor flags as supplied to open(2) or set by fcntl(2). The feature is kindof useless on FreeBSD since we don't supply all of these flags to fuse (because of the weak relationship between struct file and struct vnode). But we should at least set the access mode flags (O_RDONLY, etc). This is the last fusefs change needed to get full protocol 7.9 support. There are still a few options we don't support for good reason (mandatory file locking is dumb, flock support is broken in the protocol until 7.17, etc), but there's nothing else to do at this protocol level. Sponsored by: The FreeBSD Foundation	2019-05-28 01:09:19 +00:00
Alan Somers	8aa24ed381	fusefs: flock(2) locks must be implemented in-kernel If a FUSE file system sets the FUSE_POSIX_LOCKS flag then it can support fcntl(2)-style locks directly. However, the protocol does not adequately support flock(2)-style locks until revision 7.17. They must be implemented locally in-kernel instead. This unfortunately breaks the interoperability of fcntl(2) and flock(2) locks for file systems that support the former. C'est la vie. Prior to this commit flock(2) would get sent to the server as a fcntl(2)-style lock with the lock owner field set to stack garbage. Sponsored by: The FreeBSD Foundation	2019-05-28 00:03:46 +00:00
Alan Somers	9bcecf0fd7	fusefs: clear fuse_getattr_in.getattr_flags Protocol 7.9 adds this field. We could use it to store the file handle of the file whose attributes we're requesting. However, that requires extra work at runtime to look up a file handle, and I'm not aware of any file systems that care. So it's easiest just to clear it. Sponsored by: The FreeBSD Foundation	2019-05-27 22:25:39 +00:00
Alan Somers	bda39894c5	fusefs: set FUSE_WRITE_CACHE when writing from cache This bit tells the server that we're not sure which uid, gid, and/or pid originated the write. I don't know of a single file system that cares, but it's part of the protocol. Sponsored by: The FreeBSD Foundation	2019-05-27 21:36:28 +00:00
Alan Somers	93fecd02a1	fusefs: misc build fixes * Only build the tests on platforms with C++14 support * Fix an undefined symbol error on lint builds * Remove an unused function: fiov_clear Sponsored by: The FreeBSD Foundation	2019-05-25 21:40:27 +00:00
Alan Somers	65417f5e27	Remove "struct ucred" argument from vtruncbuf vtruncbuf takes a "struct ucred" argument. AFAICT, it's been unused ever since that function was first added in r34611. Remove it. Also, remove some "struct ucred" arguments from fuse and nfs functions that were only used by vtruncbuf. Reviewed by: cem MFC after: 2 weeks Sponsored by: The FreeBSD Foundation Differential Revision: https://reviews.freebsd.org/D20377	2019-05-24 20:27:50 +00:00
Alan Somers	e97ae4ad2d	fusefs: implement FUSE_ASYNC_READ If a daemon sets the FUSE_ASYNC_READ flag during initialization, then the client is allowed to issue multiple concurrent reads for the same file handle. Otherwise concurrent reads are not allowed. This commit implements it. Previously we unconditionally disallowed concurrent reads. Sponsored by: The FreeBSD Foundation	2019-05-24 05:12:43 +00:00
Alan Somers	ad587bc5df	fusefs: fix some garbage left behind by r348209 Sponsored by: The FreeBSD Foundation	2019-05-24 00:56:50 +00:00
Alan Somers	e76986fde0	fusefs: fix exporting fuse filesystems with nfsd A previous commit made fuse exportable via userland NFS servers. Compatibility with the in-kernel nfsd required two more changes: * During read and write operations, implicitly do a FUSE_OPEN if there isn't already a valid file handle. That's because nfsd never calls VOP_OPEN. * During VOP_READDIR, if an implicit open was necessary, directory offsets from a previous VOP_READDIR may not be valid, so VOP_READDIR may have to start from the beginning and read until it encounters the requested offset. I've done only limited testing over NFS, so there are probably still some more bugs. Thanks to rmacklem for all of the readdir changes, which he had made for his pnfs work. Sponsored by: The FreeBSD Foundation	2019-05-23 23:06:26 +00:00
Alan Somers	db7b0e747f	fusefs: assume the mountpoint's generation is 0 This seems to be libfuse's behavior (its documentation notwithstanding). Sponsored by: The FreeBSD Foundation	2019-05-23 22:57:57 +00:00
Alan Somers	e5b50fe736	fusefs: Make fuse file systems NFS-exportable This commit adds the VOPs needed by userspace NFS servers (tested with net/unfs3). More work is needed to make the in-kernel nfsd work, because of its stateless nature. It doesn't open files prior to doing I/O. Also, the NFS-related VOPs currently ignore the entry cache. Sponsored by: The FreeBSD Foundation	2019-05-23 00:44:01 +00:00
Alan Somers	2013b723d3	fusefs: improve attribute cacheing Consolidate all calls to fuse_vnode_setsize as a result of a file attribute change to one location in fuse_internal_setattr. There are still a few calls elsewhere that happen as a result of a write. Sponsored by: The FreeBSD Foundation	2019-05-23 00:22:03 +00:00
Alan Somers	18a2264e27	fusefs: fix "recursing on non recursive lockmgr" panic When mounted with -o default_permissions and when vfs.fusefs.data_cache_mode=2, fuse_io_strategy would try to clear the suid bit after a successful write by a non-owner. When combined with a not-yet-committed attribute-caching patch I'm working on, and if the FUSE_SETATTR response indicates an unexpected filesize (legal, if the file system has other clients), this would end up calling vtruncbuf. That would panic, because the buffer lock was already held by bufwrite or bufstrategy or something else upstack from fuse_vnop_strategy. Sponsored by: The FreeBSD Foundation	2019-05-22 23:30:51 +00:00
Alan Somers	b6b7fe7c7d	fusefs: remove the vfs.fusefs.sync_resize syctl, correctly this time In r347547 I intended to remove the vfs.fusefs.sync_resize sysctl, leaving fusefs's behavior as though sync_resize had its default value. But I forgot that I had already turned off sync_resize in my development system's /etc/sysctl.conf. This commit complete removes the optional behavior that was formerly controlled by sync_resize. There's no need for explicitly calling FUSE_SETATTR after every FUSE_WRITE that extends a file. The daemon can infer that the file is being extended. If this sysctl was added as a workaround for a buggy daemon, there's no clue as to what that daemon may have been. Sponsored by: The FreeBSD Foundation	2019-05-22 19:49:25 +00:00
Conrad Meyer	daec92844e	Include ktr.h in more compilation units Similar to r348026, exhaustive search for uses of CTRn() and cross reference ktr.h includes. Where it was obvious that an OS compat header of some kind included ktr.h indirectly, .c files were left alone. Some of these files clearly got ktr.h via header pollution in some scenarios, or tinderbox would not be passing prior to this revision, but go ahead and explicitly include it in files using it anyway. Like r348026, these CUs did not show up in tinderbox as missing the include. Reported by: peterj (arm64/mp_machdep.c) X-MFC-With: r347984 Sponsored by: Dell EMC Isilon	2019-05-21 20:38:48 +00:00
Alan Somers	a6fac00c53	fusefs: Allow update mounts Allow "mount -u" to change some mount options for fusefs. Sponsored by: The FreeBSD Foundation	2019-05-21 19:34:39 +00:00
Alan Somers	d311d6c467	fusefs: eliminate a superfluous fuse_node_setparent Sponsored by: The FreeBSD Foundation	2019-05-20 20:55:01 +00:00
Alan Somers	6f8114adcc	fusefs: unset MNT_LOCAL The kernel can't tell whether or not a fuse file system is truly local. But what really matters is two things: 1) Can I/O to a file system block indefinitely? 2) Can the file system bypass the O_BENEATH restriction during lookup? For fuse, the answer to both of those question is yes. So as far as the kernel is concerned, it's a non-local file system. Sponsored by: The FreeBSD Foundation	2019-05-20 20:54:09 +00:00
Conrad Meyer	e2e050c8ef	Extract eventfilter declarations to sys/_eventfilter.h This allows replacing "sys/eventfilter.h" includes with "sys/_eventfilter.h" in other header files (e.g., sys/{bus,conf,cpu}.h) and reduces header pollution substantially. EVENTHANDLER_DECLARE and EVENTHANDLER_LIST_DECLAREs were moved out of .c files into appropriate headers (e.g., sys/proc.h, powernv/opal.h). As a side effect of reduced header pollution, many .c files and headers no longer contain needed definitions. The remainder of the patch addresses adding appropriate includes to fix those files. LOCK_DEBUG and LOCK_FILE_LINE_ARG are moved to sys/_lock.h, as required by sys/mutex.h since r326106 (but silently protected by header pollution prior to this change). No functional change (intended). Of course, any out of tree modules that relied on header pollution for sys/eventhandler.h, sys/lock.h, or sys/mutex.h inclusion need to be fixed. __FreeBSD_version has been bumped.	2019-05-20 00:38:23 +00:00
Alan Somers	fe221e0177	fusefs: forward UTIME_NOW to the server If a user sets both atime and mtime to UTIME_NOW when calling a syscall like utimensat(2), allow the server to choose what "now" means. Due to the design of FreeBSD's VFS, it's not possible to do this for just one of atime or mtime; it's all or none. PR: 237181 Sponsored by: The FreeBSD Foundation	2019-05-16 23:17:39 +00:00
Alan Somers	e7f73af118	fusefs: allow the server to specify st_blksize If the server sets fuse_attr.blksize to a nonzero value in the response to FUSE_GETATTR, then the client should use that as the value for stat.st_blksize . Sponsored by: The FreeBSD Foundation	2019-05-16 22:50:04 +00:00
Alan Somers	16bd2d47c7	fusefs: Upgrade FUSE protocol to version 7.9. This commit upgrades the FUSE API to protocol 7.9 and adds unit tests for backwards compatibility with servers built for version 7.8. It doesn't implement any of 7.9's new features yet. Sponsored by: The FreeBSD Foundation	2019-05-16 17:24:11 +00:00
Alan Somers	96192dfce0	fusefs: diff reduction vs the upstream sources fuse_kernel.h defines the structures used by the FUSE protocol. Originally it came from libfuse, but the current source of truth is the Linux kernel. This commit minimizes the diffs between our version and the Linux version as of 21f3da95d (protocol version 7.8). The flags field of struct fuse_listxattr_out and fuse_listxattr_in was an error in our header. Those fields don't exist in Linux or libfuse, and they've never been used in FreeBSD. In fact, those structs don't even exist in Linux and libfuse; those projects confusingly overload the identical fuse_getexattr_in and fuse_getxattr_out structs. Sponsored by: The FreeBSD Foundation	2019-05-15 22:51:25 +00:00
Alan Somers	3d15b234a4	fusefs: don't track a file's size in two places fuse_vnode_data.filesize was mostly redundant with fuse_vnode_data.cached_attrs.st_size, but didn't have exactly the same meaning. It was very confusing. This commit eliminates the former. It also eliminates fuse_vnode_refreshsize, which ignored the cache timeout value. Sponsored by: The FreeBSD Foundation	2019-05-15 00:38:52 +00:00
Alan Somers	96658124d7	fusefs: eliminate superfluous FUSE_GETATTR when filesize=0 fuse_vnode_refreshsize was using 0 as a flag value for filesize meaning "uninitialized" (thanks to the malloc(...M_ZERO) in fuse_vnode_alloc. But this led to unnecessary getattr operations when the filesize legitimately happened to be zero. Fix by adding a distinct flag value. Sponsored by: The FreeBSD Foundation	2019-05-13 23:30:06 +00:00

1 2 3 4 5 ...

4175 Commits