freebsd-nq

Author	SHA1	Message	Date
Pawel Jakub Dawidek	97a669a3b2	Warn when user use sectorsize bigger than the page size, which will lead to problems when the geli device is used with file system or as a swap. Hopefully will prevent problems like kern/98742 in the future. MFC after: 1 week	2007-03-05 12:41:44 +00:00
Pawel Jakub Dawidek	b942093961	Fix geli after last commit for UP systems that are running SMP kernel. Submitted by: Hyo geol, Lee <hyogeollee@gmail.com> MFC after: 1 week	2007-03-02 09:38:16 +00:00
John Baldwin	4d70511ac3	Use pause() rather than tsleep() on stack variables and function pointers.	2007-02-27 17:23:29 +00:00
Matt Jacob	e770bc6bf5	First cut at GEOM based multipath. This is an active/passive{/passive...} arrangement that has no intrinsic internal knowledge of whether devices it is given are truly multipath devices. As such, this is a simplistic approach, but still a useful one. The basic approach is to (at present- this will change soon) use camcontrol to find likely identical devices and and label the trailing sector of the first one. This label contains both a full UUID and a name. The name is what is presented in /dev/multipath, but the UUID is used as a true distinguishor at g_taste time, thus making sure we don't have chaos on a shared SAN where everyone names their data multipath as "Fred". The first of N identical devices (and N may be 1!) becomes the active path until a BIO request is failed with EIO or ENXIO. When this occurs, the active disk is ripped away and the next in a list is picked to (retry and) continue with. During g_taste events new disks that meet the match criteria for existing multipath geoms get added to the tail end of the list. Thus, this active/passive setup actually does work for devices which go away and come back, as do (now) mpt(4) and isp(4) SAN based disks. There is still a lot to do to improve this- like about 5 of the 12 recommendations I've received about it, but it's been functional enough for a while that it deserves a broader test base. Reviewed by: pjd Sponsored by: IronPort Systems MFC: 2 months	2007-02-27 04:01:58 +00:00
John Baldwin	6e50e38fcc	Use tsleep() rather than msleep() with a NULL mtx parameter.	2007-02-23 23:06:10 +00:00
Nick Hibma	55fe33a350	Reduce the noise when plugging in (USB) mass storage devices, like a 4 port flash card reader. Also remove an 'Opened da0 -> <random number>' which is not needed on a daily basis (available through bootverbose). Reviewed by: phk, ken MFC after: 1 week	2007-02-21 07:45:02 +00:00
Craig Rodrigues	898b5f434b	#include <sys/systm.h> before <sys/geom.h> to get KASSERT(), and fix LINT build.	2007-02-08 04:02:56 +00:00
Marcel Moolenaar	1d3aed33e8	Evolve the ctlreq interface added to geom_gpt into a generic partitioning class that supports multiple schemes. Current schemes supported are APM (Apple Partition Map) and GPT. Change all GEOM_APPLE anf GEOM_GPT options into GEOM_PART_APM and GEOM_PART_GPT (resp). The ctlreq interface supports verbs to create and destroy partitioning schemes on a disk; to add, delete and modify partitions; and to commit or undo changes made.	2007-02-07 18:55:31 +00:00
Pawel Jakub Dawidek	1ded77b222	We expect 'bio_data != NULL' for BIO_{READ,WRITE,GETATTR}, but for BIO_{DELETE,FLUSH} we expect 'bio_data == NULL'. Reviewed by: phk	2007-01-28 23:36:07 +00:00
Pawel Jakub Dawidek	a1ea1a22e9	It is possible that GEOM taste provider before SMP is started. We can't bind to a CPU which is not yet on-line, so add code that wait for CPUs to go on-line before binding to them. Reported by: Alin-Adrian Anton <aanton@spintech.ro> MFC after: 2 weeks	2007-01-28 20:29:12 +00:00
Konstantin Belousov	2cc7d26f7f	Cylinder group bitmaps and blocks containing inode for a snapshot file are after snaplock, while other ffs device buffers are before snaplock in global lock order. By itself, this could cause deadlock when bdwrite() tries to flush dirty buffers on snapshotted ffs. If, during the flush, COW activity for snapshot needs to allocate block and ffs_alloccg() selects the cylinder group that is being written by bdwrite(), then kernel would panic due to recursive buffer lock acquision. Avoid dealing with buffers in bdwrite() that are from other side of snaplock divisor in the lock order then the buffer being written. Add new BOP, bop_bdwrite(), to do dirty buffer flushing for same vnode in the bdwrite(). Default implementation, bufbdflush(), refactors the code from bdwrite(). For ffs device buffers, specialized implementation is used. Reviewed by: tegge, jeff, Russell Cattelan (cattelan xfs org, xfs changes) Tested by: Peter Holm X-MFC after: 3 weeks (if ever: it changes ABI)	2007-01-23 10:01:19 +00:00
Pawel Jakub Dawidek	9cb9930ea6	Softc may be NULL in g_journal_orphan(), so don't be surprised.	2006-12-02 09:10:29 +00:00
Pawel Jakub Dawidek	95de128d55	Fix ia64 build breakage.	2006-11-02 16:24:18 +00:00
Pawel Jakub Dawidek	41517ab2e9	- Use g_duplicate_bio() instead of g_clone_bio(), so there memory is allocated with M_WAITOK flag. - Check 'buf' instead of 'error' so Prevent is not confused. CID: 1562, 1563 Found by: Coverity Prevent analysis tool	2006-11-02 09:14:18 +00:00
Pawel Jakub Dawidek	1506db2163	I want CPU number here. Noticed by: ru	2006-11-02 09:01:34 +00:00
Pawel Jakub Dawidek	3398f41fc0	Grr, fix one more build breakage.	2006-11-02 00:37:39 +00:00
Pawel Jakub Dawidek	501250ba60	Now, that we have gjournal in the tree add possibility to configure gmirror and graid3 in a way that it is not resynchronized after a power failure or system crash. It is safe when gjournal is running on top of gmirror/graid3.	2006-11-01 22:51:49 +00:00
Pawel Jakub Dawidek	f187490a2d	Change spaces to tabs where needed.	2006-11-01 22:16:53 +00:00
Pawel Jakub Dawidek	eba8f13797	Skip disabled CPU, because after we sched_bind() to a disabled CPU, we won't be able to exit from the thread. Function g_eli_cpu_is_disabled() stoled from kern_pmc.c. PR: 104669 Reported by: Nikolay Mirin <nik@optim.com.ru> MFC after: 1 week	2006-11-01 16:05:06 +00:00
Pawel Jakub Dawidek	8e570b35f7	Forgot to remove this line. Reported by: maxim	2006-11-01 14:09:59 +00:00
Pawel Jakub Dawidek	118c814ee8	Add BIO_FLUSH support to GSHSEC class.	2006-11-01 12:30:51 +00:00
Pawel Jakub Dawidek	0c554c7884	Add BIO_FLUSH support to GPT class.	2006-11-01 12:29:49 +00:00
Pawel Jakub Dawidek	3911a26e74	Update the code to the current sync(2) version: - Do not modify mnt_flag without mount interlock held. - Do not touch MNT_ASYNC flag, as this can lead to a race with nmount(2). Pointed out by: tegge Reviewed by: tegge	2006-11-01 09:37:11 +00:00
Pawel Jakub Dawidek	ff4ff59d64	Remove debugging code I accidentally committed.	2006-11-01 01:19:13 +00:00
Pawel Jakub Dawidek	a23d879f34	Add gjournal GEOM class (kernel side), which implements block level journaling and can be tought about marking file system as clean before doing journal switch, which easly allows to add journaling to file systems that don't have this feature. Sponsored by: home.pl	2006-10-31 21:31:00 +00:00
Pawel Jakub Dawidek	42461fba65	Implement BIO_FLUSH handling by simply passing it down to the components. Sponsored by: home.pl	2006-10-31 21:23:51 +00:00
Pawel Jakub Dawidek	1d2aee20b8	Add a new disk flag - DISKFLAG_CANFLUSHCACHE, which indicates that the disk can handle BIO_FLUSH requests. Sponsored by: home.pl	2006-10-31 21:12:43 +00:00
Pawel Jakub Dawidek	c3618c657a	Add a new I/O request - BIO_FLUSH, which basically tells providers below to flush their caches. For now will mostly be used by disks to flush their write cache. Sponsored by: home.pl	2006-10-31 21:11:21 +00:00
Pawel Jakub Dawidek	11b2174f58	Guard against invalid metadata. MFC after: 1 week	2006-10-10 15:01:47 +00:00
Ruslan Ermilov	04c7da702f	A GEOM cache can speed up read performance by sending fixed size read requests to its consumer. It has been developed to address the problem of a horrible read performance of a 64k blocksize FS residing on a RAID3 array with 8 data components, where a single disk component would only get 8k read requests, thus effectively killing disk performance under high load. Documentation will be provided later. I'd like to thank Vsevolod Lobko for his bright ideas, and Pawel Jakub Dawidek for helping me fix the nasty bug.	2006-10-06 08:27:07 +00:00
Pawel Jakub Dawidek	b7beab8d22	One more white space fix.	2006-09-30 08:23:06 +00:00
Pawel Jakub Dawidek	469e952070	Remove trailing spaces.	2006-09-30 08:16:49 +00:00
Pawel Jakub Dawidek	1517bdc897	Remove trailing spaces.	2006-09-30 08:01:11 +00:00
Pawel Jakub Dawidek	f8aa16c66c	Fix detecting of UFS1 label when mediasize%fragsize != 0. Submitted by: Stanislav Sedov PR: kern/84637 MFC after: 1 week	2006-09-16 11:24:41 +00:00
Pawel Jakub Dawidek	8abd1ad101	Add 'configure' subcommand which for now only allows setting and removing of the BOOT flag. It can be performed on both attached and detached providers. Requested by: Matthias Lederhofer <matled@gmx.net> MFC after: 1 week	2006-09-16 10:43:17 +00:00
Pawel Jakub Dawidek	5e165262f1	Add __printflike() to gctl_error(). Approved by: phk MFC after: 1 week	2006-09-16 10:39:07 +00:00
Pawel Jakub Dawidek	5a20446db8	Small fixes after adding __printflike() to gctl_error(). Approved by: phk MFC after: 3 days	2006-09-16 09:48:29 +00:00
Pawel Jakub Dawidek	dec53cdd32	Remove extra arguments. MFC after: 3 days	2006-09-16 07:47:57 +00:00
Pawel Jakub Dawidek	679f8b7e7a	Add 'show geom [addr]' ddb(4) command, which prints entire GEOM topology if no additional argument is given or details about the given GEOM object (class, geom, provider or consumer). Approved by: phk	2006-09-15 16:36:45 +00:00
Pawel Jakub Dawidek	8e007c52fd	Fix synchronization in gmirror and graid3 which I broken. Synchronization request can still have bio_to set to sc_provider (this is READ part of a synchronization request) and in this case g_{mirror,raid3}_sync() wasn't called as it should be. MFC after: 1 week	2006-09-13 15:46:49 +00:00
Pawel Jakub Dawidek	d6b910d295	Delay an orphan event if provider has still in-flight I/O requests. This way GEOM classes can safely detach from provider when an orphan event is received. This fixes 'detach with active requests' panic for gstripe/gconcat under load. PR: kern/102766 Submitted by: mjacob OK'ed by: phk MFC after: 1 week	2006-09-10 09:11:54 +00:00
John-Mark Gurney	0cca572e64	move created/detected/activated under debug level 1 to quiet the common case.. add count of active and total components to the launched line so you can see at a glance if your mirror/raid3 is complete... now: GEOM_MIRROR: Device mirror/sam launched (2/2). Reviewed by: pjd	2006-09-09 21:45:37 +00:00
Pawel Jakub Dawidek	46ee0837c2	Fix format character. Reported by: andre	2006-09-08 13:46:18 +00:00
Pawel Jakub Dawidek	fc024f7a45	Bump copyright year.	2006-09-08 10:20:44 +00:00
Pawel Jakub Dawidek	c076790223	Use __FBSDID in .c files.	2006-09-08 10:19:24 +00:00
Pawel Jakub Dawidek	6a146a1989	- Split failure probability configuration into read failure probability and write failure probability. - Allow to specify an error number to return of failure. MFC after: 3 days	2006-09-08 09:21:21 +00:00
Pawel Jakub Dawidek	7ffb6e0f6a	Fix problems with destroy and forcible destroy functionality: - hold/release device in start/done routines, this will probably slow down things a bit, but previous code was racy; - only release device if g_gate_destroy() failed - if it succeeded device is dead and there is nothing to release; - various other changes which makes forcible destruction reliable. MFC after: 3 days	2006-09-05 21:56:00 +00:00
Warner Losh	1a3c917f9d	while (0); -> while (0) in multi-line macros	2006-08-17 22:50:33 +00:00
Pawel Jakub Dawidek	1894472106	Handle MSDOS file systems properly. Before the change file systems created on Windows XP (and others maybe) were not detected. We detected only those created with newfs_msdos(8). Submitted by: Tobias Reifenberger <treif@mayn.de> style(9)ified by: pjd	2006-08-12 15:34:15 +00:00
Pawel Jakub Dawidek	d88fe2bfc7	Verify if a label doesn't point to the parent directory.	2006-08-12 15:30:24 +00:00
Pawel Jakub Dawidek	2bd4ade694	Before using byte offset for IV creation, covert it to little endian. This way one will be able to use provider encrypted on eg. i386 on eg. sparc64. This doesn't really buy us much today, because UFS isn't endian agnostic. We retain backward compatibility by setting G_ELI_FLAG_NATIVE_BYTE_ORDER flag on devices with version number less than 2 and not converting the offset.	2006-08-11 19:09:12 +00:00
Pawel Jakub Dawidek	d04c304ddf	Forgot to bump version number after G_ELI_FLAG_READONLY flag addition.	2006-08-11 18:39:58 +00:00
Marcel Moolenaar	f56b5a43dc	Strengthen the check for a PMBR: o PMBR partitions count to the number of partitions on the disk, which means that if a PMBR entry is invalid we will not treat the MBR as a PMBR by virtue of it not describing any partitions. Previously the checks were inconsistent in that an invalid PMBR entry would be harmless when no other partitions exist (we would treat the MBR as a PMBR by virtue of it being empty), but it would be fatal when there is at least one other partition. o The partition size of a PMBR partition is one less than the media size because the GPT starts at the second sector (LBA 1) and extends to the end of the media. For backward bug-compatibility we accept a size that's exactly the media size (FreeBSD bug). Also, when the partition size can not be represented in a 32-bit integral, the partition size in the MBR is to be set to 0xFFFFFFFF. Accept this as a valid size, even if the size can be represented.	2006-08-09 20:53:01 +00:00
Pawel Jakub Dawidek	850590166f	Allow geli to operate on read-only providers. Initial patch from: vd MFC after: 2 weeks	2006-08-09 18:11:14 +00:00
Pawel Jakub Dawidek	de6f1c7c6c	Not only a request from us can be passed to g_{mirror,raid3}_worker() function, but also a request to us, in which case checking bio_cflags is wrong, because the class above us is controling it, not we. MFC after: 1 week	2006-08-09 09:41:53 +00:00
Marcel Moolenaar	d1a5c5275c	Fix a phase-ordering bug: check the mediasize and sectorsize after we obtained access. It is possible that GPT gets to taste a disk first, which means the disk has not been opened before and it will not get opened until after we checked the mediasize and sectorsize. However, since the mediasize and sectorsize are determined at open and that happens when access is optained, checking the mediasize and sectorsize before obtaining access may result in GPT rejecting the disk.	2006-08-08 21:33:26 +00:00
Yaroslav Tykhiy	776fc0e90e	Commit the results of the typo hunt by Darren Pilgrim. This change affects documentation and comments only, no real code involved. PR: misc/101245 Submitted by: Darren Pilgrim <darren pilgrim bitfreak org> Tested by: md5(1) MFC after: 1 week	2006-08-04 07:56:35 +00:00
Pawel Jakub Dawidek	3c57a41d7b	Don't use f-word in comments. We are gentlemans. Pointed out by: Maciej Sobczak	2006-08-01 23:17:33 +00:00
Yaroslav Tykhiy	f6829a059f	Fix what looks like a typo: MODULE_DEPEND() takes module names, not KLD file names; and GELI module's name is g_eli, not geom_eli. Approved by: pjd (silence) MFC after: 5 days	2006-07-27 11:52:12 +00:00
Pawel Jakub Dawidek	8cfab1debb	Don't forget to initialize crp_olen field, which is used to calculate bio_completed value.	2006-07-22 10:05:55 +00:00
Pawel Jakub Dawidek	86ed3c25e1	Always allow to specify components with /dev/ prefix. MFC after: 3 days	2006-07-13 20:37:59 +00:00
Pawel Jakub Dawidek	e8c85a50ae	Only check if we're freeing a valid object if we hold the topology lock. This prevents panic under heavy load with DIAGNOSTIC compiled in.	2006-07-12 15:44:00 +00:00
Pawel Jakub Dawidek	3525bb6b98	Use proper defines instead of magic values. MFC after: 1 week	2006-07-10 21:18:00 +00:00
Pawel Jakub Dawidek	ed940a828d	When kern.geom.raid3.use_malloc tunnable is set to 1, malloc(9) instead of uma(9) will be used for memory allocation. In case of problems or tracking bugs, there are more useful tools for malloc(9) debugging than for uma(9) debugging, like memguard(9) and redzone(9). MFC after: 1 week	2006-07-09 12:25:56 +00:00
Pawel Jakub Dawidek	a3cdde5564	Remove bogus assertion. Reported by: Bradley W. Dutton <brad-fbsd-stable@duttonbros.com> MFC after: 3 days	2006-07-07 14:32:27 +00:00
Pawel Jakub Dawidek	1f7fec3cb5	Allow to close access even if device is already destroyed. Reported by: Ulrich Spoerlein <uspoerlein@gmail.com> PR: kern/98093 MFC after: 1 week	2006-07-03 10:32:38 +00:00
Maxim Sobolev	d5046da865	Improve check for protective MBR. Instead of assiming that protective MBR should have only one entry of type 0xEE, consider protective MBR to be one, that has at least one entry of type 0xEE covering the whole unit. This makes GEOM_GPT compatible with disks partitioned by the Apple's BootCamp. Approved in principle by: marcel MFC After: 1 month	2006-06-26 00:32:54 +00:00
Simon L. B. Nielsen	274ede62a8	In g_dev_strategy(), when failing an IO request with EINVAL due to offset or request size which is not a multiple of the sector size, make sure that the bio is set to indicate that no data has actually been transferred. The result of this is that the file offset is no longer incremented for these requests. The fact that the file offset was incremented broke fdisk(8)'s probing of sector size for non-512 byte sector sizes. Reviewed by: phk, cperciva Submitted by: mdodd MFC after: 2 weeks	2006-06-18 22:01:15 +00:00
Pawel Jakub Dawidek	c84efdca04	Allow to use the old -a option to specify an encryption algorithm to use (for backward compatibility), but print a warning to inform about the change.	2006-06-06 22:06:24 +00:00
Pawel Jakub Dawidek	15d6ee8de5	- Unbreak the build when geli is compiled into the kernel (on as module), by silencing unfounded compiler warning. Reported by:	2006-06-06 14:48:19 +00:00
Pawel Jakub Dawidek	eaa3b91996	Implement data integrity verification (data authentication) for geli(8). Supported by: Wheel Sp. z o.o. (http://www.wheel.pl)	2006-06-05 21:38:54 +00:00
Pawel Jakub Dawidek	05bf5e8a0a	Make kern.geom.eli.overwrites sysctl a tunable as well.	2006-06-05 21:25:19 +00:00
Pawel Jakub Dawidek	4bec0ff1c4	Add g_duplicate_bio() function which does the same thing what g_clone_bio() is doing, but g_duplicate_bio() allocates new bio with M_WAITOK flag.	2006-06-05 21:13:22 +00:00
Marcel Moolenaar	ae04949bff	Fix unaligned memory accesses on Alpha and possible other platforms. By using a pointer to struct dos_partition, we implicitly tell the compiler that the pointer is 4-bytes aligned, even though we know that's not the case. The fact that we only dereference the pointer to access a byte-wide field (field dp_ptyp) is not a guarantee that the compiler will in fact use a byte-wide load. On some platforms it's more efficient to use long word or quad word loads and use bit-shifting and bit-masking to get the intended byte. On those platforms an misaligned load will be the result. The fix is to use byte-wide pointer arithmetic based on sizeof() and offsetof() to avoid invalid casts which avoids that the compiler makes invalid assumptions. Backtrace provided by: wilko@ MFC after: 1 week	2006-06-04 20:26:13 +00:00
Ceri Davies	fccfbec9f2	Remove the trailing half of a sentence which was clearly superceded by the preceding one some time during editing.	2006-05-24 11:02:32 +00:00
Pawel Jakub Dawidek	ee40c7aa76	Use G_RAID3_FOREACH_SAFE_BIO() macro instead of G_RAID3_FOREACH_BIO() in two places where g_io_request() is called. g_io_request() can free bio structure so we can't reference it after and G_RAID3_FOREACH_BIO() macro was doing this. Found by: Coverity Prevent analysis tool (with my new models) MFC after: 1 day	2006-05-04 13:01:16 +00:00
Pawel Jakub Dawidek	ffd106f5a3	We shouldn't lock the topology here - we will panic on assertion inside g_raid3_bump_syncid(). Reported by: Bradley W. Dutton <brad-fbsd-stable@duttonbros.com> MFC after: 1 day	2006-04-30 22:14:17 +00:00
Pawel Jakub Dawidek	84edb86df6	- Don't hold the device sx lock when going to sleep. - Prevent possible live-lock in case of memory problems by freeing already completed requests first. Reported and tested by: markus, Bradley W. Dutton <brad-fbsd-stable@duttonbros.com> MFC after: 1 day	2006-04-28 12:18:03 +00:00
Pawel Jakub Dawidek	a2fe5c6676	- Remove dead code. - Comment possible event miss, which isn't critical, but probably can be fixed by replacing the event lock usage with the queue lock. MFC after: 2 weeks	2006-04-28 12:13:49 +00:00
Pawel Jakub Dawidek	18486a5ee3	Be sure to not destroy device twice. This is not possible in theory, but with this change there is even no theoretical race. MFC after: 2 weeks	2006-04-28 11:52:45 +00:00
Pawel Jakub Dawidek	a063667622	Be sure to not destroy device twice. This is not possible in theory, but with this change there is even no theoretical race. MFC after: 2 weeks	2006-04-28 11:47:28 +00:00
Pawel Jakub Dawidek	5af2ae28f6	geli(8) provides keys on newsession time, so remove CRD_F_KEY_EXPLICIT flag as HW crypto drivers don't support it.	2006-04-20 06:33:46 +00:00
Pawel Jakub Dawidek	c082905bb6	Fix storing offset of already synchronized data. Offset in entire array was stored in metadata instead of an offset in single disk. After reboot/crash synchronization process started from a wrong offset skipping (not synchronizing) part of the component which can lead to data corrutpion (when synchronization process was interrupted on initial synchronization) or other strange situations like 'graid3 status' showing value more than 100%. Reported, reviewed and tested by: ru Reported by: Dmitry Morozovsky <marck@rinet.ru> MFC after: 1 day	2006-04-18 13:52:11 +00:00
Pawel Jakub Dawidek	cd0d707eb7	Correct debug: we are sending child bio here, not parent bio. MFC after: 1 week	2006-04-15 18:30:42 +00:00
Martin Cracauer	3f4f4a1465	Make CCD be able to read and write Linux software raids. Supported for raid-0 with <n> disks, raid-1 with 2 disks. Manpages have examples, warnings etc. Test scripts on http://www.cons.org/cracauer/ccdconfig-linux/ Reviewed by: alfred	2006-04-13 20:35:31 +00:00
Pawel Jakub Dawidek	d3a1be900a	Pass BIO_GETATTR requests down. MFC after: 1 week	2006-04-12 12:18:44 +00:00
Pawel Jakub Dawidek	712fe9bd7a	Introduce and use delayed-destruction functionality from a pre-sync hook, which means that devices will be destroyed on last close. This fixes destruction order problems when, eg. RAID3 array is build on top of RAID1 arrays. Requested, reviewed and tested by: ru MFC after: 2 weeks	2006-04-10 10:32:22 +00:00
Marcel Moolenaar	ec0889a069	MFp4: o Implement the remove verb to remove a partition entry. o Improve error reporting by first checking that the verb is valid. o Add an entry parameter to the add verb. this parameter can be both read-only as welll as read-write and specifies the entry number of the newly added partition. o Make sure that the provider is alive when passed to us. It may be withering away. o When adding a new partition entry, test for overlaps with existing partitions.	2006-04-10 04:03:14 +00:00
Marcel Moolenaar	d99c155975	Add g_wither_provider() to abstract the details of destroying a particular provider. Use this function where g_orphan_provider() is being called so that the flags are updated correctly and g_orphan_provider() is called only when allowed.	2006-04-10 03:55:13 +00:00
Marcel Moolenaar	41063f9380	Change gctl_set_param() to return an error instead of setting an error on the request. Add a wrapper, gctl_set_param_err(), that sets the error on the request from the error returned by gctl_set_param() and update current callers of gctl_set_param() to call gctl_set_param_err() instead. This makes gctl_set_param() much more usable in situations where the caller knows better what to do with certain (apparent) error conditions and setting an error on the request is not one of the things that need to be done.	2006-04-07 16:19:48 +00:00
Pawel Jakub Dawidek	39d92f5fa3	Typos.	2006-04-05 22:07:31 +00:00
Pawel Jakub Dawidek	700e04d9b6	Revert previous change, as I fixed MD5(9).	2006-03-30 18:50:00 +00:00
Pawel Jakub Dawidek	8e88808915	md_hash field in g_eli_metadata structure is not 4 byte aligned, which case panic on sparc64. The problem is in MD5(9) implementation. The Encode() function takes 'unsigned char output' as its first argument, which is then assigned to 'u_int32_t op'. If the 'output' argument is not 4 byte aligned (and in geli(8) case it is not), sparc64 machine will panic. I don't know how to fix MD5(9) in a clean way, so I'm implementing a work-around in geli(8). Reported by: brueffer MFC after: 3 days	2006-03-30 14:41:13 +00:00
Lukas Ertl	ff91880e5d	Protect from creating striped and RAID5 plexes with unequally sized subdisks.	2006-03-30 14:01:25 +00:00
Pawel Jakub Dawidek	2e128ca835	- 'ndisks' variable is not boolean, so compare it with a value. - Keep conditions order consistent with the comment above. MFC after: 3 days	2006-03-30 12:15:41 +00:00
Pawel Jakub Dawidek	0d14fae5f3	Preserve previous behaviour of kern.geom.raid3.n{64,16,4}k tunables were 0 means unlimited. Reported by: ru MFC after: 3 days	2006-03-28 18:34:36 +00:00
Pawel Jakub Dawidek	d7fad9f651	Increase debug level for "Thread exiting." message. It's not that important and is 0 by accident. MFC after: 3 days	2006-03-25 23:30:36 +00:00
Lukas Ertl	5c391fb60c	Fix whitespace.	2006-03-23 20:01:13 +00:00
Lukas Ertl	7b5264faa1	Implement the 'resetconfig' command. PR: kern/94835 Submitted by: Ulf Lilleengen <lulf@stud.ntnu.no>	2006-03-23 19:58:43 +00:00
Pawel Jakub Dawidek	9bfdf5987d	Update copyright for 2006.	2006-03-19 12:55:51 +00:00
Pawel Jakub Dawidek	e675705966	kern.geom.raid3.sync_requests=2 seems to be a better default - it still keeps disks very busy, but makes system much more responsive. While here, kill extra space.	2006-03-19 11:18:33 +00:00
Pawel Jakub Dawidek	18d370acae	kern.geom.mirror.sync_requests=2 seems to be a better default - it still keeps disks very busy, but makes system much more responsive. While here, kill extra space.	2006-03-19 10:49:05 +00:00
Ruslan Ermilov	ad5722357f	Fix a typo.	2006-03-13 14:59:57 +00:00
Ruslan Ermilov	ef25813de6	Fix build on 64-bit platforms.	2006-03-13 14:48:45 +00:00
Pawel Jakub Dawidek	3650be51e2	- Reimplement I/O data allocation to prevent deadlocks. Submitted by: green - Speed up synchronization process by using configurable number of I/O requests in parallel. + Add kern.geom.raid3.sync_requests tunable which defines how many parallel I/O requests should be used. + Retire kern.geom.raid3.reqs_per_sync and kern.geom.raid3.syncs_per_sec sysctls. - Fix race between regular and synchronization requests. - Reimplement raid3's data synchronization - do not use the topology lock for this purpose, as it may case deadlocks. - Stop synchronization from pre-sync hook. - Fix some other minor issues. Tested by: Mike Tancsa <mike@sentex.net> MFC after: 3 days	2006-03-13 01:03:18 +00:00
Pawel Jakub Dawidek	855761d5db	- Speed up synchronization process by using configurable number of I/O requests in parallel. + Add kern.geom.mirror.sync_requests tunable which defines how many parallel I/O requests should be used. + Retire kern.geom.mirror.reqs_per_sync and kern.geom.mirror.syncs_per_sec sysctls. - Fix race between regular and synchronization requests. - Reimplement mirror's data synchronization - do not use the topology lock for this purpose, as it may case deadlocks. - Stop synchronization from pre-sync hook. - Fix some other minor issues. MFC after: 3 days	2006-03-13 00:58:41 +00:00
Pawel Jakub Dawidek	9d793bdd46	When inserting a new component md_provsize metadata field wasn't set, which means that old problem was triggered (when two providers end at the same offset, eg. ad0 and ad0s1 and the wrong was is picked up by gmirror/graid3). Reported by: Michal Suszko <dry@dry.pl> MFC after: 3 days	2006-03-10 07:41:31 +00:00
Pawel Jakub Dawidek	4686187543	Allow to dump kernel to gmirror providers. Some conditions have to be met to make it work properly. This will be described in the manual page. MFC after: 3 days	2006-03-08 08:27:33 +00:00
Pawel Jakub Dawidek	99c889fc7d	We need to check if file system size is equal to provider's size, because sysinstall(8) still bogusly puts first partition at offset 0 instead of 16, so glabel/ufs will find file system on slice instead of partition. Before sysinstall is fixed, we must keep this code, which means that we wont't be able to detect UFS file systems created with 'newfs -s ...'. PS. bsdlabel(8) creates partitions properly. MFC after: 3 days	2006-03-04 19:41:54 +00:00
Jeff Roberson	420239c773	- Lock Giant if needed around the call to vnode_create_vobject(). This is only important if devfs is not mpsafe. Sponsored by: Isilon Systems, Inc. Found by: kris	2006-03-02 05:37:44 +00:00
Pawel Jakub Dawidek	92ee312dd4	Assert proper use of bio_caller1, bio_caller2, bio_cflags, bio_driver1, bio_driver2 and bio_pflags fields. Reviewed by: phk	2006-03-01 19:01:58 +00:00
Pawel Jakub Dawidek	290c616103	Do not use bio structure after g_io_deliver(), it may not longer by valid. Found and fixed by: Vsevolod Lobko <seva@ip.net.ua> MFC after: 3 days	2006-02-22 10:21:05 +00:00
Pawel Jakub Dawidek	3d48264f02	Inform when label disappears. MFC after: 3 days	2006-02-18 11:24:00 +00:00
Pawel Jakub Dawidek	bdf2e45a5c	Allow to use g_slice_orphan() from outside. MFC after: 3 days	2006-02-18 11:21:17 +00:00
Pawel Jakub Dawidek	c058f51257	- Do not depend on fact that file system covers entire provider. It won't work for file systems created with -s option. Use better file system verfication. - Add myself to the copyright. MFC after: 3 days	2006-02-18 10:59:47 +00:00
Pawel Jakub Dawidek	17fb8ae78f	This function returns nothing.	2006-02-18 03:04:26 +00:00
Pawel Jakub Dawidek	33361bb5db	If provider's sector size prevents reading SBLOCKSIZE bytes return immediatelly.	2006-02-18 03:00:49 +00:00
Pawel Jakub Dawidek	bf31327cca	On component state change to ACTIVE don't forget to update metadata. MFC after: 3 days	2006-02-12 17:38:09 +00:00
Pawel Jakub Dawidek	01f1f41c25	Use time_uptime instead of time_second, as the latter may go backwards. Suggested by: ru MFC after: 3 days	2006-02-12 17:36:09 +00:00
Pawel Jakub Dawidek	67cae8aab8	Allow to set kern.geom.raid3.disconnect_on_failure from loader.conf. MFC after: 3 days	2006-02-12 02:01:38 +00:00
Pawel Jakub Dawidek	3aae74ec02	- Add kern.geom.raid3.disconnect_on_failure sysctl/tunnable (default to 1 to preserve currect behaviour). When set to 0, components are not disconnected - graid3 will try to still use them (only first error will be logged). This is helpful when we have two broken components, but in different places, so actually all data is available. Such buggy component will be visible in 'graid3 list' output with flag BROKEN. - Never disconnect the last valid component. If we detect errors there we will just pass them up. This wasn't reasonable to deny access to the whole provider because of one broken sector. Prodded by: ru MFC after: 3 days	2006-02-11 17:42:31 +00:00
Pawel Jakub Dawidek	d4b0268a24	- Add kern.geom.mirror.disconnect_on_failure sysctl/tunnable (default to 1 to preserve currect behaviour). When set to 0, components are not disconnected - gmirror will try to still use them (only first error will be logged). This is helpful when we have two broken components, but in different places, so actually all data is available. Such buggy component will be visible in 'gmirror list' output with flag BROKEN. - Never disconnect the last valid component. If we detect errors there we will just pass them up. This wasn't reasonable to deny access to the whole provider because of one broken sector. Prodded by: ru MFC after: 3 days	2006-02-11 17:39:29 +00:00
Pawel Jakub Dawidek	17fec17e77	Correct typo. 'fbp' is NULL here so this will result in a panic. MFC after: 3 days	2006-02-11 17:29:06 +00:00
Pawel Jakub Dawidek	0962f94295	Mark array as CLEAN when there are no write requests in kern.geom.raid3.idletime seconds. Write, not any requests. Mark array as clean immediatelly on last write close. Prodded by: ru MFC after: 3 days	2006-02-11 14:42:58 +00:00
Pawel Jakub Dawidek	fe6f94ea84	Mark array as CLEAN when there are no write requests in kern.geom.mirror.idletime seconds. Write, not any requests. Mark array as clean immediatelly on last write close. Prodded by: ru MFC after: 3 days	2006-02-11 14:42:23 +00:00
Pawel Jakub Dawidek	9af2131b78	Teach geli how to load keyfiles before root file system is mounted. An example entries for loader.conf to make it possible: geli_da0_keyfile0_load="YES" geli_da0_keyfile0_type="da0:geli_keyfile0" geli_da0_keyfile0_name="/boot/keys/da0.key0" geli_da0_keyfile1_load="YES" geli_da0_keyfile1_type="da0:geli_keyfile1" geli_da0_keyfile1_name="/boot/keys/da0.key1" geli_da0_keyfile2_load="YES" geli_da0_keyfile2_type="da0:geli_keyfile2" geli_da0_keyfile2_name="/boot/keys/da0.key2" geli_da1s3a_keyfile0_load="YES" geli_da1s3a_keyfile0_type="da1s3a:geli_keyfile0" geli_da1s3a_keyfile0_name="/boot/keys/da1s3a.key" Thanks for jhb and kan who showed me the right direction. MFC after: 3 days	2006-02-11 13:08:24 +00:00
Pawel Jakub Dawidek	a80f82a4a3	Check rootvnode variable to see if we still want to ask for passphrase on boot. Other methods just don't work properly. MFC after: 3 days	2006-02-11 12:45:01 +00:00
Lukas Ertl	d9a7dc858a	Catch the case when a subdisk has no provider or no consumer attached to it.	2006-02-08 21:32:45 +00:00
Christian Brueffer	9864500624	Clean up some sysctl descriptions, debug messages etc. Approved by: pjd MFC after: 3 days	2006-02-07 17:23:22 +00:00
Pawel Jakub Dawidek	38ea96ac99	Remove trailing spaces.	2006-02-01 12:06:01 +00:00
Pawel Jakub Dawidek	aaf8e1867b	Allow to specify only one disk. This is helpful when we want to extend our concatenated device later. MFC after: 1 week	2006-01-30 22:47:07 +00:00
Pawel Jakub Dawidek	87e9d284dc	Fix typo which cased that 64kB elements limit was not set properly and 16kB elements limit wasn't set at all. Submitted by: Vsevolod Lobko <seva@ip.net.ua> MFC after: 3 days	2006-01-30 22:45:43 +00:00
Max Khon	3795fc308f	Rename geom_uzip class to g_uzip in order to be consistent with the naming of other GEOM modules. PR: 89998	2006-01-22 15:35:14 +00:00
Pawel Jakub Dawidek	4f9bcb9f4f	Fix bio leak in case of malloc(9) failure. Found by: Coverity Prevent(tm) Coverity ID: CID794 MFC after: 3 days	2006-01-18 21:44:57 +00:00
Pawel Jakub Dawidek	e9b936c73c	Remove dead code. Found by: Coverity Prevent(tm) Coverity ID: CID105 MFC after: 3 days	2006-01-18 21:43:27 +00:00
Pawel Jakub Dawidek	a49c0bd40a	Remove dead code. Found by: Coverity Prevent(tm) Coverity ID: CID104 MFC after: 3 days	2006-01-18 21:42:19 +00:00
Pawel Jakub Dawidek	481b55b1e3	Style cleanups. X-MFC-after: Already MFCed to RELENG_6 by accident.	2006-01-18 11:03:20 +00:00
Pawel Jakub Dawidek	a1c10dcb4c	Move $FreeBSD$ from comment to __FBSDID().	2006-01-17 11:48:16 +00:00
Pawel Jakub Dawidek	7d54b385a6	- Use better types. - Log problems at level 0 when killing providers. MFC after: 3 days	2006-01-17 07:32:43 +00:00
Pawel Jakub Dawidek	b5f30223fc	Check return value. Found by: Coverity Prevent(tm) MFC after: 3 days	2006-01-17 07:30:34 +00:00
Pawel Jakub Dawidek	7192f621d0	Remove dead code. Found by: Coverity Prevent(tm) MFC after: 3 days	2006-01-17 07:27:46 +00:00
Pawel Jakub Dawidek	4ec0490779	Remove unused value. Found by: Coverity Prevent(tm) MFC after: 3 days	2006-01-17 07:26:48 +00:00
Pawel Jakub Dawidek	58d85f544f	Log situation when EIO is returned.	2006-01-17 07:23:36 +00:00
Pawel Jakub Dawidek	54df0743c7	Remove bio leak when EIO error is emulated. Found by: Coverity Prevent(tm) MFC after: 3 days	2006-01-17 07:22:44 +00:00
Lukas Ertl	d5817a5009	Get rid of the gv_bioq hack in most parts of the I/O path and use the standard bioq structures.	2006-01-06 18:03:17 +00:00
Pawel Jakub Dawidek	b91df0e29e	MFp4: Typo fix (without it the XML GEOM tree wasn't consistent). Reported by: Eric Anderson <anderson@centtech.com>	2005-12-19 06:05:40 +00:00
Pawel Jakub Dawidek	64806a739b	Fix build breakage by fixing typo. Reported by: glebius	2005-12-09 11:38:02 +00:00
Pawel Jakub Dawidek	24e1fdcd1a	- Allow to specify the byte which will be used for filling read buffer. - Improve sysctl description a bit. Submitted by: Ivan Voras <ivoras@gmail.com>	2005-12-08 23:06:59 +00:00
Pawel Jakub Dawidek	df3d5a19fc	Teach NOP GEOM class how to gather the following statistics: - number of read I/O requests, - number of write I/O requests, - number of read bytes, - number of written bytes. Add 'reset' subcommand for resetting statistics.	2005-12-08 23:00:31 +00:00
Maxim Sobolev	6023800194	It is unclear who is wrong and who is right, but when operating on plain file bsdlabel(8) always writes label at a fixed offset from its beginning (512 bytes), regardless of the sector size. At the same time, bsdlabel geom class expects label to be available at the very beginning of the second sector. As a result, images prepared in userland for media with sector size different from 512 bytes (i.e. 2k for cdroms) are not recognized by the tasting mechanism. Solve the problem by always looking for the label at 512-byte offset if we can't find it at the beginning of the second sector and sector size is not 512 bytes.	2005-11-30 22:54:41 +00:00
Maxim Sobolev	b53a1cf306	Don't pass error value pointer to g_read_data(9) at all if we don't have any use of it. Suggested by: pjd	2005-11-30 22:15:00 +00:00
Maxim Sobolev	8a4a44b5aa	Check for g_read_data(9) errors properly: o The only indication of error condition is NULL value returned by the function; o value pointed to by error argument is undefined in the case when operation completes successfully. Discussed with: phk	2005-11-30 19:24:51 +00:00
Maxim Sobolev	6dfb88de83	Kill leading whilespace.	2005-11-30 19:07:28 +00:00
Pawel Jakub Dawidek	88d172946f	We do nothing with returned error value, so just remove it.	2005-11-29 12:07:10 +00:00
Maxim Sobolev	5c8a6f63b2	Check value returned by g_read_data(9), otherwise we can end in panic(9) if read error happens. MFC after: 1 week	2005-11-29 03:03:34 +00:00
Lukas Ertl	8c957640aa	Add sysctl descriptions.	2005-11-25 10:09:30 +00:00
Lukas Ertl	e30534d50b	Since we want a vinum geom created anytime the module loads, move the geom creation to a seperate init function and ignore the tasting. The config is now parsed only in the vinumdrive geom, which hopefully fixes the problem, that the drive class tasted before the vinum class had a chance, for good. Also restore the behaviour that the module can be loaded at boot time and on a running system.	2005-11-24 15:11:41 +00:00
Lukas Ertl	8eb116b96e	Whitespace.	2005-11-20 12:14:18 +00:00
Lukas Ertl	2e42895a97	Always declare variables at the start of the function. Don't allocate potentially large variables on the stack. Check strsep() return values when the string comes from userland. Shorten variable names for lucidity's sake. most of the stuff: Pointed out by: njl@	2005-11-20 12:12:31 +00:00
Lukas Ertl	47517eab34	Fix whitespace issue. Pointed out by: joel@	2005-11-20 10:40:06 +00:00
Lukas Ertl	57335408d4	Finally bring in what was produced during Google SoC 2005: Add functions to rename objects and to move a subdisk from one drive to another. Obtained from: Chris Jones <chris.jones@ualberta.ca> Sponsored by: Google Summer of Code 2005 MFC in: 1 week	2005-11-19 20:25:18 +00:00
John Polstra	a7e69e8b7d	Fix a bug that caused some /dev entries to continue to exist after the underlying drive had been hot-unplugged from the system. Here is a specific example. Filesystem code had opened /dev/da1s1e. Subsequently, the drive was hot-unplugged. This (correctly) caused all of the associated /dev/da1* entries to be deleted. When the filesystem later realized that the drive was gone it closed the device, reducing the write-access counts to 0 on the geom providers for da1s1e, da1s1, and da1. This caused geom to re-taste the providers, resulting in the devices being created again. When the drive was hot-plugged back in, it resulted in duplicate /dev entries for da1s1e, da1s1, and da1. This fix adds a new disk_gone() function which is called by CAM when a drive goes away. It orphans all of the providers associated with the drive, setting an error condition of ENXIO in each one. In addition, we prevent a re-taste on last close for writing if an error condition has been set in the provider. Sponsored by: Isilon Systems Reviewed by: phk MFC after: 1 week	2005-11-18 02:43:49 +00:00
Marcel Moolenaar	ceba44f8d4	o Slightly refactor the ctlreq code to maximize code sharing between verbs. Only the create verb operates on a provider. All other verbs operate on a GPT geom. Also, the GPT entry oriented verbs require a non-downgraded GPT. o Have all verbs take an optional flags parameter. The flags parameter is a string of single-letter flags. The typical use of these flags is to enable certain behaviour in support fo the gpt(8) tool. o Add dummy implementations for the destroy and recover verbs. This change causes test 2 of the GPT regression test suite to fail. The presence of a geom parameter is now required even for unknown verbs.	2005-11-13 21:53:55 +00:00
Marcel Moolenaar	d40f7f8b3e	Make the kern.geom.conftxt sysctl more usable by also dumping the MD class. Previously only the DISK class was dumped. The only consumer of this sysctl is libdisk (i.e. sysinstall) and it tests explicitly for instances of the DISK class. Dumping other classes is therefore harmless. By also dumping the MD class regression tests can be written that use the MD class for operations that would normally be done on the DISK class. The sysctl can now be used to test if those operations took an effect. An example is partitioning.	2005-11-12 20:02:02 +00:00
Robert Watson	5bb84bc84b	Normalize a significant number of kernel malloc type names: - Prefer '_' to ' ', as it results in more easily parsed results in memory monitoring tools such as vmstat. - Remove punctuation that is incompatible with using memory type names as file names, such as '/' characters. - Disambiguate some collisions by adding subsystem prefixes to some memory types. - Generally prefer lower case to upper case. - If the same type is defined in multiple architecture directories, attempt to use the same name in additional cases. Not all instances were caught in this change, so more work is required to finish this conversion. Similar changes are required for UMA zone names.	2005-10-31 15:41:29 +00:00
Pawel Jakub Dawidek	a65a0da2f9	Fix possible live-lock under heavy load where we can't allocate more memory for request. I was sure graid3 should handle such situations well, but green@ reported it is not and we want to fix it before 6.0. Submitted by: green	2005-10-28 20:25:02 +00:00
Takanori Watanabe	f83da457dc	Add checking for File record magic.	2005-10-26 03:24:28 +00:00
Marcel Moolenaar	ada6a4d2b7	Rough implementation of the create and add verbs. The verbs cause in-memory changes only and as such are only useful for prototyping and regression testing purposes.	2005-10-09 17:10:35 +00:00
Tor Egge	2e93e9099e	Move some devstat collection to below where large IO operations are chopped up. This make iostat report operations passed down to the device driver instead of operations passed down to GEOM disk. The transfer size limit imposed by the device driver is no longer hidden, improving the correlation between iostat output and device driver workload.	2005-09-30 17:32:08 +00:00
Max Khon	f3b5092061	- Fix "end_blk out of range" panic when INVARIANTS. - Do not allow rw access. Submitted by: Dario Freni <saturnero at freesbie dot org> MFC after: 3 days	2005-09-29 22:45:16 +00:00
Marcel Moolenaar	40fcaded53	o Don't cause a panic when the control request lacks a verb. o Don't set the error twice when the named class does not exist. It causes ioctl(2) to return with error EEXIST.	2005-09-18 23:54:40 +00:00
Marcel Moolenaar	233b044b9b	Complete rewrite in preparation of adding support for control requests. The following features have been added: 1. Extensive checking and validation of both the primary and secondary headers to protect against corrupted data and to take advantage of the redundancy to allow the GPT to be used in the face of recoverable corruption. 2. Dynamic data-structures to avoid hardcoding gratuitous table limits so as to support the creation of GPT tables of (as of yet) unspecified size. 3. Only allow kernel dumps to swap partitions to provide the necessary anti-footshooting measures. Linux swap partitions are allowed. 4. Complete dump of the GPT configuration, including labels. 5. Supports Byte Order Mark (U+FEFF) handling for big-endian, little-endian and mixed-endian partition names.	2005-09-17 07:05:17 +00:00
John Baldwin	51460da87f	- Add a new simple facility for marking the current thread as being in a state where sleeping on a sleep queue is not allowed. The facility doesn't support recursion but uses a simple private per-thread flag (TDP_NOSLEEPING). The sleepq_add() function will panic if the flag is set and INVARIANTS is enabled. - Use this new facility to replace the g_xup and g_xdown mutexes that were (ab)used to achieve similar behavior. - Disallow sleeping in interrupt threads when invoking interrupt handlers. MFC after: 1 week Reviewed by: phk	2005-09-15 19:05:37 +00:00
Craig Rodrigues	318c3a55f0	Fix so that when a slice or a partition is removed through g_slice_config(), it is destroyed in GEOM, in addition to being removed from /dev. Before this patch, if you applied a new MBR which deleted a slice, the deleted slice would not be in /dev, but it would still appear in kern.geom.conftxt and kern.geom.confxml, which would confused the diskPartitionEditor in sysinstall. Submitted by: pjd Tested by: pjd, rodrigc MFC after: 1 week	2005-09-14 21:38:35 +00:00
Pawel Jakub Dawidek	71270ca60b	Fix copy&paste typo. MFC after: 3 days	2005-09-10 07:46:47 +00:00
Pawel Jakub Dawidek	cf47954083	Don't forget to initialize crp_etype field. Reported by: Nick Evans <nevans@syphen.net> MFC after: 3 days	2005-09-10 07:45:10 +00:00
Lukas Ertl	fcac1be89a	Set the G_PF_WITHER flag on the subdisk provider that is about to be destroyed. That way the GEOM system handles all deallocations and we don't have to do it ourselves.	2005-09-08 20:08:46 +00:00
Poul-Henning Kamp	e4da09c03f	Remove a race condition that could result in processes being stuck waiting for geom events to happen: Instead of maintaining a count of outstanding events, simply look if the queue is empty. Make sure to not remove events from the queue until they are executed in order to not open a new race. Much work by: pjd Tested by: kris MT6: yes, should be.	2005-09-04 19:14:19 +00:00
Poul-Henning Kamp	ac4b76b7ff	Typo.	2005-09-03 11:03:10 +00:00
Pawel Jakub Dawidek	3b37814794	Use KTR to log allocations and destructions of bios. This should hopefully allow to track down "duplicate free of g_bio" panics.	2005-08-29 11:39:24 +00:00
Lukas Ertl	1f710312a2	Prevent that sync operations can be started when they are already in progress, and be a bit more user friendly in terms of error messages returned from the kernel.	2005-08-28 18:16:31 +00:00
Pawel Jakub Dawidek	3ae0e7d8ae	Verify length of the data to read as well.	2005-08-28 00:14:21 +00:00
Lukas Ertl	df5175af0f	Shuffle around the order in which the components are compiled. This way, the VINUMDRIVE class is loaded before the VINUM class, but since geom does the tasting for newly arrived classes last-in-first-out, the VINUM class tastes first. This removes the need to call gv_parse_config() in the drive taste path.	2005-08-26 14:40:32 +00:00
Pawel Jakub Dawidek	9d34e94d14	Verify offset before reading. MFC after: 2 days	2005-08-26 12:50:08 +00:00
Takanori Watanabe	7ba4d2eaeb	Add NTFS labeling function. Reviewed by:pjd	2005-08-26 11:35:10 +00:00
Pawel Jakub Dawidek	a180109fa3	Verify if we can actually read the data at given offset. Reported by: Martin <nakal@nurfuerspam.de>	2005-08-23 18:55:38 +00:00
Lukas Ertl	fdb9eda84f	Correct the check if a plex is accessible in case it is not up. This makes degraded RAID5 plexes actually work.	2005-08-22 23:24:26 +00:00
Pawel Jakub Dawidek	dd549194ae	By default, when doing crypto work in software, start as many threads as we have active CPUs and bind each thread to its own CPU. MFC after: 3 days	2005-08-21 18:12:51 +00:00
Pawel Jakub Dawidek	b8db9f58da	Remove stale comment (we now always start worker thread). MFC after: 3 days	2005-08-21 18:06:35 +00:00
Pawel Jakub Dawidek	b866c830d9	Back-out the change from revision 1.14 and allow for '/' in labels again. Convinced by: green, Gavin Atkinson, dougb, gordon MFC after: 1 day	2005-08-20 17:05:47 +00:00
Pawel Jakub Dawidek	efd9ac0dfc	Add a __packed keyword to g_eli_metadata struct definition, so sizeof(struct g_eli_metadata) will return the exact number of bytes needed for storing it on the disk. Without this change GELI was unusable on amd64 (and probably other 64-bit archs), because sizeof(struct g_eli_metadata) was greater than 512 bytes and geli(8) was failing on assertion. Reported by: Michael Reifenberger <mike@Reifenberger.com> MFC after: 3 days	2005-08-20 10:43:03 +00:00
Pawel Jakub Dawidek	7a5c26fcbd	Allow to change number of iterations for PKCS#5v2. It can only be used when there is only one key set. MFC after: 3 days	2005-08-19 22:19:25 +00:00
Pawel Jakub Dawidek	fcd46203c5	- Add a missing period. - Fix number of spaces. MFC after: 3 days	2005-08-19 22:16:26 +00:00
Pawel Jakub Dawidek	a95452ee8d	Avoid code duplication and implement bitcount32() function in systm.h only. Reviewed by: cperciva MFC after: 3 days	2005-08-19 22:10:19 +00:00
Pawel Jakub Dawidek	dddd1d537a	Always run dedicated kernel thread (even when we have hardware support). There is no performance impact, but allows to allocate memory with M_WAITOK flag. As a side effect this simplify code a bit. MFC after: 3 days	2005-08-17 15:25:57 +00:00
Pawel Jakub Dawidek	bf71eaacf1	We should now return 0.	2005-08-17 15:12:34 +00:00
Pawel Jakub Dawidek	d1dca8a818	Even if crypto_dispatch() return an error, request is not canceled and our callback will still be called, just to tell us that requested failed... Reported by: Mike Tancsa <mike@sentex.net> MFC after: 3 days	2005-08-17 14:34:52 +00:00
Pawel Jakub Dawidek	2be2b2eab5	We don't need to clear allocated memory. This will speed-up things a bit. MFC after: 3 days	2005-08-17 14:08:50 +00:00
Poul-Henning Kamp	52d71e1a85	remove stale comments	2005-08-16 20:03:29 +00:00
Lukas Ertl	664a97517f	Make it possible to remove stale, left-over subdisks.	2005-08-16 15:12:44 +00:00
Lukas Ertl	8cc5eb98ad	Fix a stupid logic bug introduced in geom_vinum_drive.c rev 1.18: When a drive is newly created, it's state is initially set to 'down', so it won't allow saving the config to it (thus it will never know of itself being created). Work around this by adding a new flag, that's also checked when saving the config to a drive.	2005-08-15 17:07:47 +00:00
Pawel Jakub Dawidek	bb30fea667	Because code paths for I/O requests are quite complex, add comments above the functions which participate in I/O paths. MFC after: 1 day	2005-08-13 17:45:37 +00:00
Pawel Jakub Dawidek	ac445fbab5	Provide more complete "How to add a new file system to glabel." list. MFC after: 1 week	2005-08-12 00:34:45 +00:00
Pawel Jakub Dawidek	9417a618d1	Add code for Ext2FS and ReiserFS labels recognition. Submitted by: Stanislav Sedov <stas@310.ru> PR: kern/84638 MFC after: 1 week	2005-08-12 00:27:45 +00:00
Pawel Jakub Dawidek	055c32a1bc	Avoid creating directories in devfs by changing all '/' in labels to '_'. Idea from: Stanislav Sedov <stas@310.ru> MFC after: 3 days	2005-08-12 00:05:09 +00:00
Pawel Jakub Dawidek	6985decf3c	GELI doesn't need cryptodev. MFC after: 3 days	2005-08-11 14:52:27 +00:00
Pawel Jakub Dawidek	6eb1d21f14	Be case-insensitive when dealing with algorithm names. PR: kern/84659 Submitted by: Benjamin Lutz <benlutz@datacomm.ch>	2005-08-08 19:40:38 +00:00
Pawel Jakub Dawidek	ea35a2ec3a	MFp4: Export more informations about encrypted providers. MFC after: 1 week	2005-07-27 22:31:57 +00:00
Pawel Jakub Dawidek	7625429883	Reduce default debug level to 0. MFC after: 1 week	2005-07-27 21:48:47 +00:00
Pawel Jakub Dawidek	c58794debd	Add GEOM_ELI class which provides GEOM providers encryption. For features list and usage see manual page: geli(8). Sponsored by: Wheel Sp. z o.o. http://www.wheel.pl MFC after: 1 week	2005-07-27 21:43:37 +00:00
Pawel Jakub Dawidek	4ed854e8d4	Use root_mount KPI for RAID3 to delay root file system mount. Actually, one cannot setup root file system on RAID3 device, but when other file system exist in /etc/fstab which are placed on RAID3 device, boot process will be interrupted when these devices are missing. MFC after: 3 days X-MFC-note: MFC only to RELENG_6, as RELENG_5 doesn't have root_mount KPI.	2005-07-27 09:03:51 +00:00
Poul-Henning Kamp	8827c821de	By design I left a tiny race in updating the I/O statistics based on the assumption that performance was more important that beancounter quality statistics. As it transpires the microoptimization is not measurable in the real world and the inconsistent statistics confuse users, so revert the decision. MT6 candidate: possibly MT5 candidate: possibly	2005-07-25 21:12:54 +00:00
Pawel Jakub Dawidek	565bc10111	Add a very simple and small GEOM class - ZERO. It creates very huge provider (41PB) /dev/gzero. On BIO_READ request it zero-fills bio_data and on BIO_WRITE it does nothing. You can also set kern.geom.zero.clear sysctl to 0 to do nothing even for BIO_READ. I'm using it for performance testing where it is very helpful. MFC after: 3 days	2005-07-25 10:03:16 +00:00
Poul-Henning Kamp	0322f8dc8d	Comment typo	2005-07-20 18:08:16 +00:00
Pawel Jakub Dawidek	0499edf459	Before calling g_orphan_provider(), add G_PF_WITHER flag, so GEOM will know to destroy it. PR: kern/81758 Submitted by: trasz <trasz@buziaczek.pl> MFC after: 3 days	2005-07-17 13:15:02 +00:00
Yoshihiro Takahashi	0bf2708b8c	Merged from geom_mbr.c revisions 1.62 and 1.66. - Implement a gctl handler and the verb "write MBR".	2005-07-15 15:29:45 +00:00
Lukas Ertl	7ad68986b8	) Implement round-robin reads for multiplex volumes. ) Plug a possible memory leak. [1] [1] obtained from: pjd@.	2005-07-15 13:38:06 +00:00
Poul-Henning Kamp	1c3cf26412	Implement a gctl handler and the verb "write MBR" which can be used to update metadata and bootcode while the MBR is in use. MFC candidate	2005-07-15 08:00:44 +00:00
Pawel Jakub Dawidek	84436f14c4	Add CANCEL command which allows to remove one request from the queue or all requests from the queue if request number is not given. Bump version number. Approved by: re (scottl)	2005-07-08 21:08:53 +00:00
Pawel Jakub Dawidek	59ddf345d5	After provider creation!!	2005-05-25 15:54:17 +00:00
Pawel Jakub Dawidek	0f2bbe5ba4	- Call root_mount_rel() when provider IS created, not earlier. This should close the race observed by Daniel Eriksson. - Remove redundant wakeup().	2005-05-25 13:10:04 +00:00
Pawel Jakub Dawidek	4eafb037f6	Add some debug code to diagnose root-on-mirror problems with recent -current. Reported by: Daniel Eriksson	2005-05-23 13:05:07 +00:00
Pawel Jakub Dawidek	d246aa55e7	Correct typo.	2005-05-18 21:53:08 +00:00
Lukas Ertl	0164489c96	When a drive dies, don't call g_wither_geom() directly, but instead post an event to the geom event queue that will take care of it, letting outstanding bios finish, and closing the consumers. Plus some cosmetic clean ups.	2005-05-17 16:38:30 +00:00
Pawel Jakub Dawidek	3ac6c13bd4	cp can't be NULL. Noticed by: Coverity Prevent analysis tool	2005-05-11 19:36:56 +00:00
Pawel Jakub Dawidek	b957751627	gp can't be NULL. Noticed by: Coverity Prevent analysis tool	2005-05-11 19:35:43 +00:00
Pawel Jakub Dawidek	862f5624ea	Add KASSERT() to be sure there is an active component. Suggested by: Coverity Prevent analysis tool	2005-05-11 18:13:51 +00:00
Pawel Jakub Dawidek	0a3384a8f8	Check return value. Found by: Coverity Prevent analysis tool	2005-05-11 18:07:39 +00:00
Yoshihiro Takahashi	16da54931e	Fix signed vs unsigned warning.	2005-05-01 09:44:50 +00:00
Lukas Ertl	bc2d4d6784	Only allow RAID5 plexes to be parity checked. PR: kern/80427 Submitty by: Stijn Hoop <stijn@win.tue.nl>	2005-04-28 13:09:00 +00:00
Pawel Jakub Dawidek	3865ca2e13	Fix provider's size check for 'insert' command. Before this fix one was able to insert one sector too small provider. MFC after: 3 days	2005-04-25 10:41:26 +00:00
Garrett Wollman	d5e3d722df	The size of a filesystem may be less than the size of the provider it resides on. Fix the special case of the filesystem fragment size not evenly dividing the size of the provider. Fixing the general case probably requires better superblock validation (left as an exercise to the reader).	2005-04-19 21:55:28 +00:00
Pawel Jakub Dawidek	7979b3683c	Remove the hack which allowed to use gmirror for root file system, use root_mount KPI instead.	2005-04-19 21:47:25 +00:00
Poul-Henning Kamp	d1c712ede2	Call g_waitidle() instead of GEOM using the root_mount_hold() KPI. GEOM could (and will) get events as a result of drivers coming in late so a one-shot method is not good enough for GEOM.	2005-04-19 06:23:59 +00:00
Poul-Henning Kamp	73fbaa74e5	Add a named reference-count KPI to hold off mounting of the root filesystem. While we wait for holds to be released, print a list of who holds us back once per second. Use the new KPI from GEOM instead of vfs_mount.c calling g_waitidle(). Use the new KPI also from ata. With ATAmkIII's newbusification, ata could narrowly miss the window and ad0 would not exist when we tried to mount root.	2005-04-18 21:21:26 +00:00
Pawel Jakub Dawidek	811787079b	Protect against recursive labels creation in simlar way as it is done in BSD and MBR classes, ie. if provider below us uses the same metadata, don't create labels based on the metadata. This allows to create labels on geoms with rank != 1 without hacks. Tested by: Chris Elsworth <chris@shagged.org> on sparc64 OK'ed by: phk MFC after: 2 weeks	2005-04-12 08:14:15 +00:00
Pawel Jakub Dawidek	cdae843174	Fix a long-standing bug. Error string has to be copyied from the user process context. Approved by: phk MFC after: 3 days	2005-04-08 09:28:08 +00:00
Pawel Jakub Dawidek	7e0b3120e7	- Add a missing g_io_deliver() in case of allocation failure - we didn't completed I/O requests here. - First allocate all needed bios, so if any of allocations fail, we can free memory before sending any I/O requests down. Reported by: Pawel Malachowski MFC after: 3 days	2005-04-03 14:55:49 +00:00
Yoshihiro Takahashi	612f970e46	Remove geometry translations here.	2005-03-30 12:59:54 +00:00
Joerg Wunsch	3328bbeef2	Support VTOC volume names. This can be useful to distinguish multiple disks in a system. Solaris' format(1m) displays the volume names in the disk overview. MFC after: 1 month	2005-03-30 09:33:10 +00:00
Poul-Henning Kamp	9bb329f4e5	fix a "modify after free" bug which is practically impossible to experience. Found by: Coverity (id #540 #541)	2005-03-26 21:07:35 +00:00
Pawel Jakub Dawidek	34cb151796	If an error occurs, clean up before returning from g_raid3_connect_disk().	2005-03-26 17:24:19 +00:00
Pawel Jakub Dawidek	c2ca10933d	Make the code more obvious - when an error occurs in g_mirror_connect_disk(), detach and destroy consumer before returning.	2005-03-26 17:23:01 +00:00
Pawel Jakub Dawidek	cc6aa917b9	Check for return values. Submitted by: sam Found by: Coverity Prevent analysis tool	2005-03-26 16:51:19 +00:00
Poul-Henning Kamp	cb7ff8b71d	g_read_data() can return NULL, check for it. Found by: Coverity (ID#258)	2005-03-18 07:03:56 +00:00
Poul-Henning Kamp	b3fd9b46bb	After rejecting the bio request early, return instead of panicing. Found by: Coverity (ID#450)	2005-03-18 07:01:31 +00:00
Poul-Henning Kamp	b3b21113a5	Avoid null pointer dereference.	2005-03-18 06:57:58 +00:00
Pawel Jakub Dawidek	42cfb5bada	Plug memory leak. Submitted by: Ted Unangst Found by: Coverity Prevent analysis tool Approved by: phk MFC after: 3 days	2005-03-16 20:48:13 +00:00
Poul-Henning Kamp	20b3501394	forward declare struct disk.	2005-03-15 10:47:38 +00:00
Poul-Henning Kamp	03c02e5cb1	Do not attach MBR on top of an MBR. This removes some confusing slice names on disks with extended partitions. Spotted on: Mother-in-laws computer.	2005-03-14 15:22:18 +00:00
Hajimu UMEMOTO	68527b3aad	stop including rijndael-api-fst.h from rijndael.h. this is required to integrate opencrypto into crypto.	2005-03-11 15:42:51 +00:00
Lukas Ertl	cf01c54cda	Remove test for zero sectorsize when tasting. This check doesn't seem to be necessary anymore, and it prevents tasting a valid drive when booting with geom_vinum already loaded, since SCSI disks set their sectorsize not until first opening them.	2005-03-07 19:58:58 +00:00
Poul-Henning Kamp	3b3f38ed7d	Add placeholder mutex argument to new_unrhdr().	2005-03-07 11:05:47 +00:00
Lukas Ertl	9954331c23	Don't allow to synchronize a plex that is already sychronizing. Reset the 'syncing' flag in case of errors, too. Some cosmetics.	2005-03-04 16:43:40 +00:00
Pawel Jakub Dawidek	e68909854c	- Add md_provsize field to metadata, which will help with shared-last-sector problem. After this change, even if there is more than one provider with the same last sector, the proper one will be chosen based on its size. It still doesn't fix the 'c' partition problem (when da0s1 can be confused with da0s1c) and situation when 'a' partition starts at offset 0 (then da0s1a can be confused with da0s1 and da0s1c). One can use '-h' option there, when creating device or avoid sharing last sector. Actually, when providers share the same last sector and their size is equal, they provide exactly the same data, so the name (da0s1, da0s1a, da0s1c) isn't important at all. - Provide backward compatibility. - Update copyright's year. MFC after: 1 week	2005-02-27 23:07:47 +00:00
Lukas Ertl	d8688e1117	Correctly calculate what to do and how to retry a request to a plex when the previous one failed and there are more than one plex in the volume. This could have led to a flood of error messages on the console and probably a deadlock in certain situations.	2005-02-23 14:59:14 +00:00
Poul-Henning Kamp	dfd4be14bd	Try to unbreak the vnode locking around vop_reclaim() (based mostly on patch from kan@). Pull bufobj_invalbuf() out of vinvalbuf() and make g_vfs call it on close. This is not yet a generally safe function, but for this very specific use it is safe. This solves the problem with buffers not being flushed by unmount or after failed mount attempts.	2005-02-19 11:44:57 +00:00
Lukas Ertl	3608f72533	In case of drive errors, don't close the associated consumer and detach it, but instead let the geom wither away. Bump copyright year.	2005-02-17 16:08:36 +00:00
Pawel Jakub Dawidek	07b9f1becd	Fix year in copyrights.	2005-02-16 22:26:34 +00:00
Pawel Jakub Dawidek	0218292cdf	Update copyright in files changed this year.	2005-02-16 22:14:52 +00:00
Pawel Jakub Dawidek	99394c59ae	Fix year in copyrights.	2005-02-16 22:13:22 +00:00
Pawel Jakub Dawidek	ccbef85dd0	Remove mutex asserion from g_gate_find(). We don't want g_gate_list_mtx mutex to be held here, because we want speed here.	2005-02-16 16:13:56 +00:00
Pawel Jakub Dawidek	f906581296	Remove TDP_GEOM flag from thread after ggate device creation. This flag means "wait for all pending requests before returning to userland". There are pending events for sure, because we just created new provider and other classes want to taste it, but we cannot answer on I/O requests until we're here.	2005-02-16 16:12:28 +00:00
Pawel Jakub Dawidek	35f855d9f9	Fix typo. We want to unlock mutex here. Submitted by: Andreas Kohn <andreas.kohn@gmail.com> MFC after: 1 week	2005-02-12 16:19:03 +00:00
Poul-Henning Kamp	07e95ed633	Make various random things static	2005-02-10 12:10:35 +00:00
Pawel Jakub Dawidek	e35d3a7828	- Remove g_gate_hold()/g_gate_release() from start/done paths. It saves 4 mutex operations per I/O requests. - Use only one mutex to protect both (incoming and outgoing) queue. As MUTEX_PROFILING(9) shows, there is no big contention for this lock. - Protect sc_queue_count with queue mutex, instead of doing atomic operations on it. - Remove DROP_GIANT()/PICKUP_GIANT() - ggate is marked as MPSAFE and no Giant there.	2005-02-09 08:29:39 +00:00
Dag-Erling Smørgrav	04550802d8	merge from geom_vol_ffs.c rev 1.14 (avoid unaligned I/O requests)	2005-02-08 12:34:11 +00:00
Dag-Erling Smørgrav	363de7f683	Take care not to issue unaligned I/O requests while tasting a provider.	2005-02-08 08:04:23 +00:00
Pawel Jakub Dawidek	662a4e5878	- Use bioq_insert_tail()/bioq_insert_head() instead of bioq_disksort(). - Improve mediasize checking. MFC after: 1 week	2005-02-05 00:30:08 +00:00
Poul-Henning Kamp	3ad9f7c2c5	When dumping to a unpartitioned disk, make sure to chop the length of the dump area accordingly. Run into by: scottl	2005-01-29 16:49:43 +00:00
Jeff Roberson	1907e62037	- If mpsafevfs is off, acquire giant around all calls to bufdone(). Sponsored by: Isilon Systems, Inc.	2005-01-28 16:04:44 +00:00
Poul-Henning Kamp	84a6975215	Introduce and use g_vfs_close().	2005-01-25 15:52:04 +00:00
Poul-Henning Kamp	bc0fc6fcc3	Create a correctly sized vnode objects for disk devices.	2005-01-24 22:41:21 +00:00
Jeff Roberson	e9f3e3f8ca	- Don't acquire giant around calls to bufdone(). Sponsored By: Isilon Systems, Inc.	2005-01-24 10:47:46 +00:00
Lukas Ertl	f9b7569c09	Only report state changes of subdisks and plexes when there's really a state change. Reword the info a bit.	2005-01-21 18:27:23 +00:00
Lukas Ertl	0d93122102	Don't initialize error with ENXIO as we might end up here when the plex has no more consumers (e.g. orphaning).	2005-01-21 18:24:20 +00:00
Pawel Jakub Dawidek	857d14cbc9	Protect against recursive slices creation in simlar way as it is done in BSD class, ie. if provider below us uses the same metadata, don't create slices based on the metadata. This allows to create slices on geoms with rank != 1 without hacks. Discussed with: phk Approved by: phk MFC after: 2 weeks	2005-01-20 22:14:05 +00:00
Lukas Ertl	eba5b9dfce	Rename synchronization and initialization threads and prefix them with 'gv_' for consistency.	2005-01-19 14:49:26 +00:00
Lukas Ertl	f11c507c45	Although an object may already be known in the configuration, it's worker thread may have been destroyed (e.g. during orphaning). Make sure that objects get back their worker threads when they get a new geom.	2005-01-19 14:08:16 +00:00
Lukas Ertl	3b6cdf438a	Reset object flags after killing off an object's worker thread.	2005-01-19 13:57:09 +00:00
Poul-Henning Kamp	e8cde1ac6f	Discontinue zero-length g_ctl arguments as "just give him this pointer" transfers. The necessary context for calling copyin() isn't available anyway and automatic code-validation chokes on this.	2005-01-17 07:14:24 +00:00
Poul-Henning Kamp	032bc81d4d	CAM will sometimes remove a disk again even before it finished being initialized. We already cancel the pending events but we need to not dereference the geom pointer which never got set different from NULL.	2005-01-14 21:05:35 +00:00
Pawel Jakub Dawidek	080361d6b8	Introduce a new GEOM class - SHSEC. It provides sharing secret between the given providers. Without even one of the configured components there should be no way to get the secret. Supported by: WHEEL Sp. z o.o. http://www.wheel.pl	2005-01-11 18:06:44 +00:00
Poul-Henning Kamp	6ef8480a88	Add BO_SYNC() and add a default which uses the secret vnode pointer and VOP_FSYNC() for now.	2005-01-11 10:43:08 +00:00
Pawel Jakub Dawidek	437566858a	Increase default synchronization speed. MFC after: 3 days	2005-01-09 14:43:39 +00:00
Warner Losh	fa521b0366	/* -> /*- for copyright notices, minor format tweaks as necessary	2005-01-06 18:27:30 +00:00
Pawel Jakub Dawidek	ea973705b3	- Fix 'rebuild' command - it can no longer relay on retaste event (we ignore it). - Remove code used for handling spoil events, as spoiling is not possible anymore, because we keep consumers open for writing all the time. MFC after: 4 days	2005-01-04 12:15:21 +00:00
Pawel Jakub Dawidek	da84416791	Spoiling is now not possible, because we keep consumers open for writing all the time. Remove unused code then. MFC after: 4 days	2005-01-04 12:11:49 +00:00
Pawel Jakub Dawidek	fd6d312082	Fix 'rebuild' command (we ignore retaste event now, so don't relay on it).	2005-01-03 19:42:37 +00:00
Pawel Jakub Dawidek	cdca9c06d9	Remove unused #include.	2005-01-03 12:53:10 +00:00
John Baldwin	63710c4d35	Stop explicitly touching td_base_pri outside of the scheduler and simply set a thread's priority via sched_prio() when that is the desired action. The schedulers will start managing td_base_pri internally shortly.	2004-12-30 20:29:58 +00:00
Pawel Jakub Dawidek	7f456a7d61	Remove debug code.	2004-12-28 21:52:45 +00:00
Pawel Jakub Dawidek	a245a5483c	- Add genid field to the metadata which will allow to improve reliability a bit. After this change, when component is disconnected because of an I/O error, it will not be connected and synchronized automatically, it will be logged as broken and skipped. Autosynchronization can occur, when component is disconnected (on orphan event) and connected again - there were no I/O error, so there is no need to not connected the component, but when there were writes while it wasn't connected, it will be synchronized. This fix cases, when component is disconnected because of I/O error and can be connected again and again. - Bump version number. - Implement backward compatibility mechanism. After this change when metadata in old version is detected, it is automatically upgraded to the new (current) version.	2004-12-25 19:17:47 +00:00
Pawel Jakub Dawidek	538ff5ee7a	Update disk->d_genid field when increasing sc->sc_genid.	2004-12-23 21:15:15 +00:00
Pawel Jakub Dawidek	9a9f504132	- Add genid field to the metadata which will allow to improve reliability a bit. After this change, when component is disconnected because of an I/O error, it will not be connected and synchronized automatically, it will be logged as broken and skipped. Autosynchronization can occur, when component is disconnected (on orphan event) and connected again - there were no I/O error, so there is no need to not connected the component, but when there were writes while it wasn't connected, it will be synchronized. This fix cases, when component is disconnected because of I/O error and can be connected again and again. - Bump version number. - Add version change history. - Implement backward compatibility mechanism. After this change when metadata in old version is detected, it is automatically upgraded to the new (current) version.	2004-12-22 23:09:32 +00:00
Pawel Jakub Dawidek	4485f00081	Now, when force device destruction is done on shutdown, hide warning, that device cannot be destroyed immediately, under debug=1. Suggested by: simon	2004-12-21 19:50:18 +00:00
Pawel Jakub Dawidek	d97d5ee931	Improve reliability and clean up code a bit. For more details check src/sys/geom/mirror/g_mirror.c rev.1.47,1.48,1.49,1.50.	2004-12-21 19:30:59 +00:00
Pawel Jakub Dawidek	f663832b75	This should not be permitted, but some GEOM classes held the topology lock while doing g_(read\|write)_data() (e.g. BSD). This can cause a deadlock in MIRROR class. Not sure if this is safe to drop the topology lock in BSD class, so change the code in MIRROR class to avoid this deadlock.	2004-12-21 18:42:51 +00:00
Pawel Jakub Dawidek	54bab03f04	Implement g_topology_try_lock(). No objection from: phk	2004-12-21 18:32:46 +00:00
Pawel Jakub Dawidek	dc7d54e7b3	Remove unused variables.	2004-12-19 23:55:49 +00:00
Pawel Jakub Dawidek	a2a7b44de0	- Argument 'flags' in g_mirror_destroy_consumer() function is unsed - mark it as such. - Before closing consumer check if it is open. It can be closed here when g_mirror_connect_disk() fails on g_access().	2004-12-19 23:33:59 +00:00
Pawel Jakub Dawidek	9eec299fab	Some major cleanups. Keeping consumers open when device is closed is very hard. We need to open consumers sometimes to update metadata, etc. Many hacks was introduced in the past to made it possible. You cannot be sure that you can open consumer for writing always, even if you think it should be allowed. If one of the mirror components is for example da0 and you try to open it, you can get EPERM when da0s1 is opened for reading (because BSD class opens consumers (da0) with an extra 'e' bit set). Waiting for the events queue to be empty may do the trick, but it makes code much uglier (as you cannot always call g_waitidle()), it doesn't solve all edge cases and it can introduce deadlocks if there are events in the queue that wait for gmirror. I removed those hacks. Now all consumers are open r1w1e1 always, even if device is closed. Maybe it is less clean from GEOM perspective, but simpify code a lot and make it much more reliable. The only issue was retaste event which is sent when we close consumers opened for writing. I ignore retaste event by not detaching consumer immediately (so retaste event is not send to my class) and sending event right after it to detach and destroy consumer.	2004-12-19 23:12:00 +00:00
Pawel Jakub Dawidek	c37e2f9bbf	Don't quit on first failure, just skip failures.	2004-12-19 22:58:25 +00:00
Christian Brueffer	44d086bde6	Fix typo in a comment. MFC after: 3 days	2004-12-15 12:18:41 +00:00
Pawel Jakub Dawidek	89dd8e5326	bioq_insert_head() function is already in subr_disk.c.	2004-12-13 13:02:06 +00:00
Poul-Henning Kamp	2221dbebce	Pass the file->flags down to geom ioctl handlers. Reject certain ioctls if write permission is not indicated. Bump geom API version. Reported by: Ruben de Groot <mail25@bzerk.org>	2004-12-12 10:09:05 +00:00
Pawel Jakub Dawidek	53ed4e0d54	- Turn off 'fast' mode by default and increase maximum memory to consume when this mode is used. - Manual page update.	2004-12-09 12:26:47 +00:00
Marcel Moolenaar	9055ed836a	o Don't limit GPT as a rank 2 provider. Allow it to be connected anywhere in the DAG. This includes configurations that are not allowed by the EFI specification. o Reject a GPT partition table if it's not preceeded by a PMBR. There's no need to preserve the MBR partitioning anymore as GPT is mature and with the first bullet extending the applicability of GPT, it's better to be a bit more strict.	2004-12-05 06:02:21 +00:00
Pawel Jakub Dawidek	afd05d741f	When initializing device, set d_softc and d_no fields for all components, because we know it then and we need it when inserting a component which wasn't destroyed while device was running. Reported by: Michael Handler <handler@grendel.net> MFC after: 1 week	2004-12-04 21:20:59 +00:00
Warner Losh	3bc18cb767	Add observations of the Linux98 and Grub/98 boot loaders. These observations lead me to believe that the convetion for pc98 boot loaders is to have a jump unstruction, followed by a string, followed by code. The jump usually doesn't have a nop after it and usually the string is NUL terminated, but Grub/98 breaks both of these rules. # I looked for, but failed to find the Minux boot blocks for PC-9801 port.	2004-11-30 09:40:11 +00:00
Warner Losh	696ac86f2c	Reject tasting of this provider if the sector size isn't a multiple of 512. If I had an audio cdrom in my cd player when I booted my system, I'd get a panic from geom because you can't read 8192 bytes from an audio cdrom. Remove XXX comment about IPL1 and replace it with some information from my soon to be published web page on the pc98 disk layout. The IPL1 test was the result of an observation of a disk with FreeBSD's boot0 program. It was testing part of an area what appears to be reserved for a boot loader name, which comes after a jump over this area. I don't yet know if it is required to be any specific jump instruction, or if the destination has to be location 11. [1] [1] FreeBSD Press No. 13, page 115, poorly translated by myself. The picture there shows offset 8 as the destination of the jump, but FreeBSD's boot0 program has three padding NULs after the IPL1 name and uses a 16-bit 'jmp' instruction.	2004-11-30 08:00:14 +00:00
Poul-Henning Kamp	d4dbba5f83	Fix a long standing bug in geom_mbr which is only now exposed by the correct open/close behaviour of filesystems: When an ioctl to modify the MBR arrives, we cannot take for granted that we have the consumer open. The symptom is that one cannot run 'boot0cfg -s2 /dev/ad0' in single-user mode because / is the only open partition in only open r1w0e1. If it is not, we attempt to increase the write count by one and decrease it again afterwards. Presumably most if not all other slices suffer from the same problem.	2004-11-28 20:57:25 +00:00
Lukas Ertl	997337fd20	Implement 'setstate' to allow setting the state of drives and subdisks for debugging and emergency purposes.	2004-11-26 12:31:36 +00:00
Lukas Ertl	fb5885af37	Implement checkparity/rebuildparity.	2004-11-26 12:01:00 +00:00
Pawel Jakub Dawidek	a17dd95f14	- Add missing Giant drop before acquiring the topology lock. - Move DROP_GIANT()/PICKUP_GIANT() to g_gate_ioctl().	2004-11-23 11:18:26 +00:00
Max Khon	9595dba40d	Use M_ZERO to not panic in mtx_init when INVARIANTS enabled. Submitted by: simokawa MFC after: 1 week	2004-11-20 13:10:04 +00:00
Lukas Ertl	fb4e65d035	Move RAID5 offset calculation into a separate function to avoid code duplication.	2004-11-15 13:04:55 +00:00
Lukas Ertl	94175098f1	Share gv_roughlength() between kernel and userland, as we will need it there later.	2004-11-15 12:30:59 +00:00
Pawel Jakub Dawidek	085f43afae	Before trying to update metadata (so open consumer for writing), be sure that the events queue is empty. In other case we're able to hit the race where for example da0s1 is tasted by some other class, which means that da0 is open with exclusive bit set, which means that we can't open da0 for writing if it is our component. Reported by: Attila Nagy <bra@fsn.hu> (and somebody else sometime ago, but I cannot find who it was)	2004-11-09 23:27:21 +00:00
Pawel Jakub Dawidek	b8005b9b24	Introduce g_waitidlelock() function which is simlar to g_waitidle(), but should be called with the topology lock held and returns with the topology lock held and empty event queue. Approved by: phk (sometime ago)	2004-11-09 23:20:50 +00:00
Pawel Jakub Dawidek	b36b4bfb55	Don't rely on DIRTY flag to be sure that consumer if open, because DIRTY flag can be removed in idle process. Use consumer's acw field instead to avoid opening consumer twice.	2004-11-09 23:15:40 +00:00
Pawel Jakub Dawidek	9c6a3f03c6	For BIO_READ check if provider is open for reading and for BIO_WRITE, check if provider is open for writing. This fixes panic when device is open only for writing and we send write request.	2004-11-09 23:04:45 +00:00
Pawel Jakub Dawidek	fdc3c6ce23	Drop Giant lock before grabbing the topology lock.	2004-11-09 00:35:08 +00:00
Pawel Jakub Dawidek	463674f7e0	If device is marked as beeing destroyed, deny all access requests.	2004-11-08 20:23:53 +00:00
Pawel Jakub Dawidek	9bb09163fc	Don't forget to make sure that there are no not-finished requests before marking components as clean. Pointed out by: scottl	2004-11-05 17:18:39 +00:00
Pawel Jakub Dawidek	4d006a98d1	- Mark all raid3 components as clean after kern.geom.raid3.idletime seconds. - Make kern.geom.raid3.timeout variable tunable.	2004-11-05 13:12:58 +00:00
Pawel Jakub Dawidek	9da3072cae	Mark raid3 devices as clean on shutdown (after all file systems are unmounted). Suggested by: scottl	2004-11-05 13:01:25 +00:00
Pawel Jakub Dawidek	79e614937e	- Use ->index consumer's field to track number of in-flight requests. - Remove unused #include.	2004-11-05 12:42:16 +00:00
Pawel Jakub Dawidek	6349471be3	Use shutdown hooks to mark mirrors as clean after all file systems are unmounted. Suggested by: scottl	2004-11-05 12:35:21 +00:00
Pawel Jakub Dawidek	127cf38ee4	Remove unused #include.	2004-11-05 12:31:32 +00:00
Pawel Jakub Dawidek	14089dae44	- Add a sysctl kern.geom.mirror.idletime, so one can specify after how many seconds of idling, DRITY flags are removed. - If mirror is in idle state or is not open for writing, sleep without timeout when waiting for I/O requests. - Don't use atomic operations, for now sysctls are protected by Giant. - Update debugs.	2004-11-05 10:55:04 +00:00
Pawel Jakub Dawidek	2fdf5be172	MFp4: - Fix for good (I hope) force-stopping mirrors and some filure cases (e.g. the last good component dies when synchronization is in progress). Don't use ->nstart/->nend consumer's fields, as this could be racy, because those fields are used in g_down/g_up, use ->index consumer's field instead for tracking number of not finished requests. Reported by: marcel - After 5 seconds of idle time (this should be configurable) mark all dirty providers as clean, so when mirror is not used in 5 seconds and there will be power failure, no synchronization on boot is needed. Idea from: sorry, I can't find who suggested this - When there are no ACTIVE components and no NEW components destroy whole mirror, not only provider. - Fix one debug to show information about I/O request, before we change its command.	2004-11-05 09:05:15 +00:00
Poul-Henning Kamp	f9eeb89522	Finish cut&paste adjustments. Spotted by: tegge	2004-11-04 07:17:08 +00:00
Poul-Henning Kamp	e93a5ce092	Stop dumping the MBR entries under bootverbose	2004-11-03 09:08:33 +00:00
Poul-Henning Kamp	2859a695dc	Stop wasting a bootverbose line on all geom slices.	2004-11-03 09:08:10 +00:00
Poul-Henning Kamp	55f499a94f	Don't set si_bsize_phys, nobody cares.	2004-10-29 11:11:44 +00:00
Poul-Henning Kamp	4d13ab3da2	Add GEOM class "VFS" for filesystems and other buffer cache users of GEOM devices. There is nothing magic about this, it just gives a bufobj interface to GEOM.	2004-10-29 09:56:56 +00:00
Poul-Henning Kamp	725419af56	Add g_wither_geom_close() function.	2004-10-29 09:19:03 +00:00
Poul-Henning Kamp	6afb3b1c37	Give dev_strategy() an explict cdev argument in preparation for removing buf->b-dev. Put a bio between the buf passed to dev_strategy() and the device driver strategy routine in order to not clobber fields in the buf. Assert copyright on vfs_bio.c and update copyright message to canonical text. There is no legal difference between John Dysons two-clause abbreviated BSD license and the canonical text.	2004-10-29 07:16:37 +00:00
Lukas Ertl	6c39d46363	Give each plex a separate queue where held back bios are put on. This lowers the CPU usage of the worker thread and prevents a possible live lock on non-SMP machines. MFC candidate.	2004-10-26 21:01:42 +00:00
Poul-Henning Kamp	8c24ef5f78	Use unit number allocation functions for GEOM minor numbers.	2004-10-25 12:28:28 +00:00
Poul-Henning Kamp	f8fe7a735c	Retire si_stripesize and si_stripeoffset they will not be needed in cdev in the future.	2004-10-25 07:40:54 +00:00
Poul-Henning Kamp	85986ce002	Don't call g_waitidle(), it happens automagically now.	2004-10-23 20:52:15 +00:00
Poul-Henning Kamp	9197ce2ee5	Add a new per-thread private flag: TDP_GEOM. This flag gets set whenever the thread posts an event on the GEOM event queue, and if the flag is set when the thread is prepared to return to userland from the kernel, g_waitidle() will be called to make sure that the posted events have completed. This can replace an insufficient number of g_waitidle() calls in various other places, and has the advantage of being failsafe: Any system call which does a VOP_OPEN()/VOP_CLOSE will now correctly wait for any geom events it posted as part of spoils or tastes. Assert that topology and Giant is not held in g_waitidle().	2004-10-23 20:49:17 +00:00
Poul-Henning Kamp	a11021f362	Move the prototype for g_waitidle() to a more visible place.	2004-10-23 20:22:02 +00:00
Andrew R. Reiter	f96c8ef18a	- Turn KASSERT()s into warning printf()'s in the g_class_load() routine. This removes a panic that will occur if you build with GENERIC and attempt to kldload a GEOM module that is already in the kernel. Reviewed by: phk	2004-10-22 22:16:24 +00:00
Robert Watson	49dbb61dfc	Add KTR_GEOM, which allows tracing of basic GEOM I/O events occuring in the g_up and g_down threads. Each time a bio is propelled up and down the stack, an event is generating showing the provider, offset, and length, as well as thread wakeup and work status information.	2004-10-21 18:35:24 +00:00
Pawel Jakub Dawidek	06697d4f59	Ehh. Introduce a hack: Wait for 3 seconds, so GEOM is able to give us providers for tasting. Before this hack, race below is possible: SI_SUB_RAID (no not-fully-configured geoms, so don't block) GEOM tasting (now geoms are created) SI_SUB_MOUNT_ROOT (if root file system is placed on a mirror, it is possible that this mirror is not fully configured yet) There is a lot of work to do to avoid such hacks and I need a working solution before 5.3, sorry. Reported by: John Hay <jhay@icomtek.csir.co.za>	2004-10-14 07:55:29 +00:00
Pawel Jakub Dawidek	268111a210	Only allow for unloading when there are no geoms in LABEL GEOM class. We have to use our own destroy_geom method, because default one, which is a part of geom_slice is broken. MT5 candidate. PR: kern/72467 Submitted by: Vladimir Novoseltsev	2004-10-14 07:46:13 +00:00
Brian Feldman	6f299fa373	When loading GEOM modules, we expect the actual load process to be done by the time that kldload(8) returns. Satisfy that by making the GEOM module load event -- only when the kernel is !cold -- wait until the GEOM module init function has finished instead of returning immediately. This is the other half of fixing md(8) (actually, "mfs" in fstab(5)) that is similar to r1.128 of src/sys/dev/md/md.c. This bug would be why RAM disks would often fail on boot and the first call to mdconfig(8) would probably fail. pjd has ideas for not requiring kldload(8) to work synchronously for control devices that could make this obsolete. Silence on: -arch	2004-10-12 04:44:54 +00:00

... 5 6 7 8 9 ...

1495 Commits