freebsd-skq

Author	SHA1	Message	Date
Attilio Rao	83b3bdbc8a	Improve VFS locking: - Implement real draining for vfs consumers by not relying on the mnt_lock and using instead a refcount in order to keep track of lock requesters. - Due to the change above, remove the mnt_lock lockmgr because it is now useless. - Due to the change above, vfs_busy() is no more linked to a lockmgr. Change so its KPI by removing the interlock argument and defining 2 new flags for it: MBF_NOWAIT which basically replaces the LK_NOWAIT of the old version (which was unlinked from the lockmgr alredy) and MBF_MNTLSTLOCK which provides the ability to drop the mountlist_mtx once the mnt interlock is held (ability still desired by most consumers). - The stub used into vfs_mount_destroy(), that allows to override the mnt_ref if running for more than 3 seconds, make it totally useless. Remove it as it was thought to work into older versions. If a problem of "refcount held never going away" should appear, we will need to fix properly instead than trust on such hackish solution. - Fix a bug where returning (with an error) from dounmount() was still leaving the MNTK_MWAIT flag on even if it the waiters were actually woken up. Just a place in vfs_mount_destroy() is left because it is going to recycle the structure in any case, so it doesn't matter. - Remove the markercnt refcount as it is useless. This patch modifies VFS ABI and breaks KPI for vfs_busy() so manpages and __FreeBSD_version will be modified accordingly. Discussed with: kib Tested by: pho	2008-11-02 10:15:42 +00:00
Warner Losh	0952268ecd	Add support for reading Tivo Series 1 partitioning. This likely needs a little refinement, but is good enough to commit as is. # Should look to see if I should move swab(3) into the kernel or just # provide the unoptimized routine here. Reviewed by: marcel@	2008-11-02 03:02:56 +00:00
Konstantin Belousov	f5dfdb519f	Revert r184136. Instead, push the check for crashdumpmap overflow into the MD i386 and amd64 dump code. Requested by: jhb Retested by: pho MFC after: 3 days (+ 176304 + 184136)	2008-10-31 10:11:35 +00:00
Ulf Lilleengen	86b3c6f5bc	- Import macros used in gmirror for printing gvinum debug messages and making the output more standardized. - Add a sysctl to set the verbosity of the debug messages. - While there, fixup typos and wording in the messages.	2008-10-26 17:20:37 +00:00
Marcel Moolenaar	38a2db2eb0	Invalid BSD disklabels have been created by sysinstall and are possibly still being created. The d_secperunit field contains the number of sectors of the disk and not of the slice/partition to which the disklabel applies. Rather than reject the disklabel, we now silently adjust the field. Existing code, like bslabel(8), does not seem to check the label that extensively and seems to adjust fields as a side-effect as well. In other words, it's not that important apparently, so gpart should not be too strict about it. Reported by: nyan@ Reported by: Andriy Gapon <avg@icyb.net.ua>	2008-10-25 17:21:46 +00:00
Marcel Moolenaar	66fedfca7b	Allow dumps to partitions with a tag of 0. The legacy sunlabel implementation in FreeBSD does not use VTOC information and as such as no partition types.	2008-10-22 02:08:54 +00:00
Konstantin Belousov	7a882637fe	Do not overflow crashdumpmap. Reported and tested by: pho Reviewed by: jhb MFC after: 1 week	2008-10-21 18:52:38 +00:00
Marcel Moolenaar	e1d7111a53	The active and bootable flags are not part of the type. Export the active and bootable flags as attributes in the configuration XML and allow them to be manipulated with the set/unset commands. Since libdisk treats the flags as part of the partition type, preserve behavior by keeping them included in the configuration text.	2008-10-20 04:50:47 +00:00
Attilio Rao	0d7935fd01	Remove the struct thread unuseful argument from bufobj interface. In particular following functions KPI results modified: - bufobj_invalbuf() - bufsync() and BO_SYNC() "virtual method" of the buffer objects set. Main consumers of bufobj functions are affected by this change too and, in particular, functions which changed their KPI are: - vinvalbuf() - g_vfs_close() Due to the KPI breakage, __FreeBSD_version will be bumped in a later commit. As a side note, please consider just temporary the 'curthread' argument passing to VOP_SYNC() (in bufsync()) as it will be axed out ASAP Reviewed by: kib Tested by: Giovanni Trematerra <giovanni dot trematerra at gmail dot com>	2008-10-10 21:23:50 +00:00
Ulf Lilleengen	64d62b289e	- Use the new gv_write_header function to write out the header when removing a drive to make sure that the header is in the correct format.	2008-10-02 10:01:05 +00:00
Ulf Lilleengen	d4b28d5b0f	- Remove unneeded macro since the config_length field in the header was changed to 64 bit in the new format.	2008-10-02 09:35:47 +00:00
Ulf Lilleengen	46ceb66ad3	- Make gvinum header on-disk structure consistent on all platforms by storing the gvinum header in fields of fixed size and in a big endian byte order rather than the size and byte order of the actual platform. Note that the change is backwards compatible with the old gvinum configuration format, but will save the configuration in the new format when the 'saveconfig' command is executed. Submitted by: Rick C. Petty <rick-freebsd -at- kiwi-computer.com>	2008-10-01 14:50:36 +00:00
Marcel Moolenaar	a87faebb8c	Return G_PART_PROBE_PRI_HIGH instead of G_PART_PROBE_PRI_NORM if the probe succeeds. This guarantees that the BSD scheme wins over the MBR scheme when MBR gets to probe first. Build- or link-time conditions can cause schemes to end up in the linker set in a different order. Normally BSD is before MBR in the linker set and as such get to probe first. But typically when the kernel gets rebuild or relinked, this can change.	2008-09-29 02:48:22 +00:00
Marcel Moolenaar	1e2fbcfa44	Insert the null scheme at the head. This does not change any functionality, but creates an invariant: the first element on the list is always the null scheme.	2008-09-29 02:39:02 +00:00
Marcel Moolenaar	4bf0267894	Export the partition name in the conftxt and confxml output. The conftxt output is used by libdisk, and the confxml output is used by gpart itself (gpart show -l). Submitted by: nyan@	2008-09-27 19:58:11 +00:00
Marcel Moolenaar	a455de093b	Hold the root mount while we're tasting. It is possible that a nested partition (typically the BSD disklabel) is not done tasting while the root file system is being mounted. While this is rare, it's still possible.	2008-09-27 19:29:52 +00:00
Marcel Moolenaar	404cfb5e20	Allow 255 sectors/track for the BSD disklabel. The previous limit of 63 sectors/track is too PC BIOS specific. On pc98, where the BSD disklabel is used as well, 255 sectors/track is not uncommon. Submitted by: nyan@	2008-09-27 15:28:15 +00:00
Ed Schouten	d3ce832719	Remove unit2minor() use from kernel code. When I changed kern_conf.c three months ago I made device unit numbers equal to (unneeded) device minor numbers. We used to require bitshifting, because there were eight bits in the middle that were reserved for a device major number. Not very long after I turned dev2unit(), minor(), unit2minor() and minor2unit() into macro's. The unit2minor() and minor2unit() macro's were no-ops. We'd better not remove these four macro's from the kernel, because there is a lot of (external) code that may still depend on them. For now it's harmless to remove all invocations of unit2minor() and minor2unit(). Reviewed by: kib	2008-09-26 14:19:52 +00:00
Sean Bruno	c4901b6798	Just a fixup for a KTRACE message I stumbled upon many moons ago. Reviewed by: Scott Long MFC after: 2 days	2008-09-18 15:02:19 +00:00
Ulf Lilleengen	f805f204b6	- Add a new ioctl for getting the provider name of a geom provider. - Add a routine for looking up a device and checking if it is a valid geom provider given a partial or full path to its device node. Reviewed by: phk Approved by: pjd (mentor)	2008-09-07 13:54:57 +00:00
Rui Paulo	89ea496520	Fix build.	2008-09-05 18:11:18 +00:00
Rui Paulo	87662ab391	Keep entries sorted.	2008-09-05 18:09:49 +00:00
Rui Paulo	35fa2f1bcd	Include the vendor in the partition name.	2008-09-05 16:54:07 +00:00
Rui Paulo	d7255ff42e	Detect Apple HFS GPT slices.	2008-09-05 12:49:14 +00:00
Attilio Rao	59d4932531	Decontextualize vfs_busy(), vfs_unbusy() and vfs_mount_alloc() functions. Manpages are updated accordingly. Tested by: Diego Sardina <siarodx at gmail dot com>	2008-08-31 14:26:08 +00:00
Bjoern A. Zeeb	603724d3ab	Commit step 1 of the vimage project, (network stack) virtualization work done by Marko Zec (zec@). This is the first in a series of commits over the course of the next few weeks. Mark all uses of global variables to be virtualized with a V_ prefix. Use macros to map them back to their global names for now, so this is a NOP change only. We hope to have caught at least 85-90% of what is needed so we do not invalidate a lot of outstanding patches again. Obtained from: //depot/projects/vimage-commit2/... Reviewed by: brooks, des, ed, mav, julian, jamie, kris, rwatson, zec, ... (various people I forgot, different versions) md5 (with a bit of help) Sponsored by: NLnet Foundation, The FreeBSD Foundation X-MFC after: never V_Commit_Message_Reviewed_By: more people than the patch	2008-08-17 23:27:27 +00:00
Pawel Jakub Dawidek	ed6c3e478f	Style(9).	2008-08-12 20:19:08 +00:00
Dag-Erling Smørgrav	2616144e43	Add sbuf_new_auto as a shortcut for the very common case of creating a completely dynamic sbuf. Obtained from: Varnish MFC after: 2 weeks	2008-08-09 11:14:05 +00:00
Peter Wemm	fbbc785240	Trivial commit to attempt to diagnose a svn problem. Add comment that Tivo disks are APM, but do not have a DDR record.	2008-07-22 18:05:50 +00:00
Pawel Jakub Dawidek	5527ecd9a5	Clear passphrase buffer after use. Submitted by: Fabian Keil <fk@fabiankeil.de> (a bit different version)	2008-07-20 19:56:13 +00:00
Ulf Lilleengen	14e96b45e8	- When renaming a drive, also set the drive name in the gvinum header. PR: kern/125632 Approved by: pjd (mentor) MFC after: 3 days	2008-07-19 13:53:11 +00:00
Ulf Lilleengen	56af4c6141	- Fix a logic error when updating plex configuration. Approved by: pjd (mentor)	2008-07-11 16:46:29 +00:00
Robert Watson	4f7d1876d5	Introduce a new lock, hostname_mtx, and use it to synchronize access to global hostname and domainname variables. Where necessary, copy to or from a stack-local buffer before performing copyin() or copyout(). A few uses, such as in cd9660 and daemon_saver, remain under-synchronized and will require further updates. Correct a bug in which a failed copyin() of domainname would leave domainname potentially corrupted. MFC after: 3 weeks	2008-07-05 13:10:10 +00:00
Xin LI	6c97c325ff	Avoid NULL deference. Reviewed by: ivoras	2008-06-30 15:21:42 +00:00
Ulf Lilleengen	7e7a4e1d18	- Fix spelling errors. Approved by: kib (mentor) PR: kern/124788 Submitted by: Hywel Mallett <Hywel -at- hmallett.co.uk>	2008-06-20 19:48:18 +00:00
Marcel Moolenaar	f6aa3fccce	Add the set and unset verbs used to set and clear attributes for partition entries. Implement the setunset method for the MBR scheme to control the active flag.	2008-06-18 01:13:34 +00:00
Marcel Moolenaar	d3532631de	Finish the support for partition labels and add it to the XML.	2008-06-12 19:34:07 +00:00
Marcel Moolenaar	9a764aac3f	Add the raw partition type to the XML.	2008-06-12 06:34:14 +00:00
Marcel Moolenaar	eab484f822	Add the raw partition type to the XML.	2008-06-12 06:26:36 +00:00
Marcel Moolenaar	a3354bb4a7	Add the raw partition type to the XML.	2008-06-12 05:56:03 +00:00
Marcel Moolenaar	0c132595dd	Add the raw partiton type to the XML.	2008-06-12 05:28:47 +00:00
Marcel Moolenaar	40b075d366	Add the raw partition type to the XML.	2008-06-12 05:27:23 +00:00
Marcel Moolenaar	ab1e8f04c8	Add the partition label and the raw partition type to the XML.	2008-06-12 04:43:34 +00:00
Ed Schouten	06d425f92e	Remove the distinction between device minor and unit numbers. Even though we got rid of device major numbers some time ago, device drivers still need to provide unique device minor numbers to make_dev(). These numbers are only used inside the kernel. They are not related to device major and minor numbers which are visible in devfs. These are actually based on the inode number of the device. It would eventually be nice to remove minor numbers entirely, but we don't want to be too agressive here. Because the 8-15 bits of the device number field (si_drv0) are still reserved for the major number, there is no 1:1 mapping of the device minor and unit numbers. Because this is now unused, remove the restrictions on these numbers. The MAXMAJOR definition was actually used for two purposes. It was used to convert both the userspace and kernelspace device numbers to their major/minor pair, which is why it is now named UMINORMASK. minor2unit() and unit2minor() have now become useless. Both minor() and dev2unit() now serve the same purpose. We should eventually remove some of them, at least turning them into macro's. If devfs would become completely minor number unaware, we could consider using si_drv0 directly, just like si_drv1 and si_drv2. Approved by: philip (mentor)	2008-05-29 12:50:46 +00:00
Ulf Lilleengen	4e70f1decf	- Recognize the 'volume' parameter when creating a plex. PR: kern/75632 Approved by: pjd (mentor) MFC after: 1 day	2008-05-22 10:27:03 +00:00
Pawel Jakub Dawidek	9097a8e66e	- Assert that we don't send new provider event for a provider which has G_PF_WITHER flag set. - Fix typo in assertion condition (sorry, but I forgot who report that).	2008-05-18 22:50:50 +00:00
Pawel Jakub Dawidek	f02642d79e	Play nice with DDB pager. Educated by: jhb's BSDCan presentation	2008-05-18 21:13:10 +00:00
Marcel Moolenaar	5db670520f	Implement the G_PART_DUMPCONF method for all 6 schemes. Also call the method for the (indent == NULL) case (i.e. the kern.geom.conftxt sysctl). The purpose is to extend the conftxt output with scheme- specific fields which can be used by libdisk. In particular, have the schemes dump the xs and xt fields, which contain the backward compatible values for class type and partition type. This allows libdisk to work with the legacy slicers as well as with gpart and helps/promotes migration.	2008-04-23 20:13:05 +00:00
Marcel Moolenaar	4d32fcb42b	Add the bootcode verb for installing boot code. Boot code is supported for the MBR, GPT and PC98 schemes, where GPT installs boot code into the PMBR.	2008-04-13 19:54:54 +00:00
Marcel Moolenaar	e0fbffe617	Change the order from SI_ORDER_FIRST to SI_ORDER_ANY (within SI_SUB_DRIVERS) to avoid loading schemes before all the GEOM classes have been loaded and initialized. Otherwise we may end up using mutexes that haven't been initialized (due to g_retaste() posting an event).	2008-03-29 17:33:29 +00:00
Marcel Moolenaar	b03fab128b	Add support for PC-9800 partition tables.	2008-03-28 17:58:55 +00:00
Marcel Moolenaar	856744ba93	When retasting, wither any existing GEOMs of the same class. This allows the class to create a different GEOM for the same provider as well as avoid that we end up with multiple GEOMs of the same class with the same name. For example, when a disk contains a PC98 partition table but only MBR is supported, then the partition table can be treated as a MBR. If support for PC98 is later loaded as a module, the MBR scheme is pre-empted for the PC98 scheme as expected.	2008-03-28 06:31:12 +00:00
Marcel Moolenaar	4ffca444a5	Redefine G_PART_SCHEME_DECLARE() from populating a private linker set to declaring a proper module. The module event handler is part of the gpart core and will add the scheme to an internal list on module load and will remove the scheme from the internal list on module unload. This makes it possible to dynamically load and unload partitioning schemes.	2008-03-23 01:31:59 +00:00
Marcel Moolenaar	8a8fcb0089	Add g_retaste(), which given a class will present all non-open providers to it for tasting. This is useful when the class, through means outside the scope of GEOM, can claim providers previously unclaimed. The g_retaste() function posts an event which is handled by the g_retaste_event(). Event suggested by: phk	2008-03-23 01:23:35 +00:00
Ulf Lilleengen	1cf9b83c6d	- Fix a memory leak when re-discovering a gvinum configuration. Approved by: pjd (mentor) MFC after: 1 week	2008-03-18 08:48:51 +00:00
Marcel Moolenaar	909f20c80d	Add support for VTOC8 labels (aka sun disk labels). When a label does not have VTOC information about the partitions, it will be created. This is because the VTOC information is used for the partition type and FreeBSD's sunlabel(8) does not create nor use VTOC information. For this purpose, new tags have been added to support FreeBSD's partition types.	2008-03-02 00:52:49 +00:00
Marcel Moolenaar	028de8786a	Follow-up improvements to the handling of false positives: If the partition table is empty, check to see if we have something that looks sufficiently like a BPB. On non-i386 machines, the boot sector typically doesn't contain boot code; the end of the boot sector is all zeroes. This is also where the partition table is for MBRs. We only check the sector size and cluster size, as that seems to be the most reliable across implementations, BPB versions and platforms.	2008-02-29 22:41:36 +00:00
Marcel Moolenaar	6291ef2d80	Better handle false positives. The MBR differs from the boot sector only because there's a partition table where the boot sector has boot code. Boot sectors without boot code look like a MBR for all practical purposes. This change adds a check for the partition table and fails the probe when it's obvously invalid. The assumption being that the sector contains a boot sector and not a MBR. More checks are needed to distinguish a boot secto without boot code from a (empty) MBR.	2008-02-28 22:30:41 +00:00
Andrew Thompson	764fa86761	geom_lvm(4) is now known as geom_linux_lvm(4).	2008-02-20 07:52:43 +00:00
Andrew Thompson	1332875338	Add a geom class to map Linux LVM logical volumes. The logical disks will appear as /dev/lvm/<vol group>-<logical vol>, for instance /dev/lvm/vg0-home. G_LINUX_LVM currently supports linear stripes with segments on multiple physical disks. The metadata is read only, logical volumes can not be allocated or resized. Reviewed by: Ivan Voras Previously known as geom_lvm(4), rename requested by des, phk.	2008-02-20 07:45:36 +00:00
Scott Long	7bbd40c57e	Teach the dump and minidump code to respect the maxioszie attribute of the disk; the hard-coded assumption of 64K doesn't work in all cases.	2008-02-15 06:26:25 +00:00
Andrew Thompson	15df4265ef	Unbreak build, size_t is larger on 64bit platforms.	2008-02-11 09:20:01 +00:00
Andrew Thompson	77b65eef19	Add a geom class to map Linux LVM logical volumes. The logical disks will appear as /dev/lvm/<vol group>-<logical vol>, for instance /dev/lvm/vg0-home. GLVM currently supports linear stripes with segments on multiple physical disks. The metadata is read only, logical volumes can not be allocated or resized. Reviewed by: Ivan Voras	2008-02-11 03:05:11 +00:00
Marcel Moolenaar	392ffade03	Various fixes: o BSD disklabels have relative offsets. Even for the BSD in MBR slice setup, except when the mbroffset ioctl is supported. Since we don't support that ioctl, bsdlabel(8) expects relative offsets. So, when reading an existing disklabel, correct for disklabels that mistakenly have the mbroffset offsets. o Don't take the geometry seriously, because it's untrustworthy. We do expect the numbers to be within range. This means that the secperunit field will not be computed from secpercyl and ncyls, but simply is the mediasize in sectors. o Don't enforce partitions to be aligned to track boundaries. The default label, constructed by bsdlabel(8), puts partition a at offset BBSIZE bytes, which commonly means sector 16.	2007-12-24 01:01:59 +00:00
Poul-Henning Kamp	015a11e695	Chop DIOCGDELETE from userland up in 1024 sector chunks to give geom_disk or any other bio chopping geom a reasonable size of work. Check for delivered signals between chunks, because the request size and service time is unbounded.	2007-12-16 19:38:26 +00:00
Poul-Henning Kamp	eed6cda966	Don't limit BIO_DELETE requests to MAXPHYS, they perform no data transfers, so they are not subject to the VM system limitation.	2007-12-16 18:03:31 +00:00
Marcel Moolenaar	3959198cc5	Decode as many or as few partition entries as the label claims there are. We have already checked it against the caller provided maxpart.	2007-12-09 22:44:22 +00:00
Marcel Moolenaar	4275d83ab5	Fix a bug in the add verb, where we failed to keep the list of partitions in index-order. This is assumed by the APM, MBR and BSD partitioning schemes.	2007-12-09 22:26:42 +00:00
Marcel Moolenaar	04a814ef90	Internal partitions can not be deleted or modified.	2007-12-08 23:08:42 +00:00
Marcel Moolenaar	d6bbbeebd9	Skip internal partitions in the check for (user) partitions for the destroy command. Previously a freshly created BSD disklabel could not be destroyed because of the internal partition.	2007-12-08 22:06:17 +00:00
Marcel Moolenaar	ddba264187	Add support for FS_ZFS.	2007-12-08 07:01:10 +00:00
John Baldwin	f97a705a99	Only attach to a GPT partition if it has the GPT_ENT_TYPE_FREEBSD type. XXX: This only works currently with GEOM_GPT which only exists in 6.x. XXX: I didn't add 'mbroffset' support for a GPT partition holding a BSD label as I'm not sure if they use relative or absolute offsets. MFC after: 3 days	2007-12-06 09:20:27 +00:00
Marcel Moolenaar	5aaa8fefdf	Add a BSD disklabel backend to g_part: o Disklabels can have between 8 and 20 partitions (inclusive). o No device special file is created for the raw partition. o Switch ia64 to use this backend. o No support for boot code yet.	2007-12-06 02:32:42 +00:00
John Birrell	18b0b6d137	On some arches, openssl is built with OPENSSL_NO_CAMELLIA, so the code here needs to depend on that too.	2007-11-19 08:59:32 +00:00
Maxim Konovalov	e70553c775	o s/resiserfs_sb/reiserfs_sb/. Submitted by: Ighighi	2007-11-16 19:43:26 +00:00
Pawel Jakub Dawidek	b656c1b836	Save stack only when KTR_GEOM is both compiled into the kernel and enabled in debug.ktr.mask. Because saving stack is very expensive, it's better only to do it when one really wants to. Reported by: Dan Nelson	2007-10-26 06:55:00 +00:00
John Baldwin	f352a0d45f	First cut at support for booting a GPT labeled disk via the BIOS bootstrap on i386 and amd64 machines. The overall process is that /boot/pmbr lives in the PMBR (similar to /boot/mbr for MBR disks) and is responsible for locating and loading /boot/gptboot. /boot/gptboot is similar to /boot/boot except that it groks GPT rather than MBR + bsdlabel. Unlike /boot/boot, /boot/gptboot lives in its own dedicated GPT partition with a new "FreeBSD boot" type. This partition does not have a fixed size in that /boot/pmbr will load the entire partition into the lower 640k. However, it is limited in that it can only be 545k. That's still a lot better than the current 7.5k limit for boot2 on MBR. gptboot mostly acts just like boot2 in that it reads /boot.config and loads up /boot/loader. Some more details: - Include uuid_equal() and uuid_is_nil() in libstand. - Add a new 'boot' command to gpt(8) which makes a GPT disk bootable using /boot/pmbr and /boot/gptboot. Note that the disk must have some free space for the boot partition. - This required exposing the backend of the 'add' function as a gpt_add_part() function to the rest of gpt(8). 'boot' uses this to create a boot partition if needed. - Don't cripple cgbase() in the UFS boot code for /boot/gptboot so that it can handle a filesystem > 1.5 TB. - /boot/gptboot has a simple loader (gptldr) that doesn't do any I/O unlike boot1 since /boot/pmbr loads all of gptboot up front. The C portion of gptboot (gptboot.c) has been repocopied from boot2.c. The primary changes are to parse the GPT to find a root filesystem and to use 64-bit disk addresses. Currently gptboot assumes that the first UFS partition on the disk is the / filesystem, but this algorithm will likely be improved in the future. - Teach the biosdisk driver in /boot/loader to understand GPT tables. GPT partitions are identified as 'disk0pX:' (e.g. disk0p2:) which is similar to the /dev names the kernel uses (e.g. /dev/ad0p2). - Add a new "freebsd-boot" alias to g_part() for the new boot UUID. MFC after: 1 month Discussed with: marcel (some things might still change, but am committing what I have so far)	2007-10-24 21:33:00 +00:00
Marcel Moolenaar	a1fedf914f	Add the freebsd-zfs alias. Both APM and GPT have ZFS partition types.	2007-10-21 20:02:57 +00:00
Julian Elischer	3745c395ec	Rename the kthread_xxx (e.g. kthread_create()) calls to kproc_xxx as they actually make whole processes. Thos makes way for us to add REAL kthread_create() and friends that actually make theads. it turns out that most of these calls actually end up being moved back to the thread version when it's added. but we need to make this cosmetic change first. I'd LOVE to do this rename in 7.0 so that we can eventually MFC the new kthread_xxx() calls.	2007-10-20 23:23:23 +00:00
Pawel Jakub Dawidek	3ea5d7ec24	When orphaning a provider, cancel events related to it. Without this change the following situation was possible: 1. Provider is orphaned from within class' access() method on last write close - orphan provider event is send. 2. GEOM detects last write close on a provider and sends new provider event. 3. g_orphan_register() is called, and calls all orphan methods of attached consumers. 4. New provider event is executed on orphaned provider, all classes can taste already orphaned provider, and some may attach consumers to it. Those consumers will never go away, because the g_orphan_register() was already called. We end up with a zombie provider. With this change, at step 3, we will cancel new provider event. How to repeat this problem: # mdconfig -a -t malloc -s 10m # geli init -i 0 md0 # geli attach md0 # newfs -L test /dev/md0.eli # mount /dev/ufs/test /mnt/tmp # geli detach -l md0.eli # umount /mnt/tmp # glabel status Name Status Components ufs/test N/A N/A Reviewed by: phk Approved by: re (kensmith)	2007-09-27 20:18:34 +00:00
Pawel Jakub Dawidek	17a0c19020	LINT compiled just fine for me, but it seems it breaks tinerbox way of compiling LINT. Approved by: re (implicitly)	2007-09-23 15:10:48 +00:00
Pawel Jakub Dawidek	f854db0bf5	Bring in the GEOM Virtualisation class, which allows to create huge GEOM providers with limited physical storage and add physical storage as needed. Submitted by: Ivan Voras Sponsored by: Google Summer of Code 2006 Approved by: re (kensmith)	2007-09-23 07:34:23 +00:00
Pawel Jakub Dawidek	864cba9669	Add support for Camellia encryption algorithm. PR: kern/113790 Submitted by: Yoshisato YANAGISAWA <yanagisawa@csg.is.titech.ac.jp> Approved by: re (bmah)	2007-09-01 06:33:02 +00:00
Marcel Moolenaar	0081f96ecd	Have gpart synthesize a disk geometry if the underlying provider don't have it. Some partitioning schemes, as well as file systems, operate on the geometry and without it such schemes (e.g. MBR) and file systems (e.g. FAT) can't be created. This is useful for memory disks.	2007-06-17 22:19:19 +00:00
Marcel Moolenaar	6bc5044561	Add the MBR partitioning scheme to g_part. This does not yet support the ability to install boot code.	2007-06-13 04:27:36 +00:00
Marcel Moolenaar	cf23147053	Prefix unknown (i.e. un-aliased) partition types with '!'. This is how they had to be given with ctlreq.	2007-06-06 05:06:14 +00:00
Marcel Moolenaar	33a558c7e9	Call sbuf_finish() before sbuf_data() and sbuf_len().	2007-06-06 05:01:41 +00:00
Jeff Roberson	982d11f836	Commit 14/14 of sched_lock decomposition. - Use thread_lock() rather than sched_lock for per-thread scheduling sychronization. - Use the per-process spinlock rather than the sched_lock for per-process scheduling synchronization. Tested by: kris, current@ Tested on: i386, amd64, ULE, 4BSD, libthr, libkse, PREEMPTION, etc. Discussed with: kris, attilio, kmacy, jhb, julian, bde (small parts each)	2007-06-05 00:00:57 +00:00
David Malone	041b706b2f	Despite several examples in the kernel, the third argument of sysctl_handle_int is not sizeof the int type you want to export. The type must always be an int or an unsigned int. Remove the instances where a sizeof(variable) is passed to stop people accidently cut and pasting these examples. In a few places this was sysctl_handle_int was being used on 64 bit types, which would truncate the value to be exported. In these cases use sysctl_handle_quad to export them and change the format to Q so that sysctl(1) can still print them.	2007-06-04 18:25:08 +00:00
Marcel Moolenaar	ce3498bd83	Fix a dereference in KASSERT.	2007-05-15 23:29:57 +00:00
Marcel Moolenaar	35fe9df032	o Implement automatic commit. It's enabled when the flags parameter exists and contains the 'C' flag. o The partition label can be the empty string. It's how labels are cleared. o When an action fails, lower permissions when they were raised in order to allow the action. A failed action will not result in any uncommitted changes. o Allow the flags paremeter to be present but empty. It's the equivalent of not being present.	2007-05-15 20:14:55 +00:00
Marcel Moolenaar	5100f9e95b	Write the output parameter (if present) for the add, create, delete destroy and modify verbs.	2007-05-09 05:37:53 +00:00
Marcel Moolenaar	c8dffc524a	When reverting the creation of a partitioning scheme on a provider, the failure to probe an existing partitioning scheme means that no previous partitioning scheme existed. Don't error. Just destroy the geom.	2007-05-09 01:46:42 +00:00
Marcel Moolenaar	d287f59062	MFp4: 119373: o Remove the query verb, along with the request and response parameters. o Add the version and output parameters. 119390: [APM,GPT] Properly clear deleted entries. 119394: o Make the alias the standard and use the '!' to prefix literal partition types. o Treat schemes and partition types as case insensitive. 119462: [GPT] Fix a page fault caused when modifying a partition entry without a new partition type.	2007-05-08 20:18:17 +00:00
Pawel Jakub Dawidek	f0256e71f1	When deleting key, flush write cache after each overwrite, so we don't overwrite data N times in cache and only once on disk.	2007-05-06 14:56:03 +00:00
Pawel Jakub Dawidek	4887800305	Allow to use ':' in d_ident, which is quite handy character.	2007-05-05 18:09:17 +00:00
Pawel Jakub Dawidek	a04c28bdd9	Handle GEOM::ident attribute by attaching 'sX' string at the end of ident received from the underlying provider, where X is pp->index value. OK'ed by: phk	2007-05-05 17:52:22 +00:00
Pawel Jakub Dawidek	5e16a4866f	Because there are many strange hardware out there, allow to use only [a-zA-Z0-9-_@#%.] characters in d_ident field.	2007-05-05 17:47:20 +00:00
Pawel Jakub Dawidek	d0c11f9eb7	- Extend disk structure to allow to store disk's serial number, which can be retrieved via GEOM::ident attribute. - Bump disk(9) ABI version. OK'ed by: phk	2007-05-05 17:12:15 +00:00
Pawel Jakub Dawidek	0589353ac7	Implement three new ioctls that can be used with GEOM provider: DIOCGFLUSH - Flush write cache (sends BIO_FLUSH). DIOCGDELETE - Delete data (mark as unused) (sends BIO_DELETE). DIOCGIDENT - Get provider's uniqe and fixed identifier (asks for GEOM::ident attribute). First two are self-explanatory, but the last one might not be. Here are properties of provider's ident: - ident value is preserved between reboots, - provider can be detached/attached and ident is preserved, - provider's name can change - ident can't, - ident value should not be based on on-disk metadata; in other words copying whole data from one disk to another should not yield the same ident for the other disk, - there could be more than one provider with the same ident, but only if they point at exactly the same physical storage, this is the case for multipathing for example, - GEOM classes that consumes single providers and provide single providers, like geli, gbde, should just attach class name to the ident of the underlying provider, - ident is an ASCII string (is printable), - ident is optional and applications can't relay on its presence. The main purpose for this is that application and remember provider's ident and once it tries to open provider by its name again, it may compare idents to be sure this is the right provider. If it is not (idents don't match), then it can open provider by its ident. OK'ed by: phk	2007-05-05 17:02:19 +00:00
Pawel Jakub Dawidek	2b17fb9514	Implement g_delete_data() similar to g_read_data() and g_write_data(). OK'ed by: phk	2007-05-05 16:35:22 +00:00
Pawel Jakub Dawidek	d19dbf4a23	- Implement helper g_handleattr_str() function for string attributes handling. - Extend g_handleattr() to treat attribute as string when len=0. OK'ed by: phk	2007-05-05 16:33:44 +00:00
Marcel Moolenaar	e8e1f54462	Put the scheme (APM, GPT, etc) in the XML.	2007-04-27 05:58:10 +00:00
Hidetoshi Shimokawa	33018fbdff	If compressed length is zero, return a zero-filled block. MFC after: 1 week	2007-04-24 06:30:06 +00:00
Lukas Ertl	a2237c41fc	-) Correct sdcount for a plex when removing or adding subdisks. -) Set correct sizes for plexes and volumes a subdisk has been removed. Submitted by: Ulf Lilleengen <lulf_AT_freebsd.org>	2007-04-12 17:54:35 +00:00
Lukas Ertl	9e357b05da	Avoid infinite loop if the device string given for a drive only consists of "/". Submitted by: Ulf Lilleengen <lulf_AT_freebsd.org>	2007-04-12 17:40:44 +00:00
Pawel Jakub Dawidek	df3aed4f96	Use root_mounted().	2007-04-08 23:54:23 +00:00
Hidetoshi Shimokawa	54911451d5	Fix a bug for over 4GB media. MFC after: 3 days	2007-04-07 02:52:13 +00:00
Pawel Jakub Dawidek	68474f1930	Sysctl description is not a format string, so one % is enough.	2007-04-06 12:53:54 +00:00
Xin LI	a92b7d4982	- Be more verbose when saying "foo" not found. - In gctl_get_geom(), don't issue error when we were not provided with an parameter, like gctl_get_provider() did. Reviewed by: pjd	2007-03-30 16:32:08 +00:00
Kris Kennaway	17e910a261	make_dev(9) can be (and is) called without Giant, so there is no need to drop the topology lock and acquire Giant around this call. Reviewed by: phk	2007-03-26 21:47:03 +00:00
Pawel Jakub Dawidek	52b509e738	Add missing \n.	2007-03-22 15:42:13 +00:00
Sam Leffler	6810ad6f2a	Overhaul driver/subsystem api's: o make all crypto drivers have a device_t; pseudo drivers like the s/w crypto driver synthesize one o change the api between the crypto subsystem and drivers to use kobj; cryptodev_if.m defines this api o use the fact that all crypto drivers now have a device_t to add support for specifying which of several potential devices to use when doing crypto operations o add new ioctls that allow user apps to select a specific crypto device to use (previous ioctls maintained for compatibility) o overhaul crypto subsystem code to eliminate lots of cruft and hide implementation details from drivers o bring in numerous fixes from Michale Richardson/hifn; mostly for 795x parts o add an optional mechanism for mmap'ing the hifn 795x public key h/w to user space for use by openssl (not enabled by default) o update crypto test tools to use new ioctl's and add cmd line options to specify a device to use for tests These changes will also enable much future work on improving the core crypto subsystem; including proper load balancing and interposing code between the core and drivers to dispatch small operations to the s/w driver as appropriate. These changes were instigated by the work of Michael Richardson. Reviewed by: pjd Approved by: re	2007-03-21 03:42:51 +00:00
Pawel Jakub Dawidek	97a669a3b2	Warn when user use sectorsize bigger than the page size, which will lead to problems when the geli device is used with file system or as a swap. Hopefully will prevent problems like kern/98742 in the future. MFC after: 1 week	2007-03-05 12:41:44 +00:00
Pawel Jakub Dawidek	b942093961	Fix geli after last commit for UP systems that are running SMP kernel. Submitted by: Hyo geol, Lee <hyogeollee@gmail.com> MFC after: 1 week	2007-03-02 09:38:16 +00:00
John Baldwin	4d70511ac3	Use pause() rather than tsleep() on stack variables and function pointers.	2007-02-27 17:23:29 +00:00
Matt Jacob	e770bc6bf5	First cut at GEOM based multipath. This is an active/passive{/passive...} arrangement that has no intrinsic internal knowledge of whether devices it is given are truly multipath devices. As such, this is a simplistic approach, but still a useful one. The basic approach is to (at present- this will change soon) use camcontrol to find likely identical devices and and label the trailing sector of the first one. This label contains both a full UUID and a name. The name is what is presented in /dev/multipath, but the UUID is used as a true distinguishor at g_taste time, thus making sure we don't have chaos on a shared SAN where everyone names their data multipath as "Fred". The first of N identical devices (and N may be 1!) becomes the active path until a BIO request is failed with EIO or ENXIO. When this occurs, the active disk is ripped away and the next in a list is picked to (retry and) continue with. During g_taste events new disks that meet the match criteria for existing multipath geoms get added to the tail end of the list. Thus, this active/passive setup actually does work for devices which go away and come back, as do (now) mpt(4) and isp(4) SAN based disks. There is still a lot to do to improve this- like about 5 of the 12 recommendations I've received about it, but it's been functional enough for a while that it deserves a broader test base. Reviewed by: pjd Sponsored by: IronPort Systems MFC: 2 months	2007-02-27 04:01:58 +00:00
John Baldwin	6e50e38fcc	Use tsleep() rather than msleep() with a NULL mtx parameter.	2007-02-23 23:06:10 +00:00
Nick Hibma	55fe33a350	Reduce the noise when plugging in (USB) mass storage devices, like a 4 port flash card reader. Also remove an 'Opened da0 -> <random number>' which is not needed on a daily basis (available through bootverbose). Reviewed by: phk, ken MFC after: 1 week	2007-02-21 07:45:02 +00:00
Craig Rodrigues	898b5f434b	#include <sys/systm.h> before <sys/geom.h> to get KASSERT(), and fix LINT build.	2007-02-08 04:02:56 +00:00
Marcel Moolenaar	1d3aed33e8	Evolve the ctlreq interface added to geom_gpt into a generic partitioning class that supports multiple schemes. Current schemes supported are APM (Apple Partition Map) and GPT. Change all GEOM_APPLE anf GEOM_GPT options into GEOM_PART_APM and GEOM_PART_GPT (resp). The ctlreq interface supports verbs to create and destroy partitioning schemes on a disk; to add, delete and modify partitions; and to commit or undo changes made.	2007-02-07 18:55:31 +00:00
Pawel Jakub Dawidek	1ded77b222	We expect 'bio_data != NULL' for BIO_{READ,WRITE,GETATTR}, but for BIO_{DELETE,FLUSH} we expect 'bio_data == NULL'. Reviewed by: phk	2007-01-28 23:36:07 +00:00
Pawel Jakub Dawidek	a1ea1a22e9	It is possible that GEOM taste provider before SMP is started. We can't bind to a CPU which is not yet on-line, so add code that wait for CPUs to go on-line before binding to them. Reported by: Alin-Adrian Anton <aanton@spintech.ro> MFC after: 2 weeks	2007-01-28 20:29:12 +00:00
Konstantin Belousov	2cc7d26f7f	Cylinder group bitmaps and blocks containing inode for a snapshot file are after snaplock, while other ffs device buffers are before snaplock in global lock order. By itself, this could cause deadlock when bdwrite() tries to flush dirty buffers on snapshotted ffs. If, during the flush, COW activity for snapshot needs to allocate block and ffs_alloccg() selects the cylinder group that is being written by bdwrite(), then kernel would panic due to recursive buffer lock acquision. Avoid dealing with buffers in bdwrite() that are from other side of snaplock divisor in the lock order then the buffer being written. Add new BOP, bop_bdwrite(), to do dirty buffer flushing for same vnode in the bdwrite(). Default implementation, bufbdflush(), refactors the code from bdwrite(). For ffs device buffers, specialized implementation is used. Reviewed by: tegge, jeff, Russell Cattelan (cattelan xfs org, xfs changes) Tested by: Peter Holm X-MFC after: 3 weeks (if ever: it changes ABI)	2007-01-23 10:01:19 +00:00
Pawel Jakub Dawidek	9cb9930ea6	Softc may be NULL in g_journal_orphan(), so don't be surprised.	2006-12-02 09:10:29 +00:00
Pawel Jakub Dawidek	95de128d55	Fix ia64 build breakage.	2006-11-02 16:24:18 +00:00
Pawel Jakub Dawidek	41517ab2e9	- Use g_duplicate_bio() instead of g_clone_bio(), so there memory is allocated with M_WAITOK flag. - Check 'buf' instead of 'error' so Prevent is not confused. CID: 1562, 1563 Found by: Coverity Prevent analysis tool	2006-11-02 09:14:18 +00:00
Pawel Jakub Dawidek	1506db2163	I want CPU number here. Noticed by: ru	2006-11-02 09:01:34 +00:00
Pawel Jakub Dawidek	3398f41fc0	Grr, fix one more build breakage.	2006-11-02 00:37:39 +00:00
Pawel Jakub Dawidek	501250ba60	Now, that we have gjournal in the tree add possibility to configure gmirror and graid3 in a way that it is not resynchronized after a power failure or system crash. It is safe when gjournal is running on top of gmirror/graid3.	2006-11-01 22:51:49 +00:00
Pawel Jakub Dawidek	f187490a2d	Change spaces to tabs where needed.	2006-11-01 22:16:53 +00:00
Pawel Jakub Dawidek	eba8f13797	Skip disabled CPU, because after we sched_bind() to a disabled CPU, we won't be able to exit from the thread. Function g_eli_cpu_is_disabled() stoled from kern_pmc.c. PR: 104669 Reported by: Nikolay Mirin <nik@optim.com.ru> MFC after: 1 week	2006-11-01 16:05:06 +00:00
Pawel Jakub Dawidek	8e570b35f7	Forgot to remove this line. Reported by: maxim	2006-11-01 14:09:59 +00:00
Pawel Jakub Dawidek	118c814ee8	Add BIO_FLUSH support to GSHSEC class.	2006-11-01 12:30:51 +00:00
Pawel Jakub Dawidek	0c554c7884	Add BIO_FLUSH support to GPT class.	2006-11-01 12:29:49 +00:00
Pawel Jakub Dawidek	3911a26e74	Update the code to the current sync(2) version: - Do not modify mnt_flag without mount interlock held. - Do not touch MNT_ASYNC flag, as this can lead to a race with nmount(2). Pointed out by: tegge Reviewed by: tegge	2006-11-01 09:37:11 +00:00
Pawel Jakub Dawidek	ff4ff59d64	Remove debugging code I accidentally committed.	2006-11-01 01:19:13 +00:00
Pawel Jakub Dawidek	a23d879f34	Add gjournal GEOM class (kernel side), which implements block level journaling and can be tought about marking file system as clean before doing journal switch, which easly allows to add journaling to file systems that don't have this feature. Sponsored by: home.pl	2006-10-31 21:31:00 +00:00
Pawel Jakub Dawidek	42461fba65	Implement BIO_FLUSH handling by simply passing it down to the components. Sponsored by: home.pl	2006-10-31 21:23:51 +00:00
Pawel Jakub Dawidek	1d2aee20b8	Add a new disk flag - DISKFLAG_CANFLUSHCACHE, which indicates that the disk can handle BIO_FLUSH requests. Sponsored by: home.pl	2006-10-31 21:12:43 +00:00
Pawel Jakub Dawidek	c3618c657a	Add a new I/O request - BIO_FLUSH, which basically tells providers below to flush their caches. For now will mostly be used by disks to flush their write cache. Sponsored by: home.pl	2006-10-31 21:11:21 +00:00
Pawel Jakub Dawidek	11b2174f58	Guard against invalid metadata. MFC after: 1 week	2006-10-10 15:01:47 +00:00
Ruslan Ermilov	04c7da702f	A GEOM cache can speed up read performance by sending fixed size read requests to its consumer. It has been developed to address the problem of a horrible read performance of a 64k blocksize FS residing on a RAID3 array with 8 data components, where a single disk component would only get 8k read requests, thus effectively killing disk performance under high load. Documentation will be provided later. I'd like to thank Vsevolod Lobko for his bright ideas, and Pawel Jakub Dawidek for helping me fix the nasty bug.	2006-10-06 08:27:07 +00:00
Pawel Jakub Dawidek	b7beab8d22	One more white space fix.	2006-09-30 08:23:06 +00:00
Pawel Jakub Dawidek	469e952070	Remove trailing spaces.	2006-09-30 08:16:49 +00:00
Pawel Jakub Dawidek	1517bdc897	Remove trailing spaces.	2006-09-30 08:01:11 +00:00
Pawel Jakub Dawidek	f8aa16c66c	Fix detecting of UFS1 label when mediasize%fragsize != 0. Submitted by: Stanislav Sedov PR: kern/84637 MFC after: 1 week	2006-09-16 11:24:41 +00:00
Pawel Jakub Dawidek	8abd1ad101	Add 'configure' subcommand which for now only allows setting and removing of the BOOT flag. It can be performed on both attached and detached providers. Requested by: Matthias Lederhofer <matled@gmx.net> MFC after: 1 week	2006-09-16 10:43:17 +00:00
Pawel Jakub Dawidek	5e165262f1	Add __printflike() to gctl_error(). Approved by: phk MFC after: 1 week	2006-09-16 10:39:07 +00:00
Pawel Jakub Dawidek	5a20446db8	Small fixes after adding __printflike() to gctl_error(). Approved by: phk MFC after: 3 days	2006-09-16 09:48:29 +00:00
Pawel Jakub Dawidek	dec53cdd32	Remove extra arguments. MFC after: 3 days	2006-09-16 07:47:57 +00:00
Pawel Jakub Dawidek	679f8b7e7a	Add 'show geom [addr]' ddb(4) command, which prints entire GEOM topology if no additional argument is given or details about the given GEOM object (class, geom, provider or consumer). Approved by: phk	2006-09-15 16:36:45 +00:00
Pawel Jakub Dawidek	8e007c52fd	Fix synchronization in gmirror and graid3 which I broken. Synchronization request can still have bio_to set to sc_provider (this is READ part of a synchronization request) and in this case g_{mirror,raid3}_sync() wasn't called as it should be. MFC after: 1 week	2006-09-13 15:46:49 +00:00
Pawel Jakub Dawidek	d6b910d295	Delay an orphan event if provider has still in-flight I/O requests. This way GEOM classes can safely detach from provider when an orphan event is received. This fixes 'detach with active requests' panic for gstripe/gconcat under load. PR: kern/102766 Submitted by: mjacob OK'ed by: phk MFC after: 1 week	2006-09-10 09:11:54 +00:00
John-Mark Gurney	0cca572e64	move created/detected/activated under debug level 1 to quiet the common case.. add count of active and total components to the launched line so you can see at a glance if your mirror/raid3 is complete... now: GEOM_MIRROR: Device mirror/sam launched (2/2). Reviewed by: pjd	2006-09-09 21:45:37 +00:00
Pawel Jakub Dawidek	46ee0837c2	Fix format character. Reported by: andre	2006-09-08 13:46:18 +00:00
Pawel Jakub Dawidek	fc024f7a45	Bump copyright year.	2006-09-08 10:20:44 +00:00
Pawel Jakub Dawidek	c076790223	Use __FBSDID in .c files.	2006-09-08 10:19:24 +00:00
Pawel Jakub Dawidek	6a146a1989	- Split failure probability configuration into read failure probability and write failure probability. - Allow to specify an error number to return of failure. MFC after: 3 days	2006-09-08 09:21:21 +00:00
Pawel Jakub Dawidek	7ffb6e0f6a	Fix problems with destroy and forcible destroy functionality: - hold/release device in start/done routines, this will probably slow down things a bit, but previous code was racy; - only release device if g_gate_destroy() failed - if it succeeded device is dead and there is nothing to release; - various other changes which makes forcible destruction reliable. MFC after: 3 days	2006-09-05 21:56:00 +00:00
Warner Losh	1a3c917f9d	while (0); -> while (0) in multi-line macros	2006-08-17 22:50:33 +00:00
Pawel Jakub Dawidek	1894472106	Handle MSDOS file systems properly. Before the change file systems created on Windows XP (and others maybe) were not detected. We detected only those created with newfs_msdos(8). Submitted by: Tobias Reifenberger <treif@mayn.de> style(9)ified by: pjd	2006-08-12 15:34:15 +00:00
Pawel Jakub Dawidek	d88fe2bfc7	Verify if a label doesn't point to the parent directory.	2006-08-12 15:30:24 +00:00
Pawel Jakub Dawidek	2bd4ade694	Before using byte offset for IV creation, covert it to little endian. This way one will be able to use provider encrypted on eg. i386 on eg. sparc64. This doesn't really buy us much today, because UFS isn't endian agnostic. We retain backward compatibility by setting G_ELI_FLAG_NATIVE_BYTE_ORDER flag on devices with version number less than 2 and not converting the offset.	2006-08-11 19:09:12 +00:00
Pawel Jakub Dawidek	d04c304ddf	Forgot to bump version number after G_ELI_FLAG_READONLY flag addition.	2006-08-11 18:39:58 +00:00
Marcel Moolenaar	f56b5a43dc	Strengthen the check for a PMBR: o PMBR partitions count to the number of partitions on the disk, which means that if a PMBR entry is invalid we will not treat the MBR as a PMBR by virtue of it not describing any partitions. Previously the checks were inconsistent in that an invalid PMBR entry would be harmless when no other partitions exist (we would treat the MBR as a PMBR by virtue of it being empty), but it would be fatal when there is at least one other partition. o The partition size of a PMBR partition is one less than the media size because the GPT starts at the second sector (LBA 1) and extends to the end of the media. For backward bug-compatibility we accept a size that's exactly the media size (FreeBSD bug). Also, when the partition size can not be represented in a 32-bit integral, the partition size in the MBR is to be set to 0xFFFFFFFF. Accept this as a valid size, even if the size can be represented.	2006-08-09 20:53:01 +00:00
Pawel Jakub Dawidek	850590166f	Allow geli to operate on read-only providers. Initial patch from: vd MFC after: 2 weeks	2006-08-09 18:11:14 +00:00
Pawel Jakub Dawidek	de6f1c7c6c	Not only a request from us can be passed to g_{mirror,raid3}_worker() function, but also a request to us, in which case checking bio_cflags is wrong, because the class above us is controling it, not we. MFC after: 1 week	2006-08-09 09:41:53 +00:00
Marcel Moolenaar	d1a5c5275c	Fix a phase-ordering bug: check the mediasize and sectorsize after we obtained access. It is possible that GPT gets to taste a disk first, which means the disk has not been opened before and it will not get opened until after we checked the mediasize and sectorsize. However, since the mediasize and sectorsize are determined at open and that happens when access is optained, checking the mediasize and sectorsize before obtaining access may result in GPT rejecting the disk.	2006-08-08 21:33:26 +00:00
Yaroslav Tykhiy	776fc0e90e	Commit the results of the typo hunt by Darren Pilgrim. This change affects documentation and comments only, no real code involved. PR: misc/101245 Submitted by: Darren Pilgrim <darren pilgrim bitfreak org> Tested by: md5(1) MFC after: 1 week	2006-08-04 07:56:35 +00:00
Pawel Jakub Dawidek	3c57a41d7b	Don't use f-word in comments. We are gentlemans. Pointed out by: Maciej Sobczak	2006-08-01 23:17:33 +00:00
Yaroslav Tykhiy	f6829a059f	Fix what looks like a typo: MODULE_DEPEND() takes module names, not KLD file names; and GELI module's name is g_eli, not geom_eli. Approved by: pjd (silence) MFC after: 5 days	2006-07-27 11:52:12 +00:00
Pawel Jakub Dawidek	8cfab1debb	Don't forget to initialize crp_olen field, which is used to calculate bio_completed value.	2006-07-22 10:05:55 +00:00
Pawel Jakub Dawidek	86ed3c25e1	Always allow to specify components with /dev/ prefix. MFC after: 3 days	2006-07-13 20:37:59 +00:00
Pawel Jakub Dawidek	e8c85a50ae	Only check if we're freeing a valid object if we hold the topology lock. This prevents panic under heavy load with DIAGNOSTIC compiled in.	2006-07-12 15:44:00 +00:00
Pawel Jakub Dawidek	3525bb6b98	Use proper defines instead of magic values. MFC after: 1 week	2006-07-10 21:18:00 +00:00
Pawel Jakub Dawidek	ed940a828d	When kern.geom.raid3.use_malloc tunnable is set to 1, malloc(9) instead of uma(9) will be used for memory allocation. In case of problems or tracking bugs, there are more useful tools for malloc(9) debugging than for uma(9) debugging, like memguard(9) and redzone(9). MFC after: 1 week	2006-07-09 12:25:56 +00:00
Pawel Jakub Dawidek	a3cdde5564	Remove bogus assertion. Reported by: Bradley W. Dutton <brad-fbsd-stable@duttonbros.com> MFC after: 3 days	2006-07-07 14:32:27 +00:00
Pawel Jakub Dawidek	1f7fec3cb5	Allow to close access even if device is already destroyed. Reported by: Ulrich Spoerlein <uspoerlein@gmail.com> PR: kern/98093 MFC after: 1 week	2006-07-03 10:32:38 +00:00
Maxim Sobolev	d5046da865	Improve check for protective MBR. Instead of assiming that protective MBR should have only one entry of type 0xEE, consider protective MBR to be one, that has at least one entry of type 0xEE covering the whole unit. This makes GEOM_GPT compatible with disks partitioned by the Apple's BootCamp. Approved in principle by: marcel MFC After: 1 month	2006-06-26 00:32:54 +00:00
Simon L. B. Nielsen	274ede62a8	In g_dev_strategy(), when failing an IO request with EINVAL due to offset or request size which is not a multiple of the sector size, make sure that the bio is set to indicate that no data has actually been transferred. The result of this is that the file offset is no longer incremented for these requests. The fact that the file offset was incremented broke fdisk(8)'s probing of sector size for non-512 byte sector sizes. Reviewed by: phk, cperciva Submitted by: mdodd MFC after: 2 weeks	2006-06-18 22:01:15 +00:00
Pawel Jakub Dawidek	c84efdca04	Allow to use the old -a option to specify an encryption algorithm to use (for backward compatibility), but print a warning to inform about the change.	2006-06-06 22:06:24 +00:00
Pawel Jakub Dawidek	15d6ee8de5	- Unbreak the build when geli is compiled into the kernel (on as module), by silencing unfounded compiler warning. Reported by:	2006-06-06 14:48:19 +00:00
Pawel Jakub Dawidek	eaa3b91996	Implement data integrity verification (data authentication) for geli(8). Supported by: Wheel Sp. z o.o. (http://www.wheel.pl)	2006-06-05 21:38:54 +00:00
Pawel Jakub Dawidek	05bf5e8a0a	Make kern.geom.eli.overwrites sysctl a tunable as well.	2006-06-05 21:25:19 +00:00
Pawel Jakub Dawidek	4bec0ff1c4	Add g_duplicate_bio() function which does the same thing what g_clone_bio() is doing, but g_duplicate_bio() allocates new bio with M_WAITOK flag.	2006-06-05 21:13:22 +00:00
Marcel Moolenaar	ae04949bff	Fix unaligned memory accesses on Alpha and possible other platforms. By using a pointer to struct dos_partition, we implicitly tell the compiler that the pointer is 4-bytes aligned, even though we know that's not the case. The fact that we only dereference the pointer to access a byte-wide field (field dp_ptyp) is not a guarantee that the compiler will in fact use a byte-wide load. On some platforms it's more efficient to use long word or quad word loads and use bit-shifting and bit-masking to get the intended byte. On those platforms an misaligned load will be the result. The fix is to use byte-wide pointer arithmetic based on sizeof() and offsetof() to avoid invalid casts which avoids that the compiler makes invalid assumptions. Backtrace provided by: wilko@ MFC after: 1 week	2006-06-04 20:26:13 +00:00
Ceri Davies	fccfbec9f2	Remove the trailing half of a sentence which was clearly superceded by the preceding one some time during editing.	2006-05-24 11:02:32 +00:00
Pawel Jakub Dawidek	ee40c7aa76	Use G_RAID3_FOREACH_SAFE_BIO() macro instead of G_RAID3_FOREACH_BIO() in two places where g_io_request() is called. g_io_request() can free bio structure so we can't reference it after and G_RAID3_FOREACH_BIO() macro was doing this. Found by: Coverity Prevent analysis tool (with my new models) MFC after: 1 day	2006-05-04 13:01:16 +00:00
Pawel Jakub Dawidek	ffd106f5a3	We shouldn't lock the topology here - we will panic on assertion inside g_raid3_bump_syncid(). Reported by: Bradley W. Dutton <brad-fbsd-stable@duttonbros.com> MFC after: 1 day	2006-04-30 22:14:17 +00:00
Pawel Jakub Dawidek	84edb86df6	- Don't hold the device sx lock when going to sleep. - Prevent possible live-lock in case of memory problems by freeing already completed requests first. Reported and tested by: markus, Bradley W. Dutton <brad-fbsd-stable@duttonbros.com> MFC after: 1 day	2006-04-28 12:18:03 +00:00
Pawel Jakub Dawidek	a2fe5c6676	- Remove dead code. - Comment possible event miss, which isn't critical, but probably can be fixed by replacing the event lock usage with the queue lock. MFC after: 2 weeks	2006-04-28 12:13:49 +00:00
Pawel Jakub Dawidek	18486a5ee3	Be sure to not destroy device twice. This is not possible in theory, but with this change there is even no theoretical race. MFC after: 2 weeks	2006-04-28 11:52:45 +00:00
Pawel Jakub Dawidek	a063667622	Be sure to not destroy device twice. This is not possible in theory, but with this change there is even no theoretical race. MFC after: 2 weeks	2006-04-28 11:47:28 +00:00
Pawel Jakub Dawidek	5af2ae28f6	geli(8) provides keys on newsession time, so remove CRD_F_KEY_EXPLICIT flag as HW crypto drivers don't support it.	2006-04-20 06:33:46 +00:00
Pawel Jakub Dawidek	c082905bb6	Fix storing offset of already synchronized data. Offset in entire array was stored in metadata instead of an offset in single disk. After reboot/crash synchronization process started from a wrong offset skipping (not synchronizing) part of the component which can lead to data corrutpion (when synchronization process was interrupted on initial synchronization) or other strange situations like 'graid3 status' showing value more than 100%. Reported, reviewed and tested by: ru Reported by: Dmitry Morozovsky <marck@rinet.ru> MFC after: 1 day	2006-04-18 13:52:11 +00:00
Pawel Jakub Dawidek	cd0d707eb7	Correct debug: we are sending child bio here, not parent bio. MFC after: 1 week	2006-04-15 18:30:42 +00:00
Martin Cracauer	3f4f4a1465	Make CCD be able to read and write Linux software raids. Supported for raid-0 with <n> disks, raid-1 with 2 disks. Manpages have examples, warnings etc. Test scripts on http://www.cons.org/cracauer/ccdconfig-linux/ Reviewed by: alfred	2006-04-13 20:35:31 +00:00
Pawel Jakub Dawidek	d3a1be900a	Pass BIO_GETATTR requests down. MFC after: 1 week	2006-04-12 12:18:44 +00:00
Pawel Jakub Dawidek	712fe9bd7a	Introduce and use delayed-destruction functionality from a pre-sync hook, which means that devices will be destroyed on last close. This fixes destruction order problems when, eg. RAID3 array is build on top of RAID1 arrays. Requested, reviewed and tested by: ru MFC after: 2 weeks	2006-04-10 10:32:22 +00:00

... 2 3 4 5 6 ...

1458 Commits