freebsd-nq

Author	SHA1	Message	Date
Andrey V. Elsukov	733a9e2783	Prevent access after free to table entry in case when user deletes partition that not yet created (changes doesn't committed to disk). PR: 148687 Approved by: mav (mentor) MFC after: 7 days	2010-07-23 06:30:01 +00:00
Ruslan Ermilov	cf1457e4fd	Fixed cache size decoding read from a label. PR: kern/144732 Submitted by: Eugene Grosbein MFC after: 3 days	2010-07-14 08:22:00 +00:00
Rui Paulo	c6b2b6fce6	Add NTFS partition type to GEOM_MBR.	2010-06-26 13:20:40 +00:00
Pawel Jakub Dawidek	2aa15ffdab	'unit' can be negative, so use signed type for it. Found by: Coverity Prevent CID: 3731 MFC after: 3 days	2010-06-14 21:58:55 +00:00
Pawel Jakub Dawidek	15725379d0	BIO_DELETE contains range we want to delete and doesn't provide any useful data, so there is no need to copy it to userland. MFC after: 3 days	2010-06-14 21:56:24 +00:00
Andriy Gapon	1bdfff2252	fix a few cases where a string is passed via format argument instead of via %s Most of the cases looked harmless, but this is done for the sake of correctness. In one case it even allowed to drop an intermediate buffer. Found by: clang MFC after: 2 week	2010-06-11 19:27:21 +00:00
Edward Tomasz Napierala	7ce513a52a	Untangle g_print_bio(), silencing Coverity. Found with: Coverity Prevent CID: 3566, 3567	2010-06-10 17:49:36 +00:00
Matt Jacob	59ccfe8176	Try and narrow the gap in which you act on an event that has been canceled. Obtained from: Jaako Heinonen MFC after: 1 month	2010-06-08 22:40:02 +00:00
Edward Tomasz Napierala	c01eb2f36b	Make sure not to pass NULL to g_orphan_provider(). Found with: Coverity Prevent CID: 3411	2010-06-05 08:00:52 +00:00
Marius Strobl	36066952e5	Don't leak memory on destruction. Reviewed by: marcel MFC after: 3 days	2010-06-02 17:17:11 +00:00
Andriy Gapon	56b3acd001	g_label: fix possible NULL pointer dereference in case glabel debug level is >= 1 and gp->provider list is empty for some reason Found by: clang static analyzer MFC after: 4 days	2010-05-31 09:10:39 +00:00
Marius Strobl	785c3f7ea4	Fix some whitespace nits.	2010-05-24 17:33:02 +00:00
Nathan Whitehorn	0532c3a5a5	Teach gpart about bootcode on APM.	2010-05-16 22:21:33 +00:00
Matt Jacob	87e7f7be89	Yet another potential dereference of a dead provider. Sponsored by: Panasas MFC after: 1 week	2010-05-14 21:27:39 +00:00
Matt Jacob	1371a457d9	Make sure to check that the active provider pointer points to something before dereferencing the pointer. Sponsored by: Pansas MFC after: 1 week	2010-05-14 16:56:18 +00:00
Jaakko Heinonen	3535526b15	- Don't return EAGAIN from gv_unload(). It was used to work around the deadlock fixed in r207671. - Wait for worker process to exit at class unload. The worker process was not guaranteed to exit before the linker unloaded the module. - Use 0 as the worker process exit status instead of ENXIO and style the NOTREACHED comment. Reviewed by: lulf X-MFC after: r207671	2010-05-10 19:12:23 +00:00
Jaakko Heinonen	5a279fc5fc	In g_zero_destroy_geom(), return 0 instead of EBUSY in the success case. EBUSY was probably used as a workaround for the deadlock fixed in r207671. Approved by: pjd X-MFC after: r207671	2010-05-10 19:08:53 +00:00
Ulf Lilleengen	42a9ad6697	- Remove obsolete flags. MFC after: 1 week	2010-05-08 16:19:17 +00:00
Jaakko Heinonen	9061251f9a	Fix deadlock between GEOM class unloading and withering. Withering can't proceed while g_unload_class() blocks the event thread. Fix this by not running g_unload_class() as a GEOM event and dropping the topology lock when withering needs to proceed. PR: kern/139847 Silence on: freebsd-geom	2010-05-05 18:53:24 +00:00
Marcel Moolenaar	c74f160cb0	Re-calculate a geometry when reprobing as well. PR: kern/145452 Reported by: "Andrey V. Elsukov" <bu7cher@yandex.ru>	2010-04-25 01:56:39 +00:00
Marcel Moolenaar	6f702278e6	Fix undo for schemes that have internal partitions. Internal partitions do not constitute user-visible or active partitions and as such should not prevent undoing pending operations. While here, initialize the last usable sector for the placeholder geom based on the null scheme, created to allow undoing the destruction of a scheme. This gives consistent output with "gpart show". Based on a patch from: "Andrey V. Elsukov" <bu7cher@yandex.ru>	2010-04-25 00:54:11 +00:00
Marcel Moolenaar	3f71c319f4	Implement the resize verb and add support for resizing partitions for all schemes but EBR. Quality work by Andrey! Submitted by: "Andrey V. Elsukov" <bu7cher@yandex.ru>	2010-04-23 03:11:39 +00:00
Jaakko Heinonen	002d1d1c38	Fix ddb(4) "show geom addr" command when INVARIANTS is enabled. Don't assert that the topology lock is held when g_valid_obj() is called from debugger. MFC after: 1 week	2010-04-19 20:07:35 +00:00
Pawel Jakub Dawidek	31c4cef715	Use lower priority for GELI worker threads. This improves system responsiveness under heavy GELI load. MFC after: 3 days	2010-04-15 16:34:06 +00:00
Andriy Gapon	2a842317eb	g_io_check: respond to zero pp->mediasize with ENXIO Previsouly this condition was reported with EIO by bio_offset > mediasize check. Perhaps that check should be extended to bio_offset+bio_length > mediasize. MFC after: 1 week	2010-04-15 08:39:56 +00:00
Luigi Rizzo	83f8218814	fix copyright format, as requested by Joel Dahl	2010-04-13 09:56:17 +00:00
Luigi Rizzo	c36cf6fbbc	make code compile with KTR	2010-04-13 09:53:08 +00:00
Luigi Rizzo	1831a90ac5	Bring in geom_sched, support for scheduling disk I/O requests in a device independent manner. Also include an example anticipatory scheduler, gsched_rr, which gives very nice performance improvements in presence of competing random access patterns. This is joint work with Fabio Checconi, developed last year and presented at BSDCan 2009. You can find details in the README file or at http://info.iet.unipi.it/~luigi/geom_sched/	2010-04-12 16:37:45 +00:00
Andriy Gapon	8f128ff559	g_vfs_open: allow only one mount per device vnode In other words, deny multiple read-only mounts of the same device. Shared read-only mounts should theoretically be possible, but, unfortunately, can not be implemented correctly using current buffer cache code/interface and results in an eventual system crash. Also, using nullfs seems to be a more efficient way to achieve the same goal. This gets us back to where we were before GEOM and where other BSDs are. Submitted by: pjd (idea for checking for shared mounting) Discussed with: phk, pjd Silence from: fs@, geom@ MFC after: 2 weeks	2010-04-03 08:53:53 +00:00
Andriy Gapon	1b4bc5f851	bo_bsize: revert r205860 and take an alternative approch in getblk In r205860 I missed the fact that there is code that strongly assumes that devvp bo_bsize is equal to underlying provider's sectorsize. In those places it is hard to obtain the sectorsize in an alternative way if devvp bo_bsize is set to something else. So, I am reverting bo_bsize assigment in g_vfs_open. Instead, in getblk I use DEV_BSIZE block size for b_offset calculation if vp is a disk vp as reported by vn_isdisk. This should coinside with vp being a devvp. Reported by: Mykola Dzham <i@levsha.me> Tested by: Mykola Dzham <i@levsha.me> Pointyhat to: avg MFC after: 2 weeks X-ToDo: convert bread(devvp) in all fs to use bo_bsize-d blocks	2010-04-02 15:12:31 +00:00
Andriy Gapon	0c04f06072	g_vfs_open: correctly set devvp.v_bufobj.bo_bsize to DEV_BSIZE Because of how breadn -> bufstrategy -> g_vfs_strategy are currently implemented, bread on devvp always expects DEV_BSIZE block size. Thus, devvp bo_bsize must always be DEV_BSIZE irrespective of media properties or filesystem implementation details. Reviewed by: mckusick MFC after: 2 weeks	2010-03-29 20:34:25 +00:00
Matt Jacob	2b4969ff9e	Change how multipath labels are created and managed. This makes it easier to support various storage boxes which really aren't active-active. We only write the label on the first provider. For all other providers we just "add" the disk. This also allows for an "add" verb. A usage implication is that you should specificy the currently active storage path as the first provider. Note that this does not add RDAC-like functionality, but better allows for autovolumefailover configurations (additional checkins elsewhere will support this). Sponsored by: Panasas MFC after: 1 month	2010-03-29 18:04:06 +00:00
Alexander Motin	a5be8eb530	Do not fetch precise time of request start when stats collection disabled. Reviewed by: pjd, phk	2010-03-24 18:04:25 +00:00
Matt Jacob	b5dce617d8	Add 'rotate' and 'getactive' verbs to provide some control and information about what the currently active path is. Sponsored by: Panasas MFC after: 1 month	2010-03-21 15:02:47 +00:00
Jaakko Heinonen	a41aa4a789	Escape characters unsafe for XML output in GEOM class, instance and provider names. - Characters in range 0x01-0x1f except '\t', '\n', and '\r' are replaced with '?'. Those characters are disallowed in XML. - '&', '<', '>', '\'', '"' and characters in range 0x7f-0xff are replaced with XML numeric character reference. If the kern.geom.confxml sysctl provides invalid XML, libgeom geom_xml2tree() fails and utilities using it do not work. Unsafe characters are common in msdosfs and cd9660 labels. PR: kern/104389 Submitted by: Doug Steinwand (original version) Reviewed by: pjd Discussed on: freebsd-geom MFC after: 3 weeks	2010-03-20 16:16:13 +00:00
Pawel Jakub Dawidek	b0990a1dae	Simplify loops.	2010-03-18 13:11:43 +00:00
Ulf Lilleengen	77d2a01ea8	- Set missing flag when initiating a plex rebuild with the rebuildparity command. - Check if plex is already syncing or rebuilding before initiating a parity rebuild or check.	2010-03-08 21:16:28 +00:00
Pawel Jakub Dawidek	32115b105a	Please welcome HAST - Highly Avalable Storage. HAST allows to transparently store data on two physically separated machines connected over the TCP/IP network. HAST works in Primary-Secondary (Master-Backup, Master-Slave) configuration, which means that only one of the cluster nodes can be active at any given time. Only Primary node is able to handle I/O requests to HAST-managed devices. Currently HAST is limited to two cluster nodes in total. HAST operates on block level - it provides disk-like devices in /dev/hast/ directory for use by file systems and/or applications. Working on block level makes it transparent for file systems and applications. There in no difference between using HAST-provided device and raw disk, partition, etc. All of them are just regular GEOM providers in FreeBSD. For more information please consult hastd(8), hastctl(8) and hast.conf(5) manual pages, as well as http://wiki.FreeBSD.org/HAST. Sponsored by: FreeBSD Foundation Sponsored by: OMCnet Internet Service GmbH Sponsored by: TransIP BV	2010-02-18 23:16:19 +00:00
Pawel Jakub Dawidek	12f35a615a	- Style fixes. - Prefer strlcpy() over strncpy().	2010-02-18 22:29:35 +00:00
Pawel Jakub Dawidek	f24bf7522d	Correct comment.	2010-02-18 22:28:12 +00:00
Pawel Jakub Dawidek	e5131ab452	Log attach just like we log detach.	2010-02-18 22:27:38 +00:00
Oleksandr Tymoshenko	45a7687f90	- Give geom_redboot taste of flash/spi. Now there is another provider of redboot partitions. This patch was missed during merge from projects/mips.	2010-02-03 01:12:19 +00:00
Xin LI	38907b4cc7	Prevent NULL deference by checking return value of gctl_get_asciiparam. MFC after: 2 weeks	2010-02-02 22:25:22 +00:00
Marcel Moolenaar	cd18ad8347	Export the UUID of the partition in the XML. The partition UUID is used by EFI's device path to identify a partition. In order for FreeBSD to add EFI boot options, proper device paths need to be constructed.	2010-01-30 23:13:19 +00:00
Ivan Voras	49e232f2c9	Go through with write_metadata() non-error-handling and make it return "void". This is mostly to avoid dead variable assignment warning by LLVM. No functional change. Pointed out by: trasz Approved by: gnn (mentor)	2010-01-25 20:51:40 +00:00
Edward Tomasz Napierala	fdf64c5752	Remove unneeded variables. Found with: clang	2010-01-25 17:00:21 +00:00
Edward Tomasz Napierala	1373012510	Remove pointless assignment. Found with: clang	2010-01-25 16:58:58 +00:00
Edward Tomasz Napierala	dc9098605e	Remove some pointless variable assignments. Found with: clang	2010-01-25 16:55:30 +00:00
Edward Tomasz Napierala	0a36cb97a8	Remove unused variable. Found with: clang	2010-01-25 16:10:22 +00:00
Xin LI	35daa28f30	Expose stripe offset and stripe size through libgeom and geom(8) userland utilities. Reviewed by: pjd, mav (earlier version)	2010-01-17 06:20:30 +00:00
Edward Tomasz Napierala	b3f9d8c804	Add gmountver, disk mount verification GEOM class. Note that due to e.g. write throttling ('wdrain'), it can stall all the disk I/O instead of just the device it's configured for. Using it for removable media is therefore not a good idea. Reviewed by: pjd (earlier version)	2010-01-16 09:52:49 +00:00
Alexander Motin	0c8fd0c8ac	Change the way in which zero stripesize is handled. Instead of reporting zero stripeoffset in such case (as if device has no stripes), report offset from the beginning of the media (as if device has single infinite stripe). This gives partitioning tools information, required to guess better partition alignment, in case if hardware doesn't report it's stripe size. For example, it should give disklabel info about odd offset made by fdisk.	2010-01-06 13:14:37 +00:00
Alexander Motin	8de5811320	Move wakeup() out of mutex to reduce contention.	2010-01-05 10:52:21 +00:00
Alexander Motin	86de0ca52c	Move wakeup() out of mutex to reduce contention.	2010-01-05 10:30:56 +00:00
Alexander Motin	06b215fd3a	Slightly optimize XOR calculation.	2010-01-05 02:06:05 +00:00
Marcel Moolenaar	665bb830e2	Properly return the UUID represented by the alias. PR: 142174 Submitted by: Przemyslaw Laczynski <torindel@gmail.com> Pointy hat to: rpaulo	2010-01-02 01:02:59 +00:00
Alexander Motin	0d883b11e3	Call wakeup() only for the first request on the queue.	2009-12-30 17:23:27 +00:00
Antoine Brodin	13e403fdea	(S)LIST_HEAD_INITIALIZER takes a (S)LIST_HEAD as an argument. Fix some wrong usages. Note: this does not affect generated binaries as this argument is not used. PR: 137213 Submitted by: Eygene Ryabinkin (initial version) MFC after: 1 month	2009-12-28 22:56:30 +00:00
Alexander Motin	1c80ec0a6b	Add BIO_DELETE support to ada(4): - For SSDs use TRIM feature of DATA SET MANAGEMENT command, as defined by ACS-2 specification working draft. - For CompactFlash use CFA ERASE command, same as ad(4) does. With this patch, `newfs -E /dev/ada1` was able to restore write speed of my heavily weared OCZ Vertex SSD (firmware 1.4) up to the initial level for the most part of it's capacity. Previous 1.3 firmware, even reportiong TRIM capabilty bit set, was not working, reporting ABORT error for every DSM command. I have no idea whether it is normal, but for some reason it takes 200ms to handle any TRIM command on this drive, that was making delete extremely slow. But TRIM command is able to accept long list of LBAs and the length of that list seems doesn't affect it's execution time. Implemented request clusting algorithm allowed me to rise delete rate up to reasonable numbers, when many parallel DELETE requests running.	2009-12-28 20:08:01 +00:00
Alexander Motin	5f9b1143ac	Make geom_concat to passthrough stripe parameters of the first component, hoping that rest will fit.	2009-12-24 14:32:21 +00:00
Alexander Motin	113d8e5046	As soon as geom_raid3 reports it's own stripe as sector size, report largest underlying provider's stripe, multiplied by number of data disks in array, due to transformation done, as array stripe.	2009-12-24 13:38:02 +00:00
Alexander Motin	92f60381d9	As soon as mirror has no own stripes, report largest stripe of unrerlying components, hoping others fit, if they are not equal.	2009-12-24 12:17:22 +00:00
Alexander Motin	8b30323843	Add two disk ioctls, giving user-level tools information about disk/array stripe (optimal access block) size and offset.	2009-12-24 11:05:23 +00:00
Alexander Motin	f00919d2fc	Make geom_stripe report it's stripe size to upper layers.	2009-12-24 10:43:44 +00:00
Alexander Motin	d4060fa67d	Make graid3 fallback to malloc() when component request size is bigger then maximal prepared UMA zone size. This fixes crash with MAXPHYS > 128K.	2009-12-21 23:31:03 +00:00
Rui Paulo	33f7a4124d	Add Microsoft and NetBSD partition types handling.	2009-12-14 20:26:27 +00:00
Rui Paulo	f13174303d	Simplify partition type parsing by using a data-oriented model. While there add more Apple and Linux partition types.	2009-12-14 20:04:06 +00:00
Alexander Motin	891852cc12	Change 'load' balancing mode algorithm: - Instead of measuring last request execution time for each drive and choosing one with smallest time, use averaged number of requests, running on each drive. This information is more accurate and timely. It allows to distribute load between drives in more even and predictable way. - For each drive track offset of the last submitted request. If new request offset matches previous one or close for some drive, prefer that drive. It allows to significantly speedup simultaneous sequential reads. PR: kern/113885 Reviewed by: sobomax	2009-12-03 21:47:51 +00:00
Edward Tomasz Napierala	3ce9ca8947	Provide a set of sysctls and tunables to disable device node creation for specific "kinds" of disk labels - for example, GPT UUIDs. Reason for this is that sometimes, other GEOM classes attach to these device nodes instead of the proper ones - e.g. they attach to /dev/gptid/XXX instead of /dev/ada0p2, which is annoying. Reviewed by: pjd (earlier version) MFC after: 1 month	2009-11-28 11:57:43 +00:00
Rui Paulo	f9d551f7df	Add a missing check for Apple HFS partitions. MFC after: 1 week	2009-11-12 19:30:49 +00:00
Robert Noland	a59a131093	We need to allocate space for the header in the create path also. This fixes a null pointer dereference with "gpart create -s GPT" after the previous commit. Reported by: Yuri Pankov Pointyhat to: me MFC after: 1 week	2009-11-12 16:28:39 +00:00
Robert Noland	1c2dee3cc9	Fix handling of GPT headers when size is > 92 bytes. It is valid for an on-disk GPT header to report a header size which is greater than 92 bytes. Previously, we would read in the sector and copy only the 92 bytes that we know how to deal with before calculating the checksum for comparison. This meant that when we did the checksum, we overshot the buffer and took in random memory, so the checksum would fail. We now determine the size of the header and allocate enough space to preserve the entire on-disk contents. This allows us to be correctly calculate the checksum and be able to modify and write the header back to the disk, while preserving data that we might not understand. Reported by: Kris Weston Approved by: marcel@ MFC after: 2 weeks	2009-11-07 17:29:03 +00:00
Robert Noland	e80d42dda2	Set the active flag in the PMBR when we install bootcode on a GPT partitioned disk. Some BIOS require this to be set before they will boot the device. Approved by: marcel MFC after: 2 weeks	2009-10-14 19:24:01 +00:00
Pawel Jakub Dawidek	f8727e71d7	If provider is open for writing when we taste it, skip it for classes that depend on on-disk metadata. This was we won't attach to providers that are used by other classes. For example we don't want to configure partitions on da0 if it is part of gmirror, what we really want is partitions on mirror/foo. During regular work it works like this: if provider is open for writing a class receives the spoiled event from GEOM and detaches, once provider is closed the taste event is send again and class can rediscover its metadata if it is still there. This doesn't work that way when new class arrives, because GEOM gives all existing providers for it to taste, also those open for writing. Classes have to decided on their own if they want to deal with such providers (eg. geom_dev) or not (classes modified by this commit). Reported by: des, Oliver Lehmann <lehmann@ans-netz.de> Tested by: des, Oliver Lehmann <lehmann@ans-netz.de> Discussed with: phk, marcel Reviewed by: marcel MFC after: 3 days	2009-10-09 09:42:22 +00:00
Ulf Lilleengen	a8a3cd7d9d	- Improve error message consistency and wording.	2009-10-05 08:44:31 +00:00
Marcel Moolenaar	b61808630d	The first 96 bytes may not be zeroes. It can contain trivial boot code that merely emits an error and waits for a key press before rebooting. The error being that extended partitions are not bootable. The origin is presumed to be Windows 2000; Windows XP does not do this... For now, ignore the first 96 bytes when checking that the EBR is (for the most part) all zeroes. Tested by: Mario Lobo <mlobo@digiart.art.br> MFC after: 1 week	2009-09-28 23:52:47 +00:00
Marcel Moolenaar	87f4470620	Don't create more partitions than can fit in the table by checking that the index is within bounds.	2009-09-24 06:00:49 +00:00
Edward Tomasz Napierala	bb3fd7ff4f	Remove unused variable.	2009-09-08 17:20:17 +00:00
Alexander Motin	18e42503ed	Do not check proper request alignment here in geom_dev in production. It will be checked any way later by g_io_check() in g_io_schedule_down(). It is only needed here to not trigger panic from additional check, when INVARIANTS enabled. So cover it with #ifdef INVARIANTS. It saves two 64bit divisions per request.	2009-09-08 05:46:38 +00:00
Alexander Motin	7fc019af65	MFp4: Remove msleep() timeout from g_io_schedule_up/down(). It works fine without it, saving few percents of CPU on high request rates without need to rearm callout twice per request.	2009-09-06 19:33:13 +00:00
Pawel Jakub Dawidek	b740e905a4	Add support for changing providers priority. Submitted by: Mel Flynn	2009-09-06 06:52:06 +00:00
Alexander Motin	af582ea7af	Remove artificial MAX_IO_SIZE constant, equal to DFLTPHYS * 2. Use MAXPHYS instead. It is NULL change for GENERIC kernel, but allows 'fast' mode to work on systems with increased MAXPHYS.	2009-09-04 19:20:46 +00:00
Pawel Jakub Dawidek	e93f5e4d25	Simplify g_disk_ident_adjust() function and allow any printable character in serial number. Discussed with: trasz Obtained from: Wheel Sp. z o.o. (http://www.wheel.pl)	2009-09-04 09:39:06 +00:00
Pawel Jakub Dawidek	07a93e6b3c	There's no need for checking result of M_WAITOK allocation.	2009-08-27 08:40:51 +00:00
Pawel Jakub Dawidek	c16ce31b31	Fix an obvious topology lock leak. MFC after: 3 days	2009-08-27 08:28:34 +00:00
Marcel Moolenaar	8530137252	The start of the EFI GPT partition in the PMBR can always be represented by CHS addressing. Don't define these fields as 0xff, but rather define them correctly. This prevents boot problems on PCs where GPT is being used. PR: 115406 Submitted by: Kent Hauser <kent@khauser.net> Approved by: re (kib)	2009-08-17 16:16:46 +00:00
Ulf Lilleengen	b79cac0f92	- Fix the issue with read access count modification on RAID-5 plexes properly. If the access counts were not increased and decreased in equal numbers by gvinum consumers, the read access count would be inconsistent with the write access count. Instead, modify the read access count with the write access count directly to prevent any inconsistencies. Approved by: re (kib)	2009-07-18 11:12:48 +00:00
Marcel Moolenaar	f43b57e32a	Revert revisions 188839 and 188868. Use of the ioctl in geom_dev.c is invalid because the ioctl happens without prior open. The ioctl got introduced to provide backward compatibility for extended partitions, but it ended up not being used because it didn't work as expected. Since there are no consumers of the ioctl and the implementation is broken, the best fix is to remove the code entirely. Spotted by: phk Approved by: re (kensmith)	2009-07-08 05:56:14 +00:00
Edward Tomasz Napierala	8edfe76ab5	Fix a panic which (reportedly) can happen when unmounting a filesystem with I/O requests in flight on kernels compiled with "options INVARIANTS". Also, make it obvious it's not right to call g_valid_obj() (and macros using it, e.g. G_VALID_CONSUMER()) without topology lock held. Approved by: re (kib) Reported by: pho	2009-07-01 20:16:29 +00:00
Edward Tomasz Napierala	fb231f3627	Make gjournal work with kernel compiled with "options DIAGNOSTIC". Previously, it would panic immediately. Reviewed by: pjd Approved by: re (kib)	2009-06-30 14:34:06 +00:00
Ulf Lilleengen	ac2a008e69	- Apply the same naming rules of LVM names as done in the LVM code itself. PR: kern/135874	2009-06-24 22:09:30 +00:00
John Hay	65a4957806	Do not stop the loop when an empty or deleted directory entry is found. Rather just skip over it.	2009-06-24 06:42:13 +00:00
Ivan Voras	63f4d880e0	Fix tabs, slightly improve comments. Approved by: gnn (mentor) (original) Noticed by: stas	2009-06-18 11:12:11 +00:00
Ivan Voras	452f657cb9	Add support for labels derived from GPT metadata. Approved by: gnn (mentor) Reviewed by: pjd PR: 128398 Submitted by: Marius Nuennerich < marius at nuenneri.ch >	2009-06-13 00:27:03 +00:00
Luigi Rizzo	6231f75bcf	As discussed in the devsummit, introduce two fields in the struct bio to store classification information, and a hook for classifier functions that can be called by g_io_request(). This code is from Fabio Checconi as part of his GSOC work.	2009-06-11 09:55:26 +00:00
Pawel Jakub Dawidek	cb9b72ce4a	Simplify.	2009-06-05 23:35:43 +00:00
Doug Barton	8b3bfb0509	Crank the debug level necessary to display the "Label foo is removed" and "Label for provider ..." messages up from 0 to 1.	2009-05-30 22:31:52 +00:00
Jamie Gritton	76ca6f88da	Place hostnames and similar information fully under the prison system. The system hostname is now stored in prison0, and the global variable "hostname" has been removed, as has the hostname_mtx mutex. Jails may have their own host information, or they may inherit it from the parent/system. The proper way to read the hostname is via getcredhostname(), which will copy either the hostname associated with the passed cred, or the system hostname if you pass NULL. The system hostname can still be accessed directly (and without locking) at prison0.pr_host, but that should be avoided where possible. The "similar information" referred to is domainname, hostid, and hostuuid, which have also become prison parameters and had their associated global variables removed. Approved by: bz (mentor)	2009-05-29 21:27:12 +00:00
Ulf Lilleengen	4147dd02cd	- Unbreak 64 bit platforms by casting off_t to intmax.	2009-05-26 14:15:06 +00:00
Ulf Lilleengen	6d66da20b7	- Fix wrong print on BIO_DONE. - Use db_printf instead of printf. While here, apply this to other ddb commands as well. Pointed out by: pjd	2009-05-26 10:03:44 +00:00
Ulf Lilleengen	bf7d2c1797	- Add 'show bio' DDB command. MFC after: 3 weeks	2009-05-26 07:29:17 +00:00
Edward Tomasz Napierala	916cd41c47	Check return value of gctl_get_asciiparam(). Found with: Coverity Prevent(tm) CID: 1118	2009-05-12 16:59:50 +00:00
Attilio Rao	dfd233edd5	Remove the thread argument from the FSD (File-System Dependent) parts of the VFS. Now all the VFS_* functions and relating parts don't want the context as long as it always refers to curthread. In some points, in particular when dealing with VOPs and functions living in the same namespace (eg. vflush) which still need to be converted, pass curthread explicitly in order to retain the old behaviour. Such loose ends will be fixed ASAP. While here fix a bug: now, UFS_EXTATTR can be compiled alone without the UFS_EXTATTR_AUTOSTART option. VFS KPI is heavilly changed by this commit so thirdy parts modules needs to be recompiled. Bump __FreeBSD_version in order to signal such situation.	2009-05-11 15:33:26 +00:00
Ulf Lilleengen	d8d015cddc	- Split up the BIO queue into a queue for new and one for completed requests. This is necessary for two reasons: 1) In order to avoid collisions with the use of a BIOs flags set by a consumer or a provider 2) Because GV_BIO_DONE was used to mark a BIO as done, not enough flags was available, so the consumer flags of a BIO had to be misused in order to support enough flags. The new queue makes it possible to recycle the GV_BIO_DONE flag into GV_BIO_GROW. As a consequence, gvinum will now work with any other GEOM class under it or on top of it. - Use bio_pflags for storing internal flags on downgoing BIOs, as the requests appear to come from a consumer of a gvinum volume. Use bio_cflags only for cloned BIOs. - Move gv_post_bio to be used internally for maintenance requests. - Remove some cases where flags where set without need. PR: kern/133604	2009-05-06 19:34:32 +00:00
Ulf Lilleengen	41944888fe	- Fix a case where a RAID5 volume would think that it is supposed to grow a new subdisk after a parity rebuild.	2009-05-06 19:18:19 +00:00
Ulf Lilleengen	11c4adc49e	- Check if any plexes are doing internal maintenance before removing them.	2009-05-06 19:06:28 +00:00
Ulf Lilleengen	5a0fa8531c	- Add forgotten KASSERT.	2009-05-06 18:37:32 +00:00
Ulf Lilleengen	1d8dfc60f4	- Fix a bug where the bio_data field of the wrong BIO is freed if an error occurs when doing a RAID5 request.	2009-05-06 18:27:28 +00:00
Ulf Lilleengen	451b95f489	- GV_BIO_RETRY is not used, and it is actually impossible with more than 8 values for bio_cflags/bio_pflags.	2009-05-06 18:24:56 +00:00
Ulf Lilleengen	040272465d	- Split the queue mutex into one for the event queue and one for the BIO queue, as they do not really relate and to prepare for an additional queue to be covered by the BIO queue mutex. - Implement wrappers for fetching the next element from the event queue as well as for putting a new element into the BIO queue.	2009-05-06 18:21:48 +00:00
Ulf Lilleengen	ad75dd77e0	- Make the gvinum softc invisible to userland, as it is not needed.	2009-05-04 17:30:20 +00:00
Ulf Lilleengen	697ab8be86	- Remove assertion of topology lock remaining from 7.x gvinum. It is not needed, as the renaming only changes internal gvinum names and will not alter the geom topology. - The topology lock was not held when calling g_wither_geom after renaming.	2009-04-18 16:36:27 +00:00
Marcel Moolenaar	cce94b6583	Precision '*' expects an int and strlen() returns a size_t. Compensate.	2009-04-16 05:52:47 +00:00
Marcel Moolenaar	6ad9a99f21	Add a compat option to the EBR scheme that controls the naming of the partitions (GEOM_PART_EBR_COMPAT). When compatibility is enabled, changes to the partitioning are disallowed. Remove the device name aliasing added previously to provide backward compatibility, but which in practice doesn't give us anything. Enable compatibility on amd64 and i386.	2009-04-15 22:38:22 +00:00
Ulf Lilleengen	1de45ea74d	- Move out allocation part of different gvinum objects into its own routine and make use of it in the gvinum userland code.	2009-04-10 08:50:14 +00:00
Andrew Thompson	853a10a581	Revert r190676,190677 The geom and CAM changes for root_hold are the wrong solution for USB design quirks. Requested by: scottl	2009-04-10 04:08:34 +00:00
Marcel Moolenaar	94fe30d0ca	Don't use hexadecimal in the EBR partition names, because 'a'..'f' are more commonly known as BSD partition names. Discussed with: ivoras@	2009-04-08 16:18:16 +00:00
Andrew Thompson	31da42bab2	Add interleaving root hold tokens from the CAM probe to disk_create and geom provider tasting. This is needed for disk attachments that happen after threads are running in the boot process. Tested by: rnoland	2009-04-03 19:49:33 +00:00
Andrew Thompson	626fc9fe3d	Add a how argument to root_mount_hold() so it can be passed NOWAIT and be called in situations where sleeping isnt allowed.	2009-04-03 19:46:12 +00:00
Marcel Moolenaar	daca55549f	The 9 bytes immediately prior to the partition table can contain signatures or disk serial numbers. Don't assume those to be zero in all cases. This fixes a false negative. Tested by: avatar@mmlab.cse.yzu.edu.tw	2009-04-03 05:54:49 +00:00
Marcel Moolenaar	c146965cd1	Sharpen the saw: o PC98 uses 32-bit block numbers. Limit the scheme to 2^32-1 blocks when the media is larger. The 32-bit block numbers are implicit (16-bit cylinder * 8-bit head * 8-bit sector).	2009-03-30 01:03:58 +00:00
Marcel Moolenaar	6154e492ec	Sharpen the saw: o MBR uses 32-bit block numbers. Limit the scheme to 2^32-1 blocks when the media is larger.	2009-03-30 00:53:46 +00:00
Marcel Moolenaar	f5f875ed84	Sharpen the saw: o EBR uses 32-bit block numbers. Limit the scheme to 2^32-1 blocks when the media is larger. o Calculate the number of entries based on the rounded media size, rather than the raw media size.	2009-03-30 00:48:42 +00:00
Marcel Moolenaar	2a1c00ff2f	Sharpen the saw: o Don't create a GPT scheme underneath another scheme when the probe doesn't allow it.	2009-03-30 00:33:43 +00:00
Ulf Lilleengen	bd9337ce80	- Add files that should have been added in r190507.	2009-03-28 21:06:59 +00:00
Ulf Lilleengen	c0b9797aa8	Import the gvinum work that have been done during and after Summer of Code 2007. The work have been under testing and fixing since then, and it is mature enough to be put into HEAD for further testing. A lot have changed in this time, and here are the most important: - Gvinum now uses one single workerthread instead of one thread for each volume and each plex. The reason for this is that the previous scheme was very complex, and was the cause of many of the bugs discovered in gvinum. Instead, gvinum now uses one worker thread with an event queue, quite similar to what used in gmirror. - The rebuild/grow/initialize/parity check routines no longer runs in separate threads, but are run as regular I/O requests with special flags. This made it easier to support mounted growing and parity rebuild. - Support for growing striped and raid5-plexes, meaning that one can extend the volumes for these plex types in addition to the concat type. Also works while the volume is mounted. - Implementation of many of the missing commands from the old vinum: attach/detach, start (was partially implemented), stop (was partially implemented), concat, mirror, stripe, raid5 (shortcuts for creating volumes with one plex of these organizations). - The parity check and rebuild no longer goes between userland/kernel, meaning that the gvinum command will not stay and wait forever for the rebuild to finish. You can instead watch the status with the list command. - Many problems with gvinum have been reported since 5.x, and some has been hard to fix due to the complicated architecture. Hopefully, it should be more stable and better handle edge cases that previously made gvinum crash. - Failed drives no longer disappears entirely, but now leave behind a dummy drive that makes sure the original state is not forgotten in case the system is rebooted between drive failures/swaps. - Update manpage to reflect new commands and extend it with some examples. Sponsored by: Google Summer of Code 2007 Mentored by: le Tested by: Rick C. Petty <rick-freebsd2008 -at- kiwi-computer.com>	2009-03-28 17:20:08 +00:00
Marcel Moolenaar	ee94c7ef01	Sharpen the saw: o BSD uses 32-bit block numbers. Limit the scheme to 2^32-1 blocks when the media is larger.	2009-03-27 05:48:42 +00:00
Marcel Moolenaar	d01a198be7	Sharpen the saw: o Don't create an APM scheme underneath another scheme when the probe doesn't allow it. o APM uses 32-bit block numbers. Limit the scheme to 2^32-1 blocks when the media is larger.	2009-03-27 05:35:12 +00:00
Marcel Moolenaar	f3548c023e	Change the priority from high to normal. This makes sure that the BSD or GPT schemes can take precedence as appropriate.	2009-03-26 16:42:24 +00:00
Ivan Voras	f7b16839ba	Create GEOM labels from UFS IDs, e.g. /dev/ufsid/49c97b1faa2adc43. UFS IDs are always present and can be used to identify file systems (useful if hardware devices move often). Actually-by: pjd Approved by: gnn (mentor)	2009-03-25 20:38:57 +00:00
Ivan Voras	15c48b9a20	Be more explicit and complain if kernel dumps are perfomed on unsupported partition types. This is to help users used to the old behaviour. Reviewed by: marcel Approved by: gnn (mentor)	2009-03-22 00:29:48 +00:00
Ivan Voras	94dd506d54	Make GEOM provider names starting with "/dev/" acceptable as well as their "raw" names. While there, change the formatting of extended MSDOS partitions so that the dot (".") is not used to separate two numbers (which kind of looks like the whole is a decimal number). Use "+" instead, which also hints that the second part of the name is the offset from the start of the partition in the first part of the name. Also change the offset from decimal to hexadecimal notation, simply for aesthetic reasons and future compatibility. GEOM_PART is the default in 8-CURRENT but not yet in 7-STABLE so this changeset can be MFC-ed without causing major problems from the second part. Reviewed by: marcel Approved by: gnn (mentor) MFC after: 2 weeks	2009-03-19 14:23:17 +00:00
Pawel Jakub Dawidek	c5d387d010	Detach GELI providers on shutdown/reboot, which will allow providers underneath to close properly. Reported, reviewed and tested by: guido MFC after: 1 week	2009-03-16 19:31:08 +00:00
Guido van Rooij	921eec2694	Backout this commit whil a better solution is developed	2009-03-13 08:13:51 +00:00
Yoshihiro Takahashi	8753b93d95	Move the PC98_[MS]ID_* defines from g_part_pc98.c to diskpc98.h. Reviewed by: marcel	2009-03-11 13:15:42 +00:00
Sam Leffler	664c0b48bf	o disallow write to RedBoot and FIS directory partitions; these are painful to resurrect (maybe honor foot shooting bit in kern.geom_debugflags) o fix match macro so we now recognize we want to merge FIS dir with RedBoot config parameters even if we don't actually do it	2009-03-11 01:12:52 +00:00
Guido van Rooij	c5f79858ff	When attaching a geli on boot make sure that it is detached upon last close. (needed for a gmirror to properly shutdown upon reboot when a geli is on top the gmirror)	2009-03-10 15:23:43 +00:00
Yoshihiro Takahashi	9541ada9a0	Restore the return statement. It was accidentally removed by rev 188429.	2009-03-10 11:14:03 +00:00
Sam Leffler	443f1e7991	add geom_redboot, a geom module that exports RedBoot FIS partitions as named slices in dev/redboot/*	2009-03-09 23:18:36 +00:00
Marcel Moolenaar	232c8bf888	o When creating the EBR scheme, set the number of entries properly. Otherwise the minimum of 1 is used and you can only insert a single partition/slice and only at sector 0 (index 1). o When adding a partition/slice, recalculate the index after the start and size of the partition/slice are adjusted to make them a multiple of the track size. Since the precheck method sets the index based on the start of the partition as provided by the user, we know that we're off by at most 1 and adjusting the index is safe.	2009-02-21 19:25:13 +00:00
Marcel Moolenaar	4dedfc44e7	Add bootcode handling.	2009-02-21 07:01:21 +00:00
Marcel Moolenaar	59c532c500	Provide compatibility symlink for logical partitions: 1. Extend geom_dev by having it create the symlink (i.e. call make_dev_alias) based on the DIOCGPROVIDERALIAS ioctl. In this way the functionaility is generic and thus usable by any geom/provider. 2. Have g_part handle said ioctl through the devalias method, so that it's under control of the scheme itself. By design the alias will not be created for newly added partitions.	2009-02-20 04:48:40 +00:00
Marcel Moolenaar	507a0d4a6c	Fix an infinite loop created when the last logical partition is removed.	2009-02-20 04:10:31 +00:00
Marcel Moolenaar	09a278e14d	Add a default implementation for pre-check. It should always succeed if not implemented. Pointy hat: marcel	2009-02-17 18:24:58 +00:00
Marcel Moolenaar	832cdc2ca7	Remove gpt_offset and related code. It was introduced for use by the BSD scheme, ended up not to be needed. Remove to avoid abuse and to keep the bloat to a minimum.	2009-02-17 04:12:10 +00:00
Marcel Moolenaar	e24c8a3fb8	Add support to add, delete and modify logical partitions, as well as to create and destroy the extended partitioning scheme. In other words: full support.	2009-02-16 03:54:28 +00:00
Marcel Moolenaar	7ca4fa83ec	Add method precheck to the g_part interface. The precheck method allows schemes to reject the ctl request, pre-check the parameters and/or modify/set parameters. There are 2 use cases that triggered the addition: 1. When implementing a R/O scheme, deletes will still happen to the in-memory representation. The scheme is not involved in that operation. The pre-check method can be used to fail the delete up-front. Without this the write to disk will typically fail, but at that time the delete already happened. 2. The EBR scheme uses a linked list to record slices. There's no index. The EBR scheme defines the index as a function of the start LBA of the partition. The add verb picks an index for the range and then invokes the add method of the scheme to fill in the blanks. It is too late for the add method to change the index. The pre-check is used to set the index up-front. This also (silently) overrides/nullifies any (pointless) user-specified index value.	2009-02-15 22:18:16 +00:00
Ulf Lilleengen	af2c6a1332	- Use the correct argument when determining the buffer size. PR: kern/131575 MFC after: 2 days	2009-02-11 18:13:20 +00:00
Warner Losh	51f53a08e0	Fix g_part_dumpconf and g_part_name prototpyes. Submitted by: marcel@	2009-02-10 02:43:07 +00:00
Marcel Moolenaar	5d68db5bc8	Add the EBR scheme. The EBR scheme supports the Extended Boot Records found inside extended partitions and used to create logical partitions. At this time write/modify support is not (yet) present. The EBR and MBR schemes both check the parent scheme. The MBR will back-off when nested under another MBR, whereas the EBR only nests under a MBR.	2009-02-08 23:51:44 +00:00
Marcel Moolenaar	665557a4ec	Allow gpe_offset to be set by the scheme. When gpe_offset is zero, or invalid, initialize it to the start of the partition. Adjust the mediasize when the offset lies somewhere inside the partition.	2009-02-08 23:39:30 +00:00
Marcel Moolenaar	165651a553	o Add the "PART::scheme" attribute that returns the name of the underlying partitioning scheme. o Put the start and end of the partition in the XML configuration. The start and end are the LBAs of the first and last sector (resp.) of the partition. They are currently identical to the offset and size attributes, which describe the partition as an offset and size in bytes, but may not in the future. The start and end will be used for the logical partition boundaries and may include metadata. The offset and size will always represent the useful storage space within the partition. Typically these two notions are the same, but for logical partitions in an extended partition, the EBR is more naturally treated as being part of the partition.	2009-02-08 20:15:08 +00:00
Warner Losh	f4fddf53c7	Fix g_part_dumpconf to return void to match kobj definition. Fix g_part_name to return a const char * rather than a char *.	2009-02-08 07:05:23 +00:00
Marcel Moolenaar	da3ee90988	In g_handleattr(), set bp->bio_completed also for the case where len is 0. Otherwise g_getattr() will never succeed when it is handled by g_handleattr_str().	2009-02-03 07:07:13 +00:00
Marcel Moolenaar	709a626613	Constify val in g_handleattr() and str in g_handleattr_str(). This allows passing string constants to g_handleattr_str().	2009-02-01 01:50:09 +00:00
Ed Schouten	739b705c7e	Remove unused unrhdr from GEOM character device module. Now that make_dev() doesn't require unit numbers to be unique, there is no need to use an unrhdr here to generate the numbers. Remove the entire init-routine, because it is optional.	2009-01-24 18:23:19 +00:00
Edward Tomasz Napierala	38153e80f7	Prevent a panic that happens on SMP machines when removing a disk with many writes queued up. Reviewed by: phk, scottl Approved by: rwatson (mentor) Sponsored by: FreeBSD Foundation	2009-01-11 13:51:04 +00:00
Marius Strobl	c825397790	- Don't enforce an upper-bound to the number of sectors or heads, allowing the full 16-bit width of the corresponding fields in the VTOC8 label to be used. The removed limits basically only held true for providers labeled using the synthetic geometry provided by cam_calc_geometry(9) but neither SCSI disks labeled with Solaris nor sufficiently large ATA disks. - Given that providers (originally) labeled with Solaris typically use the native geometry as reported by the target while FreeBSD typically uses a synthetic one put the message complaining about mismatching geometries between what the label indicates and what GEOM thinks the provider has, which we generally can't help, under bootverbose in order to not unnecessarily scare users. - For informational purposes add the non-matching values to the message complaining about them, similar to what r186501 did for g_part_bsd_read() except also indicating the origin of the values. - Make it clear that the messages emitted by this code refer to the VTOC8 support rather than to another existing scheme or to VTOC32.	2009-01-06 14:10:10 +00:00
Marcel Moolenaar	3d556594df	Don't enforce an upper-bound to the number of sectors or heads that that the provider has. The limits we imposed were PC BIOS specific and not always applicable.	2009-01-06 06:47:53 +00:00
Marcel Moolenaar	90886ebcc2	Improve probing. o Don't check the dummy fields. o The entry is unused if either dp_mid is 0 or dp_sid is 0. o The start or end cylinder cannot be 0. o The start CHS cannot be equal to the end CHS. Submitted by: nyan	2009-01-04 07:32:06 +00:00
Ulf Lilleengen	2155338ac1	- Fix an issue with access permissions to underlying disks used by a gvinum plex. If the plex is a raid5 plex, and is being written to, parity data might have to be read from the underlying disks, requiring them to be opened for reading as well as writing. MFC after: 1 week	2008-12-27 14:32:39 +00:00
David E. O'Brien	62a353c0bd	When the geometry does not match the label, print out the values.	2008-12-26 20:27:32 +00:00
Edward Tomasz Napierala	ce8be7b8b0	Implement g_vfs_orphan(). Without it, the filesystem never closes the device, which means refcount on periph drivers never drops, which means cam_sim_free() never returns, which results in umass sleeping there ad infinitum. Submitted by: pjd Reviewed by: scottl, pjd Approved by: rwatson (mentor) Sponsored by: FreeBSD Foundation	2008-12-16 17:04:52 +00:00
Ulf Lilleengen	fa13e9bb0b	- Add missing word in comment.	2008-12-08 17:09:02 +00:00
Edward Tomasz Napierala	d27a975f72	Make it possible to use gjournal for the root filesystem. Previously, an unclean shutdown would make it impossible to mount rootfs at boot. PR: kern/128529 Reviewed by: pjd Approved by: rwatson (mentor) Sponsored by: FreeBSD Foundation	2008-12-06 11:33:10 +00:00
Ivan Voras	499c86ebd5	Trivial patch to show on which geom has the error been detected. Submitted by: Rick C. Petty Approved by: gnn (mentor) MFC after: 1 month	2008-12-01 15:02:00 +00:00
Marcel Moolenaar	6647711279	Allow boot code to be smaller than what the scheme expects. This effectively changes the boot code size to be an upper bound and makes the interface more flexible.	2008-12-01 00:07:17 +00:00
Marcel Moolenaar	95fc269897	Allow dumpon to a partition of type FS_UNUSED as well.	2008-11-26 05:18:27 +00:00
Ulf Lilleengen	251048a1ab	- Fix a potential NULL pointer reference. Note that this should not happen in practice, but it is a good programming practice and allows the kernel to not depend on userland correctness. - While there, make sizeof usage match the rest of the code. Found with: Coverity Prevent(tm) CID: 660, 662	2008-11-25 20:28:33 +00:00
Ulf Lilleengen	7e11b694f4	- Fix a potential NULL pointer reference. Note that this cannot happen in practice, but it is a good programming practice nontheless and it allows the kernel to not depend on userland correctness. Found with: Coverity Prevent(tm) CID: 655-659, 664-667	2008-11-25 19:13:58 +00:00
Marcel Moolenaar	1a696559de	Partition type FS_UNUSED does not mean the partition entry is unused. Unused partition entries have a partition size of zero. Therefore, partitions can have type FS_UNUSED. MFC after: 3 days	2008-11-18 05:55:58 +00:00
Marcel Moolenaar	dd0db05da2	Fix a panic caused by a corrupted table when the header is still valid. We were checking the state of the header and not the table. PR: 119868 Based on a patch from: Jaakko Heinonen <jh@saunalahti.fi> MFC after: 1 week	2008-11-06 16:51:33 +00:00
Attilio Rao	83b3bdbc8a	Improve VFS locking: - Implement real draining for vfs consumers by not relying on the mnt_lock and using instead a refcount in order to keep track of lock requesters. - Due to the change above, remove the mnt_lock lockmgr because it is now useless. - Due to the change above, vfs_busy() is no more linked to a lockmgr. Change so its KPI by removing the interlock argument and defining 2 new flags for it: MBF_NOWAIT which basically replaces the LK_NOWAIT of the old version (which was unlinked from the lockmgr alredy) and MBF_MNTLSTLOCK which provides the ability to drop the mountlist_mtx once the mnt interlock is held (ability still desired by most consumers). - The stub used into vfs_mount_destroy(), that allows to override the mnt_ref if running for more than 3 seconds, make it totally useless. Remove it as it was thought to work into older versions. If a problem of "refcount held never going away" should appear, we will need to fix properly instead than trust on such hackish solution. - Fix a bug where returning (with an error) from dounmount() was still leaving the MNTK_MWAIT flag on even if it the waiters were actually woken up. Just a place in vfs_mount_destroy() is left because it is going to recycle the structure in any case, so it doesn't matter. - Remove the markercnt refcount as it is useless. This patch modifies VFS ABI and breaks KPI for vfs_busy() so manpages and __FreeBSD_version will be modified accordingly. Discussed with: kib Tested by: pho	2008-11-02 10:15:42 +00:00
Warner Losh	0952268ecd	Add support for reading Tivo Series 1 partitioning. This likely needs a little refinement, but is good enough to commit as is. # Should look to see if I should move swab(3) into the kernel or just # provide the unoptimized routine here. Reviewed by: marcel@	2008-11-02 03:02:56 +00:00
Konstantin Belousov	f5dfdb519f	Revert r184136. Instead, push the check for crashdumpmap overflow into the MD i386 and amd64 dump code. Requested by: jhb Retested by: pho MFC after: 3 days (+ 176304 + 184136)	2008-10-31 10:11:35 +00:00
Ulf Lilleengen	86b3c6f5bc	- Import macros used in gmirror for printing gvinum debug messages and making the output more standardized. - Add a sysctl to set the verbosity of the debug messages. - While there, fixup typos and wording in the messages.	2008-10-26 17:20:37 +00:00
Marcel Moolenaar	38a2db2eb0	Invalid BSD disklabels have been created by sysinstall and are possibly still being created. The d_secperunit field contains the number of sectors of the disk and not of the slice/partition to which the disklabel applies. Rather than reject the disklabel, we now silently adjust the field. Existing code, like bslabel(8), does not seem to check the label that extensively and seems to adjust fields as a side-effect as well. In other words, it's not that important apparently, so gpart should not be too strict about it. Reported by: nyan@ Reported by: Andriy Gapon <avg@icyb.net.ua>	2008-10-25 17:21:46 +00:00
Marcel Moolenaar	66fedfca7b	Allow dumps to partitions with a tag of 0. The legacy sunlabel implementation in FreeBSD does not use VTOC information and as such as no partition types.	2008-10-22 02:08:54 +00:00
Konstantin Belousov	7a882637fe	Do not overflow crashdumpmap. Reported and tested by: pho Reviewed by: jhb MFC after: 1 week	2008-10-21 18:52:38 +00:00
Marcel Moolenaar	e1d7111a53	The active and bootable flags are not part of the type. Export the active and bootable flags as attributes in the configuration XML and allow them to be manipulated with the set/unset commands. Since libdisk treats the flags as part of the partition type, preserve behavior by keeping them included in the configuration text.	2008-10-20 04:50:47 +00:00
Attilio Rao	0d7935fd01	Remove the struct thread unuseful argument from bufobj interface. In particular following functions KPI results modified: - bufobj_invalbuf() - bufsync() and BO_SYNC() "virtual method" of the buffer objects set. Main consumers of bufobj functions are affected by this change too and, in particular, functions which changed their KPI are: - vinvalbuf() - g_vfs_close() Due to the KPI breakage, __FreeBSD_version will be bumped in a later commit. As a side note, please consider just temporary the 'curthread' argument passing to VOP_SYNC() (in bufsync()) as it will be axed out ASAP Reviewed by: kib Tested by: Giovanni Trematerra <giovanni dot trematerra at gmail dot com>	2008-10-10 21:23:50 +00:00
Ulf Lilleengen	64d62b289e	- Use the new gv_write_header function to write out the header when removing a drive to make sure that the header is in the correct format.	2008-10-02 10:01:05 +00:00
Ulf Lilleengen	d4b28d5b0f	- Remove unneeded macro since the config_length field in the header was changed to 64 bit in the new format.	2008-10-02 09:35:47 +00:00
Ulf Lilleengen	46ceb66ad3	- Make gvinum header on-disk structure consistent on all platforms by storing the gvinum header in fields of fixed size and in a big endian byte order rather than the size and byte order of the actual platform. Note that the change is backwards compatible with the old gvinum configuration format, but will save the configuration in the new format when the 'saveconfig' command is executed. Submitted by: Rick C. Petty <rick-freebsd -at- kiwi-computer.com>	2008-10-01 14:50:36 +00:00
Marcel Moolenaar	a87faebb8c	Return G_PART_PROBE_PRI_HIGH instead of G_PART_PROBE_PRI_NORM if the probe succeeds. This guarantees that the BSD scheme wins over the MBR scheme when MBR gets to probe first. Build- or link-time conditions can cause schemes to end up in the linker set in a different order. Normally BSD is before MBR in the linker set and as such get to probe first. But typically when the kernel gets rebuild or relinked, this can change.	2008-09-29 02:48:22 +00:00
Marcel Moolenaar	1e2fbcfa44	Insert the null scheme at the head. This does not change any functionality, but creates an invariant: the first element on the list is always the null scheme.	2008-09-29 02:39:02 +00:00
Marcel Moolenaar	4bf0267894	Export the partition name in the conftxt and confxml output. The conftxt output is used by libdisk, and the confxml output is used by gpart itself (gpart show -l). Submitted by: nyan@	2008-09-27 19:58:11 +00:00
Marcel Moolenaar	a455de093b	Hold the root mount while we're tasting. It is possible that a nested partition (typically the BSD disklabel) is not done tasting while the root file system is being mounted. While this is rare, it's still possible.	2008-09-27 19:29:52 +00:00
Marcel Moolenaar	404cfb5e20	Allow 255 sectors/track for the BSD disklabel. The previous limit of 63 sectors/track is too PC BIOS specific. On pc98, where the BSD disklabel is used as well, 255 sectors/track is not uncommon. Submitted by: nyan@	2008-09-27 15:28:15 +00:00
Ed Schouten	d3ce832719	Remove unit2minor() use from kernel code. When I changed kern_conf.c three months ago I made device unit numbers equal to (unneeded) device minor numbers. We used to require bitshifting, because there were eight bits in the middle that were reserved for a device major number. Not very long after I turned dev2unit(), minor(), unit2minor() and minor2unit() into macro's. The unit2minor() and minor2unit() macro's were no-ops. We'd better not remove these four macro's from the kernel, because there is a lot of (external) code that may still depend on them. For now it's harmless to remove all invocations of unit2minor() and minor2unit(). Reviewed by: kib	2008-09-26 14:19:52 +00:00
Sean Bruno	c4901b6798	Just a fixup for a KTRACE message I stumbled upon many moons ago. Reviewed by: Scott Long MFC after: 2 days	2008-09-18 15:02:19 +00:00
Ulf Lilleengen	f805f204b6	- Add a new ioctl for getting the provider name of a geom provider. - Add a routine for looking up a device and checking if it is a valid geom provider given a partial or full path to its device node. Reviewed by: phk Approved by: pjd (mentor)	2008-09-07 13:54:57 +00:00
Rui Paulo	89ea496520	Fix build.	2008-09-05 18:11:18 +00:00
Rui Paulo	87662ab391	Keep entries sorted.	2008-09-05 18:09:49 +00:00
Rui Paulo	35fa2f1bcd	Include the vendor in the partition name.	2008-09-05 16:54:07 +00:00
Rui Paulo	d7255ff42e	Detect Apple HFS GPT slices.	2008-09-05 12:49:14 +00:00
Attilio Rao	59d4932531	Decontextualize vfs_busy(), vfs_unbusy() and vfs_mount_alloc() functions. Manpages are updated accordingly. Tested by: Diego Sardina <siarodx at gmail dot com>	2008-08-31 14:26:08 +00:00
Bjoern A. Zeeb	603724d3ab	Commit step 1 of the vimage project, (network stack) virtualization work done by Marko Zec (zec@). This is the first in a series of commits over the course of the next few weeks. Mark all uses of global variables to be virtualized with a V_ prefix. Use macros to map them back to their global names for now, so this is a NOP change only. We hope to have caught at least 85-90% of what is needed so we do not invalidate a lot of outstanding patches again. Obtained from: //depot/projects/vimage-commit2/... Reviewed by: brooks, des, ed, mav, julian, jamie, kris, rwatson, zec, ... (various people I forgot, different versions) md5 (with a bit of help) Sponsored by: NLnet Foundation, The FreeBSD Foundation X-MFC after: never V_Commit_Message_Reviewed_By: more people than the patch	2008-08-17 23:27:27 +00:00
Pawel Jakub Dawidek	ed6c3e478f	Style(9).	2008-08-12 20:19:08 +00:00
Dag-Erling Smørgrav	2616144e43	Add sbuf_new_auto as a shortcut for the very common case of creating a completely dynamic sbuf. Obtained from: Varnish MFC after: 2 weeks	2008-08-09 11:14:05 +00:00
Peter Wemm	fbbc785240	Trivial commit to attempt to diagnose a svn problem. Add comment that Tivo disks are APM, but do not have a DDR record.	2008-07-22 18:05:50 +00:00
Pawel Jakub Dawidek	5527ecd9a5	Clear passphrase buffer after use. Submitted by: Fabian Keil <fk@fabiankeil.de> (a bit different version)	2008-07-20 19:56:13 +00:00
Ulf Lilleengen	14e96b45e8	- When renaming a drive, also set the drive name in the gvinum header. PR: kern/125632 Approved by: pjd (mentor) MFC after: 3 days	2008-07-19 13:53:11 +00:00
Ulf Lilleengen	56af4c6141	- Fix a logic error when updating plex configuration. Approved by: pjd (mentor)	2008-07-11 16:46:29 +00:00
Robert Watson	4f7d1876d5	Introduce a new lock, hostname_mtx, and use it to synchronize access to global hostname and domainname variables. Where necessary, copy to or from a stack-local buffer before performing copyin() or copyout(). A few uses, such as in cd9660 and daemon_saver, remain under-synchronized and will require further updates. Correct a bug in which a failed copyin() of domainname would leave domainname potentially corrupted. MFC after: 3 weeks	2008-07-05 13:10:10 +00:00
Xin LI	6c97c325ff	Avoid NULL deference. Reviewed by: ivoras	2008-06-30 15:21:42 +00:00
Ulf Lilleengen	7e7a4e1d18	- Fix spelling errors. Approved by: kib (mentor) PR: kern/124788 Submitted by: Hywel Mallett <Hywel -at- hmallett.co.uk>	2008-06-20 19:48:18 +00:00
Marcel Moolenaar	f6aa3fccce	Add the set and unset verbs used to set and clear attributes for partition entries. Implement the setunset method for the MBR scheme to control the active flag.	2008-06-18 01:13:34 +00:00
Marcel Moolenaar	d3532631de	Finish the support for partition labels and add it to the XML.	2008-06-12 19:34:07 +00:00
Marcel Moolenaar	9a764aac3f	Add the raw partition type to the XML.	2008-06-12 06:34:14 +00:00
Marcel Moolenaar	eab484f822	Add the raw partition type to the XML.	2008-06-12 06:26:36 +00:00
Marcel Moolenaar	a3354bb4a7	Add the raw partition type to the XML.	2008-06-12 05:56:03 +00:00
Marcel Moolenaar	0c132595dd	Add the raw partiton type to the XML.	2008-06-12 05:28:47 +00:00
Marcel Moolenaar	40b075d366	Add the raw partition type to the XML.	2008-06-12 05:27:23 +00:00
Marcel Moolenaar	ab1e8f04c8	Add the partition label and the raw partition type to the XML.	2008-06-12 04:43:34 +00:00
Ed Schouten	06d425f92e	Remove the distinction between device minor and unit numbers. Even though we got rid of device major numbers some time ago, device drivers still need to provide unique device minor numbers to make_dev(). These numbers are only used inside the kernel. They are not related to device major and minor numbers which are visible in devfs. These are actually based on the inode number of the device. It would eventually be nice to remove minor numbers entirely, but we don't want to be too agressive here. Because the 8-15 bits of the device number field (si_drv0) are still reserved for the major number, there is no 1:1 mapping of the device minor and unit numbers. Because this is now unused, remove the restrictions on these numbers. The MAXMAJOR definition was actually used for two purposes. It was used to convert both the userspace and kernelspace device numbers to their major/minor pair, which is why it is now named UMINORMASK. minor2unit() and unit2minor() have now become useless. Both minor() and dev2unit() now serve the same purpose. We should eventually remove some of them, at least turning them into macro's. If devfs would become completely minor number unaware, we could consider using si_drv0 directly, just like si_drv1 and si_drv2. Approved by: philip (mentor)	2008-05-29 12:50:46 +00:00
Ulf Lilleengen	4e70f1decf	- Recognize the 'volume' parameter when creating a plex. PR: kern/75632 Approved by: pjd (mentor) MFC after: 1 day	2008-05-22 10:27:03 +00:00
Pawel Jakub Dawidek	9097a8e66e	- Assert that we don't send new provider event for a provider which has G_PF_WITHER flag set. - Fix typo in assertion condition (sorry, but I forgot who report that).	2008-05-18 22:50:50 +00:00
Pawel Jakub Dawidek	f02642d79e	Play nice with DDB pager. Educated by: jhb's BSDCan presentation	2008-05-18 21:13:10 +00:00
Marcel Moolenaar	5db670520f	Implement the G_PART_DUMPCONF method for all 6 schemes. Also call the method for the (indent == NULL) case (i.e. the kern.geom.conftxt sysctl). The purpose is to extend the conftxt output with scheme- specific fields which can be used by libdisk. In particular, have the schemes dump the xs and xt fields, which contain the backward compatible values for class type and partition type. This allows libdisk to work with the legacy slicers as well as with gpart and helps/promotes migration.	2008-04-23 20:13:05 +00:00
Marcel Moolenaar	4d32fcb42b	Add the bootcode verb for installing boot code. Boot code is supported for the MBR, GPT and PC98 schemes, where GPT installs boot code into the PMBR.	2008-04-13 19:54:54 +00:00
Marcel Moolenaar	e0fbffe617	Change the order from SI_ORDER_FIRST to SI_ORDER_ANY (within SI_SUB_DRIVERS) to avoid loading schemes before all the GEOM classes have been loaded and initialized. Otherwise we may end up using mutexes that haven't been initialized (due to g_retaste() posting an event).	2008-03-29 17:33:29 +00:00
Marcel Moolenaar	b03fab128b	Add support for PC-9800 partition tables.	2008-03-28 17:58:55 +00:00
Marcel Moolenaar	856744ba93	When retasting, wither any existing GEOMs of the same class. This allows the class to create a different GEOM for the same provider as well as avoid that we end up with multiple GEOMs of the same class with the same name. For example, when a disk contains a PC98 partition table but only MBR is supported, then the partition table can be treated as a MBR. If support for PC98 is later loaded as a module, the MBR scheme is pre-empted for the PC98 scheme as expected.	2008-03-28 06:31:12 +00:00
Marcel Moolenaar	4ffca444a5	Redefine G_PART_SCHEME_DECLARE() from populating a private linker set to declaring a proper module. The module event handler is part of the gpart core and will add the scheme to an internal list on module load and will remove the scheme from the internal list on module unload. This makes it possible to dynamically load and unload partitioning schemes.	2008-03-23 01:31:59 +00:00
Marcel Moolenaar	8a8fcb0089	Add g_retaste(), which given a class will present all non-open providers to it for tasting. This is useful when the class, through means outside the scope of GEOM, can claim providers previously unclaimed. The g_retaste() function posts an event which is handled by the g_retaste_event(). Event suggested by: phk	2008-03-23 01:23:35 +00:00
Ulf Lilleengen	1cf9b83c6d	- Fix a memory leak when re-discovering a gvinum configuration. Approved by: pjd (mentor) MFC after: 1 week	2008-03-18 08:48:51 +00:00
Marcel Moolenaar	909f20c80d	Add support for VTOC8 labels (aka sun disk labels). When a label does not have VTOC information about the partitions, it will be created. This is because the VTOC information is used for the partition type and FreeBSD's sunlabel(8) does not create nor use VTOC information. For this purpose, new tags have been added to support FreeBSD's partition types.	2008-03-02 00:52:49 +00:00
Marcel Moolenaar	028de8786a	Follow-up improvements to the handling of false positives: If the partition table is empty, check to see if we have something that looks sufficiently like a BPB. On non-i386 machines, the boot sector typically doesn't contain boot code; the end of the boot sector is all zeroes. This is also where the partition table is for MBRs. We only check the sector size and cluster size, as that seems to be the most reliable across implementations, BPB versions and platforms.	2008-02-29 22:41:36 +00:00
Marcel Moolenaar	6291ef2d80	Better handle false positives. The MBR differs from the boot sector only because there's a partition table where the boot sector has boot code. Boot sectors without boot code look like a MBR for all practical purposes. This change adds a check for the partition table and fails the probe when it's obvously invalid. The assumption being that the sector contains a boot sector and not a MBR. More checks are needed to distinguish a boot secto without boot code from a (empty) MBR.	2008-02-28 22:30:41 +00:00
Andrew Thompson	764fa86761	geom_lvm(4) is now known as geom_linux_lvm(4).	2008-02-20 07:52:43 +00:00
Andrew Thompson	1332875338	Add a geom class to map Linux LVM logical volumes. The logical disks will appear as /dev/lvm/<vol group>-<logical vol>, for instance /dev/lvm/vg0-home. G_LINUX_LVM currently supports linear stripes with segments on multiple physical disks. The metadata is read only, logical volumes can not be allocated or resized. Reviewed by: Ivan Voras Previously known as geom_lvm(4), rename requested by des, phk.	2008-02-20 07:45:36 +00:00
Scott Long	7bbd40c57e	Teach the dump and minidump code to respect the maxioszie attribute of the disk; the hard-coded assumption of 64K doesn't work in all cases.	2008-02-15 06:26:25 +00:00
Andrew Thompson	15df4265ef	Unbreak build, size_t is larger on 64bit platforms.	2008-02-11 09:20:01 +00:00
Andrew Thompson	77b65eef19	Add a geom class to map Linux LVM logical volumes. The logical disks will appear as /dev/lvm/<vol group>-<logical vol>, for instance /dev/lvm/vg0-home. GLVM currently supports linear stripes with segments on multiple physical disks. The metadata is read only, logical volumes can not be allocated or resized. Reviewed by: Ivan Voras	2008-02-11 03:05:11 +00:00
Marcel Moolenaar	392ffade03	Various fixes: o BSD disklabels have relative offsets. Even for the BSD in MBR slice setup, except when the mbroffset ioctl is supported. Since we don't support that ioctl, bsdlabel(8) expects relative offsets. So, when reading an existing disklabel, correct for disklabels that mistakenly have the mbroffset offsets. o Don't take the geometry seriously, because it's untrustworthy. We do expect the numbers to be within range. This means that the secperunit field will not be computed from secpercyl and ncyls, but simply is the mediasize in sectors. o Don't enforce partitions to be aligned to track boundaries. The default label, constructed by bsdlabel(8), puts partition a at offset BBSIZE bytes, which commonly means sector 16.	2007-12-24 01:01:59 +00:00
Poul-Henning Kamp	015a11e695	Chop DIOCGDELETE from userland up in 1024 sector chunks to give geom_disk or any other bio chopping geom a reasonable size of work. Check for delivered signals between chunks, because the request size and service time is unbounded.	2007-12-16 19:38:26 +00:00
Poul-Henning Kamp	eed6cda966	Don't limit BIO_DELETE requests to MAXPHYS, they perform no data transfers, so they are not subject to the VM system limitation.	2007-12-16 18:03:31 +00:00
Marcel Moolenaar	3959198cc5	Decode as many or as few partition entries as the label claims there are. We have already checked it against the caller provided maxpart.	2007-12-09 22:44:22 +00:00
Marcel Moolenaar	4275d83ab5	Fix a bug in the add verb, where we failed to keep the list of partitions in index-order. This is assumed by the APM, MBR and BSD partitioning schemes.	2007-12-09 22:26:42 +00:00
Marcel Moolenaar	04a814ef90	Internal partitions can not be deleted or modified.	2007-12-08 23:08:42 +00:00
Marcel Moolenaar	d6bbbeebd9	Skip internal partitions in the check for (user) partitions for the destroy command. Previously a freshly created BSD disklabel could not be destroyed because of the internal partition.	2007-12-08 22:06:17 +00:00
Marcel Moolenaar	ddba264187	Add support for FS_ZFS.	2007-12-08 07:01:10 +00:00
John Baldwin	f97a705a99	Only attach to a GPT partition if it has the GPT_ENT_TYPE_FREEBSD type. XXX: This only works currently with GEOM_GPT which only exists in 6.x. XXX: I didn't add 'mbroffset' support for a GPT partition holding a BSD label as I'm not sure if they use relative or absolute offsets. MFC after: 3 days	2007-12-06 09:20:27 +00:00
Marcel Moolenaar	5aaa8fefdf	Add a BSD disklabel backend to g_part: o Disklabels can have between 8 and 20 partitions (inclusive). o No device special file is created for the raw partition. o Switch ia64 to use this backend. o No support for boot code yet.	2007-12-06 02:32:42 +00:00
John Birrell	18b0b6d137	On some arches, openssl is built with OPENSSL_NO_CAMELLIA, so the code here needs to depend on that too.	2007-11-19 08:59:32 +00:00
Maxim Konovalov	e70553c775	o s/resiserfs_sb/reiserfs_sb/. Submitted by: Ighighi	2007-11-16 19:43:26 +00:00
Pawel Jakub Dawidek	b656c1b836	Save stack only when KTR_GEOM is both compiled into the kernel and enabled in debug.ktr.mask. Because saving stack is very expensive, it's better only to do it when one really wants to. Reported by: Dan Nelson	2007-10-26 06:55:00 +00:00
John Baldwin	f352a0d45f	First cut at support for booting a GPT labeled disk via the BIOS bootstrap on i386 and amd64 machines. The overall process is that /boot/pmbr lives in the PMBR (similar to /boot/mbr for MBR disks) and is responsible for locating and loading /boot/gptboot. /boot/gptboot is similar to /boot/boot except that it groks GPT rather than MBR + bsdlabel. Unlike /boot/boot, /boot/gptboot lives in its own dedicated GPT partition with a new "FreeBSD boot" type. This partition does not have a fixed size in that /boot/pmbr will load the entire partition into the lower 640k. However, it is limited in that it can only be 545k. That's still a lot better than the current 7.5k limit for boot2 on MBR. gptboot mostly acts just like boot2 in that it reads /boot.config and loads up /boot/loader. Some more details: - Include uuid_equal() and uuid_is_nil() in libstand. - Add a new 'boot' command to gpt(8) which makes a GPT disk bootable using /boot/pmbr and /boot/gptboot. Note that the disk must have some free space for the boot partition. - This required exposing the backend of the 'add' function as a gpt_add_part() function to the rest of gpt(8). 'boot' uses this to create a boot partition if needed. - Don't cripple cgbase() in the UFS boot code for /boot/gptboot so that it can handle a filesystem > 1.5 TB. - /boot/gptboot has a simple loader (gptldr) that doesn't do any I/O unlike boot1 since /boot/pmbr loads all of gptboot up front. The C portion of gptboot (gptboot.c) has been repocopied from boot2.c. The primary changes are to parse the GPT to find a root filesystem and to use 64-bit disk addresses. Currently gptboot assumes that the first UFS partition on the disk is the / filesystem, but this algorithm will likely be improved in the future. - Teach the biosdisk driver in /boot/loader to understand GPT tables. GPT partitions are identified as 'disk0pX:' (e.g. disk0p2:) which is similar to the /dev names the kernel uses (e.g. /dev/ad0p2). - Add a new "freebsd-boot" alias to g_part() for the new boot UUID. MFC after: 1 month Discussed with: marcel (some things might still change, but am committing what I have so far)	2007-10-24 21:33:00 +00:00
Marcel Moolenaar	a1fedf914f	Add the freebsd-zfs alias. Both APM and GPT have ZFS partition types.	2007-10-21 20:02:57 +00:00

... 3 4 5 6 7 ...

1680 Commits