freebsd-dev

Author	SHA1	Message	Date
Alexander Motin	2b9c925ff0	Implement volume deletion if disk has more then one partition.	2012-05-01 09:21:21 +00:00
Alexander Motin	47e980965c	Improve DDF metadata writing.	2012-05-01 08:19:29 +00:00
Alexander Motin	00f32ecbd0	Add to GEOM RAID class module, supporting the DDF metadata format, as defined by the SNIA Common RAID Disk Data Format Specification v2.0. Supports multiple volumes per array and multiple partitions per disk. Supports standard big-endian and Adaptec's little-endian byte ordering. Supports all single-layer RAID levels. Dual-layer RAID levels except RAID10 are not supported now because of GEOM RAID design limitations. Some work is still to be done, but the present code already manages basic interoperation with RAID BIOS of the Adaptec 1430SA SATA RAID controller. MFC after: 1 month Sponsored by: iXsystems, Inc.	2012-04-30 17:53:02 +00:00
Alexander Motin	c9f545e5f9	s/gmirror/graid/	2012-04-29 19:40:50 +00:00
Alexander Motin	7b2a8d7823	Fix RAID5 level names changed at r234603.	2012-04-27 08:49:15 +00:00
Alexander Motin	bafd0b5b0a	Fix copy-paste typo in r234603. Submitted by: kan	2012-04-23 16:35:19 +00:00
Alexander Motin	dbb2e75504	Add names for all primary RAID levels defined by DDF 2.0 specification.	2012-04-23 13:04:02 +00:00
Alexander Motin	e26083ca69	Add sos@ copyrights to RAID metadata modules, respecting his efforts in decoding metadata formats in ataraid(4) code.	2012-04-23 09:39:39 +00:00
Alexander Motin	fc1de96060	Add to GEOM RAID class module for reading non-degraded RAID5 volumes and some environment to differentiate 4 possible RAID5 on-disk layouts. Tested with Intel and AMD RAID BIOSes. MFC after: 2 weeks	2012-04-19 12:30:12 +00:00
Dmitry Morozovsky	b20e4de387	VMware environments are not unusual now. Add VMware partitions recognition (both MBR for ESXi <= 4.1 and GPT for ESXi 5) to g_part. Reviewed by: ae Approved by: ae MFC after: 2 weeks	2012-04-18 11:59:03 +00:00
Alexander Motin	63297dfd4a	Some improvements to GEOM MULTIPATH: - Implement "configure" command to allow switching operation mode of running device on-fly without destroying and recreation. - Implement Active/Read mode as hybrid of Active/Active and Active/Passive. In this mode all paths not marked FAIL may handle reads same time, but unlike Active/Active only one path handles write requests at any point in time. It allows to closer follow original write request order if above layers need it for data consistency (not waiting for requisite write completion before sending dependent write). - Hide duplicate messages about device status change. - Remove periodic thread wake up with 10Hz rate. MFC after: 2 weeks Sponsored by: iXsystems, Inc.	2012-04-18 09:42:14 +00:00
Kirk McKusick	85121b0979	Expand locking around identification of filesystem mount point when accounting for I/O counts at completion of I/O operation. Also switch from using global devmtx to vnode mutex to reduce contention. Suggested and reviewed by: kib	2012-04-08 06:20:21 +00:00
Andrey V. Elsukov	ba289b84b0	VMDB offset should be greater than logical volume size only for MBR.	2012-03-29 07:29:27 +00:00
Andrey V. Elsukov	1c45872b03	Do proper cleanup for the GPT case when an error occurs.	2012-03-29 06:37:02 +00:00
Kirk McKusick	1faacf5d09	Keep track of the mount point associated with a special device to enable the collection of counts of synchronous and asynchronous reads and writes for its associated filesystem. The counts are displayed using `mount -v'. Ensure that buffers used for paging indicate the vnode from which they are operating so that counts of paging I/O operations from the filesystem are collected. This checkin only adds the setting of the mount point for the UFS/FFS filesystem, but it would be trivial to add the setting and clearing of the mount point at filesystem mount/unmount time for other filesystems too. Reviewed by: kib	2012-03-28 20:49:11 +00:00
Andrey V. Elsukov	472794bb9f	Check that scheme is not already registered. This may happens when a KLD is preloaded with loader(8) and leads to infinity loops. Also do not return EEXIST error code from MOD_LOAD handler, because we have undocumented(?) ability replace kernel's module with preloaded one. And if we have so, then preloaded module will be initialized first. Thus error in MOD_LOAD handler will be triggered for the kernel. PR: kern/165573 MFC after: 3 weeks	2012-03-23 07:26:17 +00:00
Andrey V. Elsukov	f1104f7190	Add CTLFLAG_TUN to sysctls. MFC after: 1 month	2012-03-19 13:21:10 +00:00
Andrey V. Elsukov	37d1a121d9	Add new GEOM_PART_LDM module that implements the Logical Disk Manager scheme. The LDM is a logical volume manager for MS Windows NT and it is also known as dynamic volumes. It supports about 2000 partitions and also provides the capability for software RAID implementations. This version implements only partitioning scheme capability and based on the linux-ntfs project documentation and several publications across the Web. NOTE: JBOD, RAID0 and RAID5 volumes aren't supported. An access to the LDM metadata is read-only. When LDM is on the disk partitioned with MBR we can also destroy metadata. For the GPT partitioned disks destroy action is not supported. Reviewed by: ivoras (previous version) MFC after: 1 month	2012-03-19 13:14:44 +00:00
Andrey V. Elsukov	422783e365	Make kern.geom.part node not static. Also add CTLFLAG_TUN to the check_integrity sysctl. MFC after: 1 month	2012-03-19 12:57:52 +00:00
Andrey V. Elsukov	5284aff594	Add MODULE_DEPEND() to geom_part modules. MFC after: 2 weeks	2012-03-15 08:39:10 +00:00
Ed Maste	972f6945b8	Remove unactionable message about label geometry It's not clear to a user what they should do after seeing the "geometry does not match label" kernel message, and it does not appear to present a problem in practice. Thus, just remove the messages. Approved by: marcel	2012-03-08 01:48:44 +00:00
Andrey V. Elsukov	5357f27569	If nested scheme allows dump kernel to its partition, we may allow dump for the parent partition too. MFC after: 2 weeks	2012-02-20 06:35:52 +00:00
Andrey V. Elsukov	c3f9f306d2	Add alias for the partition type 0x0f. Now "ebr" name is used for both types 0x05 and 0x0f, but 0x05 is preferred and used when partition is created with "gpart add -t ebr ...". This should keep EBR partitions accessible after r231754 for those, who have EBR on the partition with type 0x0f.	2012-02-20 05:48:57 +00:00
Andrey V. Elsukov	3bcf7d7191	Add additional check to EBR probe and create methods: don't try probe and create EBR scheme when parent partition type is not "ebr". This fixes error messages about corrupted EBR for some partitions where is actually another partition scheme. NOTE: if you have EBR on the partition with different than "ebr" (0x05) type, then you will lost access to partitions until it will be changed. MFC after: 2 weeks	2012-02-15 10:33:29 +00:00
Andrey V. Elsukov	0d8bc07eba	Add PART::type attribute handler. It returns partition type as string. MFC after: 2 weeks	2012-02-15 10:02:19 +00:00
Andrey V. Elsukov	48ef46e55a	Add alias for the partition with type 0x42 to the MBR scheme. MFC after: 1 week	2012-02-10 09:55:18 +00:00
Andrey V. Elsukov	f44d97bd0c	Let's be more realistic and limit maximum number of partition to 4k. MFC after: 1 week	2012-02-10 06:44:30 +00:00
Konstantin Belousov	c480f781ea	Current implementations of sync(2) and syncer vnode fsync() VOP uses mnt_noasync counter to temporary remove MNTK_ASYNC mount option, which is needed to guarantee a synchronous completion of the initiated i/o before syscall or VOP return. Global removal of MNTK_ASYNC option is harmful because not only i/o started from corresponding thread becomes synchronous, but all i/o is synchronous on the filesystem which is initiated during sync(2) or syncer activity. Instead of removing MNTK_ASYNC from mnt_kern_flag, provide a local thread flag to disable async i/o for current thread only. Use the opportunity to move DOINGASYNC() macro into sys/vnode.h and consistently use it through places which tested for MNTK_ASYNC. Some testing demonstrated 60-70% improvements in run time for the metadata-intensive operations on async-mounted UFS volumes, but still with great deviation due to other reasons. Reviewed by: mckusick Tested by: scottl MFC after: 2 weeks	2012-02-06 11:04:36 +00:00
Ed Maste	23f6856fff	Correct typo in comment (numbver)	2012-02-04 18:14:39 +00:00
Andrey V. Elsukov	7b540236bb	The scheme code may not know about some inconsistency in the metadata. So, add an integrity check after recovery attempt. MFC after: 1 week	2012-02-01 09:28:16 +00:00
Attilio Rao	5d7380f8e3	Avoid to check the same cache line/variable from all the locking primitives by breaking stop_scheduler into a per-thread variable. Also, store the new td_stopsched very close to td_*locks members as they will be accessed mostly in the same codepaths as td_stopsched and this results in avoiding a further cache-line pollution, possibly. STOP_SCHEDULER() was pondered to use a new 'thread' argument, in order to take advantage of already cached curthread, but in the end there should not really be a performance benefit, while introducing a KPI breakage. In collabouration with: flo Reviewed by: avg MFC after: 3 months (or never) X-MFC: r228424	2012-01-28 14:00:21 +00:00
Nathan Whitehorn	090dd24636	Experimental support for booting CHRP-type PowerPC systems from hard disks.	2012-01-25 03:37:39 +00:00
Don Lewis	b5bad28182	Allow an MBR primary or extended Linux swap partition to be specified as the system dump device. This was already allowed for GPT. The Linux swap metadata at the beginning of the partition should not be disturbed because the crash dump is written at the end. Reviewed by: alfred, pjd, marcel MFC after: 2 weeks	2012-01-13 18:32:56 +00:00
Jim Harris	c1ad3fcf6a	Add support for >2TB disks in GEOM RAID for Intel metadata format. Reviewed by: mav Approved by: scottl MFC after: 1 week	2012-01-09 23:01:42 +00:00
Aleksandr Rybalko	ce96bb7942	GEOM_UNCOMPRESS module, can be used with uzip images and with new ulzma images. Approved by: adrian (mentor)	2012-01-04 23:39:11 +00:00
Andriy Gapon	f6ce353e58	replace uses of libkern gets with cngets MFC after: 2 months	2011-12-17 15:26:34 +00:00
Alexander Motin	a2fa37fe67	Close race between geom destruction on g_vfs_close() when softc destroyed and g_vfs_orphan() call that tries to access softc, intruced at r227015. PR: kern/162997	2011-12-02 17:09:48 +00:00
Andrey V. Elsukov	a85a0d469e	Add an ability to increase number of allocated APM entries when we have reserved free space in the APM area. Also instead of one write request per each APM entry, use MAXPHY sized writes when we are updating APM. MFC after: 1 month	2011-11-28 16:07:26 +00:00
Andrey V. Elsukov	64c4a83782	The size of APM could be bigger than number of already allocated entries. And the first usable sector should not start from the inside of APM area. MFC after: 1 month	2011-11-28 12:38:24 +00:00
Alexander Motin	107c1508fa	Temporary revert r227009 to fix freeze on UP systems without PREEMPTION. Before r215687, if some withered geom or provider could not be destroyed, g_event thread went to sleep for 0.1s before retrying. After that change it is just restarting immediately. r227009 made orphaned (withered) provider to not detach immediately, but only after context switch. That made loop inside g_event thread infinite on UP systems without PREEMPTION. To address original problem with possible dead lock addressed by r227009 we have to fix r215687 change first, that needs some time to think and test.	2011-11-14 19:32:05 +00:00
Alexander Motin	0c883cef45	Major GEOM MULTIPATH class rewrite: - Improved locking and destruction process to fix crashes. - Improved "automatic" configuration method to make it consistent and safe by reading metadata back from all specified paths after writing to one. - Added provider size check to reduce chance of ordering conflict with other GEOM classes. - Added "manual" configuration method without using on-disk metadata. - Added "add" and "remove" commands to allow manage paths manually. - Failed paths are no longer dropped from geom, but only marked as FAIL and excluded from I/O operations. - Automatically restore failed paths when all others paths are marked as failed, for example, because of device-caused (not transport) errors. - Added "fail" and "restore" commands to manually control FAIL flag. - geom is now destroyed on last path disconnection. - Added optional Active/Active mode support. Unlike Active/Passive mode, load evenly distributed between all working paths. If supported by the device, it allows to significantly improve performance, utilizing bandwidth of all paths. It is controlled by -A option during creation. Disabled by default now. - Improved `status` and `list` commands output. Sponsored by: iXsystems, inc. MFC after: 1 month	2011-11-12 09:52:27 +00:00
Ed Schouten	6472ac3d8a	Mark all SYSCTL_NODEs static that have no corresponding SYSCTL_DECLs. The SYSCTL_NODE macro defines a list that stores all child-elements of that node. If there's no SYSCTL_DECL macro anywhere else, there's no reason why it shouldn't be static.	2011-11-07 15:43:11 +00:00
Ed Schouten	d745c852be	Mark MALLOC_DEFINEs static that have no corresponding MALLOC_DECLAREs. This means that their use is restricted to a single C file.	2011-11-07 06:44:47 +00:00
Alexander Motin	ea5791d7ab	Add mutex and two flags to make orphan() call properly asynchronous: - delay consumer closing and detaching on orphan() until all I/Os complete; - prevent new I/Os submission after orphan() called. Previous implementation could destroy consumers still having active requests and worked only because of global workaround made on GEOM level.	2011-11-02 09:24:59 +00:00
Alexander Motin	755d1ea5b5	Make orphan() method in geom_dev asynchronous using destroy_dev_sched_cb() instead of destroy_dev(). It moves device destruction waiting out of the topology lock and so fixes dead lock between orphanization and closing. Real provider and geom destruction called from swi context after device destroyed as callback of the destroy_dev_sched_cb().	2011-11-01 23:12:22 +00:00
Alexander Motin	df96fd6e14	Refactor disk disconnection and geom destruction handling sequences. Do not close/destroy opened consumer directly in case of disconnect. Instead keep it existing until it will be closed in regular way in response to upstream provider destruction. Delay geom destruction in the same way. Previous implementation could destroy consumers still having active requests and worked only because of global workaround made on GEOM level.	2011-11-01 20:56:19 +00:00
Alexander Motin	0849a53fc0	Refactor disk disconnection and geom destruction handling sequences. Do not close/destroy opened consumer directly in case of disconnect. Instead keep it existing until it will be closed in regular way in response to upstream provider destruction. Delay geom destruction in the same way. Previous implementation could destroy consumers still having active requests and worked only because of global workaround made on GEOM level.	2011-11-01 17:04:42 +00:00
Alexander Motin	20a5d5dc60	Workaround the problem introduced by combination of r162200 and r215687. r162200 delays provider orphanization until all running requests complete, to workaround broken orphan() method implementation in some classes. r215687 removes persistent periodic (10Hz) event thread wake ups. Together these changes can indefinitely delay orphanization until some other event wake up the event thread. One consequence of this is inability of CAM to destroy device disconnected when busy and, as consequence, create new one after reconnection. While the best solution would be to revert r162200, it is not easy, as some classes still look broken in that way. Instead conditionally wake up event thread if there are some providers waiting for orphanization. MFC after: 1 week	2011-11-01 08:57:49 +00:00
Andrey V. Elsukov	aea26bc05a	Our geom withering function could take some time before geom with its providers and consumers will be destroyed. Before take some actions with a geom, check that it is not destroyed at the moment. Tested by: nwhitehorn MFC after: 1 week	2011-10-28 11:45:24 +00:00
Pawel Jakub Dawidek	0c879bd990	Before this change when GELI detected hardware crypto acceleration it will start only one worker thread. For software crypto it will start by default N worker threads where N is the number of available CPUs. This is not optimal if hardware crypto is AES-NI, which uses CPU for AES calculations. Change that to always start one worker thread for every available CPU. Number of worker threads per GELI provider can be easly reduced with kern.geom.eli.threads sysctl/tunable and even for software crypto it should be reduced when using more providers. While here, when number of threads exceeds number of CPUs avilable don't reduce this number, assume the user knows what he is doing. Reported by: Yuri Karaban <dev@dev97.com> MFC after: 3 days	2011-10-27 16:12:25 +00:00
Alexander Motin	733a1f3f52	Clarify disks/volumes above 2TiB support in geom_raid: - add support for volumes above 2TiB with Promise metadata format; - enforse and document other limitations: - Intel and Promise metadata formats do not support disks above 2TiB; - NVIDIA metadata format does not support volumes above 2TiB. Sponsored by: iXsystems, Inc. MFC after: 2 weeks	2011-10-26 21:50:10 +00:00
Pawel Jakub Dawidek	92f84a9fae	Allow upper layers to discover than BIO_DELETE and/or BIO_FLUSH is not supported by returning EOPNOTSUPP instead of 0 or ENODEV. MFC after: 3 days	2011-10-25 14:07:17 +00:00
Pawel Jakub Dawidek	37f0f0a75e	Improve style a bit. MFC after: 3 days	2011-10-25 14:05:39 +00:00
Pawel Jakub Dawidek	9495476273	Simplify disk_alloc(). MFC after: 3 days	2011-10-25 14:04:59 +00:00
Pawel Jakub Dawidek	1f8c92e6fa	Add support for creating GELI devices with older metadata version for use with older FreeBSD versions: - Add -V option to 'geli init' to specify version number. If no -V is given the most recent version is used. - If -V is given don't allow to use features not supported by this version. - Print version in 'geli list' output. - Update manual page and add table describing which GELI version is supported by which FreeBSD version, so one can use it when preparing GELI device for older FreeBSD version. Inspired by: Garrett Cooper <yanegomi@gmail.com> MFC after: 3 days	2011-10-25 13:57:50 +00:00
Pawel Jakub Dawidek	effb9912c7	When decoding metadata, check magic string, so we know this is not GELI device before we check its version. We don't want to report that some garbage is unsupported version if this is not even GELI provider. MFC after: 3 days	2011-10-25 13:44:23 +00:00
Pawel Jakub Dawidek	0e236b6c47	Prefer G_ELI_VERSION_* defines for version numbers over plain digits. MFC after: 3 days	2011-10-25 13:09:22 +00:00
Pawel Jakub Dawidek	038c55adcc	Fit lines into 80 chars. MFC after: 3 days	2011-10-25 13:08:03 +00:00
Pawel Jakub Dawidek	e880ff0062	When metadata is at newer version than the highest supported, return EOPNOTSUPP when decoding. MFC after: 3 days	2011-10-25 07:48:53 +00:00
Marcel Moolenaar	369fe59de8	Add support for Boot Camp. The support is defined as follows: o Detect when Boot Camp is enabled (i.e. the MBR mirrors the GPT). o When Boot Camp is enabled, update the MBR whenever we write the GPT. o Creation of a Boot Camp enabled GPT is not supported. o Automatically disable Boot Camp when the GPT has been changed so that there's either no EFI partition or no HFS+ partition. o The first 4 partitions (by index) get mirrored in the MBR. Requested by, discussed with and tested by: kris@pcbsd.org MFC after: 1 week	2011-10-23 02:51:23 +00:00
Marius Strobl	479a4ef021	Allow to dump on Solaris swap partitions. PR: 161764 Submitted by: Peter Jeremy	2011-10-18 20:16:02 +00:00
Pawel Jakub Dawidek	8d680f2cc9	Add some spare fields to the g_class and g_geom structures needed to implement direct I/O handling and provider's property changes handling.	2011-07-17 20:35:30 +00:00
Andrey V. Elsukov	0857ee8cb8	Remove include of sys/sbuf.h from geom/geom.h. sbuf support is not always required for geom/geom.h users, and no need to depend from it. PR: kern/158398	2011-07-11 10:02:27 +00:00
Andrey V. Elsukov	5d807a0e1a	Include sys/sbuf.h directly. Reviewed by: pjd	2011-07-11 05:22:31 +00:00
Kirk McKusick	8795189c98	Allow disk partitions associated with UFS read-only mounted filesystems to be opened for writing. This functionality used to be special-cased for just the root filesystem, but with this change is now available for all UFS filesystems. This change is needed for journaled soft updates recovery. Discussed with: Jeff Roberson	2011-07-10 00:41:31 +00:00
Andrey V. Elsukov	2b9be05588	Initialize elements of state array when creating the GPT table. This fixes the problem, when the secondary GPT header is not erased when partition table destroyed. Move equal operations from g_part_gpt_create and g_part_gpt_recover to the separate function g_gpt_set_defaults. Reported by: dwhite MFC after: 1 week	2011-06-29 05:41:14 +00:00
Andrey V. Elsukov	671dfdbf11	EBR could contain an early stage of boot code. But we do not support it. Remove message about non empty bootcode, we can not break something while GEOM_PART_EBR_COMPAT is defined. But without GEOM_PART_EBR_COMPAT any changes in EBR are allowed and we can accidentally wipe the boot code. To do not break anything save the first EBR chunk and keep it untouched each time when we are changing EBR. Note that we are still not support boot code for EBR. PR: kern/141235 MFC after: 1 month	2011-06-27 12:42:48 +00:00
Andrey V. Elsukov	61162e857a	MS Windows NT+ uses 4 bytes at offset 0x1b8 in the MBR to identify disk drive. The boot0cfg(8) utility preserves these 4 bytes when is writing bootcode to keep a multiboot ability. Change gpart's bootcode method to keep DSN if it is not zero. Also do not allow writing bootcode with size not equal to MBRSIZE. PR: kern/157819 Tested by: Eir Nym MFC after: 1 month	2011-06-27 10:42:06 +00:00
Andrey V. Elsukov	503e6682cd	Change the way how we update bootcode for BSD scheme. Since the only parameter that we check is size of bootcode, then allow only two sizes: size of boot1 and size of /boot/boot. This partially protects users from losing ability to boot if incorrect bootcode is specified. Requested by: ru	2011-06-20 12:22:30 +00:00
Justin T. Gibbs	416494d7c9	Plumb device physical path reporting from CAM devices, through GEOM and DEVFS, and make it accessible via the diskinfo utility. Extend GEOM's generic attribute query mechanism into generic disk consumers. sys/geom/geom_disk.c: sys/geom/geom_disk.h: sys/cam/scsi/scsi_da.c: sys/cam/ata/ata_da.c: - Allow disk providers to implement a new method which can override the default BIO_GETATTR response, d_getattr(struct bio *). This function returns -1 if not handled, otherwise it returns 0 or an errno to be passed to g_io_deliver(). sys/cam/scsi/scsi_da.c: sys/cam/ata/ata_da.c: - Don't copy the serial number to dp->d_ident anymore, as the CAM XPT is now responsible for returning this information via d_getattr()->(a)dagetattr()->xpt_getatr(). sys/geom/geom_dev.c: - Implement a new ioctl, DIOCGPHYSPATH, which returns the GEOM attribute "GEOM::physpath", if possible. If the attribute request returns a zero-length string, ENOENT is returned. usr.sbin/diskinfo/diskinfo.c: - If the DIOCGPHYSPATH ioctl is successful, report physical path data when diskinfo is executed with the '-v' option. Submitted by: will Reviewed by: gibbs Sponsored by: Spectra Logic Corporation Add generic attribute change notification support to GEOM. sys/sys/geom/geom.h: Add a new attrchanged method field to both g_class and g_geom. sys/sys/geom/geom.h: sys/geom/geom_event.c: - Provide the g_attr_changed() function that providers can use to advertise attribute changes. - Perform delivery of attribute change notifications from a thread context via the standard GEOM event mechanism. sys/geom/geom_subr.c: Inherit the attrchanged method from class to geom (class instance). sys/geom/geom_disk.c: Provide disk_attr_changed() to provide g_attr_changed() access to consumers of the disk API. sys/cam/scsi/scsi_pass.c: sys/cam/scsi/scsi_da.c: sys/geom/geom_dev.c: sys/geom/geom_disk.c: Use attribute changed events to track updates to physical path information. sys/cam/scsi/scsi_da.c: Add AC_ADVINFO_CHANGED to the registered asynchronous CAM events for this driver. When this event occurs, and the updated buffer type references our physical path attribute, emit a GEOM attribute changed event via the disk_attr_changed() API. sys/cam/scsi/scsi_pass.c: Add AC_ADVINFO_CHANGED to the registered asynchronous CAM events for this driver. When this event occurs, update the physical patch devfs alias for this pass instance. Submitted by: gibbs Sponsored by: Spectra Logic Corporation	2011-06-14 17:10:32 +00:00
Attilio Rao	d7073a2b3b	MFC	2011-06-03 17:09:15 +00:00
Alexander Motin	0330cb3bf7	Update disk's stripesize and stripeoffset parameters on provider open. They are media-dependent and may change in run-time, same as sectorsize and/or mediasize. SCSI devices return physical sector size and offset via READ CAPACITY(16) command and so can not report it until media inserted or at least until probe sequence completed. UNMAP support is also reported there.	2011-06-03 13:49:18 +00:00
Andrey V. Elsukov	38c64884ff	Add diagnostic message about not aligned partitions. Idea from: ivoras	2011-06-03 06:58:24 +00:00
Attilio Rao	3bf1ec3a9a	MFC	2011-06-02 14:09:30 +00:00
Andrey V. Elsukov	d15033b3f8	Do not hide stripeoffset from libgeom(3), it may be useful even when stripesize is zero. MFC after: 1 week	2011-06-02 12:49:45 +00:00
Attilio Rao	9cb46334ee	MFC	2011-05-27 16:09:10 +00:00
Andrey V. Elsukov	9854b4eeee	Some partitioning tools may have a different opinion about disk geometry and partitions may start from withing the first track. If we found such partitions, then do not reserve space of the first track, only first sector.	2011-05-27 06:37:42 +00:00
Attilio Rao	7fcdc9a26f	MFC	2011-05-26 17:38:00 +00:00
Andrey V. Elsukov	ceef8f2477	Prevent non-aligned reading from provider while tasting. Reject providers with unsupported sectorsize. Reported by: Joerg Wunsch MFC after: 1 week	2011-05-25 11:14:26 +00:00
Andrey V. Elsukov	6fd1e2e013	Do not truncate available disk space to the closest track boundary.	2011-05-25 09:45:13 +00:00
Andrey V. Elsukov	23a3490034	Do not truncate available disk space to the closest track boundary.	2011-05-25 09:38:12 +00:00
Andrey V. Elsukov	db48d4a92e	Do not truncate available disk space to the closest track boundary.	2011-05-25 09:32:19 +00:00
Andrey V. Elsukov	49d12fd5be	Remove unused variable. MFC after: 1 week	2011-05-24 06:46:07 +00:00
Andrey V. Elsukov	e471361279	Remove unused variable. MFC after: 1 week	2011-05-24 06:44:16 +00:00
Attilio Rao	3ac3f6002b	MFC	2011-05-23 23:58:02 +00:00
Pawel Jakub Dawidek	204a4e196a	Recognize BIO_FLUSH requests and pass them to userland. MFC after: 1 week	2011-05-23 21:00:37 +00:00
Attilio Rao	7e7a34e520	MFC	2011-05-16 16:34:03 +00:00
Andrey V. Elsukov	d0c8ecb812	Make diagnostic messages more specific. With bootverbose print out all inconsistencies of integrity in the partition table, not first found only. Requested by: kib	2011-05-16 15:59:50 +00:00
Andrey V. Elsukov	b6c4978f6f	Add diagnostic messages for integrity checks.	2011-05-16 12:00:32 +00:00
Andrey V. Elsukov	6e81b75a3c	Add a sysctl kern.geom.part.check_integrity for those who has corrupt partition tables and lost an ability to boot after r221788. Also unhide an error message from bootverbose, this would help to easier determine the problem.	2011-05-15 20:03:54 +00:00
Attilio Rao	447274a88b	MFC	2011-05-15 15:47:16 +00:00
Mikolaj Golub	76cc7f6dd6	Fix a memory leak possible in g_eli_key_allocate() if the key with the same keyno is added while we aren't holding the lock. Approved by: pjd (mentor) MFC after: 1 week	2011-05-15 12:39:30 +00:00
Attilio Rao	ef607a6aa3	MFC	2011-05-12 14:01:40 +00:00
Andrew Thompson	b2901e999b	Move the three geom kprocs as threads under a single pid. Reviewed by: julian	2011-05-11 21:47:30 +00:00
Andrey V. Elsukov	c63e8fe201	Add basic metadata integrity check. In case when partition table was probed and read successfull, but it contains invalid values (e.g. overlapped partitions, offset or size is out of bounds), then table will be rejected. MFC after: 1 month	2011-05-11 19:59:43 +00:00
Attilio Rao	521bd6b433	MFC	2011-05-08 14:56:02 +00:00
Andrey V. Elsukov	f30b6bcb60	Limit number of sectors that can be addressed. MFC after: 1 week	2011-05-08 12:28:13 +00:00
Andrey V. Elsukov	284a82d0bb	Limit number of sectors that can be addressed. MFC after: 1 week	2011-05-08 12:20:30 +00:00
Andrey V. Elsukov	6017ae3fdd	Limit number of sectors that can be addressed. Reject table if blkcount from metadata is greater than provider.	2011-05-08 12:16:39 +00:00
Andrey V. Elsukov	2920db1713	Limit number of sectors that can be addressed. MFC after: 1 week	2011-05-08 12:11:16 +00:00
Andrey V. Elsukov	4675b2b65f	Replace UINT_MAX to UINT32_MAX. Pointed out by: kib MFC after: 1 week	2011-05-08 11:42:51 +00:00
Andrey V. Elsukov	ab0ffb4c88	Limit number of sectors that can be addressed. MFC after: 1 week	2011-05-08 11:20:27 +00:00
Andrey V. Elsukov	cfbdf6c3c5	Limit number of sectors that can be addressed. MFC after: 1 week	2011-05-08 11:16:17 +00:00
Pawel Jakub Dawidek	a1f4a8c447	Export GELI class version via sysctl kern.geom.eli.version. MFC after: 1 week	2011-05-08 09:29:21 +00:00
Pawel Jakub Dawidek	731adc8682	Version 6 is compatible with version 5 when it comes to control commands. MFC after: 1 week	2011-05-08 09:25:54 +00:00
Pawel Jakub Dawidek	964d172cbe	Detect and handle metadata of version 6. MFC after: 1 week	2011-05-08 09:25:16 +00:00
Pawel Jakub Dawidek	ad0a523639	When support for multiple encryption keys was committed, GELI integrity mode was not updated to pass CRD_F_KEY_EXPLICIT flag to opencrypto. This resulted in always using first key. We need to support providers created with this bug, so set special G_ELI_FLAG_FIRST_KEY flag for GELI provider in integrity mode with version smaller than 6 and pass the CRD_F_KEY_EXPLICIT flag to opencrypto only if G_ELI_FLAG_FIRST_KEY doesn't exist. Reported by: Anton Yuzhaninov <citrin@citrin.ru> MFC after: 1 week	2011-05-08 09:17:56 +00:00
Pawel Jakub Dawidek	9d644a4032	Remove prototype for a function that no longer exist. MFC after: 1 week	2011-05-08 09:11:04 +00:00
Pawel Jakub Dawidek	937959f0a7	Drop proper key. MFC after: 1 week	2011-05-08 09:09:49 +00:00
Pawel Jakub Dawidek	9104c920b4	Add magic field to the g_eli_key structure to detect if we are really operating on proper structures. MFC after: 1 week	2011-05-08 09:08:50 +00:00
Attilio Rao	aa8b9e0706	MFC	2011-05-06 22:45:33 +00:00
Adrian Chadd	c60fd25d34	Updates to geom_map from the author. The major update here is to support 64 bit size/offsets. There's also style related changes. Submitted by: ray@dlink.ua	2011-05-05 14:43:09 +00:00
Attilio Rao	71a19bdc64	Commit the support for removing cpumask_t and replacing it directly with cpuset_t objects. That is going to offer the underlying support for a simple bump of MAXCPU and then support for number of cpus > 32 (as it is today). Right now, cpumask_t is an int, 32 bits on all our supported architecture. cpumask_t on the other side is implemented as an array of longs, and easilly extendible by definition. The architectures touched by this commit are the following: - amd64 - i386 - pc98 - arm - ia64 - XEN while the others are still missing. Userland is believed to be fully converted with the changes contained here. Some technical notes: - This commit may be considered an ABI nop for all the architectures different from amd64 and ia64 (and sparc64 in the future) - per-cpu members, which are now converted to cpuset_t, needs to be accessed avoiding migration, because the size of cpuset_t should be considered unknown - size of cpuset_t objects is different from kernel and userland (this is primirally done in order to leave some more space in userland to cope with KBI extensions). If you need to access kernel cpuset_t from the userland please refer to example in this patch on how to do that correctly (kgdb may be a good source, for example). - Support for other architectures is going to be added soon - Only MAXCPU for amd64 is bumped now The patch has been tested by sbruno and Nicholas Esborn on opteron 4 x 12 pack CPUs. More testing on big SMP is expected to came soon. pluknet tested the patch with his 8-ways on both amd64 and i386. Tested by: pluknet, sbruno, gianni, Nicholas Esborn Reviewed by: jeff, jhb, sbruno	2011-05-05 14:39:14 +00:00
Andrey V. Elsukov	9a7defbda0	Remove unneeded code. MFC after: 1 week	2011-05-04 18:41:26 +00:00
Andrey V. Elsukov	eb8e9abe72	Remove unneeded code. MFC after: 1 week	2011-05-04 18:26:45 +00:00
Andrey V. Elsukov	ceb1c69a84	Remove unneeded code. MFC after: 1 week	2011-05-04 18:17:21 +00:00
Andrey V. Elsukov	2fbefe4829	Removed KASSERT, g_new_providerf() can not fail. MFC after: 1 week	2011-05-04 18:06:40 +00:00
Andrey V. Elsukov	c211af0352	Remove "for a moment" assignment. struct g_geom zeroed when allocated. MFC after: 1 week	2011-05-04 17:56:53 +00:00
Andrey V. Elsukov	e62dffbf5d	Remove unneeded checks, g_new_xxx functions can not fail. MFC after: 1 week	2011-05-04 17:37:37 +00:00
Andrey V. Elsukov	370efd743a	When checking existence of providers skip those which are orphaned. PR: kern/132273 MFC after: 2 week	2011-05-04 12:59:11 +00:00
Alexander Motin	bd5c368604	Use make_dev_alias_p() added in r221397 to create alias dev entry. It removes panic in case if alias name is already busy for some reason.	2011-05-03 19:12:42 +00:00
Alexander Motin	90f2be2430	Implement relaxed comparision for hardcoded provider names to make it ignore adX/adaY difference in both directions to simplify migration to the CAM-based ATA or back.	2011-04-27 00:10:26 +00:00
Alexander Motin	0d307e0905	- Add shim to simplify migration to the CAM-based ATA. For each new adaX device in /dev/ create symbolic link with adY name, trying to mimic old ATA numbering. Imitation is not complete, but should be enough in most cases to mount file systems without touching /etc/fstab. - To know what behavior to mimic, restore ATA_STATIC_ID option in cases where it was present before. - Add some more details to UPDATING.	2011-04-26 17:01:49 +00:00
Pawel Jakub Dawidek	16a174b5c5	One key is expected from providers smaller than or equal to (2^20)*sectorsize bytes. Remove bogus assertion and while here remove another too obvious assertion. Reported by: Fabian Keil <freebsd-listen@fabiankeil.de> MFC after: 2 weeks	2011-04-24 10:41:13 +00:00
Pawel Jakub Dawidek	5bd8adc750	If number of keys for the given provider doesn't exceed the limit, allocate all of them at attach time. This allows to avoid moving keys around in the most-recently-used queue and needs no mutex synchronization nor refcounting. MFC after: 2 weeks	2011-04-21 13:35:20 +00:00
Pawel Jakub Dawidek	1e09ff3dc3	Instead of allocating memory for all the keys at device attach, create reasonably large cache for the keys that is filled when needed. The previous version was problematic for very large providers (hundreds of terabytes or serval petabytes). Every terabyte of data needs around 256kB for keys. Make the default cache limit big enough to fit all the keys needed for 4TB providers, which will eat at most 1MB of memory. MFC after: 2 weeks	2011-04-21 13:31:43 +00:00
Alexander Motin	fe51d6c1d1	Reduce geom_raid log verbosity.	2011-04-18 16:15:59 +00:00
Gavin Atkinson	47bae5fd09	Remove an incorrect be16toh() that prevented geom_part_apm from working on little-endian machines. Reviewed by: marcel MFC after: 2 weeks	2011-04-15 12:32:52 +00:00
Adrian Chadd	27afdbaa51	Introduce geom_map, a GEOM provider designed for use by embedded flash stores. Some devices - notably those with uboot - don't have an explicit partition table (eg like Redboot's FIS.) geom_map thus provides an easy way to export the hard-coded flash layout as geom providers for use by filesystems and other tools. It also includes a "search" function which allows for dynamic creation of partition layouts where the device only has a single hard-coded partition. For example, if there is a "kernel+rootfs" partition, a single image can be created which appends the rootfs after the kernel with an appropriate search string. geom_map can be told to search for said search string and create a partition beginning after it. Submitted by: Aleksandr Rybalko <ray@dlink.ua>	2011-04-12 08:10:25 +00:00
Mikolaj Golub	90574b0a79	In g_eli_read_done() and g_eli_write_done(), for a bio with bio_children > 1, g_destroy_bio() is never called and the bio leaks. Fix this by calling g_destroy_bio() earlier, before the check. Submitted by: Victor Balada Diaz <victor@bsdes.net> (initial version) Approved by: pjd (mentor) MFC after: 1 week	2011-04-03 17:38:12 +00:00
Pawel Jakub Dawidek	63a6c5c12b	GEOM has an internal mechanism to deal with ENOMEM errors returned via g_io_deliver(). In such case it increases 'pace' counter on each ENOMEM and reschedules the request. The 'pace' counter is decreased for each request going down, but until 'pace' is greater than zero, GEOM will handle at most 10 requests per second. For GEOM GATE users that are proxy to local GEOM providers (like ggatel(8) and HAST) we can end up with almost permanent slow down of GEOM down queue. This is because once we reach GEOM GATE queue limit, we return ENOMEM to the GEOM. This means that we have, eg. 1024 I/O requests in the GEOM GATE queue. To make room in the queue and stop returning ENOMEM we need to proceed the requests of course, but those requests are handled by userland daemons that handle them by reading/writing also from/to local GEOM providers. For example with HAST, a new requests comes to /dev/hast/data, which is GEOM GATE provider. GEOM GATE passes the request to hastd(8) and hastd(8) reads/writes from/to /dev/da0. Once we reach GEOM GATE queue limit, to free up a slot in GEOM GATE queue, hastd(8) has to read/write from/to /dev/da0, but this request will also be very slow, because GEOM now slows down all the requests. We end up with full queue that we can unload at the speed of 10 requests per second. This simply looks like a deadlock. Fix it by allowing userland daemons that work with both GEOM GATE and local GEOM providers to specify unlimited queue size, so GEOM GATE will never return ENOMEM to the GEOM. MFC after: 1 week	2011-04-02 06:56:06 +00:00
Alexander Motin	14e2cd0a00	Bunch of small bugfixes and cleanups. Found with: Clang Static Analyzer	2011-03-31 16:19:53 +00:00
Alexander Motin	636076752a	Bunch of small bugfixes and cleanups. Found with: Coverity Prevent(tm) CID: 9656, 9658, 9693, 9705, 9706, 9707, 9808, 9809, 9810, 9711, 9712, 9713, 9714	2011-03-31 16:14:35 +00:00
Andrey V. Elsukov	53ff3d1e9c	Remove unneeded checks, g_new_xxx functions can not return NULL. Reviewed by: pjd MFC after: 1 week	2011-03-31 06:30:59 +00:00
Mikolaj Golub	bd119384c7	Increase debug level on g_gate device destruction and add message on device creation. Suggested by: danger Approved by: pjd (mentor) MFC after: 3 days	2011-03-30 21:40:14 +00:00
Mikolaj Golub	baf63f65ae	In g_gate_create() there is a window between when g_gate_softc is registered in g_gate_units array and when its sc_provider field is filled. If during this period g_gate_units is accessed by another thread that is checking for provider name collision the crash is possible. Fix this by adding sc_name field to struct g_gate_softc. In g_gate_create() when g_gate_softc is created but sc_provider is still not sc_name points to provider name stored in the local array. Approved by: pjd (mentor) Reported by: Freddie Cash <fjwcash@gmail.com> MFC after: 1 week	2011-03-27 19:56:55 +00:00
Alexander Motin	89b172238a	MFgraid/head: Add new RAID GEOM class, that is going to replace ataraid(4) in supporting various BIOS-based software RAIDs. Unlike ataraid(4) this implementation does not depend on legacy ata(4) subsystem and can be used with any disk drivers, including new CAM-based ones (ahci(4), siis(4), mvs(4), ata(4) with `options ATA_CAM`). To make code more readable and extensible, this implementation follows modular design, including core part and two sets of modules, implementing support for different metadata formats and RAID levels. Support for such popular metadata formats is now implemented: Intel, JMicron, NVIDIA, Promise (also used by AMD/ATI) and SiliconImage. Such RAID levels are now supported: RAID0, RAID1, RAID1E, RAID10, SINGLE, CONCAT. For any all of these RAID levels and metadata formats this class supports full cycle of volume operations: reading, writing, creation, deletion, disk removal and insertion, rebuilding, dirty shutdown detection and resynchronization, bad sector recovery, faulty disks tracking, hot-spare disks. For Intel and Promise formats there is support multiple volumes per disk set. Look graid(8) manual page for additional details. Co-authored by: imp Sponsored by: Cisco Systems, Inc. and iXsystems, Inc.	2011-03-24 21:31:32 +00:00
Alexander Motin	c6d4ed3a32	MFgraid/head r218212, r218257: Introduce new type of BIO_GETATTR -- GEOM::setstate, used to inform lower GEOM about state of it's providers from the point of upper layers. Make geom_disk use led(4) subsystem to illuminate states in such fashion: FAILED - "1" (on), REBUILD - "f5" (slow blink), RESYNC - "f1" (fast blink), ACTIVE - "0" (off). LED name should be set for each disk via kern.geom.disk.%s.led sysctl. Later disk API could be extended to allow disk driver to report this info in custom way via it's own facilities.	2011-03-24 19:23:42 +00:00
Alexander Motin	06f4c96d39	MFgraid/head r217827: Change BIO_GETATTR("GEOM::kerneldump") API to make set_dumper() called by consumer (geom_dev) instead of provider (geom_disk). This allows any geom insert it's code into the dump call chain, implementing more sophisticated functionality then just disk partitioning.	2011-03-24 08:37:48 +00:00
Maxim Sobolev	20cc2dc42e	Some linux distros put mount point into the ext2fs labels, such as '/', or '/boot', which confuses the devfs code and can cause userland programs to fail reading /dev/ext2fs directory with weird error code, such as any program that uses pwlib. Strip any leading slashes before feeding the label to the geom_label code. Sponsored by: Sippy Software, Inc. MFC after: 1 week	2011-03-08 17:00:31 +00:00
Nathan Whitehorn	65cb6238bd	Add the disk ident and a human-meaningful description (here, the disk model string) to the geom_disk config XML so that they are easily accessible from userland. MFC after: 1 week	2011-02-26 14:58:54 +00:00
Alexander Leidinger	cb08c2cc83	Add some FEATURE macros for various GEOM classes. No FreeBSD version bump, the userland application to query the features will be committed last and can serve as an indication of the availablility if needed. Sponsored by: Google Summer of Code 2010 Submitted by: kibab Reviewed by: silence on geom@ during 2 weeks X-MFC after: to be determined in last commit with code from this project	2011-02-25 10:24:35 +00:00
Rebecca Cran	6bccea7c2b	Fix typos - remove duplicate "the". PR: bin/154928 Submitted by: Eitan Adler <lists at eitanadler.com> MFC after: 3 days	2011-02-21 09:01:34 +00:00
Yoshihiro Takahashi	9f0f6d5fd7	Add support to set a slice name.	2011-02-19 11:09:38 +00:00
Luigi Rizzo	67c1af9d00	Correct a subtle bug in the 'gsched_rr' disk scheduler. The algorithm is supposed to work as follows: in order to prevent starvation, when a new client starts being served we record the start time and reset the counter of bytes served. We then switch to a new client after a certain amount of time or bytes, even if the current one still has pending requests. To avoid charging a new client the time of the first seek, we start counting time when the first request is served. Unfortunately a bug in the previous version of the code failed to set the start time in certain cases, resulting in some processes exceeding their timeslice. The fix (in this patch) is trivial, though it took a while to find out and replicate the bug. Thanks to Tommaso Caprai for investigating and fixing the problem. Submitted by: Tommaso Caprai MFC after: 1 week	2011-02-14 08:09:02 +00:00
Marcel Moolenaar	1e189c0839	Use the preload_fetch_addr() and preload_fetch_size() convenience functions to obtain the address and size of the preloaded key files. Sponsored by: Juniper Networks.	2011-02-13 19:34:48 +00:00
Yoshihiro Takahashi	5d627bb558	Add support to write boot menu.	2011-02-11 13:18:00 +00:00
Andrey V. Elsukov	88007f6102	Add new user-friendly aliases for partition types for the MBR and EBR schemes: fat32, ebr, linux-data, linux-raid, linux-swap and linux-lvm. Add bios-boot GUID and alias for the GPT scheme. It used by GRUB 2 loader. Also do sorting definitions of types in diskmbr.h and in g_part.c. PR: bin/120990, kern/147664 MFC after: 2 weeks	2011-01-28 11:13:01 +00:00
Andrey V. Elsukov	1313160649	While inspecting the disklabel check that start offset of partition is within provider's bounds. If not then reject this disklabel. Mark bbarea as NULL to do not free it again in destroy method. MFC after: 1 week	2011-01-27 08:02:26 +00:00
Matthew D Fleming	73d6f8516d	Remove the CTLFLAG_NOLOCK as it seems to be both unused and unfunctional. Wiring the user buffer has only been done explicitly since r101422. Mark the kern.disks sysctl as MPSAFE since it is and it seems to have been mis-using the NOLOCK flag. Partially break the KPI (but not the KBI) for the sysctl_req 'lock' field since this member should be private and the "REQ_LOCKED" state seems meaningless now.	2011-01-26 22:48:09 +00:00

1 2 3 4 5 ...

1791 Commits