freebsd-skq

Author	SHA1	Message	Date
ed	0c56cf839d	Mark all SYSCTL_NODEs static that have no corresponding SYSCTL_DECLs. The SYSCTL_NODE macro defines a list that stores all child-elements of that node. If there's no SYSCTL_DECL macro anywhere else, there's no reason why it shouldn't be static.	2011-11-07 15:43:11 +00:00
ae	972deb0b1f	Include sys/sbuf.h directly. Reviewed by: pjd	2011-07-11 05:22:31 +00:00
mav	79493720f3	Implement relaxed comparision for hardcoded provider names to make it ignore adX/adaY difference in both directions to simplify migration to the CAM-based ATA or back.	2011-04-27 00:10:26 +00:00
netchild	6bf702a55b	Add some FEATURE macros for various GEOM classes. No FreeBSD version bump, the userland application to query the features will be committed last and can serve as an indication of the availablility if needed. Sponsored by: Google Summer of Code 2010 Submitted by: kibab Reviewed by: silence on geom@ during 2 weeks X-MFC after: to be determined in last commit with code from this project	2011-02-25 10:24:35 +00:00
pjd	6f96b7c228	- Allow to specify value as const pointers. - Make optional string values always an empty string.	2010-09-13 08:56:07 +00:00
mav	95a3f24d16	Remove bintime_cmp() function, unused since r200086. MFC after: 1 week	2010-08-18 15:38:10 +00:00
mav	1073c59bbd	Move wakeup() out of mutex to reduce contention.	2010-01-05 10:30:56 +00:00
mav	e6038335e8	As soon as mirror has no own stripes, report largest stripe of unrerlying components, hoping others fit, if they are not equal.	2009-12-24 12:17:22 +00:00
mav	f018f4f599	Change 'load' balancing mode algorithm: - Instead of measuring last request execution time for each drive and choosing one with smallest time, use averaged number of requests, running on each drive. This information is more accurate and timely. It allows to distribute load between drives in more even and predictable way. - For each drive track offset of the last submitted request. If new request offset matches previous one or close for some drive, prefer that drive. It allows to significantly speedup simultaneous sequential reads. PR: kern/113885 Reviewed by: sobomax	2009-12-03 21:47:51 +00:00
pjd	e816f77286	Add support for changing providers priority. Submitted by: Mel Flynn	2009-09-06 06:52:06 +00:00
thompsa	39714cb212	Revert r190676,190677 The geom and CAM changes for root_hold are the wrong solution for USB design quirks. Requested by: scottl	2009-04-10 04:08:34 +00:00
thompsa	fe5458f665	Add a how argument to root_mount_hold() so it can be passed NOWAIT and be called in situations where sleeping isnt allowed.	2009-04-03 19:46:12 +00:00
julian	51d643caa6	Rename the kthread_xxx (e.g. kthread_create()) calls to kproc_xxx as they actually make whole processes. Thos makes way for us to add REAL kthread_create() and friends that actually make theads. it turns out that most of these calls actually end up being moved back to the thread version when it's added. but we need to make this cosmetic change first. I'd LOVE to do this rename in 7.0 so that we can eventually MFC the new kthread_xxx() calls.	2007-10-20 23:23:23 +00:00
jeff	91d1501790	Commit 14/14 of sched_lock decomposition. - Use thread_lock() rather than sched_lock for per-thread scheduling sychronization. - Use the per-process spinlock rather than the sched_lock for per-process scheduling synchronization. Tested by: kris, current@ Tested on: i386, amd64, ULE, 4BSD, libthr, libkse, PREEMPTION, etc. Discussed with: kris, attilio, kmacy, jhb, julian, bde (small parts each)	2007-06-05 00:00:57 +00:00
pjd	b34fb80d83	Now, that we have gjournal in the tree add possibility to configure gmirror and graid3 in a way that it is not resynchronized after a power failure or system crash. It is safe when gjournal is running on top of gmirror/graid3.	2006-11-01 22:51:49 +00:00
pjd	c33849dc41	Implement BIO_FLUSH handling by simply passing it down to the components. Sponsored by: home.pl	2006-10-31 21:23:51 +00:00
pjd	610c4b7a06	Fix synchronization in gmirror and graid3 which I broken. Synchronization request can still have bio_to set to sc_provider (this is READ part of a synchronization request) and in this case g_{mirror,raid3}_sync() wasn't called as it should be. MFC after: 1 week	2006-09-13 15:46:49 +00:00
jmg	ecd9e77d3e	move created/detected/activated under debug level 1 to quiet the common case.. add count of active and total components to the launched line so you can see at a glance if your mirror/raid3 is complete... now: GEOM_MIRROR: Device mirror/sam launched (2/2). Reviewed by: pjd	2006-09-09 21:45:37 +00:00
pjd	b539048d28	Not only a request from us can be passed to g_{mirror,raid3}_worker() function, but also a request to us, in which case checking bio_cflags is wrong, because the class above us is controling it, not we. MFC after: 1 week	2006-08-09 09:41:53 +00:00
yar	209e4786e7	Commit the results of the typo hunt by Darren Pilgrim. This change affects documentation and comments only, no real code involved. PR: misc/101245 Submitted by: Darren Pilgrim <darren pilgrim bitfreak org> Tested by: md5(1) MFC after: 1 week	2006-08-04 07:56:35 +00:00
pjd	38bff79a10	Don't use f-word in comments. We are gentlemans. Pointed out by: Maciej Sobczak	2006-08-01 23:17:33 +00:00
pjd	27c2ca3212	Always allow to specify components with /dev/ prefix. MFC after: 3 days	2006-07-13 20:37:59 +00:00
pjd	ee41eea403	Use proper defines instead of magic values. MFC after: 1 week	2006-07-10 21:18:00 +00:00
pjd	444d196b29	Allow to close access even if device is already destroyed. Reported by: Ulrich Spoerlein <uspoerlein@gmail.com> PR: kern/98093 MFC after: 1 week	2006-07-03 10:32:38 +00:00
pjd	c930d9ab2f	- Remove dead code. - Comment possible event miss, which isn't critical, but probably can be fixed by replacing the event lock usage with the queue lock. MFC after: 2 weeks	2006-04-28 12:13:49 +00:00
pjd	f430b234fb	Be sure to not destroy device twice. This is not possible in theory, but with this change there is even no theoretical race. MFC after: 2 weeks	2006-04-28 11:47:28 +00:00
pjd	d7eb5b2fe9	Introduce and use delayed-destruction functionality from a pre-sync hook, which means that devices will be destroyed on last close. This fixes destruction order problems when, eg. RAID3 array is build on top of RAID1 arrays. Requested, reviewed and tested by: ru MFC after: 2 weeks	2006-04-10 10:32:22 +00:00
pjd	568ba3bc0f	- 'ndisks' variable is not boolean, so compare it with a value. - Keep conditions order consistent with the comment above. MFC after: 3 days	2006-03-30 12:15:41 +00:00
pjd	ba3414666e	Update copyright for 2006.	2006-03-19 12:55:51 +00:00
pjd	fadb519311	kern.geom.mirror.sync_requests=2 seems to be a better default - it still keeps disks very busy, but makes system much more responsive. While here, kill extra space.	2006-03-19 10:49:05 +00:00
ru	9348187bf1	Fix build on 64-bit platforms.	2006-03-13 14:48:45 +00:00
pjd	11cbb2f275	- Speed up synchronization process by using configurable number of I/O requests in parallel. + Add kern.geom.mirror.sync_requests tunable which defines how many parallel I/O requests should be used. + Retire kern.geom.mirror.reqs_per_sync and kern.geom.mirror.syncs_per_sec sysctls. - Fix race between regular and synchronization requests. - Reimplement mirror's data synchronization - do not use the topology lock for this purpose, as it may case deadlocks. - Stop synchronization from pre-sync hook. - Fix some other minor issues. MFC after: 3 days	2006-03-13 00:58:41 +00:00
pjd	f0925fcaf9	When inserting a new component md_provsize metadata field wasn't set, which means that old problem was triggered (when two providers end at the same offset, eg. ad0 and ad0s1 and the wrong was is picked up by gmirror/graid3). Reported by: Michal Suszko <dry@dry.pl> MFC after: 3 days	2006-03-10 07:41:31 +00:00
pjd	1c595687a8	Allow to dump kernel to gmirror providers. Some conditions have to be met to make it work properly. This will be described in the manual page. MFC after: 3 days	2006-03-08 08:27:33 +00:00
pjd	dded50a417	On component state change to ACTIVE don't forget to update metadata. MFC after: 3 days	2006-02-12 17:38:09 +00:00
pjd	a9a29a4821	Use time_uptime instead of time_second, as the latter may go backwards. Suggested by: ru MFC after: 3 days	2006-02-12 17:36:09 +00:00
pjd	392d25e4bc	- Add kern.geom.mirror.disconnect_on_failure sysctl/tunnable (default to 1 to preserve currect behaviour). When set to 0, components are not disconnected - gmirror will try to still use them (only first error will be logged). This is helpful when we have two broken components, but in different places, so actually all data is available. Such buggy component will be visible in 'gmirror list' output with flag BROKEN. - Never disconnect the last valid component. If we detect errors there we will just pass them up. This wasn't reasonable to deny access to the whole provider because of one broken sector. Prodded by: ru MFC after: 3 days	2006-02-11 17:39:29 +00:00
pjd	ef80617741	Mark array as CLEAN when there are no write requests in kern.geom.mirror.idletime seconds. Write, not any requests. Mark array as clean immediatelly on last write close. Prodded by: ru MFC after: 3 days	2006-02-11 14:42:23 +00:00
pjd	6f074b6d64	Remove trailing spaces.	2006-02-01 12:06:01 +00:00
pjd	82bba3d6cd	Remove dead code. Found by: Coverity Prevent(tm) Coverity ID: CID104 MFC after: 3 days	2006-01-18 21:42:19 +00:00
sobomax	29543921ea	Check for g_read_data(9) errors properly: o The only indication of error condition is NULL value returned by the function; o value pointed to by error argument is undefined in the case when operation completes successfully. Discussed with: phk	2005-11-30 19:24:51 +00:00
rwatson	be4f357149	Normalize a significant number of kernel malloc type names: - Prefer '_' to ' ', as it results in more easily parsed results in memory monitoring tools such as vmstat. - Remove punctuation that is incompatible with using memory type names as file names, such as '/' characters. - Disambiguate some collisions by adding subsystem prefixes to some memory types. - Generally prefer lower case to upper case. - If the same type is defined in multiple architecture directories, attempt to use the same name in additional cases. Not all instances were caught in this change, so more work is required to finish this conversion. Similar changes are required for UMA zone names.	2005-10-31 15:41:29 +00:00
pjd	b1db94ccb3	After provider creation!!	2005-05-25 15:54:17 +00:00
pjd	f5dbb79246	- Call root_mount_rel() when provider IS created, not earlier. This should close the race observed by Daniel Eriksson. - Remove redundant wakeup().	2005-05-25 13:10:04 +00:00
pjd	ae767d0b06	Add some debug code to diagnose root-on-mirror problems with recent -current. Reported by: Daniel Eriksson	2005-05-23 13:05:07 +00:00
pjd	08790cba80	Add KASSERT() to be sure there is an active component. Suggested by: Coverity Prevent analysis tool	2005-05-11 18:13:51 +00:00
pjd	93f62be898	Fix provider's size check for 'insert' command. Before this fix one was able to insert one sector too small provider. MFC after: 3 days	2005-04-25 10:41:26 +00:00
pjd	15eddd96be	Remove the hack which allowed to use gmirror for root file system, use root_mount KPI instead.	2005-04-19 21:47:25 +00:00
pjd	e13782ca35	Make the code more obvious - when an error occurs in g_mirror_connect_disk(), detach and destroy consumer before returning.	2005-03-26 17:23:01 +00:00
pjd	90f14e00b7	Check for return values. Submitted by: sam Found by: Coverity Prevent analysis tool	2005-03-26 16:51:19 +00:00
pjd	668a028670	- Add md_provsize field to metadata, which will help with shared-last-sector problem. After this change, even if there is more than one provider with the same last sector, the proper one will be chosen based on its size. It still doesn't fix the 'c' partition problem (when da0s1 can be confused with da0s1c) and situation when 'a' partition starts at offset 0 (then da0s1a can be confused with da0s1 and da0s1c). One can use '-h' option there, when creating device or avoid sharing last sector. Actually, when providers share the same last sector and their size is equal, they provide exactly the same data, so the name (da0s1, da0s1a, da0s1c) isn't important at all. - Provide backward compatibility. - Update copyright's year. MFC after: 1 week	2005-02-27 23:07:47 +00:00
pjd	c05fe5379b	Update copyright in files changed this year.	2005-02-16 22:14:52 +00:00
pjd	a063b2627e	Increase default synchronization speed. MFC after: 3 days	2005-01-09 14:43:39 +00:00
pjd	c1f23c6a62	Spoiling is now not possible, because we keep consumers open for writing all the time. Remove unused code then. MFC after: 4 days	2005-01-04 12:11:49 +00:00
pjd	fcf90f45eb	Fix 'rebuild' command (we ignore retaste event now, so don't relay on it).	2005-01-03 19:42:37 +00:00
jhb	7b611b0cb2	Stop explicitly touching td_base_pri outside of the scheduler and simply set a thread's priority via sched_prio() when that is the desired action. The schedulers will start managing td_base_pri internally shortly.	2004-12-30 20:29:58 +00:00
pjd	9efcf672b9	Update disk->d_genid field when increasing sc->sc_genid.	2004-12-23 21:15:15 +00:00
pjd	b58db25ebe	- Add genid field to the metadata which will allow to improve reliability a bit. After this change, when component is disconnected because of an I/O error, it will not be connected and synchronized automatically, it will be logged as broken and skipped. Autosynchronization can occur, when component is disconnected (on orphan event) and connected again - there were no I/O error, so there is no need to not connected the component, but when there were writes while it wasn't connected, it will be synchronized. This fix cases, when component is disconnected because of I/O error and can be connected again and again. - Bump version number. - Add version change history. - Implement backward compatibility mechanism. After this change when metadata in old version is detected, it is automatically upgraded to the new (current) version.	2004-12-22 23:09:32 +00:00
pjd	968f03faa7	Now, when force device destruction is done on shutdown, hide warning, that device cannot be destroyed immediately, under debug=1. Suggested by: simon	2004-12-21 19:50:18 +00:00
pjd	c5ff5344f3	This should not be permitted, but some GEOM classes held the topology lock while doing g_(read\|write)_data() (e.g. BSD). This can cause a deadlock in MIRROR class. Not sure if this is safe to drop the topology lock in BSD class, so change the code in MIRROR class to avoid this deadlock.	2004-12-21 18:42:51 +00:00
pjd	d359cbb8ec	Remove unused variables.	2004-12-19 23:55:49 +00:00
pjd	5d72d5354a	- Argument 'flags' in g_mirror_destroy_consumer() function is unsed - mark it as such. - Before closing consumer check if it is open. It can be closed here when g_mirror_connect_disk() fails on g_access().	2004-12-19 23:33:59 +00:00
pjd	8c72a601a3	Some major cleanups. Keeping consumers open when device is closed is very hard. We need to open consumers sometimes to update metadata, etc. Many hacks was introduced in the past to made it possible. You cannot be sure that you can open consumer for writing always, even if you think it should be allowed. If one of the mirror components is for example da0 and you try to open it, you can get EPERM when da0s1 is opened for reading (because BSD class opens consumers (da0) with an extra 'e' bit set). Waiting for the events queue to be empty may do the trick, but it makes code much uglier (as you cannot always call g_waitidle()), it doesn't solve all edge cases and it can introduce deadlocks if there are events in the queue that wait for gmirror. I removed those hacks. Now all consumers are open r1w1e1 always, even if device is closed. Maybe it is less clean from GEOM perspective, but simpify code a lot and make it much more reliable. The only issue was retaste event which is sent when we close consumers opened for writing. I ignore retaste event by not detaching consumer immediately (so retaste event is not send to my class) and sending event right after it to detach and destroy consumer.	2004-12-19 23:12:00 +00:00
pjd	fccc23ca68	Don't quit on first failure, just skip failures.	2004-12-19 22:58:25 +00:00
pjd	64bf081bc3	Before trying to update metadata (so open consumer for writing), be sure that the events queue is empty. In other case we're able to hit the race where for example da0s1 is tasted by some other class, which means that da0 is open with exclusive bit set, which means that we can't open da0 for writing if it is our component. Reported by: Attila Nagy <bra@fsn.hu> (and somebody else sometime ago, but I cannot find who it was)	2004-11-09 23:27:21 +00:00
pjd	a2d5ae235d	Don't rely on DIRTY flag to be sure that consumer if open, because DIRTY flag can be removed in idle process. Use consumer's acw field instead to avoid opening consumer twice.	2004-11-09 23:15:40 +00:00
pjd	592adb99c2	Drop Giant lock before grabbing the topology lock.	2004-11-09 00:35:08 +00:00
pjd	b9bb54bcb8	If device is marked as beeing destroyed, deny all access requests.	2004-11-08 20:23:53 +00:00
pjd	d62be2e0d6	Don't forget to make sure that there are no not-finished requests before marking components as clean. Pointed out by: scottl	2004-11-05 17:18:39 +00:00
pjd	268c658b69	Use shutdown hooks to mark mirrors as clean after all file systems are unmounted. Suggested by: scottl	2004-11-05 12:35:21 +00:00
pjd	de4e5b4e88	Remove unused #include.	2004-11-05 12:31:32 +00:00
pjd	0bd7b4d36a	- Add a sysctl kern.geom.mirror.idletime, so one can specify after how many seconds of idling, DRITY flags are removed. - If mirror is in idle state or is not open for writing, sleep without timeout when waiting for I/O requests. - Don't use atomic operations, for now sysctls are protected by Giant. - Update debugs.	2004-11-05 10:55:04 +00:00
pjd	d0890a743f	MFp4: - Fix for good (I hope) force-stopping mirrors and some filure cases (e.g. the last good component dies when synchronization is in progress). Don't use ->nstart/->nend consumer's fields, as this could be racy, because those fields are used in g_down/g_up, use ->index consumer's field instead for tracking number of not finished requests. Reported by: marcel - After 5 seconds of idle time (this should be configurable) mark all dirty providers as clean, so when mirror is not used in 5 seconds and there will be power failure, no synchronization on boot is needed. Idea from: sorry, I can't find who suggested this - When there are no ACTIVE components and no NEW components destroy whole mirror, not only provider. - Fix one debug to show information about I/O request, before we change its command.	2004-11-05 09:05:15 +00:00
pjd	ae8741cdf4	Ehh. Introduce a hack: Wait for 3 seconds, so GEOM is able to give us providers for tasting. Before this hack, race below is possible: SI_SUB_RAID (no not-fully-configured geoms, so don't block) GEOM tasting (now geoms are created) SI_SUB_MOUNT_ROOT (if root file system is placed on a mirror, it is possible that this mirror is not fully configured yet) There is a lot of work to do to avoid such hacks and I need a working solution before 5.3, sorry. Reported by: John Hay <jhay@icomtek.csir.co.za>	2004-10-14 07:55:29 +00:00
pjd	8ad6178d29	Be sure to always return 0 for negative access requests. Reported by: Maciej Kucharz <qk@comp.waw.pl>	2004-10-07 20:13:23 +00:00
pjd	49a5dba557	Geoms without softc are geoms which are initialized, so wait for them.	2004-10-06 18:47:15 +00:00
pjd	66f574f537	Look out for geoms without softc. Reported by: tegge	2004-10-06 14:15:47 +00:00
pjd	3f28bf167b	Before root file system is mounted, wait for mirrors in degraded state.	2004-10-05 11:17:08 +00:00
pjd	63dd0f756b	Just use MAXPHYS as maximum I/O request size, instead of using my own #define for this purpose. No functional change.	2004-09-28 07:33:37 +00:00
pjd	cad6af1c8f	Minor, but very important condition fix. The current one can never be true.	2004-09-27 19:32:26 +00:00
pjd	4081d9d5d1	Decrease kern.geom.mirror.timeout to 4, so it is smaller than vfs.root.mountdelay by default.	2004-09-27 13:47:37 +00:00
pjd	9b6a1c588a	Forgot to commit addition of ds_resync field.	2004-09-26 20:42:35 +00:00
pjd	9871848b34	Avoid race while synchronizing components. It is very hard to bump into, but it is possible: 1. Read data from good component for synchronization. 2. Write data to the same area. 3. Write synchronization data, which are now stale. Found by: tegge	2004-09-26 20:41:07 +00:00
pjd	48183ebbc3	Simplify code a bit.	2004-09-26 20:30:15 +00:00
pjd	d7954bf77f	This is not needed anymore, it is forced in GEOM now. Actually, it can even cause some problems, because GEOM requires sectorsize to be more than 0 on first access, not on provider creation, so we can skip valid providers by doing this check here. Reported by: Divacky Roman <xdivac02@stud.fit.vutbr.cz> Sven Willenberger <sven@dmv.com>	2004-09-20 17:26:25 +00:00
pjd	555e9e698d	Show current status of mirror device directly. Suggested by: Krzysztof Ciep³ucha <kris@home.pl>	2004-09-08 16:37:22 +00:00
pjd	4689077c9e	Allow to configure debug level from /boot/loader.conf.	2004-08-30 18:50:06 +00:00
pjd	3bedfb04b2	GCC, ehh.	2004-08-29 14:29:30 +00:00
pjd	7f46afc9bf	Skip providers with not defined sector size. Reported by: kuriyama	2004-08-26 12:42:47 +00:00
pjd	02488a6b3c	Allow to set kern.geom.mirror.timeout from /boot/loader.conf.	2004-08-23 20:42:34 +00:00
pjd	ad8a5e508d	We really don't want to receive spoil event for synchroniztion consumers.	2004-08-18 23:33:37 +00:00
pjd	7a2e943ef3	Bump synchronization ID if we are sure, that we have ACTIVE components.	2004-08-18 07:28:48 +00:00
pjd	210c7636d4	Avoid code duplication by introducing g_mirror_write_metadata() function, which is used now by g_mirror_clear_metadata() function and g_mirror_update_metadata() function.	2004-08-15 13:58:29 +00:00
pjd	d1919d7938	MFp4: Simplify code a bit: - Remove kern.geom.mirror.sync_block_size sysctl. It is quite obvious that we want to use the biggest size possible. - Do not use UMA zone for sync data allocations. There could be only one synchronization request per synchronized disk at a time, so allocate memory for one request on whole synchronization process related to one disk. Tested by synchronizing one component (out of three) and by synchronizing two components (out of three) in parallel.	2004-08-11 23:41:53 +00:00
pjd	2f865036c5	Actually, HARDCODED flag isn't stored in metadata, so don't bother dumping it.	2004-08-11 22:16:42 +00:00
pjd	e5e3810748	- Fix typo. - Dump HARDCODED flag.	2004-08-11 22:12:44 +00:00
pjd	dd8a1c6e2a	Try harder to not panic on 'stop -f'. After the commit, this command should be really safe to use.	2004-08-11 11:10:46 +00:00
pjd	2d1d801e5f	- Recognize HARDCODED flag when dumping consumer configuration. - Improve code readabilty a bit.	2004-08-10 19:53:31 +00:00
pjd	f0d4b9a881	Forgot to commit those: introduce hardcoded provider functionality, which allow to store provider's name in the metadata and avoid problems when few providers share the same last sector.	2004-08-10 19:52:12 +00:00
pjd	a98f255700	- Introduce option for hardcoding providers' names into metadata. It allows to fix problems when last provider's sector is shared between few providers. - Bump version number for CONCAT and STRIPE and add code for backward compatibility. - Do not bump version number of MIRROR, as it wasn't officially introduced yet. Even if someone started to play with it, there is no big deal, because wrong MD5 sum of metadata will deny those providers. - Update manual pages. - Add version history to g_(stripe\|concat).h files.	2004-08-09 11:29:42 +00:00

1 2 3 4

167 Commits