freebsd-dev

Author	SHA1	Message	Date
Pawel Jakub Dawidek	5ed118d861	- On primary worker reload, update hr_exec field. - Update comment. MFC after: 1 week	2011-01-22 22:31:55 +00:00
Pawel Jakub Dawidek	9cc97e5803	Install default signal handlers before masking signals we want to handle. It is possible that the parent process ignores some of them and sigtimedwait() will never see them, eventhough they are masked. The most common situation for this to happen is boot process where init(8) ignores SIGHUP before starting to execute /etc/rc. This in turn caused hastd(8) to ignore SIGHUP. Reported by: trasz Obtained from: Wheel Systems Sp. z o.o. http://www.wheelsystems.com MFC after: 3 days	2011-01-12 14:35:29 +00:00
Pawel Jakub Dawidek	347bde360a	Log the fact of launching and include protocol version number. MFC after: 3 days	2010-12-16 07:28:40 +00:00
Pawel Jakub Dawidek	2964aeb34a	Load geom_gate.ko module after parsing arguments. MFC after: 3 days	2010-10-24 15:38:58 +00:00
Pawel Jakub Dawidek	3f562cce40	Log correct connection when canceling half-open connection. Submitted by: Mikolaj Golub <to.my.trociny@gmail.com> MFC after: 3 days	2010-10-17 15:47:27 +00:00
Pawel Jakub Dawidek	428ad0a9c4	Decrease report interval to 5 seconds, as this also means we will check for signals every 5 seconds and not every 10 seconds as before. MFC after: 3 days	2010-10-04 21:44:26 +00:00
Pawel Jakub Dawidek	5f24b330df	hook_check() is now only used to report about long-running hooks, so the argument is redundant, remove it. MFC after: 3 days	2010-10-04 21:43:06 +00:00
Pawel Jakub Dawidek	41013c0b21	We can't mask ignored signal, so install dummy signal hander for SIGCHLD before masking it. This fixes bogus reports about hooks running for too long and other problems related to garbage-collecting child processes. Reported by: Mikolaj Golub <to.my.trociny@gmail.com> MFC after: 3 days	2010-10-04 21:41:18 +00:00
Pawel Jakub Dawidek	9dd5a6cb0f	Switch to sigprocmask(2) API also in the main process and secondary process. This way the primary process inherits signal mask from the main process, which fixes a race where signal is delivered to the primary process before configuring signal mask. Reported by: Mikolaj Golub <to.my.trociny@gmail.com> MFC after: 3 days	2010-09-22 19:08:11 +00:00
Pawel Jakub Dawidek	196abd3518	Assert that descriptor numbers are sane. MFC after: 3 days	2010-09-22 19:05:54 +00:00
Pawel Jakub Dawidek	0c24d8e2a1	Fix descriptor leaks: when child exits, we have to close control and event socket pairs. We did that only in one case out of three. MFC after: 3 days	2010-09-22 18:57:06 +00:00
Pawel Jakub Dawidek	5bdff860e7	Because it is very hard to make fork(2) from threaded process safe (we are limited to async-signal safe functions in the child process), move all hooks execution to the main (non-threaded) process. Do it by maintaining connection (socketpair) between child and parent and sending events from the child to parent, so it can execute the hook. This is step in right direction for others reasons too. For example there is one less problem to drop privs in worker processes. MFC after: 2 weeks Obtained from: Wheel Systems Sp. z o.o. http://www.wheelsystems.com	2010-08-30 23:26:10 +00:00
Pawel Jakub Dawidek	6b276294af	We only want to know if descriptors are ready for reading. MFC after: 2 weeks Obtained from: Wheel Systems Sp. z o.o. http://www.wheelsystems.com	2010-08-30 23:19:21 +00:00
Pawel Jakub Dawidek	ecc99c890e	Allow to run hooks from the main hastd process. MFC after: 2 weeks Obtained from: Wheel Systems Sp. z o.o. http://www.wheelsystems.com	2010-08-29 21:41:53 +00:00
Pawel Jakub Dawidek	5da2320932	When SIGTERM or SIGINT is received, terminate worker processes. MFC after: 2 weeks Obtained from: Wheel Systems Sp. z o.o. http://www.wheelsystems.com	2010-08-27 21:28:02 +00:00
Pawel Jakub Dawidek	0becad39a7	Allow to execute specified program on various HAST events. MFC after: 2 weeks Obtained from: Wheel Systems Sp. z o.o. http://www.wheelsystems.com	2010-08-27 15:16:52 +00:00
Pawel Jakub Dawidek	0989854d45	Implement configuration reload on SIGHUP. This includes: - Load added resources. - Stop and forget removed resources. - Update modified resources in least intrusive way, ie. don't touch /dev/hast/<name> unless path to local component or provider name were modified. Obtained from: Wheel Systems Sp. z o.o. http://www.wheelsystems.com MFC after: 1 month	2010-08-05 19:16:31 +00:00
Pawel Jakub Dawidek	bbbb114cda	Prepare configuration parsing code to be called multiple times: - Don't exit on errors if not requested. - Don't keep configuration in global variable, but allocate memory for configuration. - Call yyrestart() before yyparse() so that on error in configuration file we will start from the begining next time and not from the place we left of. MFC after: 1 month	2010-08-05 19:08:54 +00:00
Pawel Jakub Dawidek	005f438bf5	- Use pjdlog_exitx() to log errors and exit instead of errx(). - Use 'unable to' (instead of 'cannot') consistently. MFC after: 1 month	2010-08-05 18:56:24 +00:00
Pawel Jakub Dawidek	f3bd74124a	Correct various log messages. Submitted by: Mikolaj Golub <to.my.trociny@gmail.com> MFC after: 3 days	2010-06-14 21:46:48 +00:00
Pawel Jakub Dawidek	9fab3c1b94	Remove macros that are not really needed. The idea was to have them in case we grow more descriptors, but I'll reconsider readding them once we get there. Passing (a = b) expression to FD_ISSET() is bad idea, as FD_ISSET() evaluates its argument twice. Found by: Coverity Prevent CID: 5243 MFC after: 3 days	2010-06-14 21:18:58 +00:00
Pawel Jakub Dawidek	c6ddcbe009	- Check if the worker process was killed by signal and restart it. - Improve logging. Pointed out by: Garrett Cooper <yanefbsd@gmail.com> MFC after: 3 days	2010-04-29 15:42:24 +00:00
Pawel Jakub Dawidek	5571414ca8	Fix a problem where hastd will stuck in recv(2) after sending request to secondary, which died between send(2) and recv(2). Do it by adding timeout to recv(2) for primary incoming and outgoing sockets and secondary outgoing socket. Reported by: Mikolaj Golub <to.my.trociny@gmail.com> Tested by: Mikolaj Golub <to.my.trociny@gmail.com> MFC after: 3 days	2010-04-29 15:36:32 +00:00
Pawel Jakub Dawidek	83a5671405	Restart worker thread only if the problem was temporary. In case of persistent problem we don't want to loop forever. MFC after: 3 days	2010-04-28 22:41:06 +00:00
Pawel Jakub Dawidek	06c117d1d1	Use WEXITSTATUS() to obtain real exit code. MFC after: 3 days	2010-04-28 22:26:30 +00:00
Pawel Jakub Dawidek	09398e9bd4	Fix control socket leak when worker process exits. Submitted by: Mikolaj Golub <to.my.trociny@gmail.com> MFC after: 3 days	2010-04-16 06:47:29 +00:00
Pawel Jakub Dawidek	32115b105a	Please welcome HAST - Highly Avalable Storage. HAST allows to transparently store data on two physically separated machines connected over the TCP/IP network. HAST works in Primary-Secondary (Master-Backup, Master-Slave) configuration, which means that only one of the cluster nodes can be active at any given time. Only Primary node is able to handle I/O requests to HAST-managed devices. Currently HAST is limited to two cluster nodes in total. HAST operates on block level - it provides disk-like devices in /dev/hast/ directory for use by file systems and/or applications. Working on block level makes it transparent for file systems and applications. There in no difference between using HAST-provided device and raw disk, partition, etc. All of them are just regular GEOM providers in FreeBSD. For more information please consult hastd(8), hastctl(8) and hast.conf(5) manual pages, as well as http://wiki.FreeBSD.org/HAST. Sponsored by: FreeBSD Foundation Sponsored by: OMCnet Internet Service GmbH Sponsored by: TransIP BV	2010-02-18 23:16:19 +00:00

27 Commits