freebsd-skq

Author	SHA1	Message	Date
glebius	54ea1a10a5	More whitespace fixes. Checked with: md5, diff -x -w	2011-04-18 22:09:03 +00:00
glebius	57baf3da41	Whitespace fixes. Checked with: md5, diff -w	2011-04-18 21:18:22 +00:00
ru	54ee91ad9a	Changed "conscontrol unset" to accept an existing virtual console device as an argument. Unsetting virtual console using /dev/console seems to have never worked. MFC after: 3 days	2011-04-18 20:28:07 +00:00
trociny	4ef25c883e	Remove hast_proto_recv(). It was used only in one place, where hast_proto_recv_hdr() may be used. This also fixes the issue (introduced by r220523) with hastctl, which crashed on assert in hast_proto_recv_data(). Suggested and approved by: pjd (mentor)	2011-04-17 16:18:45 +00:00
sobomax	169935c85b	If we can retrieve interface address sleep for one second and try again. This can happen during start-up, when natd starts before dhclient has a chance to receive IP address from the upstream provider. MFC after: 2 weeks	2011-04-17 06:05:37 +00:00
pluknet	1263a866ca	Remove vestiges of disklabel(5). Reviewed by: uqs MFC after: 5 days	2011-04-14 08:53:04 +00:00
pjd	8dd5170ed6	The replication mode that is currently support is fullsync, not memsync. Correct this and print a warning if different replication mode is configured. MFC after: 1 week	2011-04-12 19:13:10 +00:00
trociny	cafda9c73f	In hast_proto_recv() remove unnecessary check. The size is checked later in hast_proto_recv_data(). Approved by: pjd (mentor) MFC after: 1 week	2011-04-10 15:28:37 +00:00
trociny	cbdb24d2fb	In hast_proto_recv_data() check that the size of the data to be received does not exceed the buffer size. Approved by: pjd (mentor) MFC after: 1 week	2011-04-10 15:21:46 +00:00
trociny	4ff6b11c08	Fix a typo in comments. Approved by: pjd (mentor) MFC after: 3 days	2011-04-10 15:11:19 +00:00
trociny	334d4b863a	hastd(8) maintains a map of dirty extents, not hastctl(8). Fix this. Approved by: pjd (mentor) MFC after: 3 days	2011-04-10 15:07:54 +00:00
gjb	63a1889bd7	Correct 'list scan' description in the examples. The previous description was incorrect - 'list scan' does not actually do a scan, but instead lists the results of the background 'scan' cache. Submitted by: Fabian Keil (freebsd-listen of fabiankeil de) (via email) Discussed with: bschmidt MFC after: 3 days	2011-04-06 11:20:54 +00:00
obrien	cddd210098	* Add the readline(3) API to libedit. The libedit versions of {readline,history}.h are in /usr/include/edit so as to not conflict with the GNU libreadline versions. To use the libedit readline(3) one should add "-I/usr/include/edit" to their Makefile (spelled "-I${DESTDIR}/${INCLUDEDIR}/edit" within the FreeBSD source tree). * Enable its use in the BSD licensed utilities that support readline(3). * To make it easier to sync libedit development with NetBSD, histedit.h is moved into libedit's directory as history shows shown we keep merging it into that location. Obtained from: NetBSD Sponsored by: Juniper Networks	2011-04-05 18:41:01 +00:00
pjd	005ff1d986	Increase default timeout from 5 seconds to 20 seconds. 5 seconds is definitely to short under heavy load and I was experiencing those timeouts in my recent tests. MFC after: 1 week	2011-04-02 09:34:33 +00:00
pjd	6a5b532558	Handle ENOBUFS on send(2) by retrying for a while and logging the problem. MFC after: 1 week	2011-04-02 09:31:02 +00:00
pjd	ee9b74620b	When we are operating on blocking socket and get EAGAIN on send(2) or recv(2) this means that request timed out. Translate the meaningless EAGAIN to ETIMEDOUT to give administrator a hint that he might need to increase timeout in configuration file. MFC after: 1 month	2011-04-02 09:29:53 +00:00
pjd	52d273ec99	Declare directions for sockets between primary and secondary. In HAST we use two sockets - one for only sending the data and one for only receiving the data. MFC after: 1 month	2011-04-02 09:25:13 +00:00
pjd	0a787e13eb	Allow to disable sends or receives on a socket using shutdown(2) by interpreting NULL 'data' argument passed to proto_common_send() or proto_common_recv() as a will to do so. MFC after: 1 month	2011-04-02 09:22:06 +00:00
pjd	02471b97ed	Handle the problem described in r220264 by using GEOM GATE queue of unlimited length. This should fix deadlocks reported by HAST users. MFC after: 1 week	2011-04-02 07:01:09 +00:00
pjd	025061f4e0	Because ggatel(8) operates on local GEOM providers, use unlimited queue size in GEOM GATE to fix the issue described in r220264. This also means that we no longer need -q option, remove it. Don't bother to leaving it as a no-op, as ggatel(8) is just an example utility.	2011-04-02 06:59:05 +00:00
pjd	4f52368813	Add mapsize to the header just before sending the packet. Before it could change later and we were sending invalid mapsize. Some time ago I added optimization where when nodes are connected for the first time and there were no writes to them yet, there is no initial full synchronization. This bug prevented it from working. MFC after: 1 week	2011-03-25 20:19:15 +00:00
pjd	7c9a800121	Use timeout from configuration file not only when sending and receiving, but also when establishing connection. MFC after: 1 week	2011-03-25 20:15:16 +00:00
pjd	f6f9894d9f	Use role2str() when setting process title. MFC after: 1 week	2011-03-25 20:13:38 +00:00
mav	8dab5b0501	MFgraid/head: Add new RAID GEOM class, that is going to replace ataraid(4) in supporting various BIOS-based software RAIDs. Unlike ataraid(4) this implementation does not depend on legacy ata(4) subsystem and can be used with any disk drivers, including new CAM-based ones (ahci(4), siis(4), mvs(4), ata(4) with `options ATA_CAM`). To make code more readable and extensible, this implementation follows modular design, including core part and two sets of modules, implementing support for different metadata formats and RAID levels. Support for such popular metadata formats is now implemented: Intel, JMicron, NVIDIA, Promise (also used by AMD/ATI) and SiliconImage. Such RAID levels are now supported: RAID0, RAID1, RAID1E, RAID10, SINGLE, CONCAT. For any all of these RAID levels and metadata formats this class supports full cycle of volume operations: reading, writing, creation, deletion, disk removal and insertion, rebuilding, dirty shutdown detection and resynchronization, bad sector recovery, faulty disks tracking, hot-spare disks. For Intel and Promise formats there is support multiple volumes per disk set. Look graid(8) manual page for additional details. Co-authored by: imp Sponsored by: Cisco Systems, Inc. and iXsystems, Inc.	2011-03-24 21:31:32 +00:00
mav	da5bb011df	MFgraid/head r217014: Make `geom XXX list` and `geom XXX status` outputs more consistent: Add -a options to print all geoms, not only ones with providers. Add -g option for `status` to report geom's names, not provider's. Make `status` by default report provider's status (if present), not geom's. Make `status` report consumer's statuses, not only "synchronized" field.	2011-03-24 19:11:05 +00:00
ru	f4d51464c6	It's possible to unmount multiple items at once, make it clear.	2011-03-24 12:35:09 +00:00
glebius	db85233367	Fix SYNOPSIS.	2011-03-23 13:44:09 +00:00
pjd	4845f0f342	Don't create socketpair for connection forwarding between parent and secondary. Secondary doesn't need to connect anywhere. MFC after: 1 week	2011-03-23 11:09:04 +00:00
jmallett	1c16fc7189	Add support for displaying newfs flags for SU+J and TRIM.	2011-03-23 08:33:12 +00:00
pjd	d7f3432550	Add my copyright. MFC after: 1 week	2011-03-22 21:19:51 +00:00
pjd	059b3a821c	Forgot to commit this as part of r219873. MFC after: 1 week	2011-03-22 21:11:36 +00:00
trociny	9a2a9da631	After synchronization is complete we should make primary counters be equal to secondary counters: primary_localcnt = secondary_remotecnt primary_remotecnt = secondary_localcnt Previously it was done wrong and split-brain was observed after primary had synchronized up-to-date data from secondary. Approved by: pjd (mentor) MFC after: 1 week	2011-03-22 20:27:26 +00:00
trociny	03d954b2b3	For requests that are sent only to remote component use the error from remote. Approved by: pjd (mentor) MFC after: 1 week	2011-03-22 19:49:27 +00:00
pjd	b5016a737c	The proto API is a general purpose API, so don't use 'hast' in structures or function names. It can now be used outside of HAST. MFC after: 1 week	2011-03-22 16:21:11 +00:00
pjd	f29604a547	White space cleanups. MFC after: 1 week	2011-03-22 10:39:34 +00:00
pjd	b84a0251e3	When dropping privileges prefer capsicum over chroot+setgid+setuid. We can use capsicum for secondary worker processes and hastctl. When working as primary we drop privileges using chroot+setgid+setuid still as we need to send ioctl(2)s to ggate device, for which capsicum doesn't allow (yet). X-MFC after: capsicum is merged to stable/8	2011-03-21 21:31:50 +00:00
pjd	a444cd5681	Initialize localcnt on first write. This fixes assertion when we create resource, set role to primary, do no writes, then sent it to secondary and accept connection from primary. MFC after: 1 week	2011-03-21 21:16:12 +00:00
pjd	a53d08470e	Fix typo. MFC after: 1 week	2011-03-21 21:14:07 +00:00
pjd	8270eb5240	Before handling any events on descriptors check signals so we can update our info about worker processes if any of them was terminated in the meantime. This fixes the problem with 'hastctl status' running from a hook called on split-brain: 1. Secondary calls a hooks and terminates. 2. Hook asks for resource status via 'hastctl status'. 3. The main hastd handles the status request by sending it to the secondary worker who is already dead, but because signals weren't checked yet he doesn't know that and we get EPIPE. MFC after: 1 week	2011-03-21 15:29:20 +00:00
gjb	7bae0e7098	Bump date from previous commit.	2011-03-21 15:17:02 +00:00
pjd	2b67f741a9	Remove stale comment. Yes, it is valid to set role back to init. MFC after: 1 week	2011-03-21 15:08:10 +00:00
pjd	d3221d1a45	Increase debug level of "Checking hooks." message. MFC after: 1 week	2011-03-21 14:53:27 +00:00
pjd	6ddd70cb46	Be pedantic and free nvout before exiting. MFC after: 1 week	2011-03-21 14:51:16 +00:00
pjd	421e84b77e	Detect situation where resource internal identifier differs. This means that both nodes have separately managed resources that don't have the same data. MFC after: 1 week	2011-03-21 14:50:12 +00:00
pjd	81c99b19dc	Forgot to commit this as a part of r219818. MFC after: 1 week	2011-03-21 11:52:00 +00:00
pjd	3420a73611	In hast.conf we define the other node's address in 'remote' variable. This way we know how to connect to secondary node when we are primary. The same variable is used by the secondary node - it only accepts connections from the address stored in 'remote' variable. In cluster configurations it is common that each node has its individual IP address and there is one addtional shared IP address which is assigned to primary node. It seems it is possible that if the shared IP address is from the same network as the individual IP address it might be choosen by the kernel as a source address for connection with the secondary node. Such connection will be rejected by secondary, as it doesn't come from primary node individual IP. Add 'source' variable that allows to specify source IP address we want to bind to before connecting to the secondary node. MFC after: 1 week	2011-03-21 08:54:59 +00:00
pjd	ee6f63ce17	Log when we start hooks checking and when we execute a hook. MFC after: 1 week	2011-03-21 08:38:24 +00:00
pjd	e06bfb0a1b	Use snprlcat() instead of two strlcat(3)s. MFC after: 1 week	2011-03-21 08:37:50 +00:00
pjd	895e0371a1	Add snprlcat() and vsnprlcat() - the functions I'm always missing. They work as a combination of snprintf(3) and strlcat(3) - the caller can append a string build based on the given format. MFC after: 1 week	2011-03-21 08:36:50 +00:00
pjd	5b7b8f48e3	When creating connection on behalf of primary worker, set pjdlog prefix to resource name and role, so that any logs related to that can be identified properly. MFC after: 1 week	2011-03-21 08:33:58 +00:00

1 2 3 4 5 ...

6576 Commits