with large mmap files mapped into many processes, this saves hundreds of
megabytes of ram.
pv entries were individually allocated and had two tailq entries and two
pointers (or addresses). Each pv entry was linked to a vm_page_t and
a process's address space (pmap). It had the virtual address and a
pointer to the pmap.
This change replaces the individual allocation with a per-process
allocation system. A page ("pv chunk") is allocated and this provides
168 pv entries for that process. We can now eliminate one of the 16 byte
tailq entries because we can simply iterate through the pv chunks to find
all the pv entries for a process. We can eliminate one of the 8 byte
pointers because the location of the pv entry implies the containing
pv chunk, which has the pointer. After overheads from the pv chunk
bitmap and tailq linkage, this works out that each pv entry has an
effective size of 24.38 bytes.
Future work still required, and other problems:
* when running low on pv entries or system ram, we may need to defrag
the chunk pages and free any spares. The stats (vm.pmap.*) show that
this doesn't seem to be that much of a problem, but it can be done if
needed.
* running low on pv entries is now a much bigger problem. The old
get_pv_entry() routine just needed to reclaim one other pv entry.
Now, since they are per-process, we can only use pv entries that are
assigned to our current process, or by stealing an entire page worth
from another process. Under normal circumstances, the pmap_collect()
code should be able to dislodge some pv entries from the current
process. But if needed, it can still reclaim entire pv chunk pages
from other processes.
* This should port to i386 really easily, except there it would reduce
pv entries from 24 bytes to about 12 bytes.
(I have integrated Alan's recent changes.)
- Use FBSDID in trap.c
- Make the global trap_sig[] static as it's not used outside of trap.c.
- In sendsig() remove an unused variable.
- In trap() sync with the other archs; for fast data access MMU miss and
data access protection traps set ksi_addr to the SFAR reg which contains
the faulting address and otherwise to the TPC reg. Generally the TCP reg
contains the address of the instruction that caused the exception, except
for fast instruction access traps (and some others; more refinement may
be needed here) it also contains the faulting address.
Previously sendsig() always set si_addr to the SFAR reg which is wrong
for most traps.
- In sendsig() add support for FreeBSD old-style signals.
These changes are inspired by kmacy's sun4v changes and allow libsigsegv
to build on FreeBSD/sparc64, but it doesn't pass all checks and tests it
actually should, yet.
MFC after: 5 days
in the commit log) submitted support for some NO_* knobs for delete-old*
and check-old. I converted it to the new WITHOUT_* knobs (more correctly:
MK_*) and added some dummy ones so that people can see what's missing.
Volunteers can have a look at http://phk.freebsd.dk/misc/build_options/
for a list of files.
The location looks a little bit odd to me, but I don't care about the
color of this bikeshed and follow the suggestion of our build
infrastructure guru to place it "somewhere under src/tools/ please". [1]
The build/mk/ directory looks more sane to me than the other ones there.
Submitted by: milosz.galazka@gmail.com
Suggested by: ru [1]
intr_disable() and intr_restore() resp. Previously, critical
regions would have interrupts disabled, but that was changed.
Consequently, the debugger could run with interrupts enabled.
This could cause problems for the low-level console code where
received characters would trigger an interrupt that causes
the interrupt handler to read the character instead of the
cngetc() function.
The INP_DROPPED check replaces the current NULL checks; the INP_TIMEWAIT
checks appear to have always been required, but not been there, which
is/was a bug. This avoids unconditionally casting of in_ppcb to a tcpcb,
when it may be a twtcb, which may have resulted in obscure ICMP-related
panics in earlier releases.
MFC after: 3 months
casts.
Consistently use intotw() to cast inp_ppcb pointers to struct tcptw *
pointers.
Consistently use intotcpcb() to cast inp_ppcb pointers to struct tcpcb *
pointers.
Don't assign tp to the results to intotcpcb() during variable declation
at the top of functions, as that is before the asserts relating to
locking have been performed. Do this later in the function after
appropriate assertions have run to allow that operation to be conisdered
safe.
MFC after: 3 months
immediately rather than jumping to the normal output handling, which
assumes we've pulled out the inpcb, which hasn't happened at this
point (and isn't necessary).
Return ECONNABORTED instead of EINVAL when the inpcb has entered
INP_TIMEWAIT or INP_DROPPED, as this is the documented error value.
This may correct the panic seen by Ganbold.
MFC after: 1 month
Reported by: Ganbold <ganbold at micom dot mng dot net>
the NS8250 class driver. The UART has FIFOs if sc_rxfifosz>1, so
test for that instead.
While here properly initialize sc_rxfifosz and sc_txfifosz in the
case the UART doesn't have FIFOs.
is just ". file" according to POSIX, however many other shells allow
arguments to be passed after the file. For compatibility (we even use that
feature in buildworld) additional arguments are not considered to be an
error, even though this shell does not do anything with the arguments at all.
new member to struct builtincmd and set it to 1 if -s was specified. This
is done because there are cases where special builtins must be treated
differently from other builtins.
Obtained from: NetBSD (builtins.def part)
disconnect for fully connected sockets was dropped, meaning that if
the socket was closed while the connection was alive, it would be
leaked. Structure tcp_usr_detach() so that there are two clear
parts: initiating disconnect, and reclaiming state, and reintroduce
the tcp_disconnect() call in the first part.
MFC after: 3 months
hwpmc(4) and pmcstat(8) profiling support for dynamically-loaded
objects, and pmcstat(8) network logging support added[1].
Fix incorrect entries:
gmirror/graid3 uses parallel I/O request for synchronization now.
The parallel I/O request itself has been already supported[2].
Spotted by: jkoshy[1] and pjd[2]
kbdmux(4) GENERIC support currently in amd64 and i386 only[1],
uart(4) GENERIC support currently not in pc98[1],
speaker(4) on amd64 entry needs arch="amd64"[2],
hptmv(4) update entry needs arch="amd64,i386"[2], and
OpenSSH 4.3p1 import has not been merged yet[2].
Spotted by: nyan[1] and delphij[2]
takes a host:port specification.
- Update the manual page and add an example showing how log
over the network using pmcstat(8) and nc(1). Document the
current inability to process logs in cross-platform manner.
- Have pmcstat_open_log() call err(3) directly in case
of an error; this simplifies error handling in its caller.
MFC after: 1 week
the first part before starting, or the TCP port we want to bind may be in
use still. Sleep for a short period between tests.
Use SIGTERM instead of SIGKILL.
file descriptor handling bug fixed,
support for copying console messages to remote gdb(1),
kbdmux(4) now in GENERIC by default,
scc(4) for generic serial devices added,
net.inet.ip.portrange.reserved[high|low] for IPv6,
ata(4) USB mass storage class support,
gmirror kernel crash dump support,
gmirror and graid3 parallel I/O request support,
mfi(4) for LSI MegaRAID SAS controller family added,
getfacl(1) -q option added,
gvinum(8) resetconfig sub-command support,
libarchive(3) tp format support removed,
libarchive(3) POSIX.1e EA support added,
libc symbol map added,
libm symbol map added,
ls(1) -U option added,
mdconfig(8) XML device listing support,
mdconfig(8) -u option now supports comma-separated multiple devices,
BIND9 DNS resolver library imported,
strtonum(3) from OpenBSD,
rc.d/ike removed,
GNU Readline library updated to 5.1,
OpenSSH updated to 4.3p1,
hostapd updated to 0.4.8,
WPA supplicant updated to 0.4.8,
zlib updated to 1.2.3,
pkg_add(1) -F option added,
portsnap(1) HTTP_PROXY_AUTH handling bug fixed,
"make showconfig" in src/Makefile added, and
/etc/src.conf added.
a pv entry if the number of entries is below the high water mark for pv
entries.
Use pmap_try_insert_pv_entry() in pmap_copy() instead of
pmap_insert_entry(). This avoids possible recursion on a pmap lock in
get_pv_entry().
Eliminate the explicit low-memory checks in pmap_copy(). The check that
the number of pv entries was below the high water mark was largely
ineffective because it was located in the outer loop rather than the
inner loop where pv entries were allocated. Instead of checking, we
attempt the allocation and handle the failure.
Reviewed by: tegge
Reported by: kris
MFC after: 5 days
The following bug was just identified in OpenBSD and it looks like the same
bug exists in the other BSDen NFS servers.
A Linux client (don't know which version, but you can look at
http://bugzilla.kernel.org/show_bug.cgi?id=6256)
does a Setattr of mtime to the server's time, where the file is mode 0664 and
the client user has group access (ie. caller is not the file owner).
The BSD servers fail the Setattr with EPERM, since the VA_UTIMES_NULL flag
isn't set before doing the VOP_SETATTR.
It seems to me that this should be allowed, since it is allowed for a local
utimes(2). If so, the fix is to set VA_UTIMES_NULL for the
"set-time-to-server-time" cases of setting atime and/or mtime.
Submitted by: rick@snowhite.cis.uoguelph.ca
Reviewed by: cel
Approved by: silby
MFC after: 1 week
socket can have a tcp connection that has entered time wait
attached to it, in the event that shutdown() is called on the
socket and the FINs properly exchange before close(). In this
case we don't detach or free the inpcb, just leave the tcptw
detached and freed, but we must release the inpcb lock (which we
didn't previously).
MFC after: 3 months
acpi(4) HPET time counter support,
acpi_ibm(4) fan control support,
ddb(4) show lock,
ddb(4) show sleepq,
firmware(9) added,
random(4) MPSAFE,
new sysctl kern.sigqueue.queue_sigchild,
brandinfo BI_CAN_EXEC_DYN flag,
new sysctl kern.forcesigexit,
RedZone, a buffer corruption protection for kernel's malloc(9),
security.mac.biba.interfaces_equal for mac_biba,
POSIX_TIMERS support updated to 200112L,
initial support for POSIX message queue,
Xbox support,
DEFAULTS kernel configuration files for each arch,
cardbus(4) /dev/cardbus%d.cis device node added,
ce(4) for Cronyx Tau-PCI/32 added,
ipmi(4), OpenIPMI (Intelligent Platform Management Interface)
driver added,
kbdmux(4) integrated into syscons(4) and kbd,
uart(4) now in GENERIC kernel,
uart(4) LOM and RSC support,
snd_atiixp(4) added and suspend/resume support,
snd_solo(4) MPSAFE,
speaker(4) amd64 support,
uaudio(4) 24/32 bit audio support,
ath(4) updated to version 0.9.16.16,
bge(4) Jumbo frame support, big-endian arch support, MPSAFE,
em(4) updated to version 3.2.18, big-endian arch support,
performance improvement, suspend/resume support,
iwi(4) big-endian arch support,
le(4) for AMD Am7900 LANCE added,
myri10ge(4) for Myricom Myri10GE adapter added,
nve(4) updated to version 1.0-0310,
ti(4) big-endian arch support,
ufoma(4) for FOMA 3G mobile phone in Japan added,
vgapci(4) stub driver added,
arp(8) retransmission algorithm revised,
new sysctl net.link.ether.inet.log_arp_permanent_modify,
support for -i <if> with -d -a,
an experimental BPF Just-In-Time compiler added,
if_bridge(4) span ports support added,
if_bridge(4) RFC 3378 EtherIP support,
ipfw(4) now supports action argument substitution from table lookup,
ng_bpf(4) BPF Just-In-Time compiler support,
bug related to NFS over TCP reconnection fixed,
IPV6_V6ONLY now works for UDP,
amr(4) performance improvement, ioctl support for MegaRaid Tools,
ata(4) DMA for kernel dump and dumping to ataraid(4) devices,
ataraid(4) now supports JMicron ATA RAID metadata,
gmirror and graid3 disconnect_on_failure sysctls added,
g_md.ko renamed to geom_md.ko,
mpt(4) SAS HBA and 64-bit PCI support,
twa(4) updated to 9.3.0.1,
geli(8) now allows loading keyfiles before root file system is mounted,
initial support for SGI's XFS added,
ACPI-CA updated to 20051021,
DRM updated to 20051202,
TrustedBSD OpenBSM version 1.0 alpha 5 imported,
bsnmpd(1) Host Resources MIB in RFC 2790 support,
config(8) "nocpu" directive added,
config(8) now reads DEFAULT if any before the specified config file,
csh(1) NLS catalog support,
csup(1), CVSup-compatible client written in C imported,
devd(8) -f option,
ftpd(8) change related to PID file creation,
gbde(8) -k and -K option,
gpt(8) GPT partition label setting support,
gvinum(8) now supports to move a subdisk between drives,
GSS-API version 2 (RFC2743 and RFC2744) implemented,
jail(8) -J option,
kdump(1) -H and -s option,
kgdb(1) -w option,
libarchive(3) tp format support,
ln(1) -F option,
locate(1) -I option,
mdmfs(8) -P and -E option,
mergemaster(8) -A option,
mount(8) "nodev" option removed,
netstat(1) IPsec protocol stats support,
periodic(8) daily gmirror, graid3, gstripe, gconcat support,
pkill(1) -I option,
rfcomm_pppd(8) -c servicename support,
rtld(1) ELF symbol versioning support,
sh(1) "times" built-in command support,
truss(1) -s option,
truss(1) now works on FreeBSD/ppc,
usbd(8) removed in favor of devd(8),
xargs(1) -r option,
rc.d/auditd added,
rc.d/bluetooth, rc.d/hcsecd, rc.d/sdpd added,
rc.d/ftpd added,
rc.d/hostapd added,
rc.d/netif ipv4_addrs_<ifn> support,
rc.d/rcconf.sh removed and early_late_divider variable added,
rc.initdiskless now uses tar(1) instead of pax(1),
rc.d/pccard removed,
rc.d/ppp-user added (renamed from ppp),
removable_interfaces variable removed,
bsnmpd updated from 1.11 to 1.12,
pkg_add(1) -P option,
pkg_add(1) and pkg_create(1) -K option,
pkg_create(1) -x, -E, and -G options,
local_startup directory now evaluated by rcorder(8) with
scripts in the base system,
suffix of startup scripts removed,
variables "ldconfig_local_dirs" and "ldconfig_local32_dirs" added,
@cwd in pkg-plist now allows no directory argument, and
CHECKSUM.MD5's checksum in CHECKSUM.MD5 problem fixed.