freebsd-nq

Author	SHA1	Message	Date
Gleb Smirnoff	4e76af6a41	Merge from projects/counters: counter(9). Introduce counter(9) API, that implements fast and raceless counters, provided (but not limited to) for gathering of statistical data. See http://lists.freebsd.org/pipermail/freebsd-arch/2013-April/014204.html for more details. In collaboration with: kib Reviewed by: luigi Tested by: ae, ray Sponsored by: Nginx, Inc.	2013-04-08 19:40:53 +00:00
Gleb Smirnoff	17dece86fe	Merge from projects/counters: Pad struct pcpu so that its size is denominator of PAGE_SIZE. This is done to reduce memory waste in UMA_PCPU_ZONE zones. Sponsored by: Nginx, Inc.	2013-04-08 19:19:10 +00:00
Hans Petter Selasky	3550618d0b	Fix regression issue after r248910. PR: arm/177685 Submitted by: Christoph Mallon <christoph.mallon@gmx.de>	2013-04-07 13:03:57 +00:00
Oleksandr Tymoshenko	08dab5832b	Properly clean "spurious interrupt" state Suggested by: Ian Lepore	2013-04-06 03:31:28 +00:00
Andrew Turner	64277b97f9	Hide non-assembler bits behind #ifndef __ASSEMBLER__	2013-04-06 00:47:33 +00:00
Andrew Turner	c46c2523c2	Add the hw.floatingpoint sysctl to ARM to tell us if we have vfp support in the kernel and the hardware includes a vfp unit.	2013-04-05 23:35:23 +00:00
Alexander Motin	45f6d66569	Remove all legacy ATA code parts, not used since options ATA_CAM enabled in most kernels before FreeBSD 9.0. Remove such modules and respective kernel options: atadisk, ataraid, atapicd, atapifd, atapist, atapicam. Remove the atacontrol utility and some man pages. Remove useless now options ATA_CAM. No objections: current@, stable@ MFC after: never	2013-04-04 07:12:24 +00:00
Ian Lepore	4d7abca057	Fix low-level uart drivers that set their fifo sizes in the softc too late. uart(4) allocates send and receiver buffers in attach() before it calls the low-level driver's attach routine. Many low-level drivers set the fifo sizes in their attach routine, which is too late. Other drivers set them in the probe() routine, so that they're available when uart(4) allocates buffers. This fixes the ones that were setting the values too late by moving the code to probe().	2013-04-01 00:44:20 +00:00
Ian Lepore	5ea561e03a	Enable hardware flow control and high speed bulk data transfer in at91 uarts. Changes to make rtc/cts flow control work... This does not turn on the builtin hardware flow control on the SoC's usart device, because that doesn't work on uart1 due to a chip erratum (they forgot to wire up pin PA21 to RTS0 internally). Instead it uses the hardware flow control logic where the tty layer calls the driver to assert and de-assert the flow control lines as needed. This prevents overruns at the tty layer (app doesn't read fast enough), but does nothing for overruns at the driver layer (interrupts not serviced fast enough). To work around the wiring problem with RTS0, the driver reassigns that pin as a GPIO and controls it manually. It only does so if given permission via hint.uart.1.use_rts0_workaround=1, to prevent accidentally driving the pin if uart1 is used without flow control (because something not related to serial IO could be wired to that pin). In addition to the RTS0 workaround, driver changes were needed in the area of reading the current set of DCE signals. A priming read is now done at attach() time, and the interrupt routine now sets SER_INT_SIGCHG when any of the DCE signals change. Without these changes, nothing could ever be transmitted, because the tty layer thought CTS was de-asserted (when in fact we had just never read the status register, and the hwsig variable was init'd to CTS de-asserted). Changes to support bulk high-speed (230kbps and higher) data reception... Allow the receive fifo size to be tuned with hint.uart.<dev>.fifo_bytes. For high speed receive, a fifo size of 1024 works well. The default is still 128 bytes if no hint is provided. Using a value larger than 384 requires a change in dev/uart/uart_core.c to size the intermediate buffer as MAX(384, 3*sc->sc_rxfifosize). Recalculate the receive timeout whenever the baud rate changes. At low baud rates (19.2kbps and below) the timeout is the number of bits in 2 characters. At higher speed it's calculated to be 500 microseconds worth of bits. The idea is to compromise between being responsive in interactive situations and not timing out prematurely during a brief pause in bulk data flow. The old fixed timeout of 1.5 characters was just 32 microseconds at 460kbps. At interrupt time, check for receiver holding register overrun status and set the corresponding status bit in the return value. When handling a buffer overrun, get a single buffer emptied and handed back to the hardware as quickly as possible, then deal with the second buffer. This at least minimizes data loss compared to the old logic that fully processed both buffers before restarting the hardware. Rewrite the logic for handling buffers after a receive timeout. The original author speculated in a comment that there may be a race with high speed data. There was, although it was rare. The code now handles all three possible scenarios on receive timeout: two empty buffers, one empty and one partial buffer, or one full and one partial buffer. Reviewed by: imp	2013-04-01 00:00:10 +00:00
Ian Lepore	070cf887bb	When running on armv6, set alignment checking to modulo-4 mode rather than modulo-8, because clang emits ldrd and strd instructions for addresses that are only 4-byte aligned.	2013-03-31 22:42:25 +00:00
Ian Lepore	27aa887af3	Fix a typo in the CF device driver name that prevented instantiation.	2013-03-31 12:51:56 +00:00
Ian Lepore	63cdf42e8c	Add userland access to at91 gpio functionality via ioctl calls. Also, add the ability for userland to be notified of changes on gpio pins via a select(2)/read(2) interface. Change the interrupt handler from filtered to threaded. Because of the uiomove() calls in the new interface, change locking from standard mutex to sx. Add / restore the at91_gpio_high_z() function. Reviewed by: imp (long ago)	2013-03-29 19:52:57 +00:00
Ian Lepore	914421fa79	Change the API for at91_pio_gpio_get() to return the entire masked set of bits, not just a 0/1 indicating whether any of the masked bits are on. This is compatible with the single in-tree caller of this function right now (at91_vbus_poll() in dev/usb/controller/at91dci_atemelarm.c).	2013-03-29 19:04:18 +00:00
Ian Lepore	b39ec0de86	Call soc_info.soc_data->soc_clock_init() before at91_pmc_init_clock(), so that the latter correctly fills in the clock data structures based on proper hardware-specific shift and mask values from the soc_data structure.	2013-03-29 18:47:08 +00:00
Ian Lepore	fce4536cfd	Add a couple forward declarations, so that board support routines don't have to pre-include a bunch of header files they don't need just to use this one.	2013-03-29 18:43:10 +00:00
Ian Lepore	5c4938ee48	Redo the workaround for at91rm9200 erratum #26 in a way that doesn't cause a lockup on some rm92 hardware.	2013-03-29 18:17:51 +00:00
Ian Lepore	c29eb73802	Fix a typo: the RXD0 pin is PA18, not PA19.	2013-03-29 18:06:54 +00:00
Ian Lepore	14146e9a04	Remove a really noisy printf left over from debugging hardware errata.	2013-03-29 17:57:24 +00:00
Ian Lepore	49addc5755	Don't check and warn about pmap mismatch on every call to busdma sync. With some recent busdma refactoring, sometimes it happens that a sync op gets called when bus_dmamap_load() never got called, which results in a spurious warning about a map mismatch when no sync operations will actually happen anyway. Now the check is done only if a sync operation is actually performed, and the result of the check is a panic, not just a printf. Reviewed by: cognet (who prevented me from donning a point hat)	2013-03-23 17:17:06 +00:00
Alexander Motin	359b47db97	Minimal timer period of 100us introduced in r244758 is overkill. While original 2us are indeed not enough, 3us are working quite well on my tests. To be more safe set minimal period to 5us and to be even more safe replicate here from HPET mechanism of rereading counter after programming comparator. This change allows to handle 30K of short nanosleep() calls per second on Raspberry Pi instead of just 8K before. Discussed with: gonzo	2013-03-21 15:42:41 +00:00
Aleksandr Rybalko	a2c472e741	Integrate Efika MX project back to home. Sponsored by: The FreeBSD Foundation	2013-03-20 15:39:27 +00:00
Konstantin Belousov	ee75e7de7b	Implement the concept of the unmapped VMIO buffers, i.e. buffers which do not map the b_pages pages into buffer_map KVA. The use of the unmapped buffers eliminate the need to perform TLB shootdown for mapping on the buffer creation and reuse, greatly reducing the amount of IPIs for shootdown on big-SMP machines and eliminating up to 25-30% of the system time on i/o intensive workloads. The unmapped buffer should be explicitely requested by the GB_UNMAPPED flag by the consumer. For unmapped buffer, no KVA reservation is performed at all. The consumer might request unmapped buffer which does have a KVA reserve, to manually map it without recursing into buffer cache and blocking, with the GB_KVAALLOC flag. When the mapped buffer is requested and unmapped buffer already exists, the cache performs an upgrade, possibly reusing the KVA reservation. Unmapped buffer is translated into unmapped bio in g_vfs_strategy(). Unmapped bio carry a pointer to the vm_page_t array, offset and length instead of the data pointer. The provider which processes the bio should explicitely specify a readiness to accept unmapped bio, otherwise g_down geom thread performs the transient upgrade of the bio request by mapping the pages into the new bio_transient_map KVA submap. The bio_transient_map submap claims up to 10% of the buffer map, and the total buffer_map + bio_transient_map KVA usage stays the same. Still, it could be manually tuned by kern.bio_transient_maxcnt tunable, in the units of the transient mappings. Eventually, the bio_transient_map could be removed after all geom classes and drivers can accept unmapped i/o requests. Unmapped support can be turned off by the vfs.unmapped_buf_allowed tunable, disabling which makes the buffer (or cluster) creation requests to ignore GB_UNMAPPED and GB_KVAALLOC flags. Unmapped buffers are only enabled by default on the architectures where pmap_copy_page() was implemented and tested. In the rework, filesystem metadata is not the subject to maxbufspace limit anymore. Since the metadata buffers are always mapped, the buffers still have to fit into the buffer map, which provides a reasonable (but practically unreachable) upper bound on it. The non-metadata buffer allocations, both mapped and unmapped, is accounted against maxbufspace, as before. Effectively, this means that the maxbufspace is forced on mapped and unmapped buffers separately. The pre-patch bufspace limiting code did not worked, because buffer_map fragmentation does not allow the limit to be reached. By Jeff Roberson request, the getnewbuf() function was split into smaller single-purpose functions. Sponsored by: The FreeBSD Foundation Discussed with: jeff (previous version) Tested by: pho, scottl (previous version), jhb, bf MFC after: 2 weeks	2013-03-19 14:13:12 +00:00
Aleksandr Rybalko	4117c1db9e	o Switch to use physical addresses in rman for FDT. o Remove vtophys used to translate virtual address to physical in case rman carry virtual. Sponsored by: The FreeBSD Foundation	2013-03-18 15:18:55 +00:00
Ian Lepore	b479b38c0a	Eliminate an intermediate buffer and some memcpy() operations, and do DMA directly to/from the buffers passed in from higher layer drivers. Reviewed by: gonzo	2013-03-17 16:31:09 +00:00
Ian Lepore	33ff10ea55	Add a macro that gets the physical address of a memory mapped device register from a bus space resource. Note that this macro is just for ARM, and is intended to have a short lifespan. The DMA engines in some SoCs need the physical address of a memory-mapped device register as one of the arguments for the transfer. Several scattered ad-hoc solutions have been converted to use this macro, which now also serves to mark the places where a more complete fix needs to be applied (after that fix has been designed).	2013-03-17 03:04:43 +00:00
Andrew Turner	5cc8d54ca0	Move the __aeabi_unwind_cpp_pr{0,1,2} functions to libkern so they can be referenced in a non-debug kernel.	2013-03-16 04:06:49 +00:00
Andrew Turner	f8cd79f38b	Implement the required but unused __aeabi_unwind_cpp_* functions in the trampoline kernel.	2013-03-16 03:50:27 +00:00
Andrew Turner	573447b6a5	Add an END macro to ARM. This is mostly used to tell gas where the bounds of the functions are when creating the EABI unwind tables.	2013-03-16 02:48:49 +00:00
Konstantin Belousov	e8a4a618cf	Add pmap function pmap_copy_pages(), which copies the content of the pages around, taking array of vm_page_t both for source and destination. Starting offsets and total transfer size are specified. The function implements optimal algorithm for copying using the platform-specific optimizations. For instance, on the architectures were the direct map is available, no transient mappings are created, for i386 the per-cpu ephemeral page frame is used. The code was typically borrowed from the pmap_copy_page() for the same architecture. Only i386/amd64, powerpc aim and arm/arm-v6 implementations were tested at the time of commit. High-level code, not committed yet to the tree, ensures that the use of the function is only allowed after explicit enablement. For sparc64, the existing code has known issues and a stab is added instead, to allow the kernel linking. Sponsored by: The FreeBSD Foundation Tested by: pho (i386, amd64), scottl (amd64), ian (arm and arm-v6) MFC after: 2 weeks	2013-03-14 20:18:12 +00:00
Gleb Smirnoff	41a7572b26	Functions m_getm2() and m_get2() have different order of arguments, and that can drive someone crazy. While m_get2() is young and not documented yet, change its order of arguments to match m_getm2(). Sorry for churn, but better now than later.	2013-03-12 13:42:47 +00:00
Gleb Smirnoff	9ccde34069	Use m_get2() to get an mbuf of appropriate size. Reviewed by: marius Sponsored by: Nginx, Inc.	2013-03-12 10:05:36 +00:00
Olivier Houchard	6aee0b4448	Don't use an empty struct.	2013-03-11 10:56:46 +00:00
Andrew Turner	0234dfbfdc	Fix a typo where db_printf was spelt printf.	2013-03-10 02:44:06 +00:00
Andrew Turner	d80f56e15e	Update how we read the stack pointer to work on both GCC and clang.	2013-03-10 02:40:50 +00:00
Andrew Turner	536daf817a	Tell the unwinder we can't unwind swi_entry. This fixes an infinite loop when the kernel attempts to unwind through this function. The .fnstart and .fnend in this function should be moved to macros but we are currently missing an END macro on ARM.	2013-03-10 02:38:35 +00:00
Andrew Turner	fb769e0f72	__FreeBSD_ARCH_armv6__ is undefined on clang. We can use __ARM_ARCH in it's place. This makes 'uname -p' correctly output 'armv6' on a kernel built with clang.	2013-03-09 23:55:23 +00:00
Attilio Rao	89f6b8632c	Switch the vm_object mutex to be a rwlock. This will enable in the future further optimizations where the vm_object lock will be held in read mode most of the time the page cache resident pool of pages are accessed for reading purposes. The change is mostly mechanical but few notes are reported: * The KPI changes as follow: - VM_OBJECT_LOCK() -> VM_OBJECT_WLOCK() - VM_OBJECT_TRYLOCK() -> VM_OBJECT_TRYWLOCK() - VM_OBJECT_UNLOCK() -> VM_OBJECT_WUNLOCK() - VM_OBJECT_LOCK_ASSERT(MA_OWNED) -> VM_OBJECT_ASSERT_WLOCKED() (in order to avoid visibility of implementation details) - The read-mode operations are added: VM_OBJECT_RLOCK(), VM_OBJECT_TRYRLOCK(), VM_OBJECT_RUNLOCK(), VM_OBJECT_ASSERT_RLOCKED(), VM_OBJECT_ASSERT_LOCKED() * The vm/vm_pager.h namespace pollution avoidance (forcing requiring sys/mutex.h in consumers directly to cater its inlining functions using VM_OBJECT_LOCK()) imposes that all the vm/vm_pager.h consumers now must include also sys/rwlock.h. * zfs requires a quite convoluted fix to include FreeBSD rwlocks into the compat layer because the name clash between FreeBSD and solaris versions must be avoided. At this purpose zfs redefines the vm_object locking functions directly, isolating the FreeBSD components in specific compat stubs. The KPI results heavilly broken by this commit. Thirdy part ports must be updated accordingly (I can think off-hand of VirtualBox, for example). Sponsored by: EMC / Isilon storage division Reviewed by: jeff Reviewed by: pjd (ZFS specific review) Discussed with: alc Tested by: pho	2013-03-09 02:32:23 +00:00
Tim Kientzle	08907adea3	This file is specific to arm11x6 processors, so tell the assembler it's okay to use arm11x6 instructions.	2013-03-08 03:29:05 +00:00
Andrew Turner	078996e049	Fix stack alignment in the kernel to be on an 8 byte boundary as required by AAPCS.	2013-03-06 06:19:56 +00:00
Oleksandr Tymoshenko	e9401a9e0e	- Reset DMA channel if error occured - Initialize info field in bcm_dma_reset Submitted by: Daisuke Aoyama <aoyama@peach.ne.jp>	2013-03-05 20:00:11 +00:00
Olivier Houchard	8fd49af627	If we're using a PIPT L2 cache, only merge 2 segments if both the virtual and the physical addreses are contiguous. Submitted by: Thomas Skibo <ThomasSkibo@sbcglobal.net>	2013-03-04 10:41:54 +00:00
Andrew Turner	e40f53aa44	Move some virtual memory constants to the top of the file where they are on other architectures [1]. While here: - Remove an unused and commented out include. - Add a comment describing the file that other copies have. - Fix the style of the defines and add a comment on what each one is. Suggested by: [1] alc	2013-03-02 05:02:29 +00:00
Andrew Turner	6f02c16b63	Build the Raspberry Pi dtb file when building the kernel so we can copy it to the boot partition for U-Boot.	2013-03-02 03:23:14 +00:00
Andrew Turner	61fc9468e0	Ensure the stack is correctly aligned before calling the first C function.	2013-03-02 02:19:04 +00:00
Andrew Turner	5f61931668	Increase the maximum text size on ARM to 64MiB. Without this clang would be sent a SIGABRT when it is loaded as it is too large. This is the smallest power of two MiB value that allows us to execute clang. While here wrap it in an #ifndef to be consistent with the other architectures. Submitted by: Daisuke Aoyama <aoyama at peach.ne.jp>	2013-03-01 21:59:23 +00:00
Alan Cox	1f9d53d893	Eliminate a redundant #include: machine/pmap.h is already included through vm/pmap.h.	2013-03-01 19:02:41 +00:00
Alan Cox	99c8999856	Copy the definition of VM_MAX_AUTOTUNE_MAXUSERS from i386. (See r242847.) Tested by: andrew	2013-03-01 08:30:31 +00:00
Ganbold Tsagaankhuu	341d7df3da	Enable uart driver for A10. Approved by: gonzo@	2013-03-01 01:47:11 +00:00
Oleksandr Tymoshenko	adc99a8aa6	Add platform DMA support to SDHCI driver for BCM2835 Submitted by: Daisuke Aoyama <aoyama at peach.ne.jp> Reviewed by: ian@	2013-02-28 19:51:30 +00:00
Oleksandr Tymoshenko	cdafe74e84	Add driver for BCM2835's DMA engine This is a version of code submitted by Daisuke Aoyama <aoyama at peach.ne.jp> with some architectural changes.	2013-02-28 19:48:19 +00:00

1 2 3 4 5 ...

1858 Commits