freebsd-dev

Author	SHA1	Message	Date
Jim Harris	43e35466de	Revert r244549. This change was originally intended to account for test kthreads under the nvmecontrol process, but jhb indicated it may not be safe to associate kthreads with userland processes and this could have unintended consequences. I did not observe any problems with this change, but my testing didn't exhaust the kinds of corner cases that could cause problems. It is not that important to account for these test threads under nvmecontrol, so I am just reverting this change for now. On a related note, the part of this patch for <= 7.x fails compilation so reverting this fixes that too. Suggested by: jhb	2013-01-07 21:35:25 +00:00
Jim Harris	96790da4f4	Put kthreads under curproc so they are attached to nvmecontrol rather than pid 0. Sponsored by: Intel	2012-12-21 19:13:48 +00:00
Jim Harris	91fe20e34d	Map BAR 4/5, because NVMe spec says devices may place the MSI-X table behind BAR 4/5, rather than in BAR 0/1 with the control/doorbell registers. Sponsored by: Intel	2012-12-18 23:27:18 +00:00
Jim Harris	e1e84e74c1	Simplify module definition by adding nvme_modevent to DRIVER_MODULE() definition. Submitted by: Carl Delsey <carl.r.delsey@intel.com>	2012-12-18 22:10:40 +00:00
Jim Harris	4d6abcb19f	Do not use taskqueue to defer completion work when using INTx. INTx now matches MSI-X behavior. Sponsored by: Intel	2012-12-18 21:50:48 +00:00
Jim Harris	38ce9496fe	Add PCI device ID for 8-channel IDT NVMe controller, and clarify that the previously defined IDT PCI device ID was for a 32-channel controller. Submitted by: Joe Golio <joseph.golio@isilon.com>	2012-12-06 15:36:24 +00:00
Jim Harris	633c572996	Use callout_reset_curcpu to allow the callout to be handled by the current CPU and not always CPU 0. This has the added benefit of reducing a huge amount of spinlock contention on the callout_cpu spinlock for CPU 0. Sponsored by: Intel	2012-10-31 23:44:19 +00:00
Gleb Smirnoff	9427a0feed	Fix build after r241659.	2012-10-18 14:25:33 +00:00
Jim Harris	0f71ecf741	Add ability to queue nvme_request objects if no nvme_trackers are available. This eliminates the need to manage queue depth at the nvd(4) level for Chatham prototype board workarounds, and also adds the ability to accept a number of requests on a single qpair that is much larger than the number of trackers allocated. Sponsored by: Intel	2012-10-18 00:45:53 +00:00
Jim Harris	21b6da584b	Preallocate a limited number of nvme_tracker objects per qpair, rather than dynamically creating them at runtime. Sponsored by: Intel	2012-10-18 00:44:39 +00:00
Jim Harris	5ae9ed6811	Create nvme_qpair_submit_request() which eliminates all of the code duplication between the admin and io controller-level submit functions. Sponsored by: Intel	2012-10-18 00:43:25 +00:00
Jim Harris	c2e83b404f	Simplify how the qpair lock is acquired and released. Sponsored by: Intel	2012-10-18 00:41:31 +00:00
Jim Harris	5fa5cc5f12	Cleanup uio-related code to use struct nvme_request and nvme_ctrlr_submit_io_request(). While here, also fix case where a uio may have more than 1 iovec. NVMe's definition of SGEs (called PRPs) only allows for the first SGE to start on a non-page boundary. The simplest way to handle this is to construct a temporary uio for each iovec, and submit an NVMe request for each. Sponsored by: Intel	2012-10-18 00:40:40 +00:00
Jim Harris	d281e8fbbd	Add nvme_ctrlr_submit_[admin\|io]_request functions which consolidates code for allocating nvme_tracker objects and making calls into bus_dmamap_load for commands which have payloads. Sponsored by: Intel	2012-10-18 00:39:29 +00:00
Jim Harris	ad697276ce	Add struct nvme_request object which contains all of the parameters passed from an NVMe consumer. This allows us to mostly build NVMe command buffers without holding the qpair lock, and also allows for future queueing of nvme_request objects in cases where the submission queue is full and no nvme_tracker objects are available. Sponsored by: Intel	2012-10-18 00:38:28 +00:00
Jim Harris	f2b19f67ae	Merge struct nvme_prp_list into struct nvme_tracker. This simplifies the driver significantly where it is constructing commands to be submitted to hardware. By reducing the number of PRPs (NVMe parlance for SGE) from 128 to 32, it ensures we do not allocate too much memory for more common smaller I/O sizes, while still supporting up to 128KB I/O sizes. This also paves the way for pre-allocation of nvme_tracker objects for each queue which will simplify the I/O path even further. Sponsored by: Intel	2012-10-18 00:37:11 +00:00
Jim Harris	9eb93f2976	Add return codes to all functions used for submitting commands to I/O queues. Sponsored by: Intel	2012-10-18 00:32:07 +00:00
Jim Harris	6568ebfcbb	Count number of times each queue pair's interrupt handler is invoked. Also add sysctls to query and reset each queue pair's stats, including the new count added here. Sponsored by: Intel	2012-10-10 23:35:16 +00:00
Jim Harris	8bed48f25d	Put the nvme_qpair mutex on its own cacheline. Sponsored by: Intel	2012-10-10 23:28:58 +00:00
Kevin Lo	9823d52705	Revert previous commit... Pointyhat to: kevlo (myself)	2012-10-10 08:36:38 +00:00
Kevin Lo	a10cee30c9	Prefer NULL over 0 for pointers	2012-10-09 08:27:40 +00:00
Jim Harris	7e2fd60604	In nvme(4), set device description for BUS_PROBE_GENERIC case. Reported by: jhb	2012-09-19 18:25:25 +00:00
Jim Harris	d891b199bf	Report nvme(4) as a generic driver for NVMe devices if PCI class, subclass and programming interface codes match. Sponsored by: Intel	2012-09-19 16:21:23 +00:00
Jim Harris	8a382371f1	Add #if 0 around nvme_async_event_cb() until NVMe AER functionality can be tested. This fixes a build warning found only with clang.	2012-09-18 18:23:21 +00:00
Jim Harris	be4dcf1bfa	Add __aligned(4) to NVMe defined data structures. This fixes issue in nvmecontrol(8), where clang throws a cast-align warning when casting a __packed structure pointer to a uint32_t pointer as part of printing raw hex output. Reported by: dhw	2012-09-18 18:16:52 +00:00
Jim Harris	bb0ec6b359	This is the first of several commits which will add NVM Express (NVMe) support to FreeBSD. A full description of the overall functionality being added is below. nvmexpress.org defines NVM Express as "an optimized register interface, command set and feature set fo PCI Express (PCIe)-based Solid-State Drives (SSDs)." This commit adds nvme(4) and nvd(4) driver source code and Makefiles to the tree. Full NVMe functionality description: Add nvme(4) and nvd(4) drivers and nvmecontrol(8) for NVM Express (NVMe) device support. There will continue to be ongoing work on NVM Express support, but there is more than enough to allow for evaluation of pre-production NVM Express devices as well as soliciting feedback. Questions and feedback are welcome. nvme(4) implements NVMe hardware abstraction and is a provider of NVMe namespaces. The closest equivalent of an NVMe namespace is a SCSI LUN. nvd(4) is an NVMe consumer, surfacing NVMe namespaces as GEOM disks. nvmecontrol(8) is used for NVMe configuration and management. The following are currently supported: nvme(4) - full mandatory NVM command set support - per-CPU IO queues (enabled by default but configurable) - per-queue sysctls for statistics and full command/completion queue dumps for debugging - registration API for NVMe namespace consumers - I/O error handling (except for timeoutsee below) - compilation switches for support back to stable-7 nvd(4) - BIO_DELETE and BIO_FLUSH (if supported by controller) - proper BIO_ORDERED handling nvmecontrol(8) - devlist: list NVMe controllers and their namespaces - identify: display controller or namespace identify data in human-readable or hex format - perftest: quick and dirty performance test to measure raw performance of NVMe device without userspace/physio/GEOM overhead The following are still work in progress and will be completed over the next 3-6 months in rough priority order: - complete man pages - firmware download and activation - asynchronous error requests - command timeout error handling - controller resets - nvmecontrol(8) log page retrieval This has been primarily tested on amd64, with light testing on i386. I would be happy to provide assistance to anyone interested in porting this to other architectures, but am not currently planning to do this work myself. Big-endian and dmamap sync for command/completion queues are the main areas that would need to be addressed. The nvme(4) driver currently has references to Chatham, which is an Intel-developed prototype board which is not fully spec compliant. These references will all be removed over time. Sponsored by: Intel Contributions from: Joe Golio/EMC <joseph dot golio at emc dot com>	2012-09-17 19:23:01 +00:00

26 Commits