numam-spdk

Author	SHA1	Message	Date
Changpeng Liu	f903336eed	nvmf: fix the wrong cleanup logic in the RDMA initialization process Print the error information when the kernel RNIC driver did not load properly, and fix the cleanup logic for the exceptional exit. Change-Id: I97a45e73d830280b994818f3defc491bc2b6b020 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com>	2016-11-02 10:30:39 -07:00
Daniel Verkamp	1290f02f78	nvmf/rdma: add event_channel NULL check in fini If the RDMA transport failed to initialize, g_rdma.event_channel may be NULL. Change-Id: I4510ee5893389f244f0fbaa1cd4a182868939b25 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-10-31 10:15:36 -07:00
Ben Walker	20dc7f7d19	nvmf: For iWARP, register buffers with IBV_ACCESS_REMOTE_WRITE For iWARP devices, buffers that are intended to be the target of an RDMA read initiated by the target must additionally have IBV_ACCESS_REMOTE_WRITE permission. This is because iWARP's RDMA read path essentially requests the remote side to do an RDMA write. This is unfortunate because there is no way to differentiate between memory that the remote side can do an RDMA write to and memory that will only be the target of RDMA reads initiated by the target. There is nothing we can do about this serious deficiency in the specification, however, so we have to live with it. Change-Id: I3d2f2814ce0cb1df4e5347296ef371db4d16be21 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-31 08:58:40 -07:00
Ben Walker	f64690d49a	nvmf: Handle NICs that don't report RDMA initiator_depth I believe this is required for NICs to report, but handle the case where it isn't reported. Change-Id: I38d10c3590d1df8bb902ab312af0f9e01b9e5032 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-21 09:09:06 -07:00
Ben Walker	a1a085cf6b	nvmf: Transports now allocate sessions This makes it consistent with the way connections and requests work. Change-Id: Ifb97499ba72f7dfd02ac54ba1b622726d266262c Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-21 09:09:05 -07:00
Ben Walker	8b487155fa	nvmf: Fail when sessions span RNICs The shared memory pool for a session is associated with a particular RNIC via the protection domain. New connections attempting to join a session that came in on a different RNIC can't use that memory, so must be rejected. Change-Id: Ibd79fe90566a231f76b7472e5e9b484c3e528454 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-21 09:09:05 -07:00
Ben Walker	349295caf2	nvmf: Rearrange rdma file to match callback order Rearrange the functions in rdma.c to match the order of the function pointers in the transport. No other code changes. Change-Id: I9dbc68912ecd5dfdf53f20b4807d4116933a3c3a Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-21 09:09:05 -07:00
Ben Walker	b0c1149953	nvmf: Replace rdma_reg_* with ibv_reg_mr Use the lower level registration functions. The RDMA-CM examples use the ibv_* versions, so who knows if the rdma_reg_* wrappers are even well tested. Change-Id: I8e8250ab09a1401e636aebe2fc04a60806f7a827 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-21 09:09:05 -07:00
Ziye Yang	379ebca018	nvmf,rdma: only call spdk_nvmf_rdma_conn_destroy to free rdma_conn Previously, we mixed use free and spdk_nvmf_rdma_conn_destroy to free allocated spdk_nvmf_rdma_conn structure, which sounds not exactly free all the resources. Change-Id: I2917b442c34d63ba5c014add58f429ae4b831595 Signed-off-by: Ziye Yang <ziye.yang@intel.com>	2016-10-20 16:24:38 -07:00
Ben Walker	769468fc52	nvmf: Allocate RDMA wr on heap instead of stack. The RDMA API doesn't say whether the wr is copied, so be safe and allocate it on the heap. Change-Id: I091af50aa031e1861333f19d864eb52335d6b756 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-20 16:22:58 -07:00
Ben Walker	cfafcc3edd	nvmf: Init transport before adding listeners Change-Id: I1fbaca75c6b95127ff2b6ac7c0ea0f0a918b9405 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-20 16:22:58 -07:00
Ben Walker	1e6ffa0394	nvmf: Add a public API header This changes as little code as possible while still creating a single public API header. This enables future clean up of the public API and clarification of the exposed concepts. Change-Id: I780e7a5a9afd27acf0276516bd71b896ad301c50 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-14 15:26:31 -07:00
Ben Walker	18a2cc11c1	env: Remove unused DPDK headers. Remove #includes for all DPDK headers that weren't necessary. Change-Id: Ib02522e0f04e64a1c98afceb7508cc0e8d931a9d Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-12 09:53:32 -07:00
Ben Walker	0aa2986475	Replace rte_get_tsc calls with spdk_get_ticks Change-Id: I809b900321433693ff9f2498183ad0dcdbb15030 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-11 13:34:09 -07:00
Ben Walker	2224ff2162	env: Replace rte_malloc with spdk_zmalloc Use the env library to perform all memory allocations that previously called DPDK directly. Change-Id: I6d33e85bde99796e0c85277d6d4880521c34f10d Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-11 13:34:09 -07:00
Ben Walker	d4652fd37f	nvmf: Share completion queue channels All completion queues for the same listen address now share a common completion queue channel. Change-Id: I42c149fe7e221951e8a3826b1713482c37a265b8 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-07 09:15:11 -07:00
Ben Walker	a0a92ff4c2	nvmf: Combine acceptor_init/fini with transport init/fini These 4 callbacks can be condensed into two callbacks, which simplifies the API. Change-Id: I069da00de34b252753cdc8961439e13a75d1cc68 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-10-07 09:14:53 -07:00
Ben Walker	a17ad921e2	Replace RTE_VERIFY with assert We already require the assert header from the C standard library, so use that instead of RTE_VERIFY to further isolate DPDK dependencies. Change-Id: I4a718af858c88aff6080e33e6c3dd533c077b8f4 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-09-28 10:15:56 -07:00
Ziye Yang	b1e3279ae0	nvmf: change nvmf_session to spdk_nvmf_session Make the structure definition consistent Change-Id: If147066cee041a2a357bc7efcb1f882f914e5b42 Signed-off-by: Ziye Yang <ziye.yang@intel.com>	2016-09-27 09:57:17 -07:00
Ben Walker	f3d90c9165	nvmf: Reap entire batches of completions for RDMA at once Instead of polling for only 1 completion at at time, poll for batches of 32. Change-Id: I5ef99a270489e7b3d2a58cb765915f187775a93e Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-09-23 10:08:10 -07:00
Ben Walker	4bf7878521	nvmf: Combine nvmf recv and send cqs ibv_poll_cq is actually an expensive call to make, so take steps to begin to minimize the number of times it is called. Change-Id: I6fc64979604220eb8cacd612b46e3a3b1bca0924 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-09-22 09:11:38 -07:00
Cunyin Chang	b3ca958328	nvmf: Make the function spdk_nvmf_rdma_acceptor_init() reentrant. Change-Id: Ie715affe7e451fc517c5f57f1597d0bc70f69947 Signed-off-by: Cunyin Chang <cunyin.chang@intel.com>	2016-09-13 09:49:25 -07:00
Roland Dreier	4c0aab3f2a	nvmf: Correct log messages when polling recv CQ Fix copy-and-paste errors - when polling the recv CQ, we should print "Recv" instead of "Send" in log messages. Signed-off-by: Roland Dreier <roland@purestorage.com>	2016-09-13 08:26:31 -07:00
Ben Walker	af8b5732d3	nvmf: Optimize nvmf_ibv_send_wr_init This function always shows up as one of the hottest functions when profiling. I believe it is the memset that is expensive, so instead use default initialization when the wr is declared on the stack and just set the members that need to be updated in the function. Also make the function inline for good measure. Change-Id: I29e24cdd375311fa033b5a6df772ff4f73e35302 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-09-12 15:33:43 -07:00
Daniel Verkamp	bc0867dcaf	nvmf: move subsytem poller to nvmf_tgt app The application is now entirely responsible for scheduling subsystem pollers and sending events between threads. Change-Id: I88da1f53b5e8852c7c4acd6f0a7a1e2219fbed41 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-09-12 10:10:27 -07:00
Ziye Yang	1c5093358e	nvmf,rdma: update spdk_nvmf_rdma_conn_destroy Merge two if judge statements into one. Change-Id: I72f7e4a936bc0b628c06a9ebb7c816150805f495 Signed-off-by: Ziye Yang <ziye.yang@intel.com>	2016-09-06 13:19:02 -07:00
Ziye Yang	1675e153fb	nvmf: Update get_rdma_conn and get_rdma_req The "+" is not correct, should be "-". Currently, the issue doest not happen since the offset is 0, then both + and - is OK. But if we adjust the location of spdk_nvmf_conn or spdk_nvmf_request, we can find this bug. Change-Id: Ib358dc729da901a69442d0402a6089989f49b05c Signed-off-by: Ziye Yang <ziye.yang@intel.com>	2016-09-06 13:18:26 -07:00
Daniel Verkamp	39cda18718	nvmf: rename trsvc -> trsvcid for spec consistency Change-Id: I5f73ef20ba231a7b1721562964dfd50de9b2c735 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-23 09:03:28 -07:00
Daniel Verkamp	37402f490e	nvmf: pad discovery TRSVCID and TRADDR with spaces These are specified as "ASCII string", which means they should be left-aligned and padded with spaces, according to the NVMe base specification. Change-Id: I25babe0ca417c2e16137b0bfc41fc7834277114e Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-22 12:58:59 -07:00
Changpeng Liu	2641c31af2	nvmf: Listen for incoming connections only on addresses specified Currently the NVMf target listens for new connections on any address. Instead, listen only on the addresses specified by the user. Change-Id: Idb6d37c422e442fc70a8673bd3fcfb9c27b57828 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com>	2016-08-19 15:34:01 -07:00
Daniel Verkamp	4c6e4d4963	nvmf: move acceptor poller into nvmf_tgt app The NVMe over Fabrics target library now exposes a simple function call that polls the acceptor once, and the application handles registration of the poller. Also rename the transport function pointers related to the acceptor so they better reflect their purpose. Change-Id: I5fa0d516586bf17e73afeb88ff3c2d5b0d46794d Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-17 10:01:37 -07:00
Daniel Verkamp	47dde07521	nvmf: allow target to start without RDMA devices This will become more important when other transports are added. For now, it is also useful to be able to start nvmf_tgt on systems without RDMA hardware. Change-Id: I6b9002cc7711f928c4e6b73adcd9b677349ebdd6 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-17 10:01:37 -07:00
Changpeng Liu	8a23223e1b	nvmf: Allow users to configure which lcore each subsystem runs on Users can specify the core for each subsystem and the acceptor listen routine to run on different cores for performance consideration. Change-Id: I4bd1a96f39194c870863b4b778e6ea7cf8fc1a2d Signed-off-by: Changpeng Liu <changpeng.liu@intel.com>	2016-08-16 09:20:42 -07:00
Daniel Verkamp	5d8c94536a	event: hide struct spdk_poller internals This leaves more flexibility for future changes to the poller representation without requiring API changes (after this one). It also prevents the user from accidentally using poller fields in a non-thread-safe way, since they can't be accessed directly anymore. Change-Id: I7677d5b93668665d29ae39c5e0ba74333ad3f878 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-12 10:42:12 -07:00
Daniel Verkamp	184984603f	nvmf: use Connect command SQSIZE to manage SQHD The NVMe submission queue head wraparound point can be determined in the generic NVMe over Fabrics layer; it should not be using the RDMA connection queue depth. Change-Id: I9da8f09e4f057f8fdc1ff4c6cc5f48cea7123e11 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-08 08:45:38 -07:00
Daniel Verkamp	be0dae64be	nvmf: simplify spdk_nvmf_rdma_conn_create() Use the max_queue_depth parameter rather than rdma_conn->max_queue_depth so that we can start to eliminate rdma_conn->max_queue_depth. Change-Id: I1670c634e6d12aa004fb5a10338b7624850fbc4a Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-08 08:45:38 -07:00
Daniel Verkamp	60a91f33cc	nvmf: add messages for session_init failure cases Change-Id: I079df9a15ac023d160b842676aab162393f6da80 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-05 08:51:02 -07:00
Daniel Verkamp	5f6f676cbf	nvmf: fix rdma_reg_msgs size argument The large buffer pool allocation was using the per-connection queue depth, whereas the RDMA memory region registration was using the global RDMA max queue depth. These sizes need to match, so use the global RDMA max queue depth for both calls. Change-Id: Iae161b719e09e19ca3e81df6593b68a4a2e86614 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-05 08:51:02 -07:00
Daniel Verkamp	e7a6744a88	nvmf: switch RDMA acceptor to spdk_poller Use the new timer-based poller functionality to replace rte_timer. Change-Id: Ic40653306cc73b40139fe18e06bab29b35721a43 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>	2016-08-03 09:00:54 -07:00
Ben Walker	f1a584a9f7	nvmf: Use a shared memory pool for large data buffers. Change-Id: Iab66335cee2a1e6c1774edd34978735be6763ce1 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-29 15:36:10 -07:00
Ben Walker	caf8860900	nvmf: Allow higher queue depths Allow higher queue depths by allowing many more send/recv operations than read/write. Change-Id: I66c424a6463e5e09be6d5463667241ce9271404b Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-28 13:55:43 -07:00
Ben Walker	eee64c69f7	nvmf: Re-post the capsule immediately upon sending a completion The target can only provide updates to sq_head inside of completions. Therefore, we must update sq_head prior to sending the completion or we'll incorrectly get into queue full scenarios. Change-Id: If2925d39570bbc247801219f352e690d33132a2d Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-28 10:27:31 -07:00
Ben Walker	7e23841d28	nvmf: Separate the send and recv completion queues. This allows the target to poll for internal completions at higher priority. Change-Id: I895c33a594a7d7c0545aa3a8405a296be3c106fb Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-28 09:00:48 -07:00
Ben Walker	04beb5661e	nvmf: Send completions after RDMA write has completed. This ensures that the data buffers are not in use when we go to send the completion. Change-Id: I30467b3e3964001150f81b21e5b695dcd0974b0c Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-28 09:00:48 -07:00
Ben Walker	d7b8da3b81	nvmf: Add a transport specific session This is useful for holding session-wide buffer pools. Change-Id: I7024da24b210a2205bf1e159d5935e0093b81120 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-28 09:00:47 -07:00
Ben Walker	52a4a388fb	nvmf: Make RDMA WRITE operations signalled Change-Id: Iad9e216144d88c899b52220ae9b32c24e3cbb252 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-28 09:00:47 -07:00
Ben Walker	8a701c3f8d	nvmf: Use the inline SGL for keyed SGLs if the size is small enough For small SGLs, even if they are keyed and not inline, use the buffer we allocated for inline data. Change-Id: I5051c43aabacb20a4247b2feaf2af801dba5f5a9 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-26 15:24:01 -07:00
Ben Walker	ca0c13387a	nvmf: Calculate queue_depth separately from rw_depth Read/Write depth is much lower than Send/Recv depth. Calculate them separately to prepare for supporting a larger number of receives than read/writes. Currently, the target still only exposes a queue depth equal to the read/write depth. Change-Id: I08a7434d4ace8d696ae7e1eee241047004de7cc5 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-26 14:24:41 -07:00
Ben Walker	756df04485	nvmf: Remove g_nvmf_tgt global usage from transport layer Change-Id: Id788312f597abf6ea937beb7d1d1bd5a168ae0f0 Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-26 14:24:39 -07:00
Ben Walker	296add8bb1	nvmf: Add config options for inline and max I/O size These don't actually work quite yet, but pipe the configuration file data through to where it will be needed. Change-Id: I95512d718d45b936fa85c03c0b80689ce3c866bc Signed-off-by: Ben Walker <benjamin.walker@intel.com>	2016-07-26 14:23:25 -07:00

1 2 3

149 Commits