These were only intended for testing and should be replaced by a virtual blockdev that can be layered on top of any kind of bdev. Change-Id: I3ba2cc94630a6c6748d96e3401fee05aaabe20e0 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>
6.7 KiB
Getting Started Guide
The Intel(R) Storage Performance Development Kit iSCSI target application is named iscsi_tgt
.
This following section describes how to run iscsi from your cloned package.
Prerequisites
This guide starts by assuming that you can already build the standard SPDK distribution on your platform. The SPDK iSCSI target has been known to work on several Linux distributions, namely Ubuntu 14.04, 15.04, and 15.10, Fedora 21, 22, and 23, and CentOS 7.
Once built, the binary will be in app/iscsi_tgt
.
Configuring iSCSI Target
A iscsi_tgt
specific configuration file is used to configure the iSCSI target. A fully documented
example configuration file is located at etc/spdk/iscsi.conf.in
.
The configuration file is used to configure the SPDK iSCSI target. This file defines the following: TCP ports to use as iSCSI portals; general iSCSI parameters; initiator names and addresses to allow access to iSCSI target nodes; number and types of storage backends to export over iSCSI LUNs; iSCSI target node mappings between portal groups, initiator groups, and LUNs.
The SPDK iSCSI target supports several different types of storage backends. These backends will create SCSI LUNs which can be exported via iSCSI target nodes.
The storage backends can be configured in the iscsi.conf configuration file to specify the number or size of LUNs, block device names (for exporting in-kernel block devices), or other parameters.
Currently there are 3 types of storage backends supported by the iSCSI target:
Malloc
Configuration file syntax:
[Malloc]
NumberOfLuns 4
LunSizeInMB 64
Other TargetNode parameters go here (TargetName, Mapping, etc.):
[TargetNodeX]
LUN0 Malloc0
This exports a malloc'd target. The disk is a RAM disk that is a chunk of memory allocated by iscsi in user space. It will use offload engine to do the copy job instead of memcpy if the system has enough DMA channels.
Block Device
AIO devices are accessed using Linux AIO. O_DIRECT is used and thus unaligned writes will be double buffered. This option also bypasses the Linux page cache. This mode is probably as close to a typical kernel based target as a user space target can get without using a user-space driver.
Configuration file syntax:
[AIO]
#normal file or block device
AIO /dev/sdb
Other TargetNode parameters go here (TargetName, Mapping, etc.):
[TargetNodeX]
LUN0 AIO0
Ceph RBD
Ceph RBD devices are accessed via librbd and librados libraries to access the RADOS block device exported by Ceph.
Configuration file syntax:
[Ceph]
# The format of provided rbd info should be: Ceph rbd_pool_name rbd_name size.
# In the following example, rbd is the name of rbd_pool; foo is the name of
# rbd device exported by Ceph; value 512 represents the configured block size
# for this rbd, the block size should be a multiple of 512.
Ceph rbd foo 512
Other TargetNode parameters go here (TargetName, Mapping, etc.):
[TargetNodeX]
LUN0 Ceph0
NVMe
The SPDK NVMe driver by default binds to all NVMe controllers which are not bound to the kernel-mode nvme driver. Users can choose to bind to fewer controllers by setting the NumControllers parameter. Then the NVMe backend controls NVMe controller(s) directly from userspace and completely bypasses the kernel to avoid interrupts and context switching.
[Nvme]
# NVMe Device Whitelist
# Users may specify which NVMe devices to claim by their PCI
# domain, bus, device, and function. The format is dddd:bb:dd.f, which is
# the same format displayed by lspci or in /sys/bus/pci/devices. The second
# argument is a "name" for the device that can be anything. The name
# is referenced later in the Subsystem section.
#
# Alternatively, the user can specify ClaimAllDevices. All
# NVMe devices will be claimed.
BDF 0000:00:00.0
BDF 0000:01:00.0
# The number of attempts per I/O when an I/O fails. Do not include
# this key to get the default behavior.
NvmeRetryCount 4
# The maximum number of NVMe controllers to claim. Do not include this key to
# claim all of them.
NumControllers 2
[TargetNodeX]
# other TargetNode parameters go here (TargetName, Mapping, etc.)
# nvme with the following format: NvmeXnY, where X = the controller ID
# and Y = the namespace ID
# Note: NVMe namespace IDs always start at 1, not 0 - and most
# controllers have only 1 namespace.
LUN0 Nvme0n1
You should make a copy of the example configuration file, modify it to suit your environment, and then run the iscsi_tgt application and pass it the configuration file using the -c option. Right now, the target requires elevated privileges (root) to run.
app/iscsi_tgt/iscsi_tgt -c /path/to/iscsi.conf
Configuring iSCSI Initiator
The Linux initiator is open-iscsi.
Installing open-iscsi package Fedora:
yum install -y iscsi-initiator-utils
Ubuntu:
apt-get install -y open-iscsi
Setup
Edit /etc/iscsi/iscsid.conf
node.session.cmds_max = 4096
node.session.queue_depth = 128
iscsid must be restarted or receive SIGHUP for changes to take effect. To send SIGHUP, run:
killall -HUP iscsid
Recommended changes to /etc/sysctl.conf
net.ipv4.tcp_timestamps = 1
net.ipv4.tcp_sack = 0
net.ipv4.tcp_rmem = 10000000 10000000 10000000
net.ipv4.tcp_wmem = 10000000 10000000 10000000
net.ipv4.tcp_mem = 10000000 10000000 10000000
net.core.rmem_default = 524287
net.core.wmem_default = 524287
net.core.rmem_max = 524287
net.core.wmem_max = 524287
net.core.optmem_max = 524287
net.core.netdev_max_backlog = 300000
Discovery
Assume target is at 192.168.1.5
iscsiadm -m discovery -t sendtargets -p 192.168.1.5
Connect to target
iscsiadm -m node --login
At this point the iSCSI target should show up as SCSI disks. Check dmesg to see what they came up as.
Disconnect from target
iscsiadm -m node --logout
Deleting target node cache
iscsiadm -m node -o delete
This will cause the initiator to forget all previously discovered iSCSI target nodes.
Finding /dev/sdX nodes for iSCSI LUNs
iscsiadm -m session -P 3 | grep "Attached scsi disk" | awk '{print $4}'
This will show the /dev node name for each SCSI LUN in all logged in iSCSI sessions.
Tuning
After the targets are connected, they can be tuned. For example if /dev/sdc is an iSCSI disk then the following can be done: Set noop to scheduler
echo noop > /sys/block/sdc/queue/scheduler
Disable merging/coalescing (can be useful for precise workload measurements)
echo "2" > /sys/block/sdc/queue/nomerges
Increase requests for block queue
echo "1024" > /sys/block/sdc/queue/nr_requests