efccac8c7e
These were only intended for testing and should be replaced by a virtual blockdev that can be layered on top of any kind of bdev. Change-Id: I3ba2cc94630a6c6748d96e3401fee05aaabe20e0 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com>
241 lines
6.7 KiB
Markdown
241 lines
6.7 KiB
Markdown
# Getting Started Guide {#iscsi_getting_started}
|
|
|
|
The Intel(R) Storage Performance Development Kit iSCSI target application is named `iscsi_tgt`.
|
|
This following section describes how to run iscsi from your cloned package.
|
|
|
|
# Prerequisites {#iscsi_prereqs}
|
|
|
|
This guide starts by assuming that you can already build the standard SPDK distribution on your
|
|
platform. The SPDK iSCSI target has been known to work on several Linux distributions, namely
|
|
Ubuntu 14.04, 15.04, and 15.10, Fedora 21, 22, and 23, and CentOS 7.
|
|
|
|
Once built, the binary will be in `app/iscsi_tgt`.
|
|
|
|
# Configuring iSCSI Target {#iscsi_config}
|
|
|
|
A `iscsi_tgt` specific configuration file is used to configure the iSCSI target. A fully documented
|
|
example configuration file is located at `etc/spdk/iscsi.conf.in`.
|
|
|
|
The configuration file is used to configure the SPDK iSCSI target. This file defines the following:
|
|
TCP ports to use as iSCSI portals; general iSCSI parameters; initiator names and addresses to allow
|
|
access to iSCSI target nodes; number and types of storage backends to export over iSCSI LUNs; iSCSI
|
|
target node mappings between portal groups, initiator groups, and LUNs.
|
|
|
|
The SPDK iSCSI target supports several different types of storage backends. These backends will create
|
|
SCSI LUNs which can be exported via iSCSI target nodes.
|
|
|
|
The storage backends can be configured in the iscsi.conf configuration file to specify the number or
|
|
size of LUNs, block device names (for exporting in-kernel block devices), or other parameters.
|
|
|
|
Currently there are 3 types of storage backends supported by the iSCSI target:
|
|
|
|
## Malloc
|
|
|
|
Configuration file syntax:
|
|
~~~
|
|
[Malloc]
|
|
NumberOfLuns 4
|
|
LunSizeInMB 64
|
|
~~~
|
|
|
|
Other TargetNode parameters go here (TargetName, Mapping, etc.):
|
|
~~~
|
|
[TargetNodeX]
|
|
LUN0 Malloc0
|
|
~~~
|
|
|
|
This exports a malloc'd target. The disk is a RAM disk that is a chunk of memory allocated by iscsi in
|
|
user space. It will use offload engine to do the copy job instead of memcpy if the system has enough DMA
|
|
channels.
|
|
|
|
## Block Device
|
|
|
|
AIO devices are accessed using Linux AIO. O_DIRECT is used and thus unaligned writes will be double buffered.
|
|
This option also bypasses the Linux page cache. This mode is probably as close to a typical kernel based
|
|
target as a user space target can get without using a user-space driver.
|
|
|
|
Configuration file syntax:
|
|
|
|
~~~
|
|
[AIO]
|
|
#normal file or block device
|
|
AIO /dev/sdb
|
|
~~~
|
|
|
|
Other TargetNode parameters go here (TargetName, Mapping, etc.):
|
|
~~~
|
|
[TargetNodeX]
|
|
LUN0 AIO0
|
|
~~~
|
|
|
|
## Ceph RBD
|
|
|
|
Ceph RBD devices are accessed via librbd and librados libraries to access the RADOS block device
|
|
exported by Ceph.
|
|
|
|
Configuration file syntax:
|
|
|
|
~~~
|
|
[Ceph]
|
|
# The format of provided rbd info should be: Ceph rbd_pool_name rbd_name size.
|
|
# In the following example, rbd is the name of rbd_pool; foo is the name of
|
|
# rbd device exported by Ceph; value 512 represents the configured block size
|
|
# for this rbd, the block size should be a multiple of 512.
|
|
Ceph rbd foo 512
|
|
~~~
|
|
|
|
Other TargetNode parameters go here (TargetName, Mapping, etc.):
|
|
~~~
|
|
[TargetNodeX]
|
|
LUN0 Ceph0
|
|
~~~
|
|
|
|
## NVMe
|
|
|
|
The SPDK NVMe driver by default binds to all NVMe controllers which are not bound to the kernel-mode
|
|
nvme driver. Users can choose to bind to fewer controllers by setting the NumControllers parameter.
|
|
Then the NVMe backend controls NVMe controller(s) directly from userspace and completely bypasses
|
|
the kernel to avoid interrupts and context switching.
|
|
|
|
~~~
|
|
[Nvme]
|
|
# NVMe Device Whitelist
|
|
# Users may specify which NVMe devices to claim by their PCI
|
|
# domain, bus, device, and function. The format is dddd:bb:dd.f, which is
|
|
# the same format displayed by lspci or in /sys/bus/pci/devices. The second
|
|
# argument is a "name" for the device that can be anything. The name
|
|
# is referenced later in the Subsystem section.
|
|
#
|
|
# Alternatively, the user can specify ClaimAllDevices. All
|
|
# NVMe devices will be claimed.
|
|
BDF 0000:00:00.0
|
|
BDF 0000:01:00.0
|
|
|
|
# The number of attempts per I/O when an I/O fails. Do not include
|
|
# this key to get the default behavior.
|
|
NvmeRetryCount 4
|
|
# The maximum number of NVMe controllers to claim. Do not include this key to
|
|
# claim all of them.
|
|
NumControllers 2
|
|
|
|
[TargetNodeX]
|
|
# other TargetNode parameters go here (TargetName, Mapping, etc.)
|
|
# nvme with the following format: NvmeXnY, where X = the controller ID
|
|
# and Y = the namespace ID
|
|
# Note: NVMe namespace IDs always start at 1, not 0 - and most
|
|
# controllers have only 1 namespace.
|
|
LUN0 Nvme0n1
|
|
~~~
|
|
|
|
You should make a copy of the example configuration file, modify it to suit your environment, and
|
|
then run the iscsi_tgt application and pass it the configuration file using the -c option. Right now,
|
|
the target requires elevated privileges (root) to run.
|
|
|
|
~~~
|
|
app/iscsi_tgt/iscsi_tgt -c /path/to/iscsi.conf
|
|
~~~
|
|
|
|
# Configuring iSCSI Initiator {#iscsi_initiator}
|
|
|
|
The Linux initiator is open-iscsi.
|
|
|
|
Installing open-iscsi package
|
|
Fedora:
|
|
~~~
|
|
yum install -y iscsi-initiator-utils
|
|
~~~
|
|
|
|
Ubuntu:
|
|
~~~
|
|
apt-get install -y open-iscsi
|
|
~~~
|
|
|
|
## Setup
|
|
|
|
Edit /etc/iscsi/iscsid.conf
|
|
~~~
|
|
node.session.cmds_max = 4096
|
|
node.session.queue_depth = 128
|
|
~~~
|
|
|
|
iscsid must be restarted or receive SIGHUP for changes to take effect. To send SIGHUP, run:
|
|
~~~
|
|
killall -HUP iscsid
|
|
~~~
|
|
|
|
Recommended changes to /etc/sysctl.conf
|
|
~~~
|
|
net.ipv4.tcp_timestamps = 1
|
|
net.ipv4.tcp_sack = 0
|
|
|
|
net.ipv4.tcp_rmem = 10000000 10000000 10000000
|
|
net.ipv4.tcp_wmem = 10000000 10000000 10000000
|
|
net.ipv4.tcp_mem = 10000000 10000000 10000000
|
|
net.core.rmem_default = 524287
|
|
net.core.wmem_default = 524287
|
|
net.core.rmem_max = 524287
|
|
net.core.wmem_max = 524287
|
|
net.core.optmem_max = 524287
|
|
net.core.netdev_max_backlog = 300000
|
|
~~~
|
|
|
|
### Discovery
|
|
|
|
Assume target is at 192.168.1.5
|
|
~~~
|
|
iscsiadm -m discovery -t sendtargets -p 192.168.1.5
|
|
~~~
|
|
|
|
### Connect to target
|
|
|
|
~~~
|
|
iscsiadm -m node --login
|
|
~~~
|
|
|
|
At this point the iSCSI target should show up as SCSI disks. Check dmesg to see what
|
|
they came up as.
|
|
|
|
### Disconnect from target
|
|
|
|
~~~
|
|
iscsiadm -m node --logout
|
|
~~~
|
|
|
|
### Deleting target node cache
|
|
|
|
~~~
|
|
iscsiadm -m node -o delete
|
|
~~~
|
|
|
|
This will cause the initiator to forget all previously discovered iSCSI target nodes.
|
|
|
|
### Finding /dev/sdX nodes for iSCSI LUNs
|
|
|
|
~~~
|
|
iscsiadm -m session -P 3 | grep "Attached scsi disk" | awk '{print $4}'
|
|
~~~
|
|
|
|
This will show the /dev node name for each SCSI LUN in all logged in iSCSI sessions.
|
|
|
|
### Tuning
|
|
|
|
After the targets are connected, they can be tuned. For example if /dev/sdc is
|
|
an iSCSI disk then the following can be done:
|
|
Set noop to scheduler
|
|
|
|
~~~
|
|
echo noop > /sys/block/sdc/queue/scheduler
|
|
~~~
|
|
|
|
Disable merging/coalescing (can be useful for precise workload measurements)
|
|
|
|
~~~
|
|
echo "2" > /sys/block/sdc/queue/nomerges
|
|
~~~
|
|
|
|
Increase requests for block queue
|
|
|
|
~~~
|
|
echo "1024" > /sys/block/sdc/queue/nr_requests
|
|
~~~
|