218c4e68c1
Rather than using linuxapp and bsdapp everywhere, we can change things to use the, more readable, terms "linux" and "freebsd" in our build configs. Rather than renaming the configs we can just duplicate the existing ones with the new names using symlinks, and use the new names exclusively internally. ["make showconfigs" also only shows the new names to keep the list short] The result is that backward compatibility is kept fully but any new builds or development can be done using the newer names, i.e. both "make config T=x86_64-native-linuxapp-gcc" and "T=x86_64-native-linux-gcc" work. Signed-off-by: Bruce Richardson <bruce.richardson@intel.com>
145 lines
4.7 KiB
ReStructuredText
145 lines
4.7 KiB
ReStructuredText
.. SPDX-License-Identifier: BSD-3-Clause
|
|
Copyright(c) 2015-2016 Intel Corporation.
|
|
|
|
Keep Alive Sample Application
|
|
=============================
|
|
|
|
The Keep Alive application is a simple example of a
|
|
heartbeat/watchdog for packet processing cores. It demonstrates how
|
|
to detect 'failed' DPDK cores and notify a fault management entity
|
|
of this failure. Its purpose is to ensure the failure of the core
|
|
does not result in a fault that is not detectable by a management
|
|
entity.
|
|
|
|
|
|
Overview
|
|
--------
|
|
|
|
The application demonstrates how to protect against 'silent outages'
|
|
on packet processing cores. A Keep Alive Monitor Agent Core (master)
|
|
monitors the state of packet processing cores (worker cores) by
|
|
dispatching pings at a regular time interval (default is 5ms) and
|
|
monitoring the state of the cores. Cores states are: Alive, MIA, Dead
|
|
or Buried. MIA indicates a missed ping, and Dead indicates two missed
|
|
pings within the specified time interval. When a core is Dead, a
|
|
callback function is invoked to restart the packet processing core;
|
|
A real life application might use this callback function to notify a
|
|
higher level fault management entity of the core failure in order to
|
|
take the appropriate corrective action.
|
|
|
|
Note: Only the worker cores are monitored. A local (on the host) mechanism
|
|
or agent to supervise the Keep Alive Monitor Agent Core DPDK core is required
|
|
to detect its failure.
|
|
|
|
Note: This application is based on the :doc:`l2_forward_real_virtual`. As
|
|
such, the initialization and run-time paths are very similar to those
|
|
of the L2 forwarding application.
|
|
|
|
Compiling the Application
|
|
-------------------------
|
|
|
|
To compile the sample application see :doc:`compiling`.
|
|
|
|
The application is located in the ``l2fwd_keep_alive`` sub-directory.
|
|
|
|
Running the Application
|
|
-----------------------
|
|
|
|
The application has a number of command line options:
|
|
|
|
.. code-block:: console
|
|
|
|
./build/l2fwd-keepalive [EAL options] \
|
|
-- -p PORTMASK [-q NQ] [-K PERIOD] [-T PERIOD]
|
|
|
|
where,
|
|
|
|
* ``p PORTMASK``: A hexadecimal bitmask of the ports to configure
|
|
|
|
* ``q NQ``: A number of queues (=ports) per lcore (default is 1)
|
|
|
|
* ``K PERIOD``: Heartbeat check period in ms(5ms default; 86400 max)
|
|
|
|
* ``T PERIOD``: statistics will be refreshed each PERIOD seconds (0 to
|
|
disable, 10 default, 86400 maximum).
|
|
|
|
To run the application in linux environment with 4 lcores, 16 ports
|
|
8 RX queues per lcore and a ping interval of 10ms, issue the command:
|
|
|
|
.. code-block:: console
|
|
|
|
./build/l2fwd-keepalive -l 0-3 -n 4 -- -q 8 -p ffff -K 10
|
|
|
|
Refer to the *DPDK Getting Started Guide* for general information on
|
|
running applications and the Environment Abstraction Layer (EAL)
|
|
options.
|
|
|
|
|
|
Explanation
|
|
-----------
|
|
|
|
The following sections provide some explanation of the The
|
|
Keep-Alive/'Liveliness' conceptual scheme. As mentioned in the
|
|
overview section, the initialization and run-time paths are very
|
|
similar to those of the :doc:`l2_forward_real_virtual`.
|
|
|
|
The Keep-Alive/'Liveliness' conceptual scheme:
|
|
|
|
* A Keep- Alive Agent Runs every N Milliseconds.
|
|
|
|
* DPDK Cores respond to the keep-alive agent.
|
|
|
|
* If keep-alive agent detects time-outs, it notifies the
|
|
fault management entity through a callback function.
|
|
|
|
The following sections provide some explanation of the code aspects
|
|
that are specific to the Keep Alive sample application.
|
|
|
|
The keepalive functionality is initialized with a struct
|
|
rte_keepalive and the callback function to invoke in the
|
|
case of a timeout.
|
|
|
|
.. code-block:: c
|
|
|
|
rte_global_keepalive_info = rte_keepalive_create(&dead_core, NULL);
|
|
if (rte_global_keepalive_info == NULL)
|
|
rte_exit(EXIT_FAILURE, "keepalive_create() failed");
|
|
|
|
The function that issues the pings keepalive_dispatch_pings()
|
|
is configured to run every check_period milliseconds.
|
|
|
|
.. code-block:: c
|
|
|
|
if (rte_timer_reset(&hb_timer,
|
|
(check_period * rte_get_timer_hz()) / 1000,
|
|
PERIODICAL,
|
|
rte_lcore_id(),
|
|
&rte_keepalive_dispatch_pings,
|
|
rte_global_keepalive_info
|
|
) != 0 )
|
|
rte_exit(EXIT_FAILURE, "Keepalive setup failure.\n");
|
|
|
|
The rest of the initialization and run-time path follows
|
|
the same paths as the L2 forwarding application. The only
|
|
addition to the main processing loop is the mark alive
|
|
functionality and the example random failures.
|
|
|
|
.. code-block:: c
|
|
|
|
rte_keepalive_mark_alive(&rte_global_keepalive_info);
|
|
cur_tsc = rte_rdtsc();
|
|
|
|
/* Die randomly within 7 secs for demo purposes.. */
|
|
if (cur_tsc - tsc_initial > tsc_lifetime)
|
|
break;
|
|
|
|
The rte_keepalive_mark_alive function simply sets the core state to alive.
|
|
|
|
.. code-block:: c
|
|
|
|
static inline void
|
|
rte_keepalive_mark_alive(struct rte_keepalive *keepcfg)
|
|
{
|
|
keepcfg->live_data[rte_lcore_id()].core_state = RTE_KA_STATE_ALIVE;
|
|
}
|