Block I/O

Provides high-performance asynchronous access to block devices (NVMe SSDs, etc.) integrated with libevpl’s event loop.

Overview

libevpl’s Block I/O subsystem offers:

Asynchronous operations - Read, write, and flush integrate with the event loop
Zero-copy I/O - Uses iovec-based scatter-gather
Multiple backends - io_uring and VFIO-NVMe for maximum performance
High IOPS - Optimized for NVMe SSDs (millions of IOPS)
Per-thread queues - Lock-free operation within event loops

Supported Backends

io_uring

Description: Linux io_uring asynchronous I/O interface

Availability: Linux kernel 5.1+ with liburing

Characteristics:

Kernel-mediated I/O
Works with any block device
Lower CPU overhead than traditional AIO
Excellent for general-purpose storage

Use cases: General NVMe/SSD access, compatibility across devices

VFIO-NVMe

Description: Direct userspace NVMe access via VFIO

Availability: Linux with VFIO and IOMMU support

Characteristics:

Bypasses kernel completely
Direct hardware access
Ultra-low latency
Requires device unbinding from kernel driver
Requires root or appropriate VFIO permissions

Use cases: Ultra-low latency storage, maximum IOPS, dedicated storage devices

Types

`struct evpl_block_device`

Opaque structure representing an opened block device.

`struct evpl_block_queue`

Opaque structure representing a per-thread I/O queue for a block device. Each event loop creates its own queue.

`enum evpl_block_protocol_id`

Identifies block device backend:

Protocol	Description
`EVPL_BLOCK_PROTOCOL_IO_URING`	Linux io_uring
`EVPL_BLOCK_PROTOCOL_VFIO`	VFIO-NVMe direct access

`evpl_block_callback_t`

typedef void (*evpl_block_callback_t)(
    struct evpl *evpl,
    int          status,
    void        *private_data);

Callback invoked when a block operation completes.

Parameters:

evpl - Event loop
status - 0 on success, negative error code on failure
private_data - User-provided context

Functions

Device Management

`evpl_block_open_device`

struct evpl_block_device *evpl_block_open_device(
    enum evpl_block_protocol_id protocol,
    const char                 *uri);

Open a block device. Each device should be opened once globally for the whole process.

Parameters:

protocol - Backend protocol to use
uri - Device identifier (protocol-specific)

Returns: Block device handle, or NULL on failure

URI Formats:

io_uring:

Device path: /dev/nvme0n1
File path: /tmp/testfile

VFIO-NVMe:

PCI address: 0000:01:00.0

`evpl_block_close_device`

void evpl_block_close_device(struct evpl_block_device *blockdev);

Close a block device. All queues must be closed first.

Parameters:

blockdev - Device to close

`evpl_block_size`

uint64_t evpl_block_size(struct evpl_block_device *blockdev);

Get the size of a block device in bytes.

Parameters:

blockdev - Device to query

Returns: Device size in bytes

`evpl_block_max_request_size`

uint64_t evpl_block_max_request_size(struct evpl_block_device *blockdev);

Get the maximum size for a single I/O request.

Parameters:

blockdev - Device to query

Returns: Maximum request size in bytes

Note: Requests larger than this must be split into multiple operations.

Queue Management

`evpl_block_open_queue`

struct evpl_block_queue *evpl_block_open_queue(
    struct evpl              *evpl,
    struct evpl_block_device *blockdev);

Create a thread-specific I/O queue for a block device attached to an event loop.

Parameters:

evpl - Event loop
blockdev - Block device

Returns: Queue handle, or NULL on failure

Thread Safety: Each thread/event loop must create its own queue. Queues are not shared.

`evpl_block_close_queue`

void evpl_block_close_queue(
    struct evpl             *evpl,
    struct evpl_block_queue *queue);

Close an I/O queue. All pending operations must complete first.

Parameters:

evpl - Event loop
queue - Queue to close

I/O Operations

`evpl_block_read`

void evpl_block_read(
    struct evpl             *evpl,
    struct evpl_block_queue *queue,
    struct evpl_iovec       *iov,
    int                      niov,
    uint64_t                 offset,
    evpl_block_callback_t    callback,
    void                    *private_data);

Read data from a block device asynchronously. Zero copy with supported backend.

Parameters:

evpl - Event loop
queue - I/O queue
iov - Array of iovecs to read into
niov - Number of iovecs
offset - Byte offset in device
callback - Completion callback
private_data - User context

Note: Buffers must remain valid until callback is invoked. Use evpl_iovec_addref() if needed.

`evpl_block_write`

void evpl_block_write(
    struct evpl             *evpl,
    struct evpl_block_queue *queue,
    const struct evpl_iovec *iov,
    int                      niov,
    uint64_t                 offset,
    int                      sync,
    evpl_block_callback_t    callback,
    void                    *private_data);

Write data to a block device asynchronously. Zero copy with supported backend.

Parameters:

evpl - Event loop
queue - I/O queue
iov - Array of iovecs containing data to write
niov - Number of iovecs
offset - Byte offset in device
sync - 1 for durable write (FUA), 0 to allow non-durable
callback - Completion callback
private_data - User context

Synchronous vs Asynchronous:

sync=0: Write may be cached, returns when accepted by device, may be lost on power event
sync=1: Write is flushed to persistent media before being considered complete (Force Unit Access)

`evpl_block_flush`

void evpl_block_flush(
    struct evpl             *evpl,
    struct evpl_block_queue *queue,
    evpl_block_callback_t    callback,
    void                    *private_data);

Flush cached writes to persistent storage. Only pertinent if non-FUA writes were previously issued.

For io_uring, performs a sync(), for VFIO-NVME performs an NVMe flush operation.

Parameters:

evpl - Event loop
queue - I/O queue
callback - Completion callback
private_data - User context

Backend Comparison

Feature	io_uring	VFIO-NVMe
Latency	Low	Ultra-low
IOPS	Very High	Maximum
CPU Usage	Low	Very Low
Setup	Simple	Complex (device unbind)
Permissions	Standard	Root or VFIO setup
Device Support	All block devices	NVMe only
Kernel Dependency	Yes	No (userspace)

Choosing a Backend

Use io_uring when:

You need compatibility across different devices
You want simple setup and configuration
You’re using standard filesystems or partitions
You need kernel features (filesystem, encryption, etc.)

Use VFIO-NVMe when:

You need absolute minimum latency
You can dedicate entire NVMe device to application
You have proper IOMMU and VFIO configuration
CPU efficiency is critical
You’re building custom storage stack (no filesystem)

Performance Optimization

Alignment

io_uring genereally requires page and block aligned I/O to allow DMA.

VFIO-NVMe alignment dependency varies device to device. All NVMe devices require sector-aligned I/O, but many do not have any memory alignment requirement.

All else being equal, use page aligned memory when possible.

Request Size

Larger requests generally improve throughput to a point. Check maximum request size and use large requests for optimal throughput.

Queue Depth

Multiple outstanding requests improve IOPS to a point. Issue multiple requests in parallel for better performance.

Block I/O

Overview

Supported Backends

io_uring

VFIO-NVMe

Types

struct evpl_block_device

struct evpl_block_queue

enum evpl_block_protocol_id

evpl_block_callback_t

Functions

Device Management

evpl_block_open_device

evpl_block_close_device

evpl_block_size

evpl_block_max_request_size

Queue Management

evpl_block_open_queue

evpl_block_close_queue

I/O Operations

evpl_block_read

evpl_block_write

evpl_block_flush

Backend Comparison

Choosing a Backend

Use io_uring when:

Use VFIO-NVMe when:

Performance Optimization

Alignment

Request Size

Queue Depth

See Also

`struct evpl_block_device`

`struct evpl_block_queue`

`enum evpl_block_protocol_id`

`evpl_block_callback_t`

`evpl_block_open_device`

`evpl_block_close_device`

`evpl_block_size`

`evpl_block_max_request_size`

`evpl_block_open_queue`

`evpl_block_close_queue`

`evpl_block_read`

`evpl_block_write`

`evpl_block_flush`