From patchwork Tue Jan 7 16:05:14 2025 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jens Axboe X-Patchwork-Id: 13929260 Received: from mail-il1-f169.google.com (mail-il1-f169.google.com [209.85.166.169]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 4F04B78F5E for ; Tue, 7 Jan 2025 16:06:23 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.166.169 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1736265987; cv=none; b=UvtdPbAM9ViRynx4MKvTaRk7nPTTS5OwUlZQnq3oOjBSQcfq5mUZn+kBDKUdindy98eV5mtvJm7rUKIzwHyn6jMtSUyXdPWHIAj1/PqzkV3kpWlfmC4Ug1Q3WL/0WpYRVQLgqaVtI+uDk2GJWjEZSjMD40qPUi4GRQtMjMS3REQ= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1736265987; c=relaxed/simple; bh=wtV4tWvdHCc0BYgs7ryYGPM+vvAyED0pDJgZpb7sMaw=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=gShajfMrdiNfe6HDHh2vgXMzZX7UEGPGta1a1nvg14/NNR0FhgP0teDN5ZEDGGm7QilyhyiT+Y2Tz8k0V19sw0MQ7SpANqa5K6xPudxXiDwDTgKYltXqSoHy+wh+AYArsgBBcL7W3m50Jqn2Q6AkA+2iZjbqj682dDdPvJvMT00= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=Hw/6sKTL; arc=none smtp.client-ip=209.85.166.169 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="Hw/6sKTL" Received: by mail-il1-f169.google.com with SMTP id e9e14a558f8ab-3cdd57060acso12936675ab.0 for ; Tue, 07 Jan 2025 08:06:23 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1736265982; x=1736870782; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=vMWy4lVwJHJ3LX8Yqx0H0hDkU0C1A4oW2X7MUAtdqMM=; b=Hw/6sKTLlo4JC9/D2wfIxA+BhVoUQYvwmn48tM8lRmH8gcagWQPhT3suY7Fk0izDqW WUhcOtAl4TdQKNeI6Olu4u4CX1jhDpIfSIde3poKSaCHyF2VDNKNFZQKpq8fEyUBcEJN IGCPCbQEDIp0wlTqWngfy8PDKtvhUCYoD5ONigR053fC9mzAN4aaDB+mu+VZVgOV/4ES 5nYz5/Abz/6UT0giKz2uAs1VsuFhUymXgN7GHvbkYtp/K5hNngJe7kyyeBwCQQiJfemU Vymfkgv5wa4h6wTbkIZfafuqmNktGxK9dQk9GoZ6kyo4vAaScLIbvg2WEOx7GY+Bd9Up +vxQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1736265982; x=1736870782; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=vMWy4lVwJHJ3LX8Yqx0H0hDkU0C1A4oW2X7MUAtdqMM=; b=GCxHIgE+Ti2n3UorEEb3OpIBw62cpb4HMHE7zz0C7i9rVZrzb0zh8eySiC3ShnTOnj hl/bMzNm0TyqrbMT69tK+67hH7t3K3kTL6E1p6DfIxh6tP9QwtewcjeJstEHY8zZcZ47 CGsO+O/g2dSu0WAbaDhQ9qwHRX1wB1M1rPa4f9h3G+e+0Fs6frsr/4bVfjpOwWuswnHQ emTqNMxpyYzBgCJRfKONbp0JyJVhe2QuUbJPEXQovQFRErbTv0+OVCRnR3y9367kG8zs YjLEwRZ3Ur20d35ClAoLs+aFyUpj242/DrTAuCPemGrXnM4LlTXQW2bEiwVHsdCijoLc Loog== X-Forwarded-Encrypted: i=1; AJvYcCVFBCFoMZ0ptrBDx+Uj6Shs4ZfkKn5TPZc9CfW4vPo5KYzugI+Ievcf0Nco1zVK1/fF3mSabRSj@vger.kernel.org X-Gm-Message-State: AOJu0YwxztO1BEf8233gTnC/hFsj8Fc0vttJNgooDVtPaYByc80ie2WK IsfvjCXX/gkf953o4yrY/cQH2VNoYpldc/Rqn/X+TqRXIO7HNnHgbAQyaG5jmEM= X-Gm-Gg: ASbGncuHy3VgZR5eamVTAaA853lYKYQUyBQymYp7l94Jh1gdAa0zOE17xr9BzSxvrzG 3qpR1Yv43Oo72xzm6k3d3m5Pqrgvv+KTJ/qg+Mkosb1TA/V70mmrofh7tbVVs8HLtAofjxTQBgr gw0wvrbo399nu6pwOt5YT7LKsjq4w91wekfvrMjGQEuVOn+lFzaWRSxqv15mXCmH6n1FwedS2SD qKUBBMJzut2mcpzuCW3mexcFYBiTJ5sfEpDxwdH0ZhugC3SwAgHs3JY5qhr X-Google-Smtp-Source: AGHT+IF+HaeIiFVpeL/VbxKOkBi8d7+5Gppi0Wqx6gx5ir8wCeq05WbmVZbnDk1YVrh8ON0Rf5cWeg== X-Received: by 2002:a05:6e02:3201:b0:3a7:c3aa:a82b with SMTP id e9e14a558f8ab-3c2d1aa2b75mr516367585ab.1.1736265982360; Tue, 07 Jan 2025 08:06:22 -0800 (PST) Received: from localhost.localdomain ([96.43.243.2]) by smtp.gmail.com with ESMTPSA id 8926c6da1cb9f-4e68c22455csm9529955173.160.2025.01.07.08.06.21 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 07 Jan 2025 08:06:21 -0800 (PST) From: Jens Axboe To: zlang@kernel.org Cc: djwong@kernel.org, fstests@vger.kernel.org, Jens Axboe Subject: [PATCH 1/2] fsstress: add support for RWF_DONTCACHE Date: Tue, 7 Jan 2025 09:05:14 -0700 Message-ID: <20250107160617.222775-2-axboe@kernel.dk> X-Mailer: git-send-email 2.47.1 In-Reply-To: <20250107160617.222775-1-axboe@kernel.dk> References: <20250107160617.222775-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: fstests@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Using RWF_DONTCACHE tells the kernel that any page cache instantiated by this operation should get pruned once the operation completes. If data is in cache prior to the operation it will remain there. Add ops for testing both the read and write side of this. If the kernel being tested doesn't support RWF_DONTCACHE, then operations are performed with regular read/write buffered IO. See the kernel posting adding support: https://lore.kernel.org/linux-fsdevel/20241220154831.1086649-1-axboe@kernel.dk/ Signed-off-by: Jens Axboe Reviewed-by: "Darrick J. Wong" --- ltp/fsstress.c | 144 +++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 144 insertions(+) diff --git a/ltp/fsstress.c b/ltp/fsstress.c index 3d248ee25791..c5554d651b67 100644 --- a/ltp/fsstress.c +++ b/ltp/fsstress.c @@ -82,6 +82,16 @@ static int renameat2(int dfd1, const char *path1, #define RENAME_WHITEOUT (1 << 2) /* Whiteout source */ #endif +#ifndef RWF_DONTCACHE +#define RWF_DONTCACHE 0x80 +#endif + +/* + * Default to RWF_DONTCACHE being available, turn it off if -EOPNOTSUPP + * is seen during issue. + */ +static int have_rwf_dontcache = 1; + #define FILELEN_MAX (32*4096) typedef enum { @@ -117,6 +127,7 @@ typedef enum { OP_COLLAPSE, OP_INSERT, OP_READ, + OP_READ_DONTCACHE, OP_READLINK, OP_READV, OP_REMOVEFATTR, @@ -143,6 +154,7 @@ typedef enum { OP_URING_READ, OP_URING_WRITE, OP_WRITE, + OP_WRITE_DONTCACHE, OP_WRITEV, OP_EXCHANGE_RANGE, OP_LAST @@ -248,6 +260,7 @@ void zero_f(opnum_t, long); void collapse_f(opnum_t, long); void insert_f(opnum_t, long); void unshare_f(opnum_t, long); +void read_dontcache_f(opnum_t, long); void read_f(opnum_t, long); void readlink_f(opnum_t, long); void readv_f(opnum_t, long); @@ -273,6 +286,7 @@ void unlink_f(opnum_t, long); void unresvsp_f(opnum_t, long); void uring_read_f(opnum_t, long); void uring_write_f(opnum_t, long); +void write_dontcache_f(opnum_t, long); void write_f(opnum_t, long); void writev_f(opnum_t, long); void exchangerange_f(opnum_t, long); @@ -315,6 +329,7 @@ struct opdesc ops[OP_LAST] = { [OP_COLLAPSE] = {"collapse", collapse_f, 1, 1 }, [OP_INSERT] = {"insert", insert_f, 1, 1 }, [OP_READ] = {"read", read_f, 1, 0 }, + [OP_READ_DONTCACHE] = {"read_dontcache", read_dontcache_f, 1, 0 }, [OP_READLINK] = {"readlink", readlink_f, 1, 0 }, [OP_READV] = {"readv", readv_f, 1, 0 }, /* remove (delete) extended attribute */ @@ -346,6 +361,7 @@ struct opdesc ops[OP_LAST] = { [OP_URING_WRITE] = {"uring_write", uring_write_f, 1, 1 }, [OP_WRITE] = {"write", write_f, 4, 1 }, [OP_WRITEV] = {"writev", writev_f, 4, 1 }, + [OP_WRITE_DONTCACHE]= {"write_dontcache", write_dontcache_f,4, 1 }, [OP_EXCHANGE_RANGE]= {"exchangerange", exchangerange_f, 2, 1 }, }, *ops_end; @@ -4635,6 +4651,73 @@ readv_f(opnum_t opno, long r) close(fd); } +void +read_dontcache_f(opnum_t opno, long r) +{ + int e; + pathname_t f; + int fd; + int64_t lr; + off64_t off; + struct stat64 stb; + int v; + char st[1024]; + struct iovec iov; + int flags; + + init_pathname(&f); + if (!get_fname(FT_REGFILE, r, &f, NULL, NULL, &v)) { + if (v) + printf("%d/%lld: read - no filename\n", procid, opno); + free_pathname(&f); + return; + } + fd = open_path(&f, O_RDONLY); + e = fd < 0 ? errno : 0; + check_cwd(); + if (fd < 0) { + if (v) + printf("%d/%lld: read - open %s failed %d\n", + procid, opno, f.path, e); + free_pathname(&f); + return; + } + if (fstat64(fd, &stb) < 0) { + if (v) + printf("%d/%lld: read - fstat64 %s failed %d\n", + procid, opno, f.path, errno); + free_pathname(&f); + close(fd); + return; + } + inode_info(st, sizeof(st), &stb, v); + if (stb.st_size == 0) { + if (v) + printf("%d/%lld: read - %s%s zero size\n", procid, opno, + f.path, st); + free_pathname(&f); + close(fd); + return; + } + lr = ((int64_t)random() << 32) + random(); + off = (off64_t)(lr % stb.st_size); + iov.iov_len = (random() % FILELEN_MAX) + 1; + iov.iov_base = malloc(iov.iov_len); + flags = have_rwf_dontcache ? RWF_DONTCACHE : 0; + e = preadv2(fd, &iov, 1, off, flags) < 0 ? errno : 0; + if (have_rwf_dontcache && e == EOPNOTSUPP) { + have_rwf_dontcache = 0; + e = preadv2(fd, &iov, 1, off, 0) < 0 ? errno : 0; + } + free(iov.iov_base); + if (v) + printf("%d/%lld: read dontcache %s%s [%lld,%d] %d\n", + procid, opno, f.path, st, (long long)off, + (int)iov.iov_len, e); + free_pathname(&f); + close(fd); +} + void removefattr_f(opnum_t opno, long r) { @@ -5509,6 +5592,67 @@ writev_f(opnum_t opno, long r) close(fd); } +void +write_dontcache_f(opnum_t opno, long r) +{ + int e; + pathname_t f; + int fd; + int64_t lr; + off64_t off; + struct stat64 stb; + int v; + char st[1024]; + struct iovec iov; + int flags; + + init_pathname(&f); + if (!get_fname(FT_REGm, r, &f, NULL, NULL, &v)) { + if (v) + printf("%d/%lld: write - no filename\n", procid, opno); + free_pathname(&f); + return; + } + fd = open_path(&f, O_WRONLY); + e = fd < 0 ? errno : 0; + check_cwd(); + if (fd < 0) { + if (v) + printf("%d/%lld: write - open %s failed %d\n", + procid, opno, f.path, e); + free_pathname(&f); + return; + } + if (fstat64(fd, &stb) < 0) { + if (v) + printf("%d/%lld: write - fstat64 %s failed %d\n", + procid, opno, f.path, errno); + free_pathname(&f); + close(fd); + return; + } + inode_info(st, sizeof(st), &stb, v); + lr = ((int64_t)random() << 32) + random(); + off = (off64_t)(lr % MIN(stb.st_size + (1024 * 1024), MAXFSIZE)); + off %= maxfsize; + iov.iov_len = (random() % FILELEN_MAX) + 1; + iov.iov_base = malloc(iov.iov_len); + memset(iov.iov_base, nameseq & 0xff, iov.iov_len); + flags = have_rwf_dontcache ? RWF_DONTCACHE : 0; + e = pwritev2(fd, &iov, 1, off, flags) < 0 ? errno : 0; + if (have_rwf_dontcache && e == EOPNOTSUPP) { + have_rwf_dontcache = 0; + e = pwritev2(fd, &iov, 1, off, 0) < 0 ? errno : 0; + } + free(iov.iov_base); + if (v) + printf("%d/%lld: write dontcache %s%s [%lld,%d] %d\n", + procid, opno, f.path, st, (long long)off, + (int)iov.iov_len, e); + free_pathname(&f); + close(fd); +} + char * xattr_flag_to_string(int flag) { From patchwork Tue Jan 7 16:05:15 2025 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jens Axboe X-Patchwork-Id: 13929261 Received: from mail-io1-f43.google.com (mail-io1-f43.google.com [209.85.166.43]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id D407411CAF for ; Tue, 7 Jan 2025 16:06:24 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.166.43 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1736265988; cv=none; b=enPrPOS8qJFLNxPx0jjJQF80sEIvlnE7t7oXh5ohDWa28UTKUpr3x5c3TvhyW6gPQh9VKnPpCPPIL9EmR+Ww6YdQBnqURpeGyAjN89rGiMGdY7Be93QmO5h2w0kaWMD6fzYHvSh9LvtN2CTRPo21nr6N+mrJGu7+Xz0z023Voi4= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1736265988; c=relaxed/simple; bh=Ln/PhFpbf5GjSd84Py9k23dBz/p0fgtoulwsbcaHtFY=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=PIWLxx7SQEyoiGA/ruIBb+K2ItofgzlZl+i0mQChZcRMVfTu+Ii1bgPJt1rgSU/VwssTs9qjhEKzpUJUi9wb9wDNVAN+4mQpxqg+lK2v74wBShb9SjkELXX1lMV+gciMH3ivk+XIC5j+6b58sAdO9DOBy1MzwJWlAwGyRiti1ns= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk; spf=pass smtp.mailfrom=kernel.dk; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b=BypVQLju; arc=none smtp.client-ip=209.85.166.43 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.dk Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=kernel.dk Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel-dk.20230601.gappssmtp.com header.i=@kernel-dk.20230601.gappssmtp.com header.b="BypVQLju" Received: by mail-io1-f43.google.com with SMTP id ca18e2360f4ac-844e7bc6d84so606977339f.0 for ; Tue, 07 Jan 2025 08:06:24 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20230601.gappssmtp.com; s=20230601; t=1736265984; x=1736870784; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=0XGT260sEidNoQlUv2QG3xH2gCfLfyJH70rVR3JcN6U=; b=BypVQLjuN6XgtSQuW2PkQ1YygmaEFul9CANTW/g6M7f2tn88083D2mKpwbNMlwZpgX Tph9LMet9ypSQJhNZI350OloJVUEvy4Vx+LKv9aQgtFMLY9W3swU790AxZmJ+xDuNKgv haasZ3kBaVmhT8fMGeZhdn2OY4uZQKq8GRrrMS5xZXE8qH1fRrx5C0vaWky9wiIzXGET QZWSB/khenaYOWeA61bl8RqEnhiJF8odtFM4Vgr8WWFtNHQuZs65+UmdV74ugTrT/YLY JLO0Xf3GWhTPOGh2OK1vRWoOTFt9SDXOY1YSYet1VPM9MHh8aI4Z6LihC8PIndRzWrtZ +QNg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1736265984; x=1736870784; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=0XGT260sEidNoQlUv2QG3xH2gCfLfyJH70rVR3JcN6U=; b=nFfXOB3hAtc6ZDmJswjdOz20GyTBEbDjqFpTJ/uexrYZP/LqFu1IofsG9hcODjtGgS Fiqn69NApxaAYR43+wBzwio43x0gaFcB83pZmnuVawAbFD0eIBwTd0/J9UXSg0O+/1iS 0XwM1panAmMWcVnrDRsmSn+iKGkO/qKPWDA9MVdd0oiOUv7KJUQB36EVcSomAksYLdx7 DZpSWy1MK0ur+8qIiK7f/+99xhzPlV4fJJC1G48M6Ml34ovfv0ytIfCd2jmtHo6G2RlG pyxeqHtSM4gKK9LuuDElm5RvcNZJvO4ede5gWAtSbflyHMa07pMLn1dWotiXZdW+ksLv zlGQ== X-Forwarded-Encrypted: i=1; AJvYcCVxy7edcFDSWTDhwi4dR4jTLoPQmltye6BPfMELTOUdvuYN5x01lZcRay9gZBZcEteTzqIdek88@vger.kernel.org X-Gm-Message-State: AOJu0YypK2nKY6dUYhnMYRk2JsyxA5X7mw69Sm64JgB0p0ATyDagoA3/ p5tlb3cz7V2mJpu5RjalXmQNBcusBx+GvfW+XTOJBD0Xb015MgaitLTfQyAJnrVrUGDgr/Dq4Zg 5 X-Gm-Gg: ASbGncsdjFdz7QOw+xU5gUQQa+dX/nPx9rKYOCludhnMqz63WV9tKQSXeZcKQY5p5aM QIy5sCj770RLVn0pB4ihcoVQEt3FK2VQ6YYU7PKR7spXcvSuz6W3c+GYO+ZUqlPOiEs5b0ehWRM XDkj4KleMCGVoJk+yxUdAmDZGwnmyyfnlBJFpnZk47yX/o1DorwDioAhgsiRzx+2pvSPpvf49X4 vmPCvMmHieyRxzKq/i3mdNqXB9NFFwyQP8TWNYNaTi35LpG78WfCVpSRlrx X-Google-Smtp-Source: AGHT+IGfzTnIaDi1dIDGCTDby/aq/7jRiAVHpE0thrDxNT6hGXqIWRxD0pwZMKh4Cdv/5a7g43GegQ== X-Received: by 2002:a05:6602:e41:b0:835:39a8:e201 with SMTP id ca18e2360f4ac-84cd2c9f339mr255859339f.0.1736265983645; Tue, 07 Jan 2025 08:06:23 -0800 (PST) Received: from localhost.localdomain ([96.43.243.2]) by smtp.gmail.com with ESMTPSA id 8926c6da1cb9f-4e68c22455csm9529955173.160.2025.01.07.08.06.22 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 07 Jan 2025 08:06:22 -0800 (PST) From: Jens Axboe To: zlang@kernel.org Cc: djwong@kernel.org, fstests@vger.kernel.org, Jens Axboe Subject: [PATCH 2/2] fsx: add support for RWF_DONTCACHE Date: Tue, 7 Jan 2025 09:05:15 -0700 Message-ID: <20250107160617.222775-3-axboe@kernel.dk> X-Mailer: git-send-email 2.47.1 In-Reply-To: <20250107160617.222775-1-axboe@kernel.dk> References: <20250107160617.222775-1-axboe@kernel.dk> Precedence: bulk X-Mailing-List: fstests@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Using RWF_DONTCACHE tells the kernel that any page cache instantiated by this operation should get pruned once the operation completes. If data is in cache prior to the operation it will remain there. Add ops for testing both the read and write side of this. At startup, kernel support for this feature is probed. If support isn't available, uncached/dontcache IO is performed as regular buffered IO. If -Z is used to turn on O_DIRECT, then uncached/dontcache IO isn't performed. Defaults to on if available, and adds a -T parameter to turn it off. See the kernel posting adding support: https://lore.kernel.org/linux-fsdevel/20241220154831.1086649-1-axboe@kernel.dk/ Signed-off-by: Jens Axboe Reviewed-by: "Darrick J. Wong" --- ltp/fsx.c | 114 ++++++++++++++++++++++++++++++++++++------------------ 1 file changed, 76 insertions(+), 38 deletions(-) diff --git a/ltp/fsx.c b/ltp/fsx.c index 41933354328a..9efd2f5c86d1 100644 --- a/ltp/fsx.c +++ b/ltp/fsx.c @@ -43,6 +43,10 @@ # define MAP_FILE 0 #endif +#ifndef RWF_DONTCACHE +#define RWF_DONTCACHE 0x80 +#endif + #define NUMPRINTCOLUMNS 32 /* # columns of data to print on each line */ /* Operation flags (bitmask) */ @@ -101,7 +105,9 @@ int logcount = 0; /* total ops */ enum { /* common operations */ OP_READ = 0, + OP_READ_DONTCACHE, OP_WRITE, + OP_WRITE_DONTCACHE, OP_MAPREAD, OP_MAPWRITE, OP_MAX_LITE, @@ -190,15 +196,16 @@ int o_direct; /* -Z */ int aio = 0; int uring = 0; int mark_nr = 0; +int dontcache_io = 1; int page_size; int page_mask; int mmap_mask; -int fsx_rw(int rw, int fd, char *buf, unsigned len, unsigned offset); +int fsx_rw(int rw, int fd, char *buf, unsigned len, unsigned offset, int flags); #define READ 0 #define WRITE 1 -#define fsxread(a,b,c,d) fsx_rw(READ, a,b,c,d) -#define fsxwrite(a,b,c,d) fsx_rw(WRITE, a,b,c,d) +#define fsxread(a,b,c,d,f) fsx_rw(READ, a,b,c,d,f) +#define fsxwrite(a,b,c,d,f) fsx_rw(WRITE, a,b,c,d,f) struct timespec deadline; @@ -266,7 +273,9 @@ prterr(const char *prefix) static const char *op_names[] = { [OP_READ] = "read", + [OP_READ_DONTCACHE] = "read_dontcache", [OP_WRITE] = "write", + [OP_WRITE_DONTCACHE] = "write_dontcache", [OP_MAPREAD] = "mapread", [OP_MAPWRITE] = "mapwrite", [OP_TRUNCATE] = "truncate", @@ -393,12 +402,14 @@ logdump(void) prt("\t******WWWW"); break; case OP_READ: + case OP_READ_DONTCACHE: prt("READ 0x%x thru 0x%x\t(0x%x bytes)", lp->args[0], lp->args[0] + lp->args[1] - 1, lp->args[1]); if (overlap) prt("\t***RRRR***"); break; + case OP_WRITE_DONTCACHE: case OP_WRITE: prt("WRITE 0x%x thru 0x%x\t(0x%x bytes)", lp->args[0], lp->args[0] + lp->args[1] - 1, @@ -784,9 +795,8 @@ doflush(unsigned offset, unsigned size) } void -doread(unsigned offset, unsigned size) +doread(unsigned offset, unsigned size, int flags) { - off_t ret; unsigned iret; offset -= offset % readbdy; @@ -818,12 +828,7 @@ doread(unsigned offset, unsigned size) (monitorend == -1 || offset <= monitorend)))))) prt("%lld read\t0x%x thru\t0x%x\t(0x%x bytes)\n", testcalls, offset, offset + size - 1, size); - ret = lseek(fd, (off_t)offset, SEEK_SET); - if (ret == (off_t)-1) { - prterr("doread: lseek"); - report_failure(140); - } - iret = fsxread(fd, temp_buf, size, offset); + iret = fsxread(fd, temp_buf, size, offset, flags); if (iret != size) { if (iret == -1) prterr("doread: read"); @@ -870,7 +875,6 @@ check_contents(void) unsigned map_offset; unsigned map_size; char *p; - off_t ret; unsigned iret; if (!check_buf) { @@ -885,13 +889,7 @@ check_contents(void) if (size == 0) return; - ret = lseek(fd, (off_t)offset, SEEK_SET); - if (ret == (off_t)-1) { - prterr("doread: lseek"); - report_failure(140); - } - - iret = fsxread(fd, check_buf, size, offset); + iret = fsxread(fd, check_buf, size, offset, 0); if (iret != size) { if (iret == -1) prterr("check_contents: read"); @@ -1064,9 +1062,8 @@ update_file_size(unsigned offset, unsigned size) } void -dowrite(unsigned offset, unsigned size) +dowrite(unsigned offset, unsigned size, int flags) { - off_t ret; unsigned iret; offset -= offset % writebdy; @@ -1099,14 +1096,9 @@ dowrite(unsigned offset, unsigned size) (monitorstart == -1 || (offset + size > monitorstart && (monitorend == -1 || offset <= monitorend)))))) - prt("%lld write\t0x%x thru\t0x%x\t(0x%x bytes)\n", testcalls, - offset, offset + size - 1, size); - ret = lseek(fd, (off_t)offset, SEEK_SET); - if (ret == (off_t)-1) { - prterr("dowrite: lseek"); - report_failure(150); - } - iret = fsxwrite(fd, good_buf + offset, size, offset); + prt("%lld write\t0x%x thru\t0x%x\t(0x%x bytes)\tdontcache=%d\n", testcalls, + offset, offset + size - 1, size, (flags & RWF_DONTCACHE) != 0); + iret = fsxwrite(fd, good_buf + offset, size, offset, flags); if (iret != size) { if (iret == -1) prterr("dowrite: write"); @@ -1954,6 +1946,26 @@ do_preallocate(unsigned offset, unsigned length, int keep_size, int unshare) } #endif +int +test_dontcache_io(void) +{ + char buf[4096]; + struct iovec iov = { .iov_base = buf, .iov_len = sizeof(buf) }; + int ret, e; + + ret = preadv2(fd, &iov, 1, 0, RWF_DONTCACHE); + e = ret < 0 ? errno : 0; + if (e == EOPNOTSUPP) { + if (!quiet) + fprintf(stderr, + "main: filesystem does not support " + "dontcache IO, disabling!\n"); + return 0; + } + + return 1; +} + void writefileimage() { @@ -2337,12 +2349,28 @@ have_op: switch (op) { case OP_READ: TRIM_OFF_LEN(offset, size, file_size); - doread(offset, size); + doread(offset, size, 0); + break; + + case OP_READ_DONTCACHE: + TRIM_OFF_LEN(offset, size, file_size); + if (dontcache_io) + doread(offset, size, RWF_DONTCACHE); + else + doread(offset, size, 0); break; case OP_WRITE: TRIM_OFF_LEN(offset, size, maxfilelen); - dowrite(offset, size); + dowrite(offset, size, 0); + break; + + case OP_WRITE_DONTCACHE: + TRIM_OFF_LEN(offset, size, maxfilelen); + if (dontcache_io) + dowrite(offset, size, RWF_DONTCACHE); + else + dowrite(offset, size, 0); break; case OP_MAPREAD: @@ -2538,6 +2566,7 @@ usage(void) " -0: Do not use exchange range calls\n" #endif " -K: Do not use keep size\n\ + -T: Do not use dontcache IO\n\ -L: fsxLite - no file creations & no file size changes\n\ -N numops: total # operations to do (default infinity)\n\ -O: use oplen (see -o flag) for every op (default random)\n\ @@ -2546,7 +2575,7 @@ usage(void) -S seed: for random # generator (default 1) 0 gets timestamp\n\ -W: mapped write operations DISabled\n\ -X: Read file and compare to good buffer after every operation\n\ - -Z: O_DIRECT (use -R, -W, -r and -w too)\n\ + -Z: O_DIRECT (use -R, -W, -r and -w too, excludes dontcache IO)\n\ --replay-ops=opsfile: replay ops from recorded .fsxops file\n\ --record-ops[=opsfile]: dump ops file also on success. optionally specify ops file name\n\ --duration=seconds: ignore any -N setting and run for this many seconds\n\ @@ -2702,7 +2731,7 @@ uring_setup() } int -uring_rw(int rw, int fd, char *buf, unsigned len, unsigned offset) +uring_rw(int rw, int fd, char *buf, unsigned len, unsigned offset, int flags) { struct io_uring_sqe *sqe; struct io_uring_cqe *cqe; @@ -2733,6 +2762,7 @@ uring_rw(int rw, int fd, char *buf, unsigned len, unsigned offset) } else { io_uring_prep_writev(sqe, fd, &iovec, 1, o); } + sqe->rw_flags = flags; ret = io_uring_submit_and_wait(&ring, 1); if (ret != 1) { @@ -2781,7 +2811,7 @@ uring_rw(int rw, int fd, char *buf, unsigned len, unsigned offset) } #else int -uring_rw(int rw, int fd, char *buf, unsigned len, unsigned offset) +uring_rw(int rw, int fd, char *buf, unsigned len, unsigned offset, int flags) { fprintf(stderr, "io_rw: need IO_URING support!\n"); exit(111); @@ -2789,19 +2819,21 @@ uring_rw(int rw, int fd, char *buf, unsigned len, unsigned offset) #endif int -fsx_rw(int rw, int fd, char *buf, unsigned len, unsigned offset) +fsx_rw(int rw, int fd, char *buf, unsigned len, unsigned offset, int flags) { int ret; if (aio) { ret = aio_rw(rw, fd, buf, len, offset); } else if (uring) { - ret = uring_rw(rw, fd, buf, len, offset); + ret = uring_rw(rw, fd, buf, len, offset, flags); } else { + struct iovec iov = { .iov_base = buf, .iov_len = len }; + if (rw == READ) - ret = read(fd, buf, len); + ret = preadv2(fd, &iov, 1, offset, flags); else - ret = write(fd, buf, len); + ret = pwritev2(fd, &iov, 1, offset, flags); } return ret; } @@ -3065,6 +3097,9 @@ main(int argc, char **argv) if (seed < 0) usage(); break; + case 'T': + dontcache_io = 0; + break; case 'W': mapped_writes = 0; if (!quiet) @@ -3076,6 +3111,7 @@ main(int argc, char **argv) case 'Z': o_direct = O_DIRECT; o_flags |= O_DIRECT; + dontcache_io = 0; break; case 254: /* --duration */ if (!optarg) { @@ -3293,6 +3329,8 @@ main(int argc, char **argv) copy_range_calls = test_copy_range(); if (exchange_range_calls) exchange_range_calls = test_exchange_range(); + if (dontcache_io) + dontcache_io = test_dontcache_io(); while (keep_running()) if (!test())