[2/2] btrfs: send: avoid trashing the page cache

From: Filipe Manana <fdmanana@suse.com>

From: Filipe Manana <fdmanana@suse.com>

A send operation reads extent data using the buffered IO path for getting
extent data to send in write commands and this is both because it's simple
and to make use of the generic readahead infrastructure, which results in
a massive speedup.

However this fills the page cache with data that, most of the time, is
really only used by the send operation - once the write commands are sent,
it's not useful to have the data in the page cache anymore. For large
snapshots, bringing all data into the page cache eventually leads to the
need to evict other data from the page cache that may be more useful for
applications (and kernel susbsystems).

Even if extents are shared with the subvolume on which a snapshot is based
on and the data is currently on the page cache due to being read through
the subvolume, attempting to read the data through the snapshot will
always result in bringing a new copy of the data into another location in
the page cache (there's currently no shared memory for shared extents).

So make send evict the data it has read before if when it first opened
the inode, its mapping had no pages currently loaded: when
inode->i_mapping->nr_pages has a value of 0. Do this instead of deciding
based on the return value of filemap_range_has_page() before reading an
extent because the generic readahead mechanism may read pages beyond the
range we request (and it very often does it), which means a call to
filemap_range_has_page() will return true due to the readahead that was
triggered when processing a previous extent - we don't have a simple way
to distinguish this case from the case where the data was brought into
the page cache through someone else. So checking for the mapping number
of pages being 0 when we first open the inode is simple, cheap and it
generally accomplishes the goal of not trashing the page cache - the
only exception is if part of data was previously loaded into the page
cache through the snapshot by some other process, in that case we end
up not evicting any data send brings into the page cache, just like
before this change - but that however is not the common case.

Example scenario, on a box with 32G of RAM:

  $ btrfs subvolume create /mnt/sv1
  $ xfs_io -f -c "pwrite 0 4G" /mnt/sv1/file1

  $ btrfs subvolume snapshot -r /mnt/sv1 /mnt/snap1

  $ free -m
                 total        used        free      shared  buff/cache   available
  Mem:           31937         186       26866           0        4883       31297
  Swap:           8188           0        8188

  # After this we get less 4G of free memory.
  $ btrfs send /mnt/snap1 >/dev/null

  $ free -m
                 total        used        free      shared  buff/cache   available
  Mem:           31937         186       22814           0        8935       31297
  Swap:           8188           0        8188

The same, obviously, applies to an incremental send.

Signed-off-by: Filipe Manana <fdmanana@suse.com>
---
 fs/btrfs/send.c | 85 +++++++++++++++++++++++++++++++++++++++++++++++--
 1 file changed, 82 insertions(+), 3 deletions(-)

Message ID	2bab1466c746d6162a24cd8f31adb6b6fe954787.1652784088.git.fdmanana@suse.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-btrfs-owner@kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id C1E99C433EF for <linux-btrfs@archiver.kernel.org>; Tue, 17 May 2022 10:48:05 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S242197AbiEQKsA (ORCPT <rfc822;linux-btrfs@archiver.kernel.org>); Tue, 17 May 2022 06:48:00 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56290 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1344652AbiEQKrm (ORCPT <rfc822;linux-btrfs@vger.kernel.org>); Tue, 17 May 2022 06:47:42 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [145.40.68.75]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 55F303FBEA for <linux-btrfs@vger.kernel.org>; Tue, 17 May 2022 03:47:38 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 2619AB81803 for <linux-btrfs@vger.kernel.org>; Tue, 17 May 2022 10:47:37 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 625C5C34113 for <linux-btrfs@vger.kernel.org>; Tue, 17 May 2022 10:47:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1652784455; bh=oCpBUN/kXXIEBa4acvsidnBZ6ffgjardAGo5u7gFNyE=; h=From:To:Subject:Date:In-Reply-To:References:From; b=O82LS9WckcMcWQc8IhIbNl0dEWMpDSUf86cq1fZB6nPkFuZkQaVGQIlaNIBMBLEw+ LvPoLj6qy+VLutbsOFxKNEfuvA+SHyXqGeBYl91Ix/0wvuXmNM5JznuYy036EzNvjH dxwFwEzurUZ9vPS9/eh1AHiy7xKn/cPlcUjZfG5yXsJyP0jDKUKPRVKJx67gq2x62B oNEAu+r9S+DN/QGOFL78M5H9N3y6IECedYJmkpydipnxruH1LYcLX+t9Lt+L4Q+uIp bGj0VawvK6v2OrFepfhMkgYhOQmzXZYvXV0MDVuMUjhV+BG9S0d7WxSDwwWusWd27k nCf30A/0D0ziA== From: fdmanana@kernel.org To: linux-btrfs@vger.kernel.org Subject: [PATCH 2/2] btrfs: send: avoid trashing the page cache Date: Tue, 17 May 2022 11:47:30 +0100 Message-Id: <2bab1466c746d6162a24cd8f31adb6b6fe954787.1652784088.git.fdmanana@suse.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <cover.1652784088.git.fdmanana@suse.com> References: <cover.1652784088.git.fdmanana@suse.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-btrfs.vger.kernel.org> X-Mailing-List: linux-btrfs@vger.kernel.org
Series	btrfs: teach send to avoid trashing the page cache with data \| expand [v2,0/2] btrfs: teach send to avoid trashing the page cache with data [1/2] btrfs: send: keep the current inode open while processing it [2/2] btrfs: send: avoid trashing the page cache

[2/2] btrfs: send: avoid trashing the page cache

Commit Message

Patch