[2/2] btrfs: send: avoid trashing the page cache

From: Filipe Manana <fdmanana@suse.com>

From: Filipe Manana <fdmanana@suse.com>

A send operation reads extent data using the buffered IO path for getting
extent data to send in write commands and this is both because it's simple
and to make use of the generic readahead infrastructure, which results in
a massive speedup.

However this fills the page cache with data that, most of the time, is
really only used by the send operation - once the write commands are sent,
it's not useful to have the data in the page cache anymore. For large
snapshots, bringing all data into the page cache eventually leads to the
need to evict other data from the page cache that may be more useful for
applications (and kernel susbsystems).

Even if extents are shared with the subvolume on which a snapshot is based
on and the data is currently on the page cache due to being read through
the subvolume, attempting to read the data through the snapshot will
always result in bringing a new copy of the data into another location in
the page cache (there's currently no shared memory for shared extents).

So make send evict the data it has read before if when it first opened
the inode, its mapping had no pages currently loaded: when
inode->i_mapping->nr_pages has a value of 0. Do this instead of deciding
based on the return value of filemap_range_has_page() before reading an
extent because the generic readahead mechanism may read pages beyond the
range we request (and it very often does it), which means a call to
filemap_range_has_page() will return true due to the readahead that was
triggered when processing a previous extent - we don't have a simple way
to distinguish this case from the case where the data was brought into
the page cache through someone else. So checking for the mapping number
of pages being 0 when we first open the inode is simple, cheap and it
generally accomplishes the goal of not trashing the page cache - the
only exception is if part of data was previously loaded into the page
cache through the snapshot by some other process, in that case we end
up not evicting any data send brings into the page cache, just like
before this change - but that however is not the common case.

Example scenario, on a box with 32G of RAM:

  $ btrfs subvolume create /mnt/sv1
  $ xfs_io -f -c "pwrite 0 4G" /mnt/sv1/file1

  $ btrfs subvolume snapshot -r /mnt/sv1 /mnt/snap1

  $ free -m
                 total        used        free      shared  buff/cache   available
  Mem:           31937         186       26866           0        4883       31297
  Swap:           8188           0        8188

  # After this we get less 4G of free memory.
  $ btrfs send /mnt/snap1 >/dev/null

  $ free -m
                 total        used        free      shared  buff/cache   available
  Mem:           31937         186       22814           0        8935       31297
  Swap:           8188           0        8188

The same, obviously, applies to an incremental send.

Signed-off-by: Filipe Manana <fdmanana@suse.com>
---
 fs/btrfs/send.c | 80 +++++++++++++++++++++++++++++++++++++++++++++++--
 1 file changed, 77 insertions(+), 3 deletions(-)

Message ID	41782eb393b3a3ba47f4a7fce1cbb33433c3f994.1651770555.git.fdmanana@suse.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-btrfs-owner@kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 4A3D1C433FE for <linux-btrfs@archiver.kernel.org>; Thu, 5 May 2022 17:16:28 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1382604AbiEERUG (ORCPT <rfc822;linux-btrfs@archiver.kernel.org>); Thu, 5 May 2022 13:20:06 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41054 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1382097AbiEERUE (ORCPT <rfc822;linux-btrfs@vger.kernel.org>); Thu, 5 May 2022 13:20:04 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [145.40.68.75]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 787775C35F for <linux-btrfs@vger.kernel.org>; Thu, 5 May 2022 10:16:23 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 354F3B82E0A for <linux-btrfs@vger.kernel.org>; Thu, 5 May 2022 17:16:22 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 78FA1C385A4 for <linux-btrfs@vger.kernel.org>; Thu, 5 May 2022 17:16:20 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1651770981; bh=sN4XP4aJZS4O9h6XehO/6ff+t1XCAeiDeeGorpSKPhY=; h=From:To:Subject:Date:In-Reply-To:References:From; b=Dc3r8Gnk5N8lZiJeytQjnS14lcPtRtT62nvMLhCIMWx9Af1TMH78T0BYgjtJDjLhE uBQTBlX+ZT6VTVamoGrJe2j55tKEk+Z6sjEOwBYwcSEDKE21zjLbr0zt0hi6MeQmjR hGAMHW4Uuu8CTlDbn3dURfUT6U7EdCGvvK86rUHoz7gBriLJSCA2YaCAZgeddzEhTN V9tiag65R1Kp6+ngb9iJPlxVaguJ7UPuEonALrmsgalbPHzKlEut4/6gLFE2j2x5Au MQz/Kk/PpYcPPzex4pS51oetzkdzMX72HXaP7nZBpB4YT2S0zW4jDi0JDBrll+jvW7 /UN29QxeM8E4w== From: fdmanana@kernel.org To: linux-btrfs@vger.kernel.org Subject: [PATCH 2/2] btrfs: send: avoid trashing the page cache Date: Thu, 5 May 2022 18:16:15 +0100 Message-Id: <41782eb393b3a3ba47f4a7fce1cbb33433c3f994.1651770555.git.fdmanana@suse.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <cover.1651770555.git.fdmanana@suse.com> References: <cover.1651770555.git.fdmanana@suse.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-btrfs.vger.kernel.org> X-Mailing-List: linux-btrfs@vger.kernel.org
Series	btrfs: teach send to avoid trashing the page cache with data \| expand [0/2] btrfs: teach send to avoid trashing the page cache with data [1/2] btrfs: send: keep the current inode open while processing it [2/2] btrfs: send: avoid trashing the page cache

[2/2] btrfs: send: avoid trashing the page cache

Commit Message

Comments

Patch