[v5,4/4] fs: unicode: Add utf8 module and a unicode layer

utf8data.h_shipped has a large database table which is an auto-generated
decodification trie for the unicode normalization functions.
It is not necessary to load this large table in the kernel if no
filesystem is using it, hence make UTF-8 encoding loadable by converting
it into a module.
Modify the file called unicode-core which will act as a layer for
unicode subsystem. It will load the UTF-8 module and access it's functions
whenever any filesystem that needs unicode is mounted.
Also, indirect calls using function pointers are slow, use static calls to
avoid overhead caused in case of repeated indirect calls. Static calls
improves the performance by directly calling the functions as opposed to
indirect calls.

Signed-off-by: Shreeya Patel <shreeya.patel@collabora.com>
---
Changes in v5
  - Rename global variables and default static call functions for better
    understanding
  - Make only config UNICODE_UTF8 visible and config UNICODE to be always
    enabled provided UNICODE_UTF8 is enabled.  
  - Improve the documentation for Kconfig
  - Improve the commit message.

Changes in v4
  - Return error from the static calls instead of doing nothing and
    succeeding even without loading the module.
  - Remove the complete usage of utf8_ops and use static calls at all
    places.
  - Restore the static calls to default values when module is unloaded.
  - Decrement the reference of module after calling the unload function.
  - Remove spinlock as there will be no race conditions after removing
    utf8_ops.

Changes in v3
  - Add a patch which checks if utf8 is loaded before calling utf8_unload()
    in ext4 and f2fs filesystems
  - Return error if strscpy() returns value < 0
  - Correct the conditions to prevent NULL pointer dereference while
    accessing functions via utf8_ops variable.
  - Add spinlock to avoid race conditions.
  - Use static_call() for preventing speculative execution attacks.

Changes in v2
  - Remove the duplicate file from the last patch.
  - Make the wrapper functions inline.
  - Remove msleep and use try_module_get() and module_put()
    for ensuring that module is loaded correctly and also
    doesn't get unloaded while in use.
  - Resolve the warning reported by kernel test robot.
  - Resolve all the checkpatch.pl warnings.

 fs/unicode/Kconfig        |  17 ++-
 fs/unicode/Makefile       |   5 +-
 fs/unicode/unicode-core.c | 241 +++++++----------------------------
 fs/unicode/unicode-utf8.c | 256 ++++++++++++++++++++++++++++++++++++++
 include/linux/unicode.h   | 123 +++++++++++++++---
 5 files changed, 426 insertions(+), 216 deletions(-)
 create mode 100644 fs/unicode/unicode-utf8.c

Message ID	20210329204240.359184-5-shreeya.patel@collabora.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-fsdevel-owner@kernel.org> sender: shreeya) with ESMTPSA id 08A971F40EFE From: Shreeya Patel <shreeya.patel@collabora.com> To: tytso@mit.edu, adilger.kernel@dilger.ca, jaegeuk@kernel.org, chao@kernel.org, krisman@collabora.com, ebiggers@google.com, drosen@google.com, ebiggers@kernel.org, yuchao0@huawei.com Cc: linux-ext4@vger.kernel.org, linux-kernel@vger.kernel.org, linux-f2fs-devel@lists.sourceforge.net, linux-fsdevel@vger.kernel.org, kernel@collabora.com, andre.almeida@collabora.com Subject: [PATCH v5 4/4] fs: unicode: Add utf8 module and a unicode layer Date: Tue, 30 Mar 2021 02:12:40 +0530 Message-Id: <20210329204240.359184-5-shreeya.patel@collabora.com> In-Reply-To: <20210329204240.359184-1-shreeya.patel@collabora.com> References: <20210329204240.359184-1-shreeya.patel@collabora.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk
Series	Make UTF-8 encoding loadable \| expand [v5,0/4] Make UTF-8 encoding loadable [v5,1/4] fs: unicode: Use strscpy() instead of strncpy() [v5,2/4] fs: unicode: Rename function names from utf8 to unicode [v5,3/4] fs: unicode: Rename utf8-core file to unicode-core [v5,4/4] fs: unicode: Add utf8 module and a unicode layer

[v5,4/4] fs: unicode: Add utf8 module and a unicode layer

Commit Message

Comments

Patch