LSM: Revive security_task_alloc() hook.

From: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>

Casey Schaufler wrote:
> On 1/4/2017 3:00 AM, Tetsuo Handa wrote:
> > We switched from "struct task_struct"->security to "struct cred"->security
> > in Linux 2.6.29. But not all LSM modules were happy with that change.
> > TOMOYO LSM module is an example which want to use per "struct task_struct"
> > security blob, for TOMOYO's security context is defined based on "struct
> > task_struct" rather than "struct cred". AppArmor LSM module is another
> > example which want to use it, for AppArmor is currently abusing the cred
> > a little bit to store the change_hat and setexeccon info. Although
> > security_task_free() hook was revived in Linux 3.4 because Yama LSM module
> > wanted to release per "struct task_struct" security blob,
> > security_task_alloc() hook and "struct task_struct"->security field were
> > not revived. Nowadays, we are getting proposals of lightweight LSM modules
> > which want to use per "struct task_struct" security blob. PTAGS LSM module
> > and CaitSith LSM module which are currently under proposal for inclusion
> > also want to use it. Therefore, it will be time to revive
> > security_task_alloc() hook.
> 
> I think you've made a pretty clear argument.
> 

Thank you.

> > We are already allowing multiple concurrent LSM modules (up to one fully
> > armored module which uses "struct cred"->security field or exclusive hooks
> > like security_xfrm_state_pol_flow_match(), plus unlimited number of
> > lightweight modules which do not use "struct cred"->security nor exclusive
> > hooks) as long as they are built into the kernel. Since multiple LSM
> > modules might want to use "struct task_struct"->security field, we need to
> > somehow calculate and allocate enough size for that field. Casey is trying
> > to calculate and allocate enough size for "struct cred"->security field,
> > but that approach is not applicable for LKM based LSM modules. On the other
> > hand, lightweight LSM modules (e.g. Yama) do not always allocate "struct
> > task_struct"->security field. If we tolerate managing per "struct
> > task_struct" security blobs outside of "struct task_struct", we don't need
> > to calculate and allocate enough size for "struct task_struct"->security
> > field, we can save memory by using address of "struct task_struct" as
> > a hash key for searching corresponding security blobs, and as a result
> > we can also allow LKM based LSM modules. Therefore, this patch does not
> > revive "struct task_struct"->security field.
> 
> While I agree with your conclusion that it's unnecessary to
> revive the security field in the task structure now, I think
> we are going to want it in the not too distant future. We can
> leave that for the first module that uses it, or we can start
> the inevitable fight with the owner of the task structure here.
> I also believe that the infrastructure managed allocation model
> can accommodate dynamic loading. I have not proposed that yet
> as it does complicate things somewhat, and the slope is already
> steep enough.
> 
> As for hashes and IDs, I hates 'em to pieces.

OK. I changed to an infrastructure managed allocation model.
Updated patch is attached in the bottom.

> > It would be possible to remember location in security_hook_heads.task_alloc
> > list and undo up to the corresponding security_hook_heads.task_free list
> > when task_alloc failed. But security_task_alloc() unlikely fails, Yama is
> > safe to call task_free even if security blob was not allocated, and LKM
> > based LSM modules will anyway have to be prepared for possibility of
> > calling task_free without corresponding task_alloc call. Therefore,
> > this patch calls security_task_free() even if security_task_alloc() failed.
> 
> Yes, that is an implication of having a free hook where the
> alloc hook might fail or might never have been called. In the
> infrastructure managed case it would require some level of
> bookkeeping to ensure that the module code can determine if
> the blob includes data for that module. I have a design for
> doing that. I don't plan to propose it this time around, but
> I am sure that I'm doing nothing to preclude it.

By embedding a boolean flag into "struct foo_blob" for module foo which
tells whether "struct foo_blob" is initialized, foo can do bookkeeping.
Thus, always calling security_task_free() will be fine.

On 1/4/2017 4:21 AM, Jose Bollo wrote:
>> + * @task_alloc:
>> + *      @task task being allocated.
>> + *      Handle allocation of task-related resources. Note that task_free is
>> + *      called even if task_alloc failed. This means that all task_free users
>> + *      have to be prepared for task_free being called without corresponding
>> + *      task_alloc call. Since the address of @task is guaranteed to remain
>> + *      unchanged between task_alloc call and task_free call, task_free users
>> + *      can use the address of @task for checking whether task_free is called
>> + *      without corresponding task_alloc call.
> Is it possible to add a comment on the state of the task: is it fully
> initialised? parly only? not initialised at all?

What does the state of @task mean? If you meant the state of security blob of
@task, it is initialized with 0 in the updated patch attached in the bottom.
If you really meant the state of @task, it is "partially initialized" because
security_task_alloc() is called from copy_process() which duplicates current
thread's "struct task_struct" and modifies it as @task.

>> @@ -1479,6 +1489,7 @@
>>       int (*file_open)(struct file *file, const struct cred *cred);
>>  
>>       int (*task_create)(unsigned long clone_flags);
>> +     int (*task_alloc)(struct task_struct *task);
> I suggest to add the 'clone_flags' as below
>
>      int (*task_alloc)(
>                     struct task_struct *task,
>                     unsigned long clone_flags);
>
> It would allow to treat CLONE_THREAD and/or CLONE_NEW... in a specific
> way.

OK. I added it. Now, I'm tempted to eliminate security_task_create() call.

Creating a new thread is unlikely prohibited by security policy, for
fork()/execve()/exit() is fundamental of how processes are managed in Unix.
If a program is known to create a new thread, it is likely that permission
to create a new thread is given to that program. Therefore, a situation
where security_task_create() returns an error is likely that the program was
exploited and lost control. Even if SELinux failed to check permission to
create a thread at security_task_create(), SELinux can later check it at
security_task_alloc(). Since the new thread is not yet visible from the rest
of the system, nobody can do bad things using the new thread. What we waste
will be limited to some initialization steps such as dup_task_struct(),
copy_creds() and audit_alloc() in copy_process(). I think we can tolerate
these overhead for unlikely situation. What do SELinux people think?

Casey Schaufler wrote:
> On 1/4/2017 3:00 AM, Tetsuo Handa wrote:
> >  include/linux/lsm_hooks.h | 14 +++++++++++++-
> >  include/linux/security.h  |  6 ++++++
> >  kernel/fork.c             |  4 ++++
> >  security/security.c       | 15 +++++++++++++++
> >  4 files changed, 38 insertions(+), 1 deletion(-)
> 
> I would expect to have at least one module that uses
> the revived hook included.

OK, but not in this post. If SELinux can use security_task_alloc()
instead of security_task_create(), SELinux will become one of modules
that use the revived hook. ;-)

Casey Schaufler wrote:
> On 1/4/2017 4:21 AM, Jose Bollo wrote:
> > Hashing is an interesting approach for low bandwidth requests.
> >
> > I was more thinking on a struct within task like below:
> >
> > #ifdef CONFIG_SECURITY
> >   struct {
> > #    ifdef CONFIG_SECURITY_APPARMOR
> >         void *apparmor;
> > #    endif
> > #    ifdef CONFIG_SECURITY_PTAGS
> >         void *ptags;
> > #    endif
> >   } security;
> > #endif
> >
> > It potential defines 2 "security" fields: security.apparmo and security.ptags
> 
> This is not going to be popular with distributions like Ubuntu
> that compile in all security modules. You're usually going to
> have wasted space in the task structure doing this.

Yes, this can become a problem for those who don't need it.

When commit be6d3e56a6b9b3a4 ("introduce new LSM hooks where vfsmount is
available.") was merged into Linux 2.6.29 in order to allow TOMOYO (which
was merged into Linux 2.6.30) to use "struct vfsmount *" argument for
calculating an absolute pathname of the file, CONFIG_SECURITY_PATH was
introduced because these hooks are needed by only TOMOYO. Although AppArmor
(which needs these hooks) was merged into Linux 2.6.36, CONFIG_SECURITY_PATH
was not removed because these hooks are considered as overhead for SELinux
and SMACK. Not calling unnecessary hooks and not wasting memory by unused
modules are important for those who don't need it.

In the updated patch attached in the bottom, I used simple array of
"unsigned long" where index == 0 remembers size of array and index > 0
are used by LSM modules. Modules which need small bytes occupy multiple
index numbers for sizeof("struct xxx") bytes, whereas modules which
need large bytes occupy only one index number for sizeof("struct yyy *")
bytes and allocate sizeof("struct yyy") bytes separately.

My question is whether SELinux and SMACK can tolerate always calling
security_task_alloc() hook and defining "struct task_struct"->t_security field
(which costs only sizeof(unsigned long *) bytes). Since Fedora/RHEL supports
only SELinux, my customers still cannot calculate absolute pathname when using
AKARI (an LKM based LSM module similar to TOMOYO) due to CONFIG_SECURITY_PATH=n.
If security_task_alloc() hook and "struct task_struct"->t_security field are
considered as overhead and enclosed with "#ifdef CONFIG_SECURITY_TASK" ...
"#endif" clause, the same thing will happen. ;-(

Below is an updated patch. It does not include changes for how to allocate
memory for initial thread's security blobs. We need to know how many bytes
needs to be allocated for initial thread's security blobs before security_init()
is called. But security_reserve_task_blob_index() which calculates amount of
needed bytes is called from security_init(). This is a chicken-or-egg syndrome.
We will need to split module registration into three steps. The first step is
call security_reserve_task_blob_index() on all modules which should be activated.
The second step is allocate memory for the initial thread's security blob.
The third step is actually activate all modules which should be activated.
The simplest way is to call registration hooks in security_init() twice, once
for the first step, once more for the third step.

>From 6eeb52d5b4f8ed22531ee8150808305b103cfe92 Mon Sep 17 00:00:00 2001
From: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
Date: Fri, 6 Jan 2017 19:57:21 +0900
Subject: [PATCH] LSM: Revive security_task_alloc() hook and per "struct
 task_struct" security blob.

We switched from "struct task_struct"->security to "struct cred"->security
in Linux 2.6.29. But not all LSM modules were happy with that change.
TOMOYO LSM module is an example which want to use per "struct task_struct"
security blob, for TOMOYO's security context is defined based on "struct
task_struct" rather than "struct cred". AppArmor LSM module is another
example which want to use it, for AppArmor is currently abusing the cred
a little bit to store the change_hat and setexeccon info. Although
security_task_free() hook was revived in Linux 3.4 because Yama LSM module
wanted to release per "struct task_struct" security blob,
security_task_alloc() hook and "struct task_struct"->security field were
not revived. Nowadays, we are getting proposals of lightweight LSM modules
which want to use per "struct task_struct" security blob. PTAGS LSM module
and CaitSith LSM module which are currently under proposal for inclusion
also want to use it. Therefore, it will be time to revive
security_task_alloc() hook and "struct task_struct"->security field.

We are already allowing multiple concurrent LSM modules (up to one fully
armored module which uses "struct cred"->security field or exclusive hooks
like security_xfrm_state_pol_flow_match(), plus unlimited number of
lightweight modules which do not use "struct cred"->security nor exclusive
hooks) as long as they are built into the kernel. Since multiple LSM
modules might want to use "struct task_struct"->security field, we need to
somehow calculate and allocate enough size for that field. Since it is
also possible that none of activated LSM modules uses that field, it is
important that we do not waste too much. Therefore, this patch implements
variable length "struct task_struct"->security field using array of
"unsigned long". By using array of unsigned long, only sizeof(unsigned
long *) bytes in "struct task_struct" will be wasted when none of activated
LSM modules uses that field.

It would be possible to remember location in security_hook_heads.task_alloc
list and undo up to the corresponding security_hook_heads.task_free list
when task_alloc failed. But security_task_alloc() unlikely fails, Yama is
safe to call task_free even if security blob was not allocated, and LKM
based LSM modules will anyway have to be prepared for possibility of
calling task_free without corresponding task_alloc call. Therefore,
this patch calls security_task_free() even if security_task_alloc() failed.

Pointer to per "struct task_struct" security blobs can be fetched using
task_security() function. The "&& index && *p >= (unsigned long) index"
check in task_security() is for now only for catching buggy built-in LSM
modules who registers after non-initial threads are created, but it will
serve as a mechianism for LKM based LSM modules to detect task_free call
without corresponding task_alloc call when LKM based LSM modules becomes
legal (e.g. __init attribute is dropped and a simple lock for serializing
module registration is added).

Signed-off-by: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
Cc: John Johansen <john.johansen@canonical.com>
Cc: Paul Moore <paul@paul-moore.com>
Cc: Stephen Smalley <sds@tycho.nsa.gov>
Cc: Eric Paris <eparis@parisplace.org>
Cc: Casey Schaufler <casey@schaufler-ca.com>
Cc: Kees Cook <keescook@chromium.org>
Cc: James Morris <james.l.morris@oracle.com>
Cc: Serge E. Hallyn <serge@hallyn.com>
Cc: Jose Bollo <jobol@nonadev.net>
---
 include/linux/init_task.h |  7 +++++++
 include/linux/lsm_hooks.h | 42 +++++++++++++++++++++++++++++++++++++++++-
 include/linux/sched.h     |  3 +++
 include/linux/security.h  |  7 +++++++
 kernel/fork.c             |  4 ++++
 security/security.c       | 33 +++++++++++++++++++++++++++++++++
 6 files changed, 95 insertions(+), 1 deletion(-)

LSM: Revive security_task_alloc() hook.

Commit Message

Comments

Patch