Re: [RFC PATCH ghak90 (was ghak32) V3 07/10] audit: add support for containerid to network namespaces

Tuesday, 24 July 2018

On 2018-07-20 18:14, Paul Moore wrote:
...
 On Wed, Jun 6, 2018 at 1:03 PM Richard Guy Briggs
<rgb(a)redhat.com&gt; wrote:
 > Audit events could happen in a network namespace outside of a task
 > context due to packets received from the net that trigger an auditing
 > rule prior to being associated with a running task.  The network
 > namespace could in use by multiple containers by association to the
 > tasks in that network namespace.  We still want a way to attribute
 > these events to any potential containers.  Keep a list per network
 > namespace to track these audit container identifiiers.
 >
 > Add/increment the audit container identifier on:
 > - initial setting of the audit container identifier via /proc
 > - clone/fork call that inherits an audit container identifier
 > - unshare call that inherits an audit container identifier
 > - setns call that inherits an audit container identifier
 > Delete/decrement the audit container identifier on:
 > - an inherited audit container identifier dropped when child set
 > - process exit
 > - unshare call that drops a net namespace
 > - setns call that drops a net namespace
 >
 > See: https://github.com/linux-audit/audit-kernel/issues/92
 > See: https://github.com/linux-audit/audit-testsuite/issues/64
 > See: https://github.com/linux-audit/audit-kernel/wiki/RFE-Audit-Container-ID
 > Signed-off-by: Richard Guy Briggs <rgb(a)redhat.com&gt;
 > ---
 >  include/linux/audit.h | 23 ++++++++++++++++
 >  kernel/audit.c        | 72 +++++++++++++++++++++++++++++++++++++++++++++++++++
 >  kernel/auditsc.c      |  5 ++++
 >  kernel/nsproxy.c      |  4 +++
 >  4 files changed, 104 insertions(+)

 ...

 > diff --git a/include/linux/audit.h b/include/linux/audit.h
 > index 1e37abf..7e2e51c 100644
 > --- a/include/linux/audit.h
 > +++ b/include/linux/audit.h
 > @@ -87,6 +88,12 @@ struct audit_field {
 >         u32                             op;
 >  };
 >
 > +struct audit_contid {
 > +       struct list_head        list;
 > +       u64                     id;
 > +       refcount_t              refcount;
 > +};

 Do we need to worry about locking the audit container ID list?  Does
 the network namespace code (or some other namespace code) ensure that
 add/deletes are serialized? 
Now that you mention it, I don't have any idea.  I'll need to look into this.

...
 > @@ -156,6 +163,10 @@ extern void audit_log_task_info(struct
audit_buffer *ab,
 >                                 struct task_struct *tsk);
 >  extern int audit_log_contid(struct audit_context *context,
 >                                      char *op, u64 contid);
 > +extern struct list_head *audit_get_contid_list(const struct net *net);
 > +extern void audit_contid_add(struct net *net, u64 contid);
 > +extern void audit_contid_del(struct net *net, u64 contid);

 I wonder if we should change these function names to indicate that
 they are managing the netns/cid list?  Right now there is no mention
 of networking other than the first parameter.

 Maybe audit_netns_contid_*()? 
I was going to protest that they should be more generally named
functions to deal with namespaces rather than specifically network
namespaces, but each type of namespace will need its own accessor
functions since each type of namespace has a different namespace type
pointer.  It is tempting to try to generalize it, but that could be an
excercise for the reader if another type of namespace needs this sort of
support.

...
 > +extern void audit_switch_task_namespaces(struct nsproxy *ns,
struct task_struct *p);
 >
 >  extern int                 audit_update_lsm_rules(void);
 >
 > @@ -209,6 +220,18 @@ static inline void audit_log_task_info(struct audit_buffer
*ab,
 >  static inline int audit_log_contid(struct audit_context *context,
 >                                             char *op, u64 contid)
 >  { }
 > +static inline struct list_head *audit_get_contid_list(const struct net *net)
 > +{
 > +       static LIST_HEAD(list);
 > +       return &list;
 > +}

 Why can't we just return NULL here like a normal dummy function?  It's
 only ever used inside audit.  Actually, why is this even in
 include/linux/audit.h, couldn't we put it in kernel/audit.h or even
 just make it a static in audit.c? 
You are right, static in kernel/audit.c is sufficient.

...
 > +static inline void audit_contid_add(struct net *net, u64
contid)
 > +{ }
 > +static inline void audit_contid_del(struct net *net, u64 contid)
 > +{ }
 > +static inline void audit_switch_task_namespaces(struct nsproxy *ns, struct
task_struct *p)
 > +{ }
 > +
 >  #define audit_enabled 0
 >  #endif /* CONFIG_AUDIT */
 >
 > diff --git a/kernel/audit.c b/kernel/audit.c
 > index ba304a8..ecd2de4 100644
 > --- a/kernel/audit.c
 > +++ b/kernel/audit.c
 > @@ -106,6 +106,7 @@
 >   */
 >  struct audit_net {
 >         struct sock *sk;
 > +       struct list_head contid_list;
 >  };
 >
 >  /**
 > @@ -311,6 +312,76 @@ static struct sock *audit_get_sk(const struct net *net)
 >         return aunet->sk;
 >  }
 >
 > +/**
 > + * audit_get_contid_list - Return the audit container ID list for the given network
namespace
 > + * @net: the destination network namespace
 > + *
 > + * Description:
 > + * Returns the list pointer if valid, NULL otherwise.  The caller must ensure
 > + * that a reference is held for the network namespace while the sock is in use.
 > + */
 > +struct list_head *audit_get_contid_list(const struct net *net)
 > +{
 > +       struct audit_net *aunet = net_generic(net, audit_net_id);
 > +
 > +       return &aunet->contid_list;
 > +}
 > +
 > +void audit_contid_add(struct net *net, u64 contid)
 > +{
 > +       struct list_head *contid_list = audit_get_contid_list(net);
 > +       struct audit_contid *cont;
 > +
 > +       if (!cid_valid(contid))
 > +               return;
 > +       if (!list_empty(contid_list))
 > +               list_for_each_entry(cont, contid_list, list)
 > +                       if (cont->id == contid) {
 > +                               refcount_inc(&cont->refcount);
 > +                               return;
 > +                       }

 <thinking out loud>I think this is fine for right now, but we may need
 to be a bit clever about how we store the IDs - walking an unsorted
 list with lots of entries may prove to be too painful.</thinking out
 loud> 
Ok, agreed, it may want a hash array list...  That should be a
straightforward optimization later.

...
 > +       cont = kmalloc(sizeof(struct audit_contid),
GFP_KERNEL);
 > +       if (!cont)
 > +               return;
 > +       INIT_LIST_HEAD(&cont->list);
 > +       cont->id = contid;
 > +       refcount_set(&cont->refcount, 1);
 > +       list_add(&cont->list, contid_list);
 > +}
 > +
 > +void audit_contid_del(struct net *net, u64 contid)
 > +{
 > +       struct list_head *contid_list = audit_get_contid_list(net);
 > +       struct audit_contid *cont = NULL;
 > +       int found = 0;
 > +
 > +       if (!cid_valid(contid))
 > +               return;
 > +       if (!list_empty(contid_list))
 > +               list_for_each_entry(cont, contid_list, list)
 > +                       if (cont->id == contid) {
 > +                               found = 1;
 > +                               break;

 You don't really need the found variable, you can just move all of the
 work you need to do up into the if statement and return from inside
 the if statement. 
Fine, sure.

...
 > +                       }
 > +       if (!found)
 > +               return;
 > +       list_del(&cont->list);
 > +       if (refcount_dec_and_test(&cont->refcount))
 > +               kfree(cont);

 Don't you want to dec_and_test first and only remove it from the list
 if there are no other references? 
I don't think so.  Let me try to describe it in prose to see if I
understood this properly and see if this makes more sense: I want to
remove this audit_contid list member from this net's audit_contid list
and decrement unconditionally this member's refcount so it knows there
is one less thing pointing at it and when there is no longer anything
pointing at it, free it.

...
 > +}
 >
 > +void audit_switch_task_namespaces(struct nsproxy *ns, struct task_struct *p)
 > +{
 > +       u64 contid = audit_get_contid(p);
 > +       struct nsproxy *new = p->nsproxy;
 > +
 > +       if (!cid_valid(contid))
 > +               return;
 > +       audit_contid_del(ns->net_ns, contid);
 > +       if (new)
 > +               audit_contid_add(new->net_ns, contid);
 > +}
 > +
 >  void audit_panic(const char *message)
 >  {
 >         switch (audit_failure) {
 > @@ -1550,6 +1621,7 @@ static int __net_init audit_net_init(struct net *net)
 >                 return -ENOMEM;
 >         }
 >         aunet->sk->sk_sndtimeo = MAX_SCHEDULE_TIMEOUT;
 > +       INIT_LIST_HEAD(&aunet->contid_list);
 >
 >         return 0;
 >  }
 > diff --git a/kernel/auditsc.c b/kernel/auditsc.c
 > index ea1ee35..6ab5e5e 100644
 > --- a/kernel/auditsc.c
 > +++ b/kernel/auditsc.c
 > @@ -75,6 +75,7 @@
 >  #include <linux/uaccess.h>
 >  #include <linux/fsnotify_backend.h>
 >  #include <uapi/linux/limits.h>
 > +#include <net/net_namespace.h>
 >
 >  #include "audit.h"
 >
 > @@ -2165,6 +2166,7 @@ int audit_set_contid(struct task_struct *task, u64 contid)
 >         uid_t uid;
 >         struct tty_struct *tty;
 >         char comm[sizeof(current->comm)];
 > +       struct net *net = task->nsproxy->net_ns;
 >
 >         /* Can't set if audit disabled */
 >         if (!task->audit)
 > @@ -2185,10 +2187,13 @@ int audit_set_contid(struct task_struct *task, u64 contid)
 >         else if (cid_valid(oldcontid) && !task->audit->inherited)
 >                 rc = -EEXIST;
 >         if (!rc) {
 > +               if (cid_valid(oldcontid))
 > +                       audit_contid_del(net, oldcontid);
 >                 task_lock(task);
 >                 task->audit->contid = contid;
 >                 task->audit->inherited = false;
 >                 task_unlock(task);
 > +               audit_contid_add(net, contid);
 >         }
 >
 >         if (!audit_enabled)
 > diff --git a/kernel/nsproxy.c b/kernel/nsproxy.c
 > index f6c5d33..dcb69fe 100644
 > --- a/kernel/nsproxy.c
 > +++ b/kernel/nsproxy.c
 > @@ -27,6 +27,7 @@
 >  #include <linux/syscalls.h>
 >  #include <linux/cgroup.h>
 >  #include <linux/perf_event.h>
 > +#include <linux/audit.h>
 >
 >  static struct kmem_cache *nsproxy_cachep;
 >
 > @@ -140,6 +141,7 @@ int copy_namespaces(unsigned long flags, struct task_struct
*tsk)
 >         struct nsproxy *old_ns = tsk->nsproxy;
 >         struct user_namespace *user_ns = task_cred_xxx(tsk, user_ns);
 >         struct nsproxy *new_ns;
 > +       u64 contid = audit_get_contid(tsk);
 >
 >         if (likely(!(flags & (CLONE_NEWNS | CLONE_NEWUTS | CLONE_NEWIPC |
 >                               CLONE_NEWPID | CLONE_NEWNET |
 > @@ -167,6 +169,7 @@ int copy_namespaces(unsigned long flags, struct task_struct
*tsk)
 >                 return  PTR_ERR(new_ns);
 >
 >         tsk->nsproxy = new_ns;
 > +       audit_contid_add(new_ns->net_ns, contid);
 >         return 0;
 >  }
 >
 > @@ -224,6 +227,7 @@ void switch_task_namespaces(struct task_struct *p, struct
nsproxy *new)
 >         ns = p->nsproxy;
 >         p->nsproxy = new;
 >         task_unlock(p);
 > +       audit_switch_task_namespaces(ns, p);
 >
 >         if (ns && atomic_dec_and_test(&ns->count))
 >                 free_nsproxy(ns);
 > --
 > 1.8.3.1
 >
 > --
 > Linux-audit mailing list
 > Linux-audit(a)redhat.com
 > https://www.redhat.com/mailman/listinfo/linux-audit

 --
 paul moore
 www.paul-moore.com 
- RGB

--
Richard Guy Briggs <rgb(a)redhat.com&gt;
Sr. S/W Engineer, Kernel Security, Base Operating Systems
Remote, Ottawa, Red Hat Canada
IRC: rgb, SunRaycer
Voice: +1.647.777.2635, Internal: (81) 32635

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Re: [RFC PATCH ghak90 (was ghak32) V3 07/10] audit: add support for containerid to network namespaces