Paul Moore <paul(a)paul-moore.com> writes:
On Thu, Apr 16, 2020 at 4:36 PM Eric W. Biederman
<ebiederm(a)xmission.com> wrote:
> Paul Moore <paul(a)paul-moore.com> writes:
> > On Mon, Mar 30, 2020 at 1:49 PM Richard Guy Briggs <rgb(a)redhat.com>
wrote:
> >> On 2020-03-30 13:34, Paul Moore wrote:
> >> > On Mon, Mar 30, 2020 at 12:22 PM Richard Guy Briggs
<rgb(a)redhat.com> wrote:
> >> > > On 2020-03-30 10:26, Paul Moore wrote:
> >> > > > On Mon, Mar 30, 2020 at 9:47 AM Richard Guy Briggs
<rgb(a)redhat.com> wrote:
> >> > > > > On 2020-03-28 23:11, Paul Moore wrote:
> >> > > > > > On Tue, Mar 24, 2020 at 5:02 PM Richard Guy Briggs
<rgb(a)redhat.com> wrote:
> >> > > > > > > On 2020-03-23 20:16, Paul Moore wrote:
> >> > > > > > > > On Thu, Mar 19, 2020 at 6:03 PM Richard
Guy Briggs <rgb(a)redhat.com> wrote:
> >> > > > > > > > > On 2020-03-18 18:06, Paul Moore
wrote:
> >
> > ...
> >
> >> > > Well, every time a record gets generated, *any* record gets
generated,
> >> > > we'll need to check for which audit daemons this record is in
scope and
> >> > > generate a different one for each depending on the content and
whether
> >> > > or not the content is influenced by the scope.
> >> >
> >> > That's the problem right there - we don't want to have to
generate a
> >> > unique record for *each* auditd on *every* record. That is a recipe
> >> > for disaster.
> >> >
> >> > Solving this for all of the known audit records is not something we
> >> > need to worry about in depth at the moment (although giving it some
> >> > casual thought is not a bad thing), but solving this for the audit
> >> > container ID information *is* something we need to worry about right
> >> > now.
> >>
> >> If you think that a different nested contid value string per daemon is
> >> not acceptable, then we are back to issuing a record that has only *one*
> >> contid listed without any nesting information. This brings us back to
> >> the original problem of keeping *all* audit log history since the boot
> >> of the machine to be able to track the nesting of any particular contid.
> >
> > I'm not ruling anything out, except for the "let's just completely
> > regenerate every record for each auditd instance".
>
> Paul I am a bit confused about what you are referring to when you say
> regenerate every record.
>
> Are you saying that you don't want to repeat the sequence:
> audit_log_start(...);
> audit_log_format(...);
> audit_log_end(...);
> for every nested audit daemon?
If it can be avoided yes. Audit performance is already not-awesome,
this would make it even worse.
As far as I can see not repeating sequences like that is fundamental
for making this work at all. Just because only the audit subsystem
should know about one or multiple audit daemons. Nothing else should
care.
> Or are you saying that you would like to literraly want to send
the same
> skb to each of the nested audit daemons?
Ideally we would reuse the generated audit messages as much as
possible. Less work is better. That's really my main concern here,
let's make sure we aren't going to totally tank performance when we
have a bunch of nested audit daemons.
So I think there are two parts of this answer. Assuming we are talking
about nesting audit daemons in containers we will have different
rulesets and I expect most of the events for a nested audit daemon won't
be of interest to the outer audit daemon.
Beyond that it should be very straight forward to keep a pointer and
leave the buffer as a scatter gather list until audit_log_end
and translate pids, and rewrite ACIDs attributes in audit_log_end
when we build the final packet. Either through collaboration with
audit_log_format or a special audit_log command that carefully sets
up the handful of things that need that information.
Hmm. I am seeing that we send skbs to kauditd and then kauditd
sends those skbs to userspace. I presume that is primary so that
sending messages to userspace does not block the process being audited.
Plus a little bit so that the retry logic will work.
I think the naive implementation would be to simply have 1 kauditd
per auditd (strictly and audit context/namespace). Although that can be
optimized if that is a problem.
Beyond that I think we would need to look at profiles to really
understand where the bottlenecks are.
> Or are you thinking of something else?
As mentioned above, I'm not thinking of anything specific, other than
let's please not have to regenerate *all* of the audit record strings
for each instance of an audit daemon, that's going to be a killer.
Maybe we have to regenerate some, if we do, what would that look like
in code? How do we handle the regeneration aspect? I worry that is
going to be really ugly.
Maybe we finally burn down the audit_log_format(...) function and pass
structs/TLVs to the audit subsystem and the audit subsystem generates
the strings in the auditd connection thread. Some of the record
strings could likely be shared, others would need to be ACID/auditd
dependent.
I think we just a very limited amount of structs/TLVs for the cases that
matter and one-one auditd and kauditd implementations we should still
be able to do everything in audit_log_end. Plus doing as much work as
possible in audit_log_end where things are still cache hot is desirable.
I'm open to any ideas people may have. We have a problem,
let's solve
it.
It definitely makes sense to look ahead to having audit daemons running
in containers, but in the grand scheme of things that is a nice to have.
Probably something we will and should get to, but we have lived a long
time without auditd running in containers so I expect we can live a
while longer.
As I understand Richard patchset for the specific case of the ACID we
are only talking about taking a subset of an existing string, and one
string at that. Not hard at all. Especially when looking at the
fundamental fact that we will need to send a different skb to
userspace, for each audit daemon.
Eric