5.9 KiB

Raw Blame History

Filesystem Resolution 01

Author(s): cgwippern@google.com
Reviewers:
Date: 2020-02-12
Status: [Reviewed/Cancelled/Under implementation/Complete]

Here is a brief explanation of the Statuses

Reviewed: The proposal PR has been accepted, merged and ready for implementation.
Under implementation: An accepted proposal is being implemented by actual work. Note: The design might change in this phase based on issues during implementation.
Cancelled: During or before implementation the proposal was cancelled. It could be due to:
- other features added which made the current design proposal obsolete.
- No longer a priority.
Complete: This feature/change is implemented.

Background

Kaniko builds Docker image layers as overlay filesystem layers; specifically it creates a tar file which contains the entire content of a given layer in the overlay filesystem. Each overlay layer corresponds to one image layer.

Overlay filesystems should only contain the objects changed in each layer; meaning that if only one file changes between some layer A and some B, layer B would only contain a single file (the one that changed).

To accomplish this, Kaniko walks the entire filesystem to discover every object. Some of these objects may actually be a symlink to another object in the filesystem; in these cases we must consider both the link and the target object.

Kaniko also maintains a set of whitelisted (aka ignored) filepaths. Any object which matches one of these filepaths should be ignored by kaniko.

This results in a 3 dimensional search space

changed relative to previous layer
symlink
whitelisted

Kaniko must also track which objects are referred to by multiple stages; this functionality is out of scope for this proposal.

This search space is currently managed in an inconsistent and somewhat ad-hoc way; code that manages the various search dimensions is spread out and duplicated. There are also a number of edge cases which continue to cause bugs.

The search space dimensions cannot be reduced or substituted.

Currently there are a number of bugs around symlinks incorrectly resolved, whitelists not respected, and unchanged files added to layers.

Design

During snapshotting resolution of the filesystem and the objects it contains should be handled consistently and with a pair of APIs.

First API

The first API will generate a list of filepaths to resolve

The API should take a limited set of arguments

A filesystem root

The API should return only two arguments

A set of filepaths to resolve
error

The API will walk the filesystem similar to how Snapshotting of the full filesystem is currently handled.

It will optimize the walk by skipping any whitelisted directories.

It will not resolve symlinks.

Second API

The second API will resolve filepaths to be added to the layer

Callers of the first API can pass the output to the second API

Some callers of the second API will not use the first API (e.g. Copy commands)

Callers of this API should not be concerned with the type of object at a given filepath (e.g. symlink or not).
Callers of this API should not be concerned with whether a given path is whitelisted.
This API should return a set of filepaths which should be added to the layer without further processing.

The API should take a limited set of arguments

A list of absolute filepaths to scan
The whitelist

The API should return only two arguments

A set of filepaths which should be added to the layer without further processing
error

The API will iterate over the set of filepaths and for each item

check whether it is whitelisted; if it is, skip it
check whether it is a symlink; if it is, resolve the link ancestor and the target, add the link ancestor to the output, check whether the target is whitelisted and if not add the target to the output

Open Issues/Questions

Given some link /foo/link/bar whose target is a whitelisted path such as /var/run, should /foo/link/bar be added to the layer?

Resolution: Not Yet Resolved

According to this comment the ancestor directories (parent, grandparent, etc) must also be added to the layer to preserve the permissions on those directories. This brings into question whether any filtering needs to happen on these ancestors. IIUC the current whitelist logic it is possible for /some/dir to be whitelisted but not /some/dir/containing-a-file.txt. If filtering needs to be applied to these ancestors does it make most sense to handle this within the proposed filtering API?

Resolution: Not Yet Resolved

The proposal currently states that the list of files returned from the API should be immediately added to the layer, but this would imply that diff'ing existing files, finding newly created files, and handling deleted files would have already been done. It may be advantageous to handle these outside of the API in order to reduce scope and complexity. If these are handled outside of the API how can we decouple and encapsulate these two functions?

Resolution: Not Yet Resolved

Implementation plan

Write the new API
Write tests for the new API
Integrate the new API into existing code

Integration test plan

Add integration tests to the existing suite which cover the known bugs

Notes

Given some path /usr/lib/foo which is a link to /etc/foo/

And /etc/foo contains /etc/foo/bar.txt

Adding a link /usr/lib/foo/bar.txt => /etc/foo/bar.txt will break the image

In a linux shell this raises an error

$ ls /usr/lib/bar
=> /usr/lib/bar/foo.txt
$ ln -s /usr/lib/bar barlink
$ ln -s /usr/lib/bar/foo.txt barlink/foo.txt
=> ERROR

Given some path /usr/foo/bar which is a link to /dev/null, and /dev is whitelisted /dev/null should not be added to the image.

5.9 KiB Raw Blame History

Filesystem Resolution 01

Background

Design

First API

Second API

Open Issues/Questions

Implementation plan

Integration test plan

Notes

5.9 KiB

Raw Blame History