docs: add cache storage backend docs

This fleshes out the documentation to include information along with examples and detailed use cases for the cache backends. The general format and style follows from the style of the build driver docs. Eventually, these docs will be included on docs.docker.com along with the rest of the build docs. Signed-off-by: Justin Chadwell <me@jedevc.com>
3 years ago · 04b56c7331
parent 294421db9c
commit 04b56c7331
7 changed files with 590 additions and 0 deletions
--- a/docs/guides/cache/azblob.md
+++ b/docs/guides/cache/azblob.md
@ -0,0 +1,62 @@
+# Azure Blob Storage cache storage
+
+> **Warning**
+>
+> The `azblob` cache is currently unreleased. You can use it today, by using
+> the `moby/buildkit:master` image in your Buildx driver.
+
+The `azblob` cache store uploads your resulting build cache to
+[Azure's blob storage service](https://azure.microsoft.com/en-us/services/storage/blobs/).
+
+> **Note**
+>
+> The `azblob` cache storage backend requires using a different driver than
+> the default `docker` driver - see more information on selecting a driver
+> [here](../drivers/index.md). To create a new docker-container driver (which
+> can act as a simple drop-in replacement):
+>
+> ```console
+> docker buildx create --use --driver=docker-container
+> ```
+
+To import and export your cache using the `azblob` storage backend we use the
+`--cache-to` and `--cache-from` flags and point it to our desired blob using
+the required `account_url` and `name` parameters:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=azblob,account_url=https://myaccount.blob.core.windows.net,name=my_image \
+  --cache-from type=azblob,account_url=https://myaccount.blob.core.windows.net,name=my_image
+```
+
+## Authentication
+
+To authenticate to Azure to read from and write to the cache, the following
+parameters are required:
+
+* `secret_access_key`: secret access key
+  * specifies the primary or secondary account key for your Azure Blob
+    Storage account. [Azure Blob Storage account keys](https://docs.microsoft.com/en-us/azure/storage/common/storage-account-keys-manage)
+
+While these can be manually provided, if left unspecified, then the credentials
+for Azure will be pulled from the BuildKit server's environment following the
+environment variables scheme for the [Azure Go SDK](https://docs.microsoft.com/en-us/azure/developer/go/azure-sdk-authentication).
+
+> **Warning**
+>
+> These environment variables **must** be specified on the BuildKit server, not
+> the `buildx` client.
+
+## Cache options
+
+The `azblob` cache has lots of parameters to adjust its behavior.
+
+### Cache mode
+
+See [Registry - Cache mode](./registry.md#cache-mode) for more information.
+
+## Further reading
+
+For an introduction to caching see [Optimizing builds with cache management](https://docs.docker.com/build/building/cache).
+
+For more information on the `azblob` cache backend, see the [BuildKit README](https://github.com/moby/buildkit#azure-blob-storage-cache-experimental).
--- a/docs/guides/cache/gha.md
+++ b/docs/guides/cache/gha.md
@ -0,0 +1,109 @@
+# GitHub Actions cache storage
+
+> **Warning**
+>
+> The `gha` cache is currently experimental. You can use it today, in current
+> releases of Buildx and Buildkit, however, the interface and behavior do not
+> have any stability guarantees and may change in future releases.
+
+The `gha` cache utilizes the [GitHub-provided Action's cache](https://github.com/actions/cache)
+available from inside your CI execution environment. This is the recommended
+cache to use inside your GitHub action pipelines, as long as your use case
+falls within the [size and usage limits set by GitHub](https://docs.github.com/en/actions/using-workflows/caching-dependencies-to-speed-up-workflows#usage-limits-and-eviction-policy).
+
+> **Note**
+>
+> The `gha` cache storage backend requires using a different driver than
+> the default `docker` driver - see more information on selecting a driver
+> [here](../drivers/index.md). To create a new docker-container driver (which
+> can act as a simple drop-in replacement):
+>
+> ```console
+> docker buildx create --use --driver=docker-container
+> ```
+>
+> If you're using the official [docker/setup-buildx-action](https://github.com/docker/setup-buildx-action),
+> then this step will be automatically run for you.
+
+To import and export your cache using the `gha` storage backend we use the
+`--cache-to` and `--cache-from` flags configured with the appropriate
+[Authentication](#authentication) parameters:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=gha,url=...,token=...
+  --cache-from type=gha,url=...,token=...
+```
+
+By default, caches are scoped by branch - this ensures a separate cache
+environment for the main branch, as well as for each feature branch. However,
+if you build multiple images as part of your build, then caching them both to
+the same `gha` scope will overwrite all but the last build, leaving only the
+final cache.
+
+To prevent this, you can manually specify a cache scope name using the `scope`
+parameter (in this case, including the branch name set [by GitHub](https://docs.github.com/en/actions/learn-github-actions/environment-variables#default-environment-variables)
+to ensure each branch gets its own cache):
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=gha,url=...,token=...,scope=$GITHUB_REF_NAME-image
+  --cache-from type=gha,url=...,token=...,scope=$GITHUB_REF_NAME-image
+$ docker buildx build --push -t <user>/<image2> \
+  --cache-to type=gha,url=...,token=...,scope=$GITHUB_REF_NAME-image2
+  --cache-from type=gha,url=...,token=...,scope=$GITHUB_REF_NAME-image2
+```
+
+GitHub's [cache scoping rules](https://docs.github.com/en/actions/advanced-guides/caching-dependencies-to-speed-up-workflows#restrictions-for-accessing-a-cache),
+still apply, with the cache only populated from the current branch, the base
+branch and the default branch for a run.
+
+## Authentication
+
+To authenticate against the [GitHub Actions Cache service API](https://github.com/tonistiigi/go-actions-cache/blob/master/api.md#authentication)
+to read from and write to the cache, the following parameters are required:
+
+* `url`: cache server URL (default `$ACTIONS_CACHE_URL`)
+* `token`: access token (default `$ACTIONS_RUNTIME_TOKEN`)
+
+If the parameters are not specified, then their values will be pulled from the
+environment variables. If invoking the `docker buildx` command manually from an
+inline step, then the variables must be manually exposed, for example, by using
+[crazy-max/ghaction-github-runtime](https://github.com/crazy-max/ghaction-github-runtime)
+as a workaround.
+
+### With [docker/build-push-action](https://github.com/docker/build-push-action)
+
+When using the [docker/build-push-action](https://github.com/docker/build-push-action),
+the `url` and `token` parameters are automatically populated, with no need to
+manually specify them, or include any additional workarounds.
+
+For example:
+
+```yaml
+      -
+        name: Build and push
+        uses: docker/build-push-action@v3
+        with:
+          context: .
+          push: true
+          tags: user/app:latest
+          cache-from: type=gha
+          cache-to: type=gha,mode=max
+```
+
+<!-- FIXME: cross-link to ci docs once docs.docker.com has them -->
+
+## Cache options
+
+The `gha` cache has lots of parameters to adjust its behavior.
+
+### Cache mode
+
+See [Registry - Cache mode](./registry.md#cache-mode) for more information.
+
+## Further reading
+
+For an introduction to caching see [Optimizing builds with cache management](https://docs.docker.com/build/building/cache).
+
+For more information on the `gha` cache backend, see the [BuildKit README](https://github.com/moby/buildkit#github-actions-cache-experimental).
--- a/docs/guides/cache/index.md
+++ b/docs/guides/cache/index.md
@ -0,0 +1,87 @@
+# Cache storage backends
+
+To ensure that builds run as quickly as possible, BuildKit automatically caches
+the build result in its own internal cache. However, in addition to this simple
+cache, BuildKit also supports exporting cache for a specific build to an
+external location to easily import into future builds.
+
+This external cache becomes almost essential in CI/CD environments, where there
+may be little-to-no persistence between runs, but it's still important to keep
+the runtime of image builds as low as possible.
+
+> **Warning**
+>
+> If you use secrets or credentials inside your build process, then ensure you
+> manipulate them using the dedicated [--secret](../../reference/buildx_build.md#secret)
+> functionality instead of using manually `COPY`d files or build `ARG`s. Using
+> manually managed secrets like this with exported cache could lead to an
+> information leak.
+
+Currently, Buildx supports the following cache storage backends:
+
+- `inline` image cache, that embeds the build cache into the image, and is pushed to
+  the same location as the main output result - note that this only works for the
+  `image` exporter.
+
+  ([guide](./inline.md))
+
+- `registry` image cache, that embeds the build cache into a separate image, and
+  pushes to a dedicated location separate from the main output.
+
+  ([guide](./registry.md))
+
+- `local` directory cache, that writes the build cache to a local directory on
+  the filesystem.
+
+  ([guide](./local.md))
+
+- `gha` GitHub Actions cache, that uploads the build cache to [GitHub](https://docs.github.com/en/rest/actions/cache)
+  (experimental).
+
+  ([guide](./gha.md))
+
+- `s3` AWS cache, that uploads the build cache to an [AWS S3 bucket](https://aws.amazon.com/s3/)
+  (unreleased).
+
+  ([guide](./s3.md))
+
+- `azblob` Azure cache, that uploads the build cache to [Azure Blob Storage](https://azure.microsoft.com/en-us/services/storage/blobs/)
+  (unreleased).
+
+  ([guide](./azblob.md))
+
+To use any of the above backends, you first need to specify it on build with
+the [`--cache-to`](../../reference/buildx_build.md#cache-to) option to export
+the cache to your storage backend of choice, then use the [`--cache-from`](../../reference/buildx_build.md#cache-from)
+option to import the cache from the storage backend into the current build.
+Unlike the local BuildKit cache (which is always enabled), **all** of the cache
+storage backends have to be explicitly exported to and then explicitly imported
+from. Note that all cache exporters except for the `inline` cache, require
+[selecting an alternative Buildx driver](../drivers/index.md).
+
+For example, to perform a cache import and export using the [`registry` cache](./registry.md):
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=registry,ref=<user>/<cache-image> \
+  --cache-from type=registry,ref=<user>/<cache-image> .
+```
+
+> **Warning**
+>
+> As a general rule, each cache writes to some location - no location can be
+> written to twice, without overwriting the previously cached data. If you want
+> to maintain multiple separately scoped caches (e.g. a cache per git branch),
+> then ensure that you specify different locations in your cache exporters.
+
+While [currently](https://github.com/moby/buildkit/pull/3024) only a single
+cache exporter is supported, you can import from as many remote caches as you
+like. For example, a common pattern is to use the cache of both the current
+branch as well as the main branch (again using the [`registry` cache](./registry.md)):
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=registry,ref=<user>/<cache-image>:<branch> \
+  --cache-from type=registry,ref=<user>/<cache-image>:<branch> \
+  --cache-from type=registry,ref=<user>/<cache-image>:master .
+```
--- a/docs/guides/cache/inline.md
+++ b/docs/guides/cache/inline.md
@ -0,0 +1,45 @@
+# Inline cache storage
+
+The `inline` cache store is the simplest way to get an external cache and is
+easy to get started using if you're already building and pushing an image.
+However, it doesn't scale as well to multi-stage builds as well as the other
+drivers do and it doesn't offer separation between your output artifacts and
+your cache output. This means that if you're using a particularly complex build
+flow, or not exporting your images directly to a registry, then you may want to
+consider the [registry](./registry.md) cache.
+
+To export your cache using `inline` storage, we can pass `type=inline` to the
+`--cache-to` option:
+
+```console
+$ docker buildx build --push -t <user>/<image> --cache-to type=inline .
+```
+
+Alternatively, you can also export inline cache by setting the build-arg
+`BUILDKIT_INLINE_CACHE`, instead of using the `--cache-to` flag:
+
+```console
+$ docker buildx build --push -t <user>/<image> --arg BUILDKIT_INLINE_CACHE=1 .
+```
+
+To import the resulting cache on a future build, we can pass `type=registry` to
+`--cache-from` which lets us extract the cache from inside a docker image:
+
+```console
+$ docker buildx build --push -t <user>/<image> --cache-from type=registry,ref=<user>/<image> .
+```
+
+Most of the time, you'll want to have each build both import and export cache
+from the cache store - to do this, specify both `--cache-to` and `--cache-from`:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+    --cache-to type=inline \
+    --cache-from type=registry,ref=<user>/<image>
+```
+
+## Further reading
+
+For an introduction to caching see [Optimizing builds with cache management](https://docs.docker.com/build/building/cache).
+
+For more information on the `inline` cache backend, see the [BuildKit README](https://github.com/moby/buildkit#inline-push-image-and-cache-together).
--- a/docs/guides/cache/local.md
+++ b/docs/guides/cache/local.md
@ -0,0 +1,94 @@
+# Local cache storage
+
+The `local` cache store is a simple cache option that stores your cache as
+files in a local directory on your filesystem (using an [OCI image layout](https://github.com/opencontainers/image-spec/blob/main/image-layout.md)
+for the underlying directory structure). It's a good choice if you're just
+testing locally, or want the flexibility to manage a shared storage option
+yourself, by manually uploading it to a file server, or mounting it over an
+NFS volume.
+
+> **Note**
+>
+> The `local` cache storage backend requires using a different driver than
+> the default `docker` driver - see more information on selecting a driver
+> [here](../drivers/index.md). To create a new docker-container driver (which
+> can act as a simple drop-in replacement):
+>
+> ```console
+> docker buildx create --use --driver=docker-container
+> ```
+
+To import and export your cache using the `local` storage backend we use the
+`--cache-to` and `--cache-from` flags and point it to our desired local
+directory using the `dest` and `src` parameters respectively:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=local,dest=path/to/local/dir \
+  --cache-from type=local,src=path/to/local/dir .
+```
+
+If the cache does not exist, then the cache import step will fail, but the
+build will continue.
+
+## Cache versioning
+
+If you inspect the cache directory manually, you can see the resulting OCI
+image layout:
+
+```console
+$ ls cache
+blobs  index.json  ingest
+$ cat cache/index.json | jq
+{
+  "schemaVersion": 2,
+  "manifests": [
+    {
+      "mediaType": "application/vnd.oci.image.index.v1+json",
+      "digest": "sha256:6982c70595cb91769f61cd1e064cf5f41d5357387bab6b18c0164c5f98c1f707",
+      "size": 1560,
+      "annotations": {
+        "org.opencontainers.image.ref.name": "latest"
+      }
+    }
+  ]
+}
+```
+
+Similarly to the other cache exporters, the cache is replaced on export, by
+replacing the contents of the `index.json` file - however, previous caches will
+still be available if the hash of the previous cache image index is known.
+These old caches will be kept indefinitely, so the local directory will
+continue to grow: see [moby/buildkit#1896](https://github.com/moby/buildkit/issues/1896)
+for more information.
+
+When importing cache using `--cache-to`, you can additionally specify the
+`digest` parameter to force loading an older version of the cache, for example:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=local,dest=path/to/local/dir \
+  --cache-from type=local,ref=path/to/local/dir,digest=sha256:6982c70595cb91769f61cd1e064cf5f41d5357387bab6b18c0164c5f98c1f707 .
+```
+
+## Cache options
+
+The `local` cache has lots of parameters to adjust its behavior.
+
+### Cache mode
+
+See [Registry - Cache mode](./registry.md#cache-mode) for more information.
+
+### Cache compression
+
+See [Registry - Cache compression](./registry.md#cache-compression) for more information.
+
+### OCI media types
+
+See [Registry - OCI Media Types](./registry.md#oci-media-types) for more information.
+
+## Further reading
+
+For an introduction to caching see [Optimizing builds with cache management](https://docs.docker.com/build/building/cache).
+
+For more information on the `local` cache backend, see the [BuildKit README](https://github.com/moby/buildkit#local-directory-1).
--- a/docs/guides/cache/registry.md
+++ b/docs/guides/cache/registry.md
@ -0,0 +1,129 @@
+# Registry cache storage
+
+The `registry` cache store can be thought of as the natural extension to the
+simple `inline` cache. Unlike the `inline` cache, the `registry` cache is
+entirely separate from the image, which allows for more flexible usage -
+`registry`-backed cache can do everything that the inline cache can do, and
+more:
+
+- It allows for separating the cache and resulting image artifacts so that
+  you can distribute your final image without the cache inside.
+- It can efficiently cache multi-stage builds in `max` mode, instead of only
+  the final stage.
+- It works with other exporters for more flexibility, instead of only the
+  `image` exporter.
+
+> **Note**
+>
+> The `registry` cache storage backend requires using a different driver than
+> the default `docker` driver - see more information on selecting a driver
+> [here](../drivers/index.md). To create a new docker-container driver (which
+> can act as a simple drop-in replacement):
+>
+> ```console
+> docker buildx create --use --driver=docker-container
+> ```
+
+To import and export your cache using the `registry` storage backend we use the
+`--cache-to` and `--cache-from` flags and point it to our desired image target
+using the `ref` parameter:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=registry,ref=<user>/<cache-image> \
+  --cache-from type=registry,ref=<user>/<cache-image> .
+```
+
+You can choose any valid reference value for `ref`, as long as it's not the
+same as the target location that you push your image to. You might choose
+different tags (e.g. `foo/bar:latest` and `foo/bar:build-cache`), separate
+image names (e.g. `foo/bar` and `foo/bar-cache`), or even different
+repositories (e.g. `docker.io/foo/bar` and `ghcr.io/foo/bar`).
+
+If the cache does not exist, then the cache import step will fail, but the
+build will continue.
+
+## Cache options
+
+Unlike the simple `inline` cache, the `registry` cache has lots of parameters to
+adjust its behavior.
+
+### Cache mode
+
+Build cache can be exported in one of two modes: `min` or `max`, with either
+`mode=min` or `mode=max` respectively. For example, to build the cache with
+`mode=max`:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=registry,ref=<user>/<cache-image>,mode=max \
+  --cache-from type=registry,ref=<user>/<cache-image> .
+```
+
+Note that only `--cache-to` needs to be modified, as `--cache-from` will
+automatically extract the relevant parameters from the resulting output.
+
+In `min` cache mode (the default), only layers that are exported into the
+resulting image are cached, while in `max` cache mode, all layers are cached,
+even those of intermediate steps.
+
+While `min` cache is typically smaller (which speeds up import/export times,
+and reduces storage costs), `max` cache is more likely to get more cache hits.
+Depending on the complexity and location of your build, you should experiment
+with both parameters to get the results.
+
+### Cache compression
+
+Since `registry` cache is exported separately from the main build result, you
+can specify separate compression parameters for it (which are similar to the
+options provided by the `image` exporter). While the defaults have been
+selected to provide a good out-of-the-box experience, you may wish to tweak the
+parameters to optimize for storage vs compute costs.
+
+To select the compression algorithm, you can use the `compression=<uncompressed|gzip|estargz|zstd>`
+option. For example, to build the cache with `compression=zstd`:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=registry,ref=<user>/<cache-image>,compression=zstd \
+  --cache-from type=registry,ref=<user>/<cache-image> .
+```
+
+The `compression-level=<value>` option can be used alongside the `compression`
+parameter to choose a compression level for the algorithms which support it
+(from 0-9 for `gzip` and `estargz` and 0-22 for `zstd`). As a general rule, the
+higher the number, the smaller the resulting file will be, but the longer the
+compression will take to run.
+
+The `force-compression=<bool>` option can be enabled with `true` (and disabled
+with `false`) to force re-compressing layers that have been imported from a
+previous cache if the requested compression algorithm is different from the
+previous compression algorithm.
+
+> **Note**
+>
+> The `gzip` and `estargz` compression methods use the [`compress/gzip` package](https://pkg.go.dev/compress/gzip),
+> while `zstd` uses the [`github.com/klauspost/compress/zstd` package](https://github.com/klauspost/compress/tree/master/zstd).
+
+### OCI media types
+
+Like the `image` exporter, the `registry` cache exporter supports creating
+images with Docker media types or with OCI media types. To enable OCI Media
+types, you can use the `oci-mediatypes` property:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=registry,ref=<user>/<cache-image>,oci-mediatypes=true \
+  --cache-from type=registry,ref=<user>/<cache-image> .
+```
+
+This property is only meaningful with the `--cache-to` flag, when fetching
+cache, BuildKit will auto-detect the correct media types to use.
+
+<!-- FIXME: link to image exporter guide when it's written -->
+
+## Further reading
+
+For an introduction to caching see [Optimizing builds with cache management](https://docs.docker.com/build/building/cache).
+
+For more information on the `registry` cache backend, see the [BuildKit README](https://github.com/moby/buildkit#registry-push-image-and-cache-separately).
--- a/docs/guides/cache/s3.md
+++ b/docs/guides/cache/s3.md
@ -0,0 +1,64 @@
+# AWS S3 cache storage
+
+> **Warning**
+>
+> The `s3` cache is currently unreleased. You can use it today, by using the
+> `moby/buildkit:master` image in your Buildx driver.
+
+The `s3` cache store uploads your resulting build cache to [AWS's S3 file storage service](https://aws.amazon.com/s3/),
+into a bucket of your choice.
+
+> **Note**
+>
+> The `s3` cache storage backend requires using a different driver than
+> the default `docker` driver - see more information on selecting a driver
+> [here](../drivers/index.md). To create a new docker-container driver (which
+> can act as a simple drop-in replacement):
+>
+> ```console
+> docker buildx create --use --driver=docker-container
+> ```
+
+To import and export your cache using the `s3` storage backend we use the
+`--cache-to` and `--cache-from` flags and point it to our desired bucket using
+the required `region` and `bucket` parameters:
+
+```console
+$ docker buildx build --push -t <user>/<image> \
+  --cache-to type=s3,region=eu-west-1,bucket=my_bucket,name=my_image \
+  --cache-from type=s3,region=eu-west-1,bucket=my_bucket,name=my_image
+```
+
+## Authentication
+
+To authenticate to S3 to read from and write to the cache, the following
+parameters are required:
+
+* `access_key_id`: access key ID
+* `secret_access_key`: secret access key
+* `session_token`: session token
+
+While these can be manually provided, if left unspecified, then the credentials
+for S3 will be pulled from the BuildKit server's environment following the
+environment variables scheme for the [AWS Go SDK](https://docs.aws.amazon.com/sdk-for-go/v1/developer-guide/configuring-sdk.html).
+
+> **Warning**
+>
+> These environment variables **must** be specified on the BuildKit server, not
+> the `buildx` client.
+
+<!-- FIXME: update once https://github.com/docker/buildx/pull/1294 is released -->
+
+## Cache options
+
+The `s3` cache has lots of parameters to adjust its behavior.
+
+### Cache mode
+
+See [Registry - Cache mode](./registry.md#cache-mode) for more information.
+
+## Further reading
+
+For an introduction to caching see [Optimizing builds with cache management](https://docs.docker.com/build/building/cache).
+
+For more information on the `s3` cache backend, see the [BuildKit README](https://github.com/moby/buildkit#s3-cache-experimental).