Install
Last updated
Last updated
ArchiveBox is primarily distributed as a Python package via pip
, but it also depends on some system packages that can be installed manually or automatically with Docker. It usually takes less than ~10min to get ArchiveBox set up and running.
CPU Architectures: amd64
(x86_64
), arm64
(aarch64
), arm7
(Including 64-bit Intel/AMD, M1/M2/etc. Macs, Rasberry Pi >= 3)
macOS: >=10.12 (with brew
)
Linux: Ubuntu (>= 18.04), Debian (>= 10), etc. (with apt
)
BSD: FreeBSD, OpenBSD, NetBSD etc (with pkg
)
Other systems are not officially supported but may work with degraded functionality:
Windows: Via [[Docker]], Docker in WSL2, or WSL2 without Docker (not recommended)
Other UNIX systems: Arch, Nix, Guix, Fedora, SUSE, Arch, CentOS, etc.
Note: On arm7
the playwright
package is not available, so chromium
must be installed manually if needed.
You will also need at least 500MB of RAM (bare minimum), 2GB or greater is recommended. You may be able to reduce the RAM requirements if you disable all the chrome-based archiving methods with USE_CHROME=False
.
It's also recommended to use a filesystem with compression and/or deduplication (e.g. ZFS or BTRFS) for maximum efficiency.
Docker Compose is the recommended way to get ArchiveBox, as it includes all the extras out-of-the-box and provides the best security and upgrade UX.
If you don't already have docker installed, follow the official instructions to get Docker on Linux, macOS, or Windows: https://docs.docker.com/install/#supported-platforms ➡️
Then follow the Quickstart guide and read the [[Docker]] wiki page for next steps. ➡️
You can also run Dockerized ArchiveBox using UNRAID/TrueNAS/Proxmox/etc. or Kubernetes.
More info:
ArchiveBox Docker Quickstart + Usage + Configuration + Upgrading documentation
If you're on Linux with apt
, FreeBSD with pkg
, or macOS with brew
there is an optional auto-setup script provided.
(or scroll further down for manual install instructions)
The script explains what it installs beforehand, and will prompt for user confirmation before making any changes to your system. The script uses Docker if already installed, but you can decline and it will attempt to auto-install everything using apt
/brew
/pkg
+ pip
instead.
After running the setup script, continue with the Quickstart guide... ➡️
See here for our thoughts on the inherent limitations of
curl | sh
as an install method...
If you'd rather not use Docker or our auto-install script, you can follow these manual setup instructions to install ArchiveBox and its dependencies using pip
& your system package manager of choice (e.g. apt
, brew
, pkg
, nix
, etc.).
See our Dependencies documentation to see the full list of dependencies and how they're used. Not all the dependencies are required for all modes. If you disable some archive methods you can skip installing those dependencies, for example, if you set FETCH_MEDIA=False
you don't need to install yt-dlp
, and if you set FETCH_[PDF,SCREENSHOT,DOM]=False
you don't need chromium
.
More info:
For help installing these, see the Manual Setup, [[Troubleshooting]] and [[Chromium Install]] pages.
To use specific binaries for dependencies, see the Configuration: Dependencies page.
To disable unwanted dependencies, see the Configuration: Archive Method Toggles page.
Be aware, you'll need to keep all these packages up-to-date yourself over time!
Make sure you have Homebrew installed first.
Make sure apt
and dpkg
are available on your system.
See the Quickstart instructions for other operating systems and release channels. ➡️
pip
It's recommended to pip
-install ArchiveBox even if you already installed archivebox
with one of our official apt
/brew
packages above (sometimes the pip
version is newer). This step also ensures you have the latest yt-dlp
and playwright
versions.
archivebox setup
Finish installing the runtime JS dependencies that live inside your collection data dir (e.g. readability, singlefile, mercury).
Make sure the pip
-installed version of archivebox
is available in your $PATH
.
(ensure the version shown is the most recent available from Releases)
Make sure to run archivebox
as an unprivileged user (i.e. without sudo
/ not logged in as root
).
Make sure to run all commands, including archivebox version
, archivebox help
, etc. inside a data directory (or a new empty dir that will become a data dir).
If you have issues getting Chromium / Google Chrome or other dependencies working with ArchiveBox, see the [[Chromium Install]] and [[Troubleshooting]] pages for more detailed instructions.
For guides on how to import URLs from different sources into ArchiveBox, check out Input Formats and Preparing URLs. ➡️
See our [[Usage]] Wiki documentation page for more examples.
Make sure all apt/brew/pkg/etc. dependencies from above are installed & up-to-date first.
Check our more detailed Upgrading documentation and Release Notes if you run into any problems. ➡️
Read [[Usage]] to learn how to use the ArchiveBox CLI and HTML output
Read [[Configuration]] to learn about the various archive method options
Read [[Scheduled Archiving]] to learn how to set up automatic daily archiving
Read [[Publishing Your Archive]] if you want to host your archive for others to access online
Read [[Troubleshooting]] if you encounter any problems