Conda-forge Autotick Bot and Dependency Graph
Autoticking Bot
Conda-forge is a community led collection of recipes, build
infrastructure and distributions for the conda package manager
which makes it easy for developers to build and upload packages to the
conda-forge channel. This allows users to install and start using packages
simply by running conda install <package_name>
. Conda-forge stores each
package’s recipe in a separate GitHub repository, called a feedstock. When a
recipe is updated and pushed to the feedstock, a build of the conda package
is triggered, allowing users to conda install the most recent version or build
of the package. Traditionally, updating the recipe on each release has been
the job of a group of recipe maintainers. Putting this responsibility solely
on recipe maintainers, however, results in some feedstocks being neglected and
left outdated, as maintainers may not remember to bump the feedstock version
when a new version of the source code is released or may not even be aware
of a new version since in many cases recipe maintainers are not developers
on the source code. To solve this, the conda-forge autotick bot
was created to track out-of-date feedstocks and issue pull requests with
updated recipes.
The bot tracks and updates out-of-date feedstocks in four steps:
- Find the names of all feedstocks on conda-forge.
- Compute the dependency graph of packages on conda-forge found in step 1.
- Find the most recent version of each feedstock’s source code.
- Open a PR into each out-of-date feedstock updating the meta.yaml for the most recent upstream release.
These steps are run on Travis CI from four different GitHub repos as daily
cron jobs, so a different worker runs every six hours. The dependency graph
is saved in the regro/cf-graph repository, and any changes made
to the graph while the bot runs are pushed to this repo using
doctr. While doctr is generally used to deploy docs from
Travis CI to GitHub pages, it is able to push the contents of any directory
to a specified repo. So, by including doctr deploy --token --built-docs .
--deploy-repo regro/cf-graph --deploy-branch-name master .
in the
.travis.yml of the worker repos, the bot can push the files it modified to
the master branch of the regro/cf-graph repository.
Dependency Graph
To keep track of the packages on conda-forge and determine if they need to be updated, the bot stores the dependency graph of conda-forge packages. The dependency graph is a directed graph in which each node represents a package and the edges represent dependencies between packages. For example, scipy requires numpy to run, so the dependency graph has an edge from numpy to scipy. Here is the dependency graph for all packages on conda-forge, visualized using HoloViews and Bokeh: