Provide a means of filtering repositories from which we gather metrics
We gather git commit metrics for project repositories.
By default, we collect metrics for all project repositories.
There are some cases, however, where we want to skip collecting metrics. The Eclipse Temurin project, for example, maintains a number of mirrors of OpenJDK content that is not Eclipse content and should be indexed for project metrics; it also has some binaries repositories that I'd like to skip. The Eclipse Oniro project does something similar with mirrors of content from Yocto.
My thinking is that the easiest way to do this would be to add a field (that allows us to specify regular expressions for exclusions.
e.g., for Eclipse Temurin, the dash process is hardcoded to skip repositories that match the following patterns:
^https:\/\/github\.com\/adoptium\/jdk
^https:\/\/github\.com\/adoptium\/alpine-jdk
^https:\/\/github\.com\/adoptium\/aarch32-jdk
^https:\/\/github\.com\/adoptium\/riscv\-port-jdk
^https:\/\/github\.com\/adoptium\/.+\-binaries$
If push comes to shove, I can express this as a single regular expression, but I'm thinking that I'd like to avoid making this any more complex than necessary. Further, I'm pretty sure that we can make this work with a single field (i.e., we can specify GitHub, GitLab, and Gerrit exclusions in one field).
This field's values need to be included in the API.