Configuration

ZenCrepes configuration takes the shape of a file that can be shared (but can also be different) between zqueue, zindexer and zapi. It's full version is available in zindexer/src/components/config/defaultConfig.ts

This part of the documentation will break the configuration by type, for simplicity (and compactness) the documentation will show JSON snippets but the file is composed of its YML translation.

Elasticsearch

  elasticsearch: {
    host: 'http://127.0.0.1:9200',
    sslCa: '',
    cloudId: '',
    username: '',
    password: '',
    sysIndices: {
      sources: 'sources', // this index is used to store sources data
      datasets: 'datasets', // this index is used to store data about available index types
      config: 'config', // this index is used to store zencrepes configuration
    },
    oneIndexPerSource: false,
    dataIndices: {
      githubRepos: 'gh_repos',
      githubIssues: 'gh_issues_',
      githubPullrequests: 'gh_prs_',
      githubVulnerabilities: 'gh_vulns_',
      githubStargazers: 'gh_stargazers_watchers_',
      githubWatchers: 'gh_stargazers_watchers_',
      githubProjects: 'gh_projects_',
      githubMilestones: 'gh_milestones_',
      githubLabels: 'gh_labels_',
      githubReleases: 'gh_releases_',
      jiraIssues: 'j_issues_',
      jiraProjects: 'j_projects_',
      circleciPipelines: 'cci_pipelines_',
      circleciEnvvars: 'cci_envvars_',
      circleciInsights: 'cci_insights_',
    },
  },

Connecting to Elasticsearch

You can connect to a local Elasticsearch instance or an instance running in Elastic Cloud (by providing your cloudID and credentials). The details on how the connection is established are available here

System indices

ZenCrepes uses system indices to store some of the data it needs to operate.

sources: Used by zindexer to determine what data elements to fetch. For example you could scan an entire organization but only enable data fetching for some of its repositories. This index will contain the list of sources and which are enabled or not.
datasets: Deprecated
config: Holds ZenCrepes configuration (which datasets are available, which facets to show, how to display and export the list view). This index is used by zindexer and zapi.

One Index per Source

Although this feature is currently available and operational, its use it still a bit uncertain. The original idea was to store data in one index per source, later allowing index level permissions to be configured. Using this feature on an organization with a lot of repositories will result in a lot of indices generated.

Data indices

These are the indices used to store data from the different datasets. These must be identical between zindexer, zapi and zqueue configurations.

Redis

Use this configuration option to setup the Redis host, used by zqueue to handle its queue mechanism.

GitHub

  github: {
    enabled: true,
    username: 'YOUR_USERNAME',
    token: 'YOUR_TOKEN',
    fetch: {
      maxNodes: 30,
      maxParallel: 1,
      delayBetweenCalls: 1000,
    },
    // Define a match between a points label and numbers
    storyPointsLabels: [
      { label: 'xx-small', points: 1 },
      { label: 'x-small', points: 2 },
      { label: 'small', points: 3 },
      { label: 'medium', points: 5 },
      { label: 'large', points: 8 },
      { label: 'x-large', points: 13 },
    ],
    // The webhook configuration is used by zqueue to determine next course of action
    webhook: {
      secret: 'PLEASE_CHANGE_ME',
      // The Array of events matches and event name with an entity type as processed by ZenCrepes
      // You shouldn't need to change these values
      events: [
        { githubEvent: 'label', zencrepesEntity: 'labels' },
        { githubEvent: 'repository', zencrepesEntity: 'repos' },
        { githubEvent: 'pull_request', zencrepesEntity: 'pullrequests' },
        { githubEvent: 'issues', zencrepesEntity: 'issues' },
        { githubEvent: 'vulnerabilities', zencrepesEntity: 'vulnerabilities' },
        { githubEvent: 'stargazers', zencrepesEntity: 'star' },
        { githubEvent: 'watchers', zencrepesEntity: 'watch' },
        { githubEvent: 'project', zencrepesEntity: 'projects' },
        { githubEvent: 'milestone', zencrepesEntity: 'milestones' },
        { githubEvent: 'release', zencrepesEntity: 'releases' },
      ],
      // Save the raw webhook "as-received" in a timeline fashion (no overwrite)
      timelinePayload: {
        includeGithubEvents: ['*'],
        excludeGithubEvents: ['push', 'create'],
        esIndexPrefix: 'gh_webhook_timeline_',
      },
      // Save the node data contained in the webhook
      // Overwrite previous node state if the same node with same ID is received
      // One index per node type
      nodePayload: {
        includeGithubEvents: ['*'],
        excludeGithubEvents: ['push', 'create'],
        esIndexPrefix: 'gh_webhook_',
      },
      // Execute a call to GitHub to fetch the latest data in the same format than zindexer (using the same GraphQL query)
      // Data is fed into the indices specified in the elasticsearch section
      fetchNode: {
        includeGithubEvents: ['*'],
        excludeGithubEvents: [''],
      },
    },
  },

This section of configures handling of GitHub.

Fetch

Zindexer and Zqueue must abide by GitHub rate limits, and performance implication of some of the calls.

maxNodes: Number of nodes to fetch at a time during bulk fetch operations. For example when fetching issues, the system will by default limit itself to 30 issues at a time. GitHub supports a maximum of 100, but for some of the most complex GraphQL queries (or largest repos), 100 could result in errors. So it's all about finding the sweet spot between number of queries and chances for errors. The sweet spot is usually in the 30-50 region.
maxParrallel: Used by zqueue to limit the number of parallel threads processing the queue to 1.
delayBetweenCalls: Should there be a minimum delay between each call to GitHub API ?

Story Points

The storyPointsLabels is an array of label and points, when parsing an issue zindexer and zqueue would look for one of these labels and assign the corresponding points to the issue.

Webook

The webhook section has three elements to define how the payload should be processed based on received events.

includeGithubEvents: Allow for specifying which events should be processed by the queue. Using '*' indicates all events should be processed.
excludeGithubEvents: Allow for specifying which events should not be processed by the queue.

Specifying '*' in includeGithubEvents and 'push' in excludeGithubEvents, means the queue will process all but push events.

CircleCI

  circleci: {
    enabled: true,
    token: 'YOUR_TOKEN',
  },

Circleci configuration is pretty straight forward, just specify the token you generated from CircleCI

Jira

  jira: [
    {
      name: 'SERVER_1',
      enabled: true,
      config: {
        username: 'username',
        password: 'password',
        host: 'https://jira.myhost.org',
        fields: {
          issues: [
            { jfield: 'issueType', zfield: 'issueType' },
            { jfield: 'parent', zfield: 'parent' },
            { jfield: 'project', zfield: 'project' },
            { jfield: 'fixVersions', zfield: 'fixVersions' },
            { jfield: 'resolution', zfield: 'resolution' },
            { jfield: 'resolutiondate', zfield: 'closedAt' },
            { jfield: 'watches', zfield: 'watches' },
            { jfield: 'created', zfield: 'createdAt' },
            { jfield: 'priority', zfield: 'priority' },
            { jfield: 'versions', zfield: 'versions' },
            { jfield: 'issuelinks', zfield: 'links' },
            { jfield: 'issuetype', zfield: 'type' },
            { jfield: 'assignee', zfield: 'assignee' },
            { jfield: 'resolution', zfield: 'resolution' },
            { jfield: 'updated', zfield: 'updatedAt' },
            { jfield: 'status', zfield: 'status' },
            { jfield: 'description', zfield: 'description' },
            { jfield: 'summary', zfield: 'summary' },
            { jfield: 'creator', zfield: 'creator' },
            { jfield: 'subtasks', zfield: 'subtasks' },
            { jfield: 'reporter', zfield: 'reporter' },
            { jfield: 'environment', zfield: 'environment' },
            { jfield: 'duedate', zfield: 'dueAt' },
            { jfield: 'customfield_10114', zfield: 'points' },
            {
              jfield: 'customfield_11115',
              zfield: 'originalPoints',
            },
            {
              jfield: 'customfield_11112',
              zfield: 'parentInitiative',
            },
            {
              jfield: 'customfield_10314',
              zfield: 'parentEpic',
            },
          ],
        },
        excludeDays: ['1900-01-01'],
        fetch: {
          maxNodes: 30,
        },
      },
    },
  ],

Jira configuration takes an array of Jira servers.

Elasticsearch#

Connecting to Elasticsearch#

System indices#

One Index per Source#

Data indices#

Redis#

GitHub#

Fetch#

Story Points#

Webook#

CircleCI#

Jira#