Skip to content
This repository was archived by the owner on Oct 30, 2025. It is now read-only.
This repository was archived by the owner on Oct 30, 2025. It is now read-only.

Project health data collection #24

@jimthematrix

Description

@jimthematrix

Overview

This issue tracks the discussions and the work for project health indicator data.

As part of the TSC meeting on 1/6/2022, the policy around project quarterly reports was brought up. As part of the discussions, collecting data to accurately reflect a project's health came up.

A proposal to pre-populate the project quarterly report template with project health data was made. This will help project teams to not be stressed about filling out the reports because each report already comes with useful data. Equally important, this gives the TSC members a standard set of data dimensions to review in order to properly evaluate the health of each project.

Sources of Health Data

Currently, for Hyperledger project, the following sources of data are available:

  • Linux Foundation Insights: a custom developed application maintained by LF. The app currently covers github based activities (commits, PRs, issues) and social engagements (rocket chat).
  • Project github contributor reports: data pulled using github APIs assembled in report PDFs. Currently focused on contributor count and status: total, new, core/regular/casual, active/inactive, etc.

Future Requirements

What can be done to allow project health data to be accurately identified and properly captured?

Stable APIs

Insights currently doesn't offer stable APIs that can be used outside of the Insights' own dashboard UI. Having stable APIs would allow the information to be embedded in other places such as TSC wiki during review meetings.

It's important to capture point-in-time snapshots for the quarterly reports, or other places where such information is used, such as the Learning Materials Development Working Group.

This requirements for snapshots can be accomplished via one of the following ways:

  • APIs to allow a date range to be specified and reliably produce the same results for the same range
  • APIs to allow the data to rendered inside a report or image for the charts, and be downloaded as a pdf or image file

More data sources for Insights

Are there other data sources that can be useful to load into Insights?

Updated List (ver. 03/02/2022)

The following bullets capture the summary from the ongoing discussions.

  • community
    • growth: both in terms of new interested individuals and conversion to contributor. data that reflects this dimension:
      • the number of contributors to the code base (github PRs)
      • the number of contributors to design discussions (discord)
      • the number of contributors to requirements (github issues)
    • diversity: no single organization keeps the project live. data that reflects this dimension:
      • the number of organizations contributing to the code base (github PRs)
    • retention: interesting/useful projects attract contributors, healthy projects retain them. data that reflects this dimension:
      • active contributor longevity (github PRs, discord)
    • maturity: I'm not able to properly articulate this one, maybe someone can help here?
    • responsiveness: how long until proposed changes (code, design, bug reports, etc.) are given attention? data that reflects this dimension:
      • time to resolve PRs and issues (github)
      • time to respond to questions (discord)
  • code
    • usefulness: is the project being adopted by customers and tire kickers? data that reflects this dimension:
      • usage information provided by customers and developers
      • number of questions from clients trying to use the code
      • docker pulls
      • release binary downloads
      • tagged online resources: case studies, presentations, mentorship programs
    • production-readiness: is the current code base coherent enough to be usable in a real-world scenario? data that reflects this dimension:
      • release number (latest is 1.0.0 or later?)
      • test coverage
      • performance and reliability testing data
      • user documentation

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions