Triage github tickets #2612

mpenkov · 2019-09-29T06:46:34Z

We have a ton of issues on github (over 220) and for me personally, it feels a bit overwhelming. What do you think?

Realistically, that's more than we can ever hope to resolve given our current velocity. I think it's worth performing a bug triage: going through the issues and identifying:

Priority
Severity
Approximate time scope (now / weeks / months / years / never)

Here are some places to start (issue labels may overlap):

bug: 75 open
feature: 83 issues
wishlist: 42 issues

You can see the full list of our labels here.

@piskvorky @gojomo What's the best medium to do this? Perhaps a phone call? It does not have to happen now or even in the immediate future, but it should happen sometime.

piskvorky · 2019-09-29T09:32:05Z

For me, fixing bugs, improving docs and streamlining the existing workflows is the priority. "Wishlist" or new feature tickets we can (should) ignore: We don't have the time capacity to add major functionality ourselves, and I have little trust in outside contributions.

It's my impression many of the tickets will be trivial (user mistake), and/or obsolete. Identifying those tickets that are both non-trivial and urgent seems worthwhile. I suspect there won't be that many – yes, let's clear up which is which (for ourselves) in a call.

Then we can come up with a ticket-labeling scheme to better sort & organize the tickets.

mpenkov · 2019-09-29T09:36:38Z

For me, fixing bugs, improving docs and streamlining the existing workflows is the priority.

👍

"Wishlist" or new feature tickets we can (should) ignore: We don't have the time capacity to add major functionality ourselves, and I have little trust in outside contributions.

OK, so do we close all of the wishlist tickets, then?

It's my impression many of the tickets will be trivial (user mistake), and/or obsolete. Identifying those tickets that are both non-trivial and urgent seems worthwhile. I suspect there won't be that many – yes, let's clear up which is which (for ourselves) in a call.

Then we can come up with a ticket-labeling scheme to better sort & organize the tickets.

👍

piskvorky · 2019-09-29T09:47:33Z

OK, so do we close all of the wishlist tickets, then?

They don't bother me, I'd keep them open. But the more outlandish / impractical ones we can certainly close. Or would you close them all? We can go over this in our call too.

mpenkov · 2019-09-29T10:10:36Z

Let's go through them too, perhaps after the bugs are triaged.

gojomo · 2019-09-30T17:46:30Z

For a project like gensim, I'd prefer to keep marginal/speculative/longshot issues open, but use other labeling to help keep them from distracting people doing prioritized work.

Why? Closed issues can be harder to find, and "closing" can imply disinterest/rejection – when in fact many such issues are just awaiting the arrival of the right, interested volunteers to research/complete them. (Or, awaiting the addition of some additional report/insight that eventually helps connect them to a larger opportunity.) Keeping them open, but well-labeled as low-priority, lets them store & accumulate info for the future.

(This calculation changes for projects with more fixed budgets/timelines, and a smaller set of customer/manager/coworker collaborators. There, quick definitive prioritize-or-close processes can be important, and also reporters will know better how to find/escalate/revive closed issues. But here, even many fringe bugs/ideas could be promising, if they eventually attract a skills-matched, motivated contributor.)

piskvorky · 2019-10-08T09:56:41Z

We went over the first page of bug tickets with Misha today. Recording my impressions here:

Bugs are really bugs, need action.
- Very few nonsense tickets.
- Some minor ticket hijacking.
Largest classes of bugs:
- API design: the clusterfuck of Re-design "*2vec" implementations #1777 redesign. We didn't manage to revert that PR in time, and now there's a steady stream of broken API contracts, things that shouldn't work but do, things that should work but don't, unclear responsibilities.
- Scaling of LdaModel and LdaMulticore: multiple issues with numerical stability (esp. in combination with large corpora); training getting stuck; training resulting in zero vectors.
- I/O issues with fastText: loading from native fastText, RAM issues.
- Wrapper issues: sklearn, mallet, pandas, keras.
The rest are more case-by-case errors, no common pattern

The most severe errors to me are the first kind (API design). Rather than being a single fixable bug, they're compromising the core of Gensim's mission: topic modeling for humans. They're also the most embarrassing bugs, because they show a lack of engineering skill – a very bad sign for any library.

mpenkov · 2019-10-09T03:14:40Z

@piskvorky I couldn't articulate the difference between severity and priority during our call, but this article does a decent job: http://tryqa.com/what-is-the-difference-between-severity-and-priority/

Do you think we need to keep both severity and priority labels?

piskvorky · 2019-10-09T07:15:47Z

Thanks, that looks good to me. As long as we're clear about the difference between the various labels: the clearer we can articulate the purpose of each label ("would this ticket fit under this label?"), the better.

Can you make the label names and descriptions more explicit? Otherwise we'll forget again soon :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Triage github tickets #2612

Triage github tickets #2612

mpenkov commented Sep 29, 2019

piskvorky commented Sep 29, 2019 •

edited

Loading

mpenkov commented Sep 29, 2019 •

edited by piskvorky

Loading

piskvorky commented Sep 29, 2019 •

edited

Loading

mpenkov commented Sep 29, 2019

gojomo commented Sep 30, 2019

piskvorky commented Oct 8, 2019 •

edited

Loading

mpenkov commented Oct 9, 2019

piskvorky commented Oct 9, 2019 •

edited

Loading

Triage github tickets #2612

Triage github tickets #2612

Comments

mpenkov commented Sep 29, 2019

piskvorky commented Sep 29, 2019 • edited Loading

mpenkov commented Sep 29, 2019 • edited by piskvorky Loading

piskvorky commented Sep 29, 2019 • edited Loading

mpenkov commented Sep 29, 2019

gojomo commented Sep 30, 2019

piskvorky commented Oct 8, 2019 • edited Loading

mpenkov commented Oct 9, 2019

piskvorky commented Oct 9, 2019 • edited Loading

piskvorky commented Sep 29, 2019 •

edited

Loading

mpenkov commented Sep 29, 2019 •

edited by piskvorky

Loading

piskvorky commented Sep 29, 2019 •

edited

Loading

piskvorky commented Oct 8, 2019 •

edited

Loading

piskvorky commented Oct 9, 2019 •

edited

Loading