Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Are we tracking errors properly, are we monitoring? #6417

Open
bedeho opened this issue Jul 10, 2024 · 2 comments
Open

Are we tracking errors properly, are we monitoring? #6417

bedeho opened this issue Jul 10, 2024 · 2 comments
Assignees
Labels
idea Idea for a new feature pending-triage Requires triage before any work can begin

Comments

@bedeho
Copy link
Member

bedeho commented Jul 10, 2024

Background

I recently saw a video where someone was trying to sell creator tokens, and it consistently failed, they got the "somethign went wrong" dialogue. They were doing it from the portfolio screen, but I'm guessing the root cause was exact same issue as #6415. Given that this limitation has been in place since when creator tokens were first shipped, I can only imagine how many times people have seen this error, yet we have been totally unaware of it. The only way we ran into this is by directly experiencing it ourselves, or catching screencasts of others. That is really bad.

Proposal

We already have a sentry dashboard for tracking errors, but I logged in and I saw nothing about it. So either we are failing on actually tracking this error in Sentry, or we are failing to monitor Sentry daily to actually process the issue. We cant wait for 100s of bad user interactions before we incidentally see the error in action first hand.

@bedeho bedeho added idea Idea for a new feature pending-triage Requires triage before any work can begin labels Jul 10, 2024
@ikprk
Copy link
Collaborator

ikprk commented Jul 10, 2024

I'd say there are few layers to that.

  1. We track a lot of events, I'd say it make it harder to catch important one.
  2. Some issues are not necessary, for example, the failed to send network performance events issue is non-actionable and introduces more spam than value to Sentry.
  3. Some issues are impossible to reproduce just with Sentry error details.
  4. To find the desired issue you have to know its' message that is only in the code. So traversing the feed is tricky without codebase knowledge.

@bedeho
Copy link
Member Author

bedeho commented Jul 20, 2024

ok, thank you for the rundown, 1&2 was also my sense. I'm not so concerned about being able to reproduce, first and foremost detection is the key. If we see an error happen time and time again, we can escalate the effort to do more granular logging or error tracking once we know there is something we need to pin down. The real killer is when you don't even know there is a problem.

Can you prepare a plan to fix this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
idea Idea for a new feature pending-triage Requires triage before any work can begin
Projects
None yet
Development

No branches or pull requests

2 participants