Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Task Manager] Workload aggregation caps out at 10k tasks #98624

Closed
gmmorris opened this issue Apr 28, 2021 · 4 comments · Fixed by #99360
Closed

[Task Manager] Workload aggregation caps out at 10k tasks #98624

gmmorris opened this issue Apr 28, 2021 · 4 comments · Fixed by #99360
Assignees
Labels
bug Fixes for quality problems that affect the customer experience Feature:Task Manager Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams)

Comments

@gmmorris
Copy link
Contributor

Kibana version: 7.12.0

It seems the aggregation caps out at 10,000 tasks, which means that the aggregation is probably hitting a page and we'er not fetching it.

Task Types are definitely missing in the list when this happens, so we're not providing an accurate image of the workload.

@gmmorris gmmorris added the bug Fixes for quality problems that affect the customer experience label Apr 28, 2021
@botelastic botelastic bot added the needs-team Issues missing a team label label Apr 28, 2021
@gmmorris gmmorris added Feature:Task Manager Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams) labels Apr 28, 2021
@elasticmachine
Copy link
Contributor

Pinging @elastic/kibana-alerting-services (Team:Alerting Services)

@botelastic botelastic bot removed the needs-team Issues missing a team label label Apr 28, 2021
@ymao1
Copy link
Contributor

ymao1 commented May 5, 2021

From the example that was provided:

{
 ... other task manager stuff
  "workload": {
    "count": 10000,
    "task_types": {
      "actions:.pagerduty": {
        "count": 91992,
        "status": {
          "failed": 91917,
          "running": 59,
          "idle": 16
        }
      },
     ... other task types
  }
}

It looks like the aggregation is running over all the data but the count that is returned is 10,000 which is the max number returned when track_total_hits is not set to true for a query. reference I believe the aggregation is running over all the data because the number of pagerduty actions is 91,992, which is much greater than 10,000

@gmmorris Should this issue just be to update the count value to reflect the actual number of documents?

@pmuellr
Copy link
Member

pmuellr commented May 5, 2021

Oh, my first thought when I saw mention of this was that the problem was the agg was only returning 10 terms (task types), rather than all of them. But I wasn't listening all that closely. I do remember hearing that there were task types we expected to see in the list, that we didn't, so ... somehow we're missing some buckets ...

Ah, just saw the linked PR and can see that is being addressed!

@gmmorris
Copy link
Contributor Author

gmmorris commented May 6, 2021

I do remember hearing that there were task types we expected to see in the list, that we didn't, so ... somehow we're missing some buckets ...

yeah, indeed.

Ah, just saw the linked PR and can see that is being addressed!

and yay :)

@kobelb kobelb added the needs-team Issues missing a team label label Jan 31, 2022
@botelastic botelastic bot removed the needs-team Issues missing a team label label Jan 31, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Fixes for quality problems that affect the customer experience Feature:Task Manager Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants