Profiling of labkey.post

I've noticed that fetching a large table from labkey via `Rlabkey` is significantly slower than other API client like JavaScript, so I profiled the `labkey.selectRows` call for a large table (182779 rows) from DataSpace.

```R
profvis::profvis(Rlabkey::labkey.selectRows(
  baseUrl = "https://dataspace.cavd.org",
  folderPath = "/CAVD",
  schemaName = "study",
  queryName = "ICS" # 182779 rows
))
```
![image](https://user-images.githubusercontent.com/5667572/62651333-952d5400-b90d-11e9-8f68-918877ec5c83.png)

As we can see, actual fetching of data via `POST` only takes fraction of time in `labkey.SelectRows` call, and the majority of time is spent processing the response (`processResponse`) and creating a data.frame object (`makeDF`).

We can break it down in 5 steps:

1. [fetch raw data via `POST`](https://github.com/LabKey/labkey-api-r/blob/develop/Rlabkey/R/labkey.defaults.R#L210)
2. [parse json (with simplifying to data.frame) to a list to check status](https://github.com/LabKey/labkey-api-r/blob/develop/Rlabkey/R/labkey.defaults.R#L242)
3. [parse text from raw](https://github.com/LabKey/labkey-api-r/blob/develop/Rlabkey/R/labkey.defaults.R#L220)
4. [parse json (without simplifying to data.frame) to a list from text](https://github.com/LabKey/labkey-api-r/blob/develop/Rlabkey/R/makeDF.R#L19)
5. [make a data.frame from list via c++ code](https://github.com/LabKey/labkey-api-r/blob/develop/Rlabkey/R/makeDF.R#L108)

We can see that there are redundancies in this process.
- We are parsing parsing json twice (step 2 and step 4)
- We are creating data.frame twice (step 2 and step 5)

Another thing we should note is that `jsonlite::fromJSON(simplifyDataFrame=TRUE)` is more efficient in creating a data.frame than `Rlabkey:::listToMatrix`.

Can you please take a look into this and make changes accordingly?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Profiling of labkey.post #36

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Profiling of labkey.post #36

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions