[BUG] lazy loading regression #481

lmeyerov · 2023-05-03T04:27:32Z

after 0.29.0 (big pyg[ai] merge), looks like import is slow again, likely some top-level imports getting triggered:

for reference, pandas is just 600ms on same box -- so we can be targeting subsecond, but aren't

step #1 is to profile again: https://www.graphistry.com/blog/import-pygraphistry-from-10s-to-1s-by-python-import-lazy-loading-and-tuna

lmeyerov · 2023-05-03T04:30:48Z

as part of the fix, maybe we add a unit test? at least '1 out of 3 cold starts should be subsecond'

silkspace · 2023-05-03T04:32:51Z

@tanmoyio could this have to do with new inits?

lmeyerov · 2023-07-23T06:50:20Z

@tanmoyio @dcolinmorgan can we look as part of landing cu_cat?

lmeyerov · 2023-07-23T06:51:05Z

I documented how I tracked it down last time: https://www.graphistry.com/blog/import-pygraphistry-from-10s-to-1s-by-python-import-lazy-loading-and-tuna

dcolinmorgan · 2023-07-24T08:18:45Z

as in lazy loading cucat? we already lazy load cudf and cuml within #486 , or at least that was the whole idea, perhaps its not working correctly as 9s seems far too long

lmeyerov · 2023-07-24T20:22:49Z

i mean why is import graphistry slow -- is it pulling in some unexpected deps during import time that need to be lazy loaded?

lmeyerov · 2023-07-24T20:23:30Z

w/ lazy loading, cu_cat, cudf, etc shouldn't load at import graphistry, i'd expect at most just pandas

dcolinmorgan · 2023-07-27T10:55:23Z

yes i see the issue now on 29.3 -- on eks where cuda is available, cudf loads in 4.7s for total g. load time of 6.7s, whereas on my mac laptop entire graphistry 29.3 loads in 0.829s. seems lazy cudf is getting initiated before user request. So the question is to front load or break loadings into pieces.

re: #439 seems like there are 2 options:

if cudf is available, load it and run on full gpu by default via 'auto' flag. otherwise
wait to load cudf until user triggers with cu-cat/cuml flags

from your comment above, the second option seems expected and wanted -- should not be a hard fix. I suppose i never noticed since I've always loaded cudf before graphistry

lmeyerov · 2023-07-27T13:47:56Z

Option 3. Lazy load

Delay any cudf imports until the code using it (including "auto") is run. 'import graphistry' shouldn't need to import cudf.

Can you find the path triggering the import? It looks like plotter imports embed_utils, and embed_utils is importing cudf at import time somewhere instead of when its methods are used

dcolinmorgan · 2023-07-28T08:19:58Z

ok yeah i see the issue in embed_utils -- at some point we needed to typecheck between cudf and pd, and loaded cudf to do that, so i just swapped to using getmodule and loadtime back down to 1.8s all on pandas

lmeyerov added bug help wanted good-first-issue labels May 3, 2023

lmeyerov assigned dcolinmorgan and tanmoyio Jul 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] lazy loading regression #481

[BUG] lazy loading regression #481

lmeyerov commented May 3, 2023 •

edited

Loading

lmeyerov commented May 3, 2023

silkspace commented May 3, 2023

lmeyerov commented Jul 23, 2023

lmeyerov commented Jul 23, 2023

dcolinmorgan commented Jul 24, 2023 •

edited

Loading

lmeyerov commented Jul 24, 2023

lmeyerov commented Jul 24, 2023

dcolinmorgan commented Jul 27, 2023 •

edited

Loading

lmeyerov commented Jul 27, 2023 •

edited

Loading

dcolinmorgan commented Jul 28, 2023 •

edited

Loading

[BUG] lazy loading regression #481

[BUG] lazy loading regression #481

Comments

lmeyerov commented May 3, 2023 • edited Loading

lmeyerov commented May 3, 2023

silkspace commented May 3, 2023

lmeyerov commented Jul 23, 2023

lmeyerov commented Jul 23, 2023

dcolinmorgan commented Jul 24, 2023 • edited Loading

lmeyerov commented Jul 24, 2023

lmeyerov commented Jul 24, 2023

dcolinmorgan commented Jul 27, 2023 • edited Loading

lmeyerov commented Jul 27, 2023 • edited Loading

dcolinmorgan commented Jul 28, 2023 • edited Loading

lmeyerov commented May 3, 2023 •

edited

Loading

dcolinmorgan commented Jul 24, 2023 •

edited

Loading

dcolinmorgan commented Jul 27, 2023 •

edited

Loading

lmeyerov commented Jul 27, 2023 •

edited

Loading

dcolinmorgan commented Jul 28, 2023 •

edited

Loading