Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* mypy thinks mask is Series[int], hence wrapped in np.array(dtype=bool) * bind title fix * removes feature flags that are deprecated (similarity, categories, confidence) and adds docstrings in main featurize function call * type check * deletes search_index if it exists when mutating nodes. g.search will rebuilt if not there * makes safe search under the case where nodes are mutated via assertion * lint * resolves Leos requested changes * fix(temp fix on imputer so that it doesnt drop nan columns) * removes print * changes `impute` type and logic for more robust pipeline. removes deprecated flags * adds outlier poc code * switched to rgcn * typo * typo fix * works * predict mode * commented out * adds Ask-HackerNews demo tutorial, adds all recall nodes to subgraph, rather than just those with found edges in text_utils.py, and adds what should be on master in feature_utiils * adds helper model dict imports for featurization * adds easier model presets for users when using .featurize * adds cli * docs * adds update to README * adds update to README * adds update to README * rel + feat * better naming * bug * batch_size arg * adds corrections and args * typo * fixes errors and adds embeddings * bug * bug * save before merge with work Tanmoy and I did * adds scoring in main graphistry instance * adds working node priors from featurization * fixes nodes issues * adds state changing code that removes edges not in nodes and vice versa * lazy import returns modules * cherry picks other branches to add dgl fix and outliers lib * adds CyberSecurity CTU-13 dataset GNN pipeline demo * adds Jack Dorsey Social Good Pledge dataset * infer with [s, r] -> d * adds function methods like get_matrix etc * wip(logger): make uniform * handles missing nodes df * adds chavismo OSINT demo * no-node feat() fix * bigfix num_node * train_split bugfix * typo * eval * typo bug * typo * save before stash * reverting back * stable * merged * adds stable algo and namespace * breaks up training so that repeated .embed calls trains existing model * faster chaining if model and preprocessing has already occured * faster chaining if model and preprocessing has already occured * faster chaining if model and preprocessing has already occured * faster chaining if model and preprocessing has already occured * faster chaining if model and preprocessing has already occured * logger * better chaining * update default lr * adds evaluation flag * to device * fixes hard coded cuda and sets args * flake8, isort, black * basic unit tests * bug * adds .to device for outside features * docs(rgcn demos): infosec jupyterthon 2022 * adds query naming in g.search_graph so it shows up in hub with name * logger * node idx converted to pd.series * map * efficient predict_link * remap pred_links wrt dict * linters in networks.py * dummy numpy doc with annotations * lint * lint * more annotations * lint * some annotations * commit before merge * adds logic for chaining and when parameters change * adds passing tests, adds args for sample_size, num_steps in g_iterator * adds passing tests, adds args for sample_size, num_steps in g_iterator * more type hints * lint * mypy checks * mypy checks more * mypy checks more * lazy imports * trange * trial 7 none(s) * embed outside minimal test * unittest min dep required * unittest min dep required * small comments for later * fixes score issues over train_idx that were expand_dims in error prone way * lint * empty * typo * infra(adds ai-embed test hook into ci gha) * infra(adds bin/test-embed.sh) * infra(adds sphinx nitpick) * adds README and CHANGELOG * adds README and CHANGELOG * adds README and CHANGELOG * feat(adds `anomalous` flag to score low confidence edges, updates readme) * feat(adds default KG args to PlotterBase) * fix(removes pd.Series as it is not needed, lint) * docs(changelog): rgcn * refactor(mypy): reducing type: ignore count from 47 -> 19 * perf: scalable predict_links_all, some cleanup of old funcs, migrating gcn_node_embeddings to property * ci: adding tqdm-stubs to setup.py * feat(streamlines predict code) * feat: New inference api with targeted source, relation and destination arguments * fix: mypy checks in predict_links method * fix: predict_links input type changed from pd.Series -> list * feat(adds factory method for scoring triplets) * fix(adds test given refactor, and CHANGELOG public methods) * feat(adds RED team hunt UMAP notebook for simplified outlier detection and alert volume reduction) * feat(handles returning dataframe as flag) * feat(handles returning dataframe as flag) * feat(sorts scored triplets) * adds more README * updates readme * lint * fix: some mypy-pandas typecheck fix * fix: some mypy-pandas typecheck fix * fix: some mypy-pandas typecheck fix * fix: some mypy-pandas typecheck fix * Readme * adds tests, README * adds tests * removing some comments * fix: mypy fix List[str] -> List * fix: mypy fix * adds changes to demo given new api changes. Adds logging in networks * demo notebook change to reflect new api * changes name of notebook * updates networks.py from heteroembed branch (which passes lint) * black reformattingg * merges feature_utils from heteroembed, adds linting changes * lint * linting hyper_dask.py * lint * adds `get_features_by_cols`, updates CHANGELOG and README, and small change in features.py * lint * lint * feat(adds conditional prob): for some reason this was not on branch... * feat(adds separate mixin for conditional.py methods) * lint * lint * lint * lint * changes compute import in plotter.py * changelog.md * sphinx adds conditional.ConditionMixin * typo * feat(adds tests for conditional.py) * feat(adds tests for conditional.py) * lint * test * test * test * Update CHANGELOG.md * docs(ModelDict): main example * docs(hackernew) * docs(more hnews) * doc(ask hacker news demo) * doc(ask hacker news demo) * changes(keywords in setup, HackerNews demo) * Delete cyber-fraud-umap-demo.ipynb Renamed but it didn't delete on remote. * mypi * mypi * adds type ignore * adds type ignore * comments out test * removes test * adds working changes from ai_demos branch for single file * fix(tests): sso_login tests no longer tolerate unexpected exns * garden(sso): clearer unexpected exn msg * docs(changelog): sso fixes * fix(tests): reenable test_hyper_evil * garden(tests): print veresion of mypy, pandas, numpy * adds docstrings * adds docstrings * adds docstrings and lint * lint * lint * fix(ci): tolerate hypergraph evil warning * fix(ci): redo warning supression Co-authored-by: Alex <[email protected]> Co-authored-by: tanmoyio <[email protected]> Co-authored-by: Alex Morrise <[email protected]> Co-authored-by: Tanmoy Sarkar <[email protected]>
- Loading branch information