GRAPH-OF-WORDS file with edges and nodes labels #3

Matt-81 · 2023-04-05T13:00:03Z

Dear @GuillaumeDD,
thank you for this great work! I was trying gowpy.gow.miner and gowpy.gow.io for converting a corpus into a collection of graphs of words. I saw that in the exported file the graphs does not report the input text (e.g., a node like "foo", becomes "v 0 0").

I was wondering if it is possible to export it as "v 0 foo".

Thanks in advance for your help!

GuillaumeDD · 2023-04-07T09:26:11Z

Hi @Matt-81 ,
Thank you for your positive feedbacks on gowpy 🙏

Unfortunately, it is not possible to export a node as "v 0 foo". The reason is that frequent mining subgraph algorithms expect node/edge labels as non-negative integers, see https://github.com/Jokeren/gBolt#input-specification for instance.

However, the GoWMiner class keeps the mapping between these integers and their corresponding labels.
The easiest way to get back to the tokens is via GoWVectorizer initialized from the GoWMiner. There is an example to get back the feature names in the following notebook examples/classification-r8-frequent_subgraphs.ipynb in Section "GoW Vectorizer Example".

Hope this help!

Matt-81 · 2023-04-11T10:36:53Z

Hi @GuillaumeDD, great, thanks for the feedback! 🙏

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GRAPH-OF-WORDS file with edges and nodes labels #3

GRAPH-OF-WORDS file with edges and nodes labels #3

Matt-81 commented Apr 5, 2023

GuillaumeDD commented Apr 7, 2023

Matt-81 commented Apr 11, 2023

GRAPH-OF-WORDS file with edges and nodes labels #3

GRAPH-OF-WORDS file with edges and nodes labels #3

Comments

Matt-81 commented Apr 5, 2023

GuillaumeDD commented Apr 7, 2023

Matt-81 commented Apr 11, 2023