You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm working on #75 right now (the parsing functions have been updated, but I need to update the surrounding parts of the codebase). Right now (at least in this branch), LastGraph, GML, and DOT files containing duplicate edges are all supported. These filetypes' corresponding parsing functions return nx.MultiDiGraph objects, rather than just nx.DiGraph objects like before.
The GFA and FASTG parsing functions also return nx.MultiDiGraph objects, but these functions actually delegate the work of parsing to other libraries (GfaPy and pyfastg, respectively), both of which do not allow duplicate edges. This means that, although we could support duplicate edges for these filetypes, trying to run MetagenomeScope on them will lead to an error being raised by the library to which we delegate.
I don't think addressing this is an urgent or important issue, since I haven't seen many GFA or FASTG files containing duplicate edges (I'm not even sure how you'd express duplicate edges in SPAdes-dialect FASTG files). I'm just documenting this issue here, separately from #75, so that when #75 is addressed we will still have this thread open.
Support duplicate edges in GFA files (relatively feasible)
Support duplicate edges in FASTG files (uhhhh yeah idk about this one)
The text was updated successfully, but these errors were encountered:
... even though the fastg / gfa ones don't actually allow for parallel
edges in parsing (marbl#239). We just wanna *create* a multidigraph so that
in the later stages of pattern identification, etc. (marbl#202), we can
assume safely that the object we are working with is a multidigraph.
I'm working on #75 right now (the parsing functions have been updated, but I need to update the surrounding parts of the codebase). Right now (at least in this branch), LastGraph, GML, and DOT files containing duplicate edges are all supported. These filetypes' corresponding parsing functions return
nx.MultiDiGraph
objects, rather than justnx.DiGraph
objects like before.The GFA and FASTG parsing functions also return
nx.MultiDiGraph
objects, but these functions actually delegate the work of parsing to other libraries (GfaPy and pyfastg, respectively), both of which do not allow duplicate edges. This means that, although we could support duplicate edges for these filetypes, trying to run MetagenomeScope on them will lead to an error being raised by the library to which we delegate.I don't think addressing this is an urgent or important issue, since I haven't seen many GFA or FASTG files containing duplicate edges (I'm not even sure how you'd express duplicate edges in SPAdes-dialect FASTG files). I'm just documenting this issue here, separately from #75, so that when #75 is addressed we will still have this thread open.
The text was updated successfully, but these errors were encountered: