Class SimpleGraphMLImporter<V,​E>

  • Type Parameters:
    V - the graph vertex type
    E - the graph edge type
    All Implemented Interfaces:
    GraphImporter<V,​E>

    public class SimpleGraphMLImporter<V,​E>
    extends BaseEventDrivenImporter<V,​E>
    implements GraphImporter<V,​E>
    Imports a graph from a GraphML data source.

    This is a simple implementation with supports only a limited set of features of the GraphML specification. For a more rigorous parser use GraphMLImporter. This version is oriented towards parsing speed.

    The importer uses the graph suppliers (Graph.getVertexSupplier() and Graph.getEdgeSupplier()) in order to create new vertices and edges. Moreover, it notifies lazily and completely out-of-order for any additional vertex, edge or graph attributes in the input file. Users can register consumers for vertex, edge and graph attributes after construction of the importer. Finally, default attribute values are completely ignored.

    For a description of the format see http://en.wikipedia.org/wiki/ GraphML or the GraphML Primer.

    Below is small example of a graph in GraphML format.

     
     <?xml version="1.0" encoding="UTF-8"?>
     <graphml xmlns="http://graphml.graphdrawing.org/xmlns"  
         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
         xsi:schemaLocation="http://graphml.graphdrawing.org/xmlns 
         http://graphml.graphdrawing.org/xmlns/1.0/graphml.xsd">
       <key id="d0" for="node" attr.name="color" attr.type="string" />
       <key id="d1" for="edge" attr.name="weight" attr.type="double"/>
       <graph id="G" edgedefault="undirected">
         <node id="n0">
           <data key="d0">green</data>
         </node>
         <node id="n1">
           <data key="d0">black</data>
         </node>     
         <node id="n2">
           <data key="d0">blue</data>
         </node>
         <node id="n3">
           <data key="d0">red</data>
         </node>
         <node id="n4">
           <data key="d0">white</data>
         </node>
         <node id="n5">
           <data key="d0">turquoise</data>
         </node>
         <edge id="e0" source="n0" target="n2">
           <data key="d1">1.0</data>
         </edge>
         <edge id="e1" source="n0" target="n1">
           <data key="d1">1.0</data>
         </edge>
         <edge id="e2" source="n1" target="n3">
           <data key="d1">2.0</data>
         </edge>
         <edge id="e3" source="n3" target="n2"/>
         <edge id="e4" source="n2" target="n4"/>
         <edge id="e5" source="n3" target="n5"/>
         <edge id="e6" source="n5" target="n4">
           <data key="d1">1.1</data>
         </edge>
       </graph>
     </graphml>
     
     

    The importer reads the input into a graph which is provided by the user. In case the graph is weighted and the corresponding edge key with attr.name="weight" is defined, the importer also reads edge weights. Otherwise edge weights are ignored. To test whether the graph is weighted, method Graph.getType() can be used.

    The provided graph object, where the imported graph will be stored, must be able to support the features of the graph that is read. For example if the GraphML file contains self-loops then the graph provided must also support self-loops. The same for multiple edges. Moreover, the parser completely ignores the attribute "edgedefault" which denotes whether an edge is directed or not. Whether edges are directed or not depends on the underlying implementation of the user provided graph object.

    The importer by default validates the input using the 1.0 GraphML Schema. The user can (not recommended) disable the validation by calling setSchemaValidation(boolean).

    The graph vertices and edges are build using the corresponding graph suppliers. The id of the vertices in the original dot file are reported as a vertex attribute named "ID". Thus, in case vertices in the dot file also contain an "ID" attribute, such an attribute will be reported multiple times.

    The default behavior of the importer is to use the graph vertex supplier in order to create vertices. The user can also bypass vertex creation by providing a custom vertex factory method using setVertexFactory(Function). The factory method is responsible to create a new graph vertex given the vertex identifier read from file.

    Author:
    Dimitrios Michail
    • Field Detail

      • DEFAULT_VERTEX_ID_KEY

        public static final String DEFAULT_VERTEX_ID_KEY
        Default key used for vertex ID.
        See Also:
        Constant Field Values
    • Constructor Detail

      • SimpleGraphMLImporter

        public SimpleGraphMLImporter()
        Constructs a new importer.
    • Method Detail

      • getEdgeWeightAttributeName

        public String getEdgeWeightAttributeName()
        Get the attribute name for edge weights
        Returns:
        the attribute name
      • setEdgeWeightAttributeName

        public void setEdgeWeightAttributeName​(String edgeWeightAttributeName)
        Set the attribute name to use for edge weights.
        Parameters:
        edgeWeightAttributeName - the attribute name
      • isSchemaValidation

        public boolean isSchemaValidation()
        Whether the importer validates the input
        Returns:
        true if the importer validates the input
      • setSchemaValidation

        public void setSchemaValidation​(boolean schemaValidation)
        Set whether the importer should validate the input
        Parameters:
        schemaValidation - value for schema validation
      • getVertexFactory

        public Function<String,​V> getVertexFactory()
        Get the user custom vertex factory. This is null by default and the graph supplier is used instead.
        Returns:
        the user custom vertex factory
      • setVertexFactory

        public void setVertexFactory​(Function<String,​V> vertexFactory)
        Set the user custom vertex factory. The default behavior is being null in which case the graph vertex supplier is used. If supplied the vertex factory is called every time a new vertex is encountered in the file. The method is called with parameter the vertex identifier from the file and should return the actual graph vertex to add to the graph.
        Parameters:
        vertexFactory - a vertex factory
      • importGraph

        public void importGraph​(Graph<V,​E> graph,
                                Reader input)
        Import a graph.

        The provided graph must be able to support the features of the graph that is read. For example if the GraphML file contains self-loops then the graph provided must also support self-loops. The same for multiple edges.

        Specified by:
        importGraph in interface GraphImporter<V,​E>
        Parameters:
        graph - the output graph
        input - the input reader
        Throws:
        ImportException - in case an error occurs, such as I/O or parse error