DGL

Datasets from DGL.

graph_datasets.datasets.dgl.load_dgl_data(dataset_name: str, directory: str = './data', verbosity: int = 0) Tuple[DGLGraph, Tensor, int][source]

Load DGL graphs.

Parameters:
  • dataset_name (str) – Dataset name.

  • directory (str, optional) – Raw dir for loading or saving. Defaults to DEFAULT_DATA_DIR=os.path.abspath(“./data”).

  • verbosity (int, optional) – Output debug information. The greater, the more detailed. Defaults to 0.

Raises:

NotImplementedError – Dataset unknown.

Note

Chameleon, Squirrel, Actor, Cornell, Texas and Wisconsin are from Geom-GCN, which may be slightly different from heterophilous settings.

Returns:

[graph, label, n_clusters]

Return type:

Tuple[dgl.DGLGraph, torch.Tensor, int]

PyG

Datasets from PyG.

graph_datasets.datasets.pyg.load_pyg_data(dataset_name: str, directory: str = './data', verbosity: int = 0) Tuple[DGLGraph, Tensor, int][source]

Load pyG graphs.

Parameters:
  • dataset_name (str) – Dataset name.

  • directory (str, optional) – Raw dir for loading or saving. Defaults to DEFAULT_DATA_DIR=os.path.abspath(“./data”).

  • verbosity (int, optional) – Output debug information. The greater, the more detailed. Defaults to 0.

Raises:

NotImplementedError – Dataset unknown.

Returns:

[graph, label, n_clusters]

Return type:

Tuple[dgl.DGLGraph, torch.Tensor, int]

Note

No row-normalization conducted.

OGB

Datasets from OGB.

graph_datasets.datasets.ogb.load_ogb_data(dataset_name: str, directory: str = './data', verbosity: int = 0) Tuple[DGLGraph, Tensor, int][source]

Load OGB graphs.

Parameters:
  • dataset_name (str) – Dataset name.

  • directory (str, optional) – Raw dir for loading or saving. Defaults to DEFAULT_DATA_DIR=os.path.abspath(“./data”).

  • verbosity (int, optional) – Output debug information. The greater, the more detailed. Defaults to 0.

Raises:

NotImplementedError – Dataset unknown.

Returns:

[graph, label, n_clusters]

Return type:

Tuple[dgl.DGLGraph, torch.Tensor, int]

LINKX

Datasets from the paper LINKX.

graph_datasets.datasets.linkx.load_linkx_data(dataset_name: str, directory: str = './data', verbosity: int = 0) Tuple[DGLGraph, Tensor, int][source]

Load LINKX graphs.

Parameters:
  • dataset_name (str) – Dataset name.

  • directory (str, optional) – Raw dir for loading or saving. Defaults to DEFAULT_DATA_DIR=os.path.abspath(“./data”).

  • verbosity (int, optional) – Output debug information. The greater, the more detailed. Defaults to 0.

Returns:

[graph, label, n_clusters]

Return type:

Tuple[dgl.DGLGraph, torch.Tensor, int]

CRITICAL

Datasets from the paper A Critical Look at the Evaluation of GNNs Under Heterophily: Are We Really Making Progress?.

graph_datasets.datasets.critical.load_critical_dataset(dataset_name: str, directory: str = './data', verbosity: int = 0) Tuple[DGLGraph, Tensor, int][source]

Load graphs from A Critical Look at the Evaluation of GNNs Under Heterophily: Are We Really Making Progress?

Parameters:
  • dataset_name (str) – Dataset name.

  • directory (str, optional) – Raw dir for loading or saving. Defaults to DEFAULT_DATA_DIR=os.path.abspath(“./data”).

  • verbosity (int, optional) – Output debug information. The greater, the more detailed. Defaults to 0.

Returns:

[graph, label, n_clusters]

Return type:

Tuple[dgl.DGLGraph, torch.Tensor, int]

CoLA

Datasets from the paper CoLA.

graph_datasets.datasets.cola.load_cola_data(dataset_name: str, directory: str = './data', verbosity: int = 0) Tuple[DGLGraph, Tensor, int][source]

Load CoLA graphs.

Parameters:
  • dataset_name (str) – Dataset name.

  • directory (str, optional) – Raw dir for loading or saving. Defaults to DEFAULT_DATA_DIR=os.path.abspath(“./data”).

  • verbosity (int, optional) – Output debug information. The greater, the more detailed. Defaults to 0.

Returns:

[graph, label, n_clusters]

Return type:

Tuple[dgl.DGLGraph, torch.Tensor, int]

SDCN

Datasets from the paper SDCN.

graph_datasets.datasets.sdcn.load_sdcn_data(dataset_name: str, directory: str = './data', verbosity: int = 0) Tuple[DGLGraph, Tensor, int][source]

Load SDCN graphs.

Parameters:
  • dataset_name (str) – Dataset name.

  • directory (str, optional) – Raw dir for loading or saving. Defaults to DEFAULT_DATA_DIR=os.path.abspath(“./data”).

  • verbosity (int, optional) – Output debug information. The greater, the more detailed. Defaults to 0.

Note

The last node of DBLP is an isolated node.

Returns:

[graph, label, n_clusters]

Return type:

Tuple[dgl.DGLGraph, torch.Tensor, int]