COCO Graph Generation
As we show in our survey, there are many different ways that an image can be converted to a graph. Whilst there are an increasing number of image captioning projects that make use of graph representations, these graphs are loaded from various supplementary datasets and are assembled as graphs at train/inference time. This codebase aims to provide a single platform for researchers and developers to define a graph structure that matches their downstream model and precompute them. Doing so allows for faster model training and faster development iterations. The extendable pipeline allows for data to be pooled together into either a PyTorch Geometric Data
or HeteroData
object that is then saved as a .pt
file.