IEEE/CAA Journal of Automatica Sinica
Citation:  Xiaobin Hong, Tong Zhang, Zhen Cui, and Jian Yang, "Variational Gridded Graph Convolution Network for Node Classification," IEEE/CAA J. Autom. Sinica, vol. 8, no. 10, pp. 16971708, Oct. 2021. doi: 10.1109/JAS.2021.1004201 
[1] 
Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, et al., “Gradientbased learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998. doi: 10.1109/5.726791

[2] 
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look once: Unified, realtime object detection,” in Proc. Computer Vision and Pattern Recognition, 2016, pp. 779–788.

[3] 
S. Ren, K. He, R. Girshick, and J. Sun, “Faster RCNN: Towards realtime object detection with region proposal networks,” in Proc. Advances in Neural Information Processing Systems, 2015, pp. 91–99.

[4] 
T. Luong, H. Pham, and C. D. Manning, “Effective approaches to attentionbased neural machine translation,” in Proc. Empirical Methods in Natural Language Processing, 2015, pp. 1412–1421.

[5] 
G. Hinton, L. Deng, D. Yu, G. Dahl, A.r. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, B. Kingsbury, et al., “Deep neural networks for acoustic modeling in speech recognition,” IEEE Signal Processing Magazine, vol. 29, 2012.

[6] 
F. Orsini, D. Baracchi, and P. Frasconi, “Shift aggregate extract networks,” Frontiers in Robotics and AI, p. 42, 2018.

[7] 
J. M. Kleinberg, R. Kumar, P. Raghavan, S. Rajagopalan, and A. S. Tomkins, “The web as a graph: Measurements, models, and methods,” in Proc. Int. Computing and Combinatorics Conf., Springer, 1999, pp. 1–17.

[8] 
D. Xu, Y. Zhu, C. B. Choy, and L. FeiFei, “Scene graph generation by iterative message passing,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2017, pp. 5410–5419.

[9] 
K. M. Borgwardt, H.P. Kriegel, S. Vishwanathan, and N. N. Schraudolph, “Graph kernels for disease outcome prediction from proteinprotein interaction networks,” in Biocomputing. World Scientific, 2007, pp. 4–15.

[10] 
X. Luo, H. Wu, H. Yuan, and M. Zhou, “Temporal patternaware QoS prediction via biased nonnegative latent factorization of tensors,” IEEE Trans. Cybernetics, vol. 50, no. 5, pp. 1798–1809, 2019.

[11] 
T. D. Pham, K. Wardell, A. Eklund, and G. Salerud, “Classification of short time series in early Parkinson’s disease with deep learning of fuzzy recurrence plots,” IEEE/CAA J. Autom. Sinica, vol. 6, no. 6, pp. 1306–1317, 2019. doi: 10.1109/JAS.2019.1911774

[12] 
L. Wei and E. Keogh, “Semisupervised time series classification,” in Proc. 12th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, 2006, pp. 748–753.

[13] 
D. Wu, X. Luo, M. Shang, Y. He, G. Wang, and X. Wu, “A datacharacteristicaware latent factor model for web services QoS prediction,” IEEE Trans. Knowledge and Data Engineering, 2020.

[14] 
X. Luo, M. Zhou, S. Li, Y. Xia, Z.H. You, Q. Zhu, and H. Leung, “Incorporation of efficient secondorder solvers into latent factor models for accurate prediction of missing QoS data,” IEEE Trans. Cybernetics, vol. 48, no. 4, pp. 1216–1228, 2017.

[15] 
J. Bruna, W. Zaremba, A. Szlam, and Y. LeCun, “Spectral networks and locally connected networks on graphs,” arXiv preprint arXiv: 1312.6203, 2013.

[16] 
M. Defferrard, X. Bresson, and P. Vandergheynst, “Convolutional neural networks on graphs with fast localized spectral filtering,” in Proc. Advances in Neural Information Processing Systems, 2016, pp. 3844– 3852.

[17] 
T. N. Kipf and M. Welling, “Semisupervised classification with graph convolutional networks,” in Proc. Int. Conf. Learning Representations, 2017.

[18] 
R. Li, S. Wang, F. Zhu, and J. Huang, “Adaptive graph convolutional neural networks,” in Proc. 32nd AAAI Conf. Artificial Intelligence, 2018.

[19] 
S. Fu, W. Liu, D. Tao, Y. Zhou, and L. Nie, “Hesgcn: Hessian graph convolutional networks for semisupervised classification,” Information Sciences, vol. 514, pp. 484–498, 2020. doi: 10.1016/j.ins.2019.11.019

[20] 
S. Fu, W. Liu, Y. Zhou, and L. Nie, “Hplapgcn: Hypergraph plaplacian graph convolutional networks,” Neurocomputing, vol. 362, pp. 166–174, 2019. doi: 10.1016/j.neucom.2019.06.068

[21] 
F. Sichao, L. Weifeng, L. Shuying, and Z. Yicong, “Twoorder graph convolutional networks for semisupervised classification,” IET Image Processing, vol. 13, no. 14, pp. 2763–2771, 2019. doi: 10.1049/ietipr.2018.6224

[22] 
B. Wu, Y. Liu, B. Lang, and L. Huang, “DGCNN: Disordered graph convolutional neural network based on the gaussian mixture model,” Neurocomputing, vol. 321, pp. 346–356, 2018. doi: 10.1016/j.neucom.2018.09.008

[23] 
S. Franco, G. Marco, T. Ah Chung, H. Markus, and M. Gabriele, “The graph neural network model,” IEEE Trans. Neural Networks, vol. 20, no. 1, Article No. 61, 2009. doi: 10.1109/TNN.2008.2005605

[24] 
W. L. Hamilton, R. Ying, and J. Leskovec, “Inductive representation learning on large graphs,” in Proc. Neural Information Processing Systems, 2017.

[25] 
J. Atwood and D. Towsley, “Diffusionconvolutional neural networks,” in Proc. Advances in Neural Information Processing Systems, 2016, pp. 1993–2001.

[26] 
P. W. Battaglia, J. B. Hamrick, V. Bapst, A. SanchezGonzalez, V. Zambaldi, M. Malinowski, A. Tacchetti, D. Raposo, A. Santoro, R. Faulkner, et al., “Relational inductive biases, deep learning, and graph networks,” Computing Research Repository, 2018.

[27] 
F. Scarselli, M. Gori, A. C. Tsoi, M. Hagenbuchner, and G. Monfardini, “The graph neural network model,” IEEE Trans. Neural Networks, vol. 20, no. 1, pp. 61–80, 2008.

[28] 
Y. Li, D. Tarlow, M. Brockschmidt, and R. Zemel, “Gated graph sequence neural networks,” in Proc. Int. Conf. Learning Representations, 2016.

[29] 
H. Dai, Z. Kozareva, B. Dai, A. Smola, and L. Song, “Learning steadystates of iterative algorithms over graphs,” in Proc. Int. Conf. Machine Learning, 2018, pp. 1114–1122.

[30] 
K. Xu, W. Hu, J. Leskovec, and S. Jegelka, “How powerful are graph neural networks?” in Proc. Int. Conf. Learning Representations, 2019.

[31] 
P. Veličković, G. Cucurull, A. Casanova, A. Romero, P. Liò, and Y. Bengio, “Graph attention networks,” in Proc. Int. Conf. Learning Representations, 2018.

[32] 
L. Lovász, “Random walks on graphs: A survey, combinatorics, Paul Erdos is eighty,” Lecture Notes in Mathematics, vol. 2, no. 1, pp. 1–46, 1993. doi: 10.1007/BFb0077189

[33] 
X. Hong, T. Zhang, Z. Cui, C. Xu, L. Zhang, and J. Yang, “Fast hyperwalk gridded convolution on graph,” in Proc. Chinese Conf. Pattern Recognition and Computer Vision, Springer, 2020, pp. 197– 208.

[34] 
H. Kashima, K. Tsuda, and A. Inokuchi, “Marginalized kernels between labeled graphs,” in Proc. 20th Int. Conf. Machine Learning, 2003, pp. 321–328.

[35] 
N. Shervashidze, S. Vishwanathan, T. Petri, K. Mehlhorn, and K. Borgwardt, “Efficient graphlet kernels for large graph comparison,” in Proc. Artificial Intelligence and Statistics, 2009, pp. 488–495.

[36] 
P. Yanardag and S. Vishwanathan, “Deep graph kernels,” in Proc. 21st ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, 2015, pp. 1365–1374.

[37] 
C. Morris, K. Kersting, and P. Mutzel, “Glocalized weisfeilerlehman graph kernels: Globallocal feature maps of graphs,” in Proc. IEEE Int. Conf. Data Mining, 2017, pp. 327–336.

[38] 
M. Ceci, A. Appice, N. Barile, and D. Malerba, “Transductive learning from relational data,” in Proc. Int. Workshop on Machine Learning and Data Mining in Pattern Recognition, 2007, pp. 324–338.

[39] 
R. S. Michalski, “A theory and methodology of inductive learning,” in Machine Learning. Springer, 1983, pp. 83–134.

[40] 
S. Fortunato, “Community detection in graphs,” Physics Reports, vol. 486, no. 3–5, pp. 75–174, 2010. doi: 10.1016/j.physrep.2009.11.002

[41] 
A. Lancichinetti and S. Fortunato, “Community detection algorithms: A comparative analysis,” Physical Review E, vol. 80, no. 5, p. 056117, 2009.

[42] 
L. Bai, X. Cheng, J. Liang, and Y. Guo, “Fast graph clustering with a new description model for community detection,” Information Sciences, vol. 388, pp. 37–47, 2017.

[43] 
M. Schwartz, Telecommunication Networks: Protocols, Modeling and Analysis. AddisonWesley Longman Publishing Co., Inc., 1986.

[44] 
A. Chapanond, M. S. Krishnamoorthy, and B. Yener, “Graph theoretic and spectral analysis of enron email data,” Computational &Mathematical Organization Theory, vol. 11, no. 3, pp. 265–281, 2005.

[45] 
T. He, Y. Liu, T. H. Ko, K. C. Chan, and Y.S. Ong, “Contextual correlation preserving multiview featured graph clustering,” IEEE Trans. Cybernetics, vol. 50, no. 10, pp. 4318–4331, 2019.

[46] 
L. Hu, S. Yang, X. Luo, and M. Zhou, “An algorithm of inductively identifying clusters from attributed graphs,” IEEE Trans. Big Data, 2020. DOI: 10.1109/TBDATA.2020.2964544

[47] 
L. Hu, K. C. Chan, X. Yuan, and S. Xiong, “A variational Bayesian framework for cluster analysis in a complex network,” IEEE Trans. Knowledge and Data Engineering, vol. 32, no. 11, pp. 2115–2128, 2019.

[48] 
B. Perozzi, R. AlRfou, and S. Skiena, “Deepwalk: Online learning of social representations,” in Proc. 20th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, 2014, pp. 701–710.

[49] 
H. Zhang, X. Shang, H. Luan, M. Wang, and T.S. Chua, “Learning from collective intelligence: Feature learning using social images and tags,” ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 13, no. 1, p. 1, 2017.

[50] 
S. Pan, J. Wu, X. Zhu, C. Zhang, and Y. Wang, “Triparty deep network representation,” Network, vol. 11, no. 9, p. 12, 2016.

[51] 
A. Grover and J. Leskovec, “node2vec: Scalable feature learning for networks,” in Proc. 22nd ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, 2016, pp. 855– 864.

[52] 
L. Qiu, Y. Cao, Z. Nie, Y. Yu, and Y. Rui, “Learning word representation considering proximity and ambiguity,” in Proc. 28th AAAI Conf. Artificial Intelligence, 2014.

[53] 
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, “Distributed representations of words and phrases and their compositionality,” in Proc. Advances in Neural Information Processing Systems, 2013, pp. 3111–3119.

[54] 
C. J. Geyer, “Introduction to Markov chain Monte Carlo,” Handbook of Markov Chain Monte Carlo, vol. 20116022, p. 45, 2011.

[55] 
D. P. Kingma and M. Welling, “Autoencoding variational Bayes,” arXiv: Machine Learning, 2013.

[56] 
L. Devroye, “Samplebased nonuniform random variate generation,” in Proc. 18th Conf. Winter Simulation, 1986, pp. 260–265.

[57] 
C. Robert and G. Casella, Monte Carlo Statistical Methods. Springer Science & Business Media, 2013.

[58] 
P. Sen, G. Namata, M. Bilgic, L. Getoor, B. Galligher, and T. EliassiRad, “Collective classification in network data,” AI Magazine, vol. 29, no. 3, pp. 93–93, 2008. doi: 10.1609/aimag.v29i3.2157

[59] 
J. Klicpera, S. Weißenberger, and S. Günnemann, “Diffusion improves graph learning,” arXiv preprint arXiv: 1911.05485, 2019.

[60] 
Z. Yang, W. W. Cohen, and R. Salakhutdinov, “Revisiting semisupervised learning with graph embeddings,” in Proc. Int. Conf. Machine Learning, 2016.

[61] 
A. Carlson, J. Betteridge, B. Kisiel, B. Settles, E. R. Hruschka, and T. M. Mitchell, “Toward an architecture for neverending language learning,” in Proc. 24th AAAI Conf. Artificial Intelligence, 2010.

[62] 
B. Dalvi, A. Mishra, and W. W. Cohen, “Hierarchical semisupervised classification with incomplete class hierarchies,” in Proc. 9th ACM Int. Conf. Web Search and Data Mining, 2016, pp. 193–202.

[63] 
C. Zhuang and Q. Ma, “Dual graph convolutional networks for graphbased semisupervised classification,” in Proc. Int. World Wide Web Conf. Steering Committee, 2018, pp. 499–508.

[64] 
B. Jiang, Z. Zhang, D. Lin, and J. Tang, “Graph learningconvolutional networks,” in Proc. Computer Vision and Pattern Recognition, 2019.

[65] 
B. Jiang and D. Lin, “Graph laplacian regularized graph convolutional networks for semisupervised learning,” arXiv preprint arXiv: 1809.09839, 2018.

[66] 
Y. Feng, H. You, Z. Zhang, R. Ji, and Y. Gao, “Hypergraph neural networks,” in Proc. AAAI Conf. Artificial Intelligence, vol. 33, 2019, pp. 3558–3565.

[67] 
L. V. Der Maaten and G. E. Hinton, “Visualizing data using tSNE,” Journal of Machine Learning Research, vol. 9, pp. 2579–2605, 2008.

[68] 
P. J. Rousseeuw, “Silhouettes: A graphical aid to the interpretation and validation of cluster analysis,” Journal of Computational and Applied Mathematics, vol. 20, no. 1, pp. 53–65, 1987.
