Radviz is a radial visualization technique that maps data from multi-dimensional space onto a planar picture. The dimensions placed on the circumference of a circle, called dimension anchors, can be reordered to reveal different patterns in the dataset. Extending the number of dimensions can enhance the flexibility in the placement of dimension anchors to explore meaningful visualizations. This paper describes a method that rationally extends a dimension to multiple new dimensions in Radviz. This method first calculates the probability distribution histogram of a dimension. The mean shift algorithm is applied to get centers of probability density to segment the histogram, and then the dimension can be extended according to the number of segments of the histogram. The paper also suggests using Dunn's index and accuracy rate to find the optimal placement of DAs, so the better effect of visual clustering can be achieved and evaluated after the dimension expansion in Radviz. Finally, it demonstrates the effectiveness of the new approach on synthetic and real world datasets.
[1] Hoffman PE, Grinstein GG, Marx K, Grose I, Stanley E. DNA visual and analytic data mining. In:Proc. of the IEEE Visualization. 1997. 437-441.[doi:10.1109/VISUAL.1997.663916]
[2] Ankerst M, Berchtold S, Keim DA. Similarity clustering of dimensions for an enhanced visualization of multidimensional data. In:Proc. of the IEEE Symp. on Information Visualization. 1998. 52-60.[doi:10.1109/INFVIS.1998.729559]
[3] Leban G, Zupan B, Vidmar G, Bratko I. Vizrank:Data visualization guided by machine learning. Data Mining and Knowledge Discovery, 2006,13(2):119-136.[doi:10.1007/s10618-005-0031-5]
[4] Albuquerque G, Eisemann M, Lehmann DJ, Theisel H, Magnor M. Improving the visual analysis of high-dimensional datasets using quality measures. In:Proc. of the IEEE Symp. on Visual Analytics Science and Technology. 2010. 19-26.[doi:10.1109/VAST.2010.5652433]
[5] Sharko J, Grinstein G, Marx KA. Vectorized Radviz and its application to multiple cluster datasets. IEEE Trans. on Visualization and Computer Graphics, 2008,14(6):1444-1427.[doi:10.1109/TVCG.2008.173]
[6] Ingram S, Munzner T, Irvine V, Tory M, Bergner S, Moller T. Dimstiller:Workflows for dimensional analysis and reduction. In:Proc. of the IEEE Symp. on Visual Analytics Science and Technology. 2010. 3-10.[doi:10.1109/VAST.2010.5652392]
[7] Sedlmair M, Munzner T, Tory M. Empirical guidance on scatterplot and dimension reduction technique choices. IEEE Trans. on Visualization and Computer Graphics, 2013,19(12):2634-2643.[doi:10.1109/TVCG.2013.153]
[8] Guo P, Xiao H, Wang Z, Yuan X. Interactive local clustering operations for high dimensional data in parallel coordinates. In:Proc. of the IEEE Pacific Visualization Symp. 2010. 97-104.[doi:10.1109/PACIFICVIS.2010.5429608]
[9] Kandogan E. Star coordinates:A multi-dimensional visualization technique with uniform treatment of dimensions. In:Proc. of the IEEE Information Visualization Symp. 2000. 9-12.
[10] Sun Y, Tang JY, Tang DQ, Xiao WD. An improved multivariate data visualization method. Ruan Jian Xue Bao/Journal of Software, 2010,21(6):1462-1472(in Chinese with English abstract). http://www.jos.org.cn/1000-9825/3460.htm[doi:10.3724/SP.J.1001. 2010.03460]
[11] Bertini E, Tatu A, Keim DA. Quality metrics in high-dimensional data visualization:An overview and systematization. IEEE Trans. on Visualization and Computer Graphics, 2011,17(12):2203-2212.[doi:10.1109/TVCG.2011.229]
[12] Yuan X, Ren D, Wang Z, Guo C. Dimension projection matrix/tree:Interactive subspace visual exploration and analysis of high dimensional data. IEEE Trans. on Visualization and Computer Graphics, 2013,19(12):2625-2633.[doi:10.1109/TVCG.2013.150]
[13] Mccarthy JF, Marx KA, Hoffman PE, Gee AG, Oneil P, Ujwal ML, Hotchkiss J. Applications of machine learning and high-dimensional visualization in cancer detection, diagnosis, and management. Annals of the New York Academy of Sciences, 2004,1020:239-262.[doi:10.1196/annals. 1310.020]
[14] Basole RC, Clear T, Hu M, Hu MD, Mehrotra H, Stako J. Understanding interfirm relationships in business ecosystems with interactive visualization. IEEE Trans. on Visualization and Computer Graphics, 2013,19(12):2526-2535.[doi:10.1109/TVCG.2013.209]
[15] Xu YH, Hong WX, Chen MM. Visual fault diagnosis method based on Radviz and its optimization. Application Research of Computers, 2009,26(3):840-842(in Chinese with English abstract).
[16] Shi L, Liao Q, He Y, Li R, Striegel A, Su Z. SAVE:Sensor anomaly visualization engine. In:Proc. of the IEEE Conf. on Visual Analytics Science and Technology. 2011. 201-210.[doi:10.1109/VAST.2011.6102458]
[17] Zhou F, Huang W, Zhao Y, Shi Y, Liang X, Fan X. ENTVis:A visual analytic tool for entropy-based network traffic anomaly detection. IEEE Computer Graphs and Applications, 2015,35(6):42-50.[doi:10.1109/MCG.2015.97]
[18] Artero AO, De Oliveira MCF. Viz3D:Effective exploratory visualization of large multidimensional data sets. In:Proc. of the XVII Brazilian Symp. on Computer Graphics and Image. 2004. 340-347.[doi:10.1109/SIBGRA.2004.1352979]
[19] Nováková L, Stepankova O. Radviz and identification of clusters in multidimensional data. In:Proc. of the IEEE 17th Int'l Conf. on Information Visualization. 2009. 104-109.[doi:10.1109/IV.2009.103]
[20] Caro LD, Frias-Martinez V, Frias-Martinez E. Analyzing the role of dimension arrangement for data visualization in Radviz. In:Advances in Knowledge Discovery and Data Mining. 2010. 125-132.[doi:10.1007/978-3-642-13672-6_13]
[21] Gee AG, Yu M, Grinstein GG. Dynamic and interactive dimensional anchors for spring-based visualizations. Technical Report, Computer Science, University of Massachussetts Lowell, 2005.
[22] Kuntal BK, Ghosh TS, Mande SS. Igloo-Plot:A tool for visualization of multidimensional datasets. Genomics, 2014,103(1):11-20.[doi:10.1016/j.ygeno.2014.01.004]
[23] Cheng SH, Mueller K. Improving the fidelity of contextual data layouts using a generalized barycentric coordinates framework. In:Proc. of the IEEE Conf. on Pacific Visualization. 2015.[doi:10.1109/PACIFICVIS.2015.7156390]
[24] Comaniciu D, Meer P. Mean shift:A robust approach toward feature space analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2002,24(5):603-619.[doi:10.1109/34.1000236]
[25] Zhou FF, Zhao Y, Ma KL. Parallel mean shift for interactive volume segmentation. In:Proc. of the MLMI 2010. LNCS 6357, Berlin, Heidelberg:Springer-Verlag, 2010, 67-75.[doi:10.1007/978-3-642-15948-0_9]
[26] Peng NS, Yang J, Liu Z, Zhang CF. Automatic selection of kernel function window width in mean-shift tracking algorithm. Ruan Jian Xue Bao/Journal of Software, 2005,16(9):1542-1550(in Chinese with English abstract). http://www.jos.org.cn/ch/reader/create_pdf.aspx?file_no=20050903&journal_id=jos[doi:10.1360/jos161542]
[27] Dunn JC. Well-Separated clusters and optimal fuzzy partitions. Journal of Cybernetics, 1974,4(1):95-104.[doi:10.1080/01969727408546059]
[29] Chen Y, Zhang XY, Feng YH, Liang J, Chen HQ. Sunburst with ordered nodes based on hierarchical clustering:A visual analyzing method for associated hierarchical pesticide residue data. Journal of Visualization, 2015,18(2):237-254.[doi:10.1007/s12650-014-0269-3]
[30] Wu YD, Wang S, Wang HY, Li QS, Jiang HY, Zou YG. A total variation-based hierarchical radial video visualization method. Journal of Visualization, 2015,18(2):255-267.[doi:10.1007/s12650-014-0266-6]
[31] Manuel R, Laura R, Francisco D, Alberto S. A comparative study between Radviz and Star Coordinates. IEEE Trans on Visualization and Computer Graphics, 2016,22(1):619-628.[doi:10.1109/TVCG.2015.2467324]