南航暑期国际课程大数据可视化第4讲课件3.ppt

上传人(卖家):晟晟文业 文档编号:4106018 上传时间:2022-11-11 格式:PPT 页数:38 大小:3MB
下载 相关 举报
南航暑期国际课程大数据可视化第4讲课件3.ppt_第1页
第1页 / 共38页
南航暑期国际课程大数据可视化第4讲课件3.ppt_第2页
第2页 / 共38页
南航暑期国际课程大数据可视化第4讲课件3.ppt_第3页
第3页 / 共38页
南航暑期国际课程大数据可视化第4讲课件3.ppt_第4页
第4页 / 共38页
南航暑期国际课程大数据可视化第4讲课件3.ppt_第5页
第5页 / 共38页
点击查看更多>>
资源描述

1、Visualization andData MiningOutline Graphical excellence and lie factor Representing data in 1,2,and 3-D Representing data in 4+dimensions Parallel coordinates Scatterplots Stick figuresNapoleon Invasion of Russia,1812NapoleonMarley,1885 www.odt.org,from http:/www.odt.org/Pictures/minard.jpg,used by

2、 permissionSnows Cholera Map,1855Asia at nightSouth and North Korea at nightSeoul,South KoreaNorth KoreaNotice how darkit isVisualization RoleSupport interactive explorationHelp in result presentationDisadvantage:requires human eyesCan be misleading Bad Visualization:SpreadsheetYear Sales1999 2,1102

3、000 2,1052001 2,1202002 2,1212003 2,124Sales2095210021052110211521202125213019992000200120022003SalesWhat is wrong with this graph?Bad Visualization:Spreadsheet with misleading Y axisYear Sales1999 2,1102000 2,1052001 2,1202002 2,1212003 2,124Sales2095210021052110211521202125213019992000200120022003

4、SalesY-Axis scale gives WRONGimpression of big changeBetter VisualizationYear Sales1999 2,1102000 2,1052001 2,1202002 2,1212003 2,124Sales05001000150020002500300019992000200120022003SalesAxis from 0 to 2000 scale gives correct impression of small changeLie Factordataineffectofsizegraphicinshowneffec

5、tofsizeFactorLie8.14528.0833.718)0.185.27(6.0)6.03.5(Tufte requirement:0.95Lie Factor1.05Tuftes Principles of Graphical Excellence Give the viewer the greatest number of ideas in the shortest time with the least ink in the smallest space.Tell the truth about the data!Visualization MethodsVisualizing

6、 in 1-D,2-D and 3-D well-known visualization methodsVisualizing more dimensions Parallel Coordinates Other ideas1-D(Univariate)Data Representations7531020MeanlowhighMiddle 50%Tukey box plotHistogram2-D(Bivariate)Data Scatter plot,pricemileage3-D Data(projection)priceLie Factor=14.83-D image(requires

7、 3-D blue and red glasses)Taken by Mars Rover Spirit,Jan 2004Visualizing in 4+Dimensions Scatterplots Parallel Coordinates Chernoff faces Stick Figures Multiple ViewsGive each variable its own display A B C D E1 4 1 8 3 52 6 3 4 2 13 5 7 2 4 34 2 6 3 1 5A B C D E1234Problem:does not show correlation

8、sScatterplot MatrixRepresent each possiblepair of variables in theirown 2-D scatterplot(car data)Q:Useful for what?A:linear correlations (e.g.horsepower&weight)Q:Misses what?A:multivariate effectsParallel Coordinates Encode variables along a horizontal row Vertical line specifies valuesDataset in a

9、Cartesian coordinatesSame dataset in parallel coordinatesInvented by Alfred Inselberg while at IBM,1985Example:Visualizing Iris DataIris setosaIris versicolorIris virginicaFlower PartsPetal,a non-reproductive part of the flowerSepal,a non-reproductive part of the flowerParallel Coordinates Sepal Len

10、gth5.1Parallel Coordinates:2 DSepal Length5.1Sepal Width3.5Parallel Coordinates:4 DSepal Length5.1Sepal WidthPetal lengthPetal Width3.51.40.25.13.51.40.2Parallel Visualization of Iris dataParallel Visualization SummaryEach data point is a lineSimilar points correspond to similar linesLines crossing

11、over correspond to negatively correlated attributesInteractive exploration and clusteringProblems:order of axes,limit to 20 dimensionsChernoff FacesEncode different variables values in characteristicsof human facehttp:/www.cs.uchicago.edu/wiseman/chernoff/http:/ applets:Interactive FaceChernoff face

12、s,exampleStick FiguresTwo variables are mapped to X,Y axesOther variables are mapped to limb lengths and angles Texture patterns can show data characteristicsStick figures,examplecensus data showingage,income,sex,education,etc.Closed figures correspond to women and we can see more of them on the left.Note also a young woman with high incomeVisualization softwareFree and Open-sourceGgobiXmdvMany more-see www.KD SummaryMany methodsVisualization is possible in more than 3-DAim for graphical excellence

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 办公、行业 > 各类PPT课件(模板)
版权提示 | 免责声明

1,本文(南航暑期国际课程大数据可视化第4讲课件3.ppt)为本站会员(晟晟文业)主动上传,163文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。
2,用户下载本文档,所消耗的文币(积分)将全额增加到上传者的账号。
3, 若此文所含内容侵犯了您的版权或隐私,请立即通知163文库(发送邮件至3464097650@qq.com或直接QQ联系客服),我们立即给予删除!


侵权处理QQ:3464097650--上传资料QQ:3464097650

【声明】本站为“文档C2C交易模式”,即用户上传的文档直接卖给(下载)用户,本站只是网络空间服务平台,本站所有原创文档下载所得归上传人所有,如您发现上传作品侵犯了您的版权,请立刻联系我们并提供证据,我们将在3个工作日内予以改正。


163文库-Www.163Wenku.Com |网站地图|