机器视觉综述课件.ppt

上传人(卖家):晟晟文业 文档编号:3952679 上传时间:2022-10-29 格式:PPT 页数:76 大小:28.98MB
下载 相关 举报
机器视觉综述课件.ppt_第1页
第1页 / 共76页
机器视觉综述课件.ppt_第2页
第2页 / 共76页
机器视觉综述课件.ppt_第3页
第3页 / 共76页
机器视觉综述课件.ppt_第4页
第4页 / 共76页
机器视觉综述课件.ppt_第5页
第5页 / 共76页
点击查看更多>>
资源描述

1、Computer Vision(机器视觉)Image by Todays Talk What is Computer Vision?Why Study Computer Vision?How Vision is Used Now?Overview of Computer Vision Algorithm Challenges of Computer Vision Questions2What is computer vision?Terminator 23Terminator 5Every picture tells a story4Goal of computer vision is to

2、write computer programs that can interpret imagesCan computers match(or beat)human vision?5What is Computer Vision?Automatic understanding of images and video1.Computing properties of the 3D world from visual data(measurement)6 1.Vision for measurementReal-time stereoStructure from motionNASA Mars R

3、overPollefeys et al.Multi-view stereo forcommunity photo collectionsGoesele et al.Slide credit:L.Lazebnik7What is Computer Vision?Automatic understanding of images and video1.Computing properties of the 3D world from visual data(measurement)2.Algorithms and representations to allow a machine to reco

4、gnize objects,people,scenes,and activities.(perception and interpretation)82.Vision for perception,interpretationskywaterFerris wheelamusement parkCedar Point12 Etreetreetreecarouseldeckpeople waiting in linerideriderideumbrellaspedestriansmaxairbenchtreeLake Eriepeople sitting on rideObjectsActivit

5、iesScenesLocationsText/writingFacesGesturesMotionsEmotionsThe Wicked Twister9What is Computer Vision?Automatic understanding of images and video1.Computing properties of the 3D world from visual data(measurement)2.Algorithms and representations to allow a machine to recognize objects,people,scenes,a

6、nd activities.(perception and interpretation)3.Algorithms to mine,search,and interact with visual data(search and organization)103.Vision for search and organization11Components of a computer vision systemLightingSceneCameraComputer Scene InterpretationSrinivasa Narasimhans slide12Computer vision vs

7、 human visionWhat we seeWhat a computer sees13Vision is really hardVision is an amazing feat of natural intelligence Visual cortex occupies about 50%of brain More human brain devoted to vision than anything elseIs that a queen or a bishop?14Vision is multidisciplinary From wikiComputer GraphicsHCI15

8、Why computer vision mattersSafetyHealthSecurityComfortAccessFun16A little story about Computer VisionIn 1966,Marvin Minsky at MIT asked his undergraduate student Gerald Jay Sussman to“spend the summer linking a camera to acomputer and getting the computer to describe what it saw”.We now know that th

9、e problem is slightly more difficult than that.(Szeliski 2009,Computer Vision)17Ridiculously brief history of computer vision1966:Minsky assigns computer vision as an undergraduate summer project1960s:interpretation of synthetic worlds1970s:some progress on interpreting selected images1980s:ANNs com

10、e and go;shift toward geometry and increased mathematical rigor1990s:face recognition;statistical analysis in vogue2000s:broader recognition;large annotated datasets available;video processing starts2030s:robot uprising?Guzman 68Ohta Kanade 78Turk and Pentland 9119 Why study computer vision?Millions

11、 of images being captured all the timeLots of useful applicationsThe next slides show the current state of the artSource:S.Lazebnik Flickr01E+092E+093E+094E+095E+096E+092003/12/152004/12/152005/12/152006/12/152007/12/152008/12/152009/12/151 billion2 billion3 billion4 billion5 billion6 billion Other

12、photo sharing sites10 billion20 billion50 billion2003/12/152004/6/152004/12/152005/6/152005/12/152006/6/152006/12/152007/6/152007/12/152008/6/152008/12/152009/6/1530 billion40 billion and growingFlickr:1.7 million photos/dayFacebook:100 million photos/dayYouTube:35 hours of video every minute 57 bil

13、lion photos will be taken(US)in 2010http:/ of November 2010)(compare with 17 billion negatives exposed in 1996)(as of February 2010)How vision is used nowExamples of state-of-the-art241.Optical character recognition(OCR)Digit recognition,AT&T labshttp:/ to convert scanned docs to text If you have a

14、scanner,it probably came with OCR softwareLicense plate readershttp:/en.wikipedia.org/wiki/Automatic_number_plate_recognition252.Face detectionMany new digital cameras now detect facesCanon,Sony,Fuji,263.Smile detectionSony Cyber-shot T70 Digital Still Camera 274.3D from thousands of imagesBuilding

15、Rome in a Day:Agarwal et al.200928The old city of Dubrovnik,4,619 images,3,485,717 points5.Object recognition(in supermarkets)LaneHawk by EvolutionRobotics“A smart camera is flush-mounted in the checkout lane,continuously watching for items.When an item is detected and recognized,the cashier verifie

16、s the quantity of items that were found under the basket,and continues to close the transaction.The item can remain under the basket,and with LaneHawk,you are assured to get paid for it“296.Vision-based biometrics“How the Afghan Girl was Identified by Her Iris Patterns”National Geographic307.Forensi

17、csSource:Nayar and Nishino,“Eyes for Relighting”Source:Nayar and Nishino,“Eyes for Relighting”Source:Nayar and Nishino,“Eyes for Relighting”8.Login without a passwordFingerprint scanners on many new laptops,other devicesFace recognition systems now beginning to appear more widelyhttp:/ recognition(i

18、n mobile phones)Point&Find,NokiaGoogle Goggles3510.Vision in spaceVision systems(JPL)used for several tasks Panorama stitching 3D terrain modeling Obstacle detection,position tracking For more,read“Computer Vision on Mars”by Matthies et al.NASAS Mars Exploration Rover Spirit captured this westward v

19、iew from atop a low plateau where Spirit spent the closing months of 2007.3611.Industrial robotsVision-guided robots position nut runners on wheels3712.Mobile robotshttp:/www.robocup.org/NASAs Mars Spirit Roverhttp:/en.wikipedia.org/wiki/Spirit_roverSaxena et al.2008STAIR at Stanford38THANK YOUSUCCE

20、SS2022-10-29可编辑13.Medical imagingImage guided surgeryGrimson et al.,MIT3D imagingMRI,CT4014.Digital cosmetics4115.InpaintingBertalmio et al.SIGGRAPH 004216.DebluringFergus et al.SIGGRAPH 064317.SportsSportvision first down lineNice explanation on http:/ carsMobileye Vision systems currently in high-

21、end BMW,GM,Volvo models By 2010:70%of car manufacturers.4519.Google carsOct 9,2010.Google Cars Drive Themselves,in Traffic.The New York Times.John MarkoffJune 24,2011.Nevada state law paves the way for driverless cars.Financial Post.Christine DobbyAug 9,2011,Human error blamed after Googles driverle

22、ss car sparks five-vehicle crash.The Star(Toronto)4620.Interactive Games:KinectObject Recognition:http:/ Matrix movies,ESC Entertainment,XYZRGB,NRC21.Special effects:shape capture48Pirates of the Carribean,Industrial Light and Magic22.Special effects:motion capture49Computer Vision and Nearby Fields

23、l Computer Graphics:Models to Imagesl Comp.Photography:Images to Imagesl Computer Vision:Images to Models50Overview of Computer Vision Algorithm51So what do humans care about?Verification:is that a bus?slide by Fei Fei,Fergus&Torralba 52Detection:are there cars?slide by Fei Fei,Fergus&Torralba 53Ide

24、ntification:is that a picture of Mao?slide by Fei Fei,Fergus&Torralba 54Object categorizationskybuildingflagwallbannerbuscarsbusfacestreet lampslide by Fei Fei,Fergus&Torralba 55Scene and context categorization outdoor city traffic slide by Fei Fei,Fergus&Torralba 56Rough 3D layout,depth ordering57O

25、verview of Computer Vision Algorithml Image formationl Features l Grouping&fittingl Multi-view geometryl Recognition&learningl Motion&tracking581.Image formationHow does light in 3d world project to form 2d images?592.Features and filtersTransforming and describing images;textures,colors,edges603.Gr

26、ouping&fittingfig from Shi et alClustering,segmentation,fitting;what parts belong together?614.Multiple viewsHartley and ZissermanMulti-view geometry,matching,invariant features,stereo visionFei-Fei Li625.Recognition and learningRecognizing objects and categories,learning techniques636.Motion and tr

27、ackingTracking objects,video analysis,low level motion,optical flow64Challenges 1:view point variationMichelangelo 1475-156465Challenges 2:illuminationslide credit:S.Ullman66Challenges 3:occlusionMagritte,1957 67Challenges 4:scaleslide by Fei Fei,Fergus&Torralba 68Challenges 5:deformationXu,Beihong

28、194369Challenges 6:background clutterKlimt,191370Challenges 7:object intra-class variationslide by Fei-Fei,Fergus&Torralba 71Challenges 8:local ambiguityslide by Fei-Fei,Fergus&Torralba 72Challenges 9:the world behind the image 73Challenges 10:complexityThousands to millions of pixels in an image3,0

29、00-30,000 human recognizable object categories30+degrees of freedom in the pose of articulated objects(humans)Billions of images indexed by Google Image Search18 billion+prints produced from digital camera images in 2004295.5 million camera phones sold in 200574Keep Moving Ok,clearly the vision problem is deep and challenging time to give up?Active research area with exciting progress!75THANK YOUSUCCESS2022-10-29可编辑

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 办公、行业 > 各类PPT课件(模板)
版权提示 | 免责声明

1,本文(机器视觉综述课件.ppt)为本站会员(晟晟文业)主动上传,163文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。
2,用户下载本文档,所消耗的文币(积分)将全额增加到上传者的账号。
3, 若此文所含内容侵犯了您的版权或隐私,请立即通知163文库(发送邮件至3464097650@qq.com或直接QQ联系客服),我们立即给予删除!


侵权处理QQ:3464097650--上传资料QQ:3464097650

【声明】本站为“文档C2C交易模式”,即用户上传的文档直接卖给(下载)用户,本站只是网络空间服务平台,本站所有原创文档下载所得归上传人所有,如您发现上传作品侵犯了您的版权,请立刻联系我们并提供证据,我们将在3个工作日内予以改正。


163文库-Www.163Wenku.Com |网站地图|