楼主: oliyiyi
1119 1

7 Steps to Understanding Computer Vision [推广有奖]

版主

已卖:2996份资源

泰斗

1%

还不是VIP/贵宾

-

TA的文库  其他...

计量文库

威望
7
论坛币
47275 个
通用积分
31671.2767
学术水平
1454 点
热心指数
1573 点
信用等级
1364 点
经验
384134 点
帖子
9629
精华
66
在线时间
5508 小时
注册时间
2007-5-21
最后登录
2025-7-8

初级学术勋章 初级热心勋章 初级信用勋章 中级信用勋章 中级学术勋章 中级热心勋章 高级热心勋章 高级学术勋章 高级信用勋章 特级热心勋章 特级学术勋章 特级信用勋章

楼主
oliyiyi 发表于 2016-8-10 09:22:07 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币

By Pulkit Khandelwal, VIT University.

If We Want Machines to Think, We Need to Teach Them to See.

-Fei Fei Li, Director of Stanford AI Lab and Stanford Vision Lab

Learning and computation provides machine the ability to better understand the context of images and build visual systems which truly understand intelligence. The huge amount of image and video content urges the scientific community to make sense and identify patterns amongst it to reveal details which we aren’t aware of. Computer Vision generates mathematical models from images; Computer Graphics draws in images from models and lastly image processing takes image as an input and gives an image at the output.



Computer Vision is an overlapping field drawing on concepts from areas such as artificial intelligence, digital image processing, machine learning, deep learning, pattern recognition, probabilistic graphical models, scientific computing and a lot of mathematics. So, take this post as a starting point to dwell into this field. I will try to cover as much as possible in this post but still there will be a lot of advanced topics and some cools things which might be left out (maybe for later posts?).

Step 1 - Background Check

As usual get the basics right with an undergraduate course inprobability, statistics, linear algebra, calculus (both: differential andintegral). A brief introduction to matrix calculus should come in handy. Also, my experience says that if one has some idea of digital signal processing then it should be helpful to grasp concepts easily.

On the implementation side, I prefer one to have a background in both MATLAB and Python. Check sentdex (a YouTube channel) for everything you need for scientific programming in Python. Do keep in mind that Computer Vision is all about computational programming.

You might want to have a look to Probabilistic Graphical Models(though it is a very advanced subject). You can always return to it later.

Step 2 - Digital Image Processing

Watch the videos by Prof. Guillermo Sapiro of Duke University. The syllabus is very self contained and comes in with lot of exercises. You can find videos on Youtube or wait for the next session on Coursera starting September 2016.

Refer to the bookDigital Image Processing by Gonzalez and Woods. Go through the examples of the concepts as taught by this course on MATLAB.

Step 3 - Computer Vision

Once done with Digital Image Processing the next step is to understand the mathematical models underlying the formulations of variety of applications of image and video content. University of Florida’s Prof. Mubarak Shah’s course on Computer Vision act as good introductory course covering all the fundamental concepts required to build on advanced material.



Watch these videos and alongside implementing the learned concepts and algorithms by following GaTech Prof. James Hays’ projects of his Computer Vision class. These assignments are also on MATLAB. Do not skip these. You only get the deep understanding of the algorithms and equations once you implement them from scratch.

Step 4 - Advanced Computer Vision

Following the first three steps will now make you get going for the advanced material.

Coursera’s offering Discrete Inference in Artificial Vision gives you a probabilistic graphical models and mathematical overdose of Computer Vision. Although Coursera has removed this content from the website, you should be able to find that somewhere on the internet. Things now seem to look interesting and will definitely give you a feel of how complex yet simple models are built for machine vision systems. This course should also be a stepping stone to get going with academic papers.

Step 5 - Bring in Python and Open Source

Let’s get into Python.

There are many packages such as OpenCV, PIL, vlfeat and the likes. Now is the right time to use packages built by others into your projects. No need to implement everything from scratch.

You can find many good blogs and videos to get started withProgramming Computer Vision with Python. I would recommend this book; it should be more than enough. Go and have fun! See how MATLAB and Python get you to implement algorithms.

Step 6 - Machine Learning and CovNets

There are just too many posts on getting started with machine learning.

Check here, here and here.

From now on you are better off sticking with Python. Have a quick go through Building Machine Learning Systems with Python and Python Machine Learning.

With all the deep learning hype around, you now enter into the current research work in Computer Vision: the use of CovNets.Stanford’s CS231n: Convolutional Neural Networks for Visual Recognition is a comprehensive course on this. Although videos have been taken down from the official website, you can very easily find re-uploads on Youtube.

Step 7 - How should I explore more?

You might think that I have already overloaded you with so much of information. But, there is lot of stuff to explore.

One good approach should be to have a look at some of the graduate seminar courses by Sanja Fidler of University of Torontoand James Hays to get an idea of current research directions in Computer Vision through rich academic papers.

Another possible approach is to follow top papers from top conferences such as CVPR, ICCV, ECCV, BMVC. Alternatively you can follow blogs such as pyimagesearch.com orcomputervisionblog.com or aishack.in. Watch endless talks and lectures on Computer Vision and related fields at videolectures.net!

In a nutshell you have covered the history of computer vision right from filters, feature detectors and descriptors, camera models, trackers to tasks such as recognition, segmentation and the most recent advancements in neural nets and deep learning. In the next post I will give a list of top blogs to follow and in the subsequent post I will write about the top papers of all time to read related to Computer Vision.

Bio: Pulkit Khandelwal is an incoming Computer Science Master’s student at McGill University. His interests lie in Computer Vision and Machine Learning.


二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Understand Computer compute VISION Comput Vision

已有 1 人评分经验 论坛币 收起 理由
Nicolle + 50 + 50 精彩帖子

总评分: 经验 + 50  论坛币 + 50   查看全部评分

缺少币币的网友请访问有奖回帖集合
https://bbs.pinggu.org/thread-3990750-1-1.html

沙发
h2h2 发表于 2016-8-11 04:46:23
谢谢分享

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群
GMT+8, 2026-1-11 14:30