Raspberry Pi Computer Vision Programming

1关注
62粉丝

VIP

已卖：4901份资源

学术权威

14%

还不是VIP/贵宾

-

TA的文库 其他...

R资源总汇

Panel Data Analysis

Experimental Design

0%

威望: 1 级
论坛币: 49675 个
通用积分: 56.2487
学术水平: 370 点
热心指数: 273 点
信用等级: 335 点
经验: 57805 点
帖子: 4005
精华: 21
在线时间: 582 小时
注册时间: 2005-5-8
最后登录: 2023-11-26

楼主

ReneeBK 发表于 2017-6-5 03:41:04 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

本帖隐藏的内容

Raspberry Pi Computer Vision Programming.pdf (8.85 MB, 需要: 5 个论坛币)

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏0 回帖

关键词：Programming Raspberry Computer Program compute Vision

相关帖子

沙发

ReneeBK(未真实交易用户) 发表于 2017-6-5 03:45:16

Retrieving image properties
We can retrieve and use many properties of an image with OpenCV functions. Have a look at the following code:
import cv2
img = cv2.imread('/home/pi/book/test_set/lena_color_512.tif',1)
print img.shape
print img.size
print img.dtype
The img.shape function returns the shape of an image, that is, its dimensions and the number of color channels. The output of the preceding code is as follows:
pi@pi02 ~/book/code/chapter03 $ python prog1.py
(512, 512, 3)
786432
uint8
If the image is colored, then img.shape returns a triplet containing the number of rows, columns, and channels in the image. Usually, the number of channels is three, representing the red, green, and blue channels. If the image is grayscale, then img.shape only returns the number of rows and columns. Try modifying the preceding code to read the image in grayscale mode and observe the output of img.shape.
The img.size function returns the total number of pixels and img.dtype returns the image data type.

复制代码

藤椅

ReneeBK(未真实交易用户) 发表于 2017-6-5 03:45:50

import cv2
img1 = cv2.imread('/home/pi/book/test_set/4.2.03.tiff',1)
img2 = cv2.imread('/home/pi/book/test_set/4.2.04.tiff',1)
cv2.imshow('Image1',img1)
cv2.waitKey(0)
cv2.imshow('Image2',img2)
cv2.waitKey(0)
cv2.imshow('Addition',cv2.add(img1,img2))
cv2.waitKey(0)
cv2.imshow('Image1-Image2',cv2.subtract(img1,img2))
cv2.waitKey(0)
cv2.imshow('Image2-Image1',cv2.subtract(img2,img1))
cv2.waitKey(0)
cv2.destroyAllWindows()

复制代码

板凳

ReneeBK(未真实交易用户) 发表于 2017-6-5 03:46:24

import cv2
import numpy as np
import time
img1 = cv2.imread('/home/pi/book/test_set/4.2.03.tiff',1)
img2 = cv2.imread('/home/pi/book/test_set/4.2.04.tiff',1)
for i in np.linspace(0,1,40):
alpha=i
beta=1-alpha
print 'ALPHA ='+ str(alpha)+' BETA ='+str (beta)
cv2.imshow('Image Transition',cv2.addWeighted(img1,alpha,img2,beta,0))
time.sleep(0.05)
if cv2.waitKey(1) == 27 :
break
cv2.destroyAllWindows()

复制代码

报纸

ReneeBK(未真实交易用户) 发表于 2017-6-5 03:48:14

Splitting and merging image colour channels
On several occasions, we may be interested in working separately with the red, green, and blue channels. For example, we might want to build a histogram for every channel of an image.
Note
We will work separately with the different channels in Chapter 8, Histograms, Contours, Morphological Transformations, and Performance Measurement.
Here, cv2.split() is used to split an image into three different intensity arrays for each color channel, whereas cv2.merge() is used to merge different arrays into a single multi-channel array, that is, a color image.
The following example demonstrates this:
import cv2
img = cv2.imread('/home/pi/book/test_set/4.2.03.tiff',1)
b,g,r = cv2.split (img)
cv2.imshow('Blue Channel',b)
cv2.imshow('Green Channel',g)
cv2.imshow('Red Channel',r)
img=cv2.merge((b,g,r))
cv2.imshow('Merged Output',img)
cv2.waitKey(0)
cv2.destroyAllWindows()

复制代码

地板

ReneeBK(未真实交易用户) 发表于 2017-6-5 03:48:43

Creating a negative of an image
In mathematical terms, the negative of an image is the inversion of colors. For a grayscale image, it is even simpler! The negative of a grayscale image is just the intensity inversion, which can be achieved by finding the complement of the intensity from 255. A pixel value ranges from 0 to 255, and therefore, negation involves the subtracting of the pixel value from the maximum value, that is, 255. The code for the same is as follows:
import cv2
img = cv2.imread('/home/pi/book/test_set/4.2.07.tiff')
grayscale = cv2.cvtColor(img,cv2.COLOR_BGR2GRAY)
negative = abs(255-grayscale)
cv2.imshow('Original',img)
cv2.imshow('Grayscale',grayscale)
cv2.imshow('Negative',negative)
cv2.waitKey(0)
cv2.destroyAllWindows()

复制代码

7楼

ReneeBK(未真实交易用户) 发表于 2017-6-5 03:50:00

Logical operations on images
OpenCV provides bitwise logical operation functions for images. We will have a look at the functions that provide the bitwise logical AND, OR, XOR (exclusive OR), and NOT (inversion) functionality. These functions can be better demonstrated visually with grayscale images. I am going to use barcode images in horizontal and vertical orientation for demonstration. Let's have a look at the following code:
import cv2
import matplotlib.pyplot as plt
img1 = cv2.imread('/home/pi/book/test_set/Barcode_Hor.png',0)
img2 = cv2.imread('/home/pi/book/test_set/Barcode_Ver.png',0)
not_out=cv2.bitwise_not(img1)
and_out=cv2.bitwise_and(img1,img2)
or_out=cv2.bitwise_or(img1,img2)
xor_out=cv2.bitwise_xor(img1,img2)
titles = ['Image 1','Image 2','Image 1 NOT','AND','OR','XOR']
images = [img1,img2,not_out,and_out,or_out,xor_out]
for i in xrange(6):
plt.subplot(2,3,i+1)
plt.imshow(images[i],cmap='gray')
plt.title(titles[i])
plt.xticks([]),plt.yticks([])
plt.show()

复制代码

8楼

ReneeBK(未真实交易用户) 发表于 2017-6-5 04:04:02

Colorspaces and conversions
A colorspace is a mathematical model used to represent colors. Usually, colorspaces are used to represent the colors in a numerical form and to perform mathematical and logical operations with them. In this book, the colorspaces we mostly use are BGR (OpenCV's default colorspace), RGB, HSV, and grayscale. BGR stands for blue, green, and red. HSV represents colors in Hue, Saturation, and Value format. OpenCV has a function cv2.cvtColor(img,conv_flag) that allows us to change the colorspace of an image (img), while the source and target colorspaces are indicated on the conv_flag parameter.
If you remember, in Chapter 2, Working with Images, Webcams, and GUI, we discovered that OpenCV loads images in BGR format and matplotlib uses the RGB format for images. So, before displaying an image with matplotlib, we need to convert an image from BGR to RGB colorspace.
Take a look at the following code. The program reads the image in color mode using cv2.imread(), which imports the image in the BGR colorspace. Then, it converts it to RGB using cv2.cvtColor(), and finally, it uses matplotlib to display the image:
import cv2
import matplotlib.pyplot as plt
img = cv2.imread('/home/pi/book/test_set/4.2.07.tiff',1)
img = cv2.cvtColor ( img , cv2.COLOR_BGR2RGB )
plt.imshow ( img ) , plt.title ('COLOR IMAGE'), plt.xticks([]) , plt.yticks([])
plt.show()

复制代码

9楼

ReneeBK(未真实交易用户) 发表于 2017-6-5 04:07:05

Tracking in real time based on color
Let's study a real-life application of this concept. In HSV format, it's much easier to recognize the color range. If we need to track a specific color object, we will have to define a color range in HSV, then convert the captured image in the HSV format, and then check whether the part of that image falls within the HSV color range of our interest. We can use the cv2.inRange() function to achieve this. This function takes an image, the upper and lower bounds of the colors, and then checks the range criteria for each pixel. If the pixel value falls in the given color range, the corresponding pixel in the output image is 0; otherwise it is 255, thus creating a binary mask.
We can use bitwise_and() to extract the color range we're interested in using this binary mask thereafter. Take a look at the following code to understand this concept:
import numpy as np
import cv2
cam = cv2.VideoCapture(0)
while ( True ):
ret, frame = cam.read()
hsv=cv2.cvtColor(frame,cv2.COLOR_BGR2HSV)
image_mask=cv2.inRange(hsv,np.array([40,50,50]),np.array([80,255,255]))
output=cv2.bitwise_and(frame,frame,mask=image_mask)
cv2.imshow('Original',frame)
cv2.imshow('Output',output)
if cv2.waitKey(1) == 27:
break
cv2.destroyAllWindows()
cam.release()

复制代码

10楼

小陆家嘴(真实交易用户) 发表于 2017-6-5 07:49:38

thanks for sharing

Raspberry Pi Computer Vision Programming [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

本帖隐藏的内容

扫码加我拉你入群

相关帖子

浏览过的帖子

浏览过的版块

本版微信群

Raspberry Pi Computer Vision Programming [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

本帖隐藏的内容

扫码加我 拉你入群

相关帖子

浏览过的帖子

浏览过的版块

本版微信群

扫码加我拉你入群