Introduction to Computer Vision

Jul 28 ·4min read

A basic explanation of Computer Vision beyond all the media noise and glamour.

“I do not fear computers. I fear lack of them.” — Isaac Asimov

Computer vision has been a popular reoccurring term for the past decade, although its popularity has oscillated over time from an unheard of subject to hot news. As a result of becoming a trending topic in recent years, the understanding of what Computer vision entails has been somewhat noisy. Therefore the purpose of this article is to break down the term Computer vision and analyse its component, thereby providing a baseline understanding of what Computer vision is.

To expand on the topic of Computer vision, we first we need to analyse the components of the term (‘Computer’ and ‘Vision’) and define them.

A computer can be defined as an electronic machine capable of performing various processes, calculation, and operations from sets of instructions directed by software or hardware.

So vision, more specifically, the visual perception through sight can be defined as the understanding of the local environment through the illumination of objects within that environment via the visible light spectrum.

fy2u63a.jpg!web

Combining the two-term definitions, rudimentary explanation would be that Computer vision is how machines try to understand what they see to achieve a goal.

6J3Ibui.png!web

We can expand on the definitions above by stating that Computer Vision is the process by which a machine or a system generates an understanding of visual information by invoking one or more algorithms acting on the information provided. The understanding are then translated into decisions, classifications, pattern observation, and many more.

Translation of collated understanding is then utilised in applications, hardware, and software and can take many forms such as the following:

Object detection: Identifying objects of interest (cats, dogs, cars) in digital images or even videos
Optical character recognition: Translating images of text that are written or typed into a machine-encoded format
Fingerprint recognition: Using pattern information of the human fingerprint to make a comparison between a fingerprint source and a fingerprint that is a target.

BR7nymb.png!web

jEvmUbV.png!web

nyEZbei.jpg!web

When introducing Computer vision to the masses, we need to include the approaches that have been taken in order to enable systems to reason about developing understanding and as a result, developing useful applications of Computer vision.

There are two main strategies a Computer vision system can apply to derive understanding from the information its has been provided with. They are namely: Bottom-Up and Top-Down approaches.

The bottom-up approach involves using the understanding of information accumulated to conduct further understanding of some arbitrary observation; eventually, all the accumulated understanding leads to a solution or general understanding of the entirety of the observation object.

An example of where a Bottom-Up approach is used in a Computer vision application is in Automatic number plate recognition. This type of application seems to be a reasonable component of a traffic speeding camera.

Automatic number plate recognition(ANPR) via the bottom-up approach work by passing the visual information (number plate) into our computer vision system. The system proceeds to make some form of understanding by identifying the edges in the numbers on the plates. From the edge information, we proceed one level up and start to identify lines by joining up edges (here we can see the transfer of understanding from one level to another). After that, the lines can be joined up to form shapes, and then finally, we observe characters by identifying areas where lines meet, and edges close.

The top-down approach applies background knowledge to generate an understanding from an observation. The background knowledge acts as a referential guide for the selection of parameters that fits a model (similar to the approach of Deep Learning techniques). This approach can be briefly summarized as the process by which an image is broken down into sub-components, and an understanding of the fragmented information is obtained to present an understanding of the entirety of the image.

M3AzaaY.png!web

From this article, we understand the definition of Computer Vision and its components. Also mentioned are some areas where computer vision techniques are applied. Besides this, we have explored approaches computer vision techniques within these systems can take when trying to derive an understanding from the information.

From here on, you can look into some basic low-level image processing techniques such as edge detection, noise reduction, image sharpening, etc.

Introduction to Computer Vision

Introduction to Computer Vision

Recommend

Graph Design UI by React - Graphical Model Lab

HTTPS 运行原理解析

「走进k8s」Docker 基本命令（二）

使用 Docker 运行 MySQL 实例

马斯克想造个娱乐中心？他表示特斯拉可以用来追剧了

如何让AI画一幅水墨画？

你相信10000小时定律？运动场上未必如此

QQ、微信等账号可以继承吗？看完你就知道了！

TFBOYS六周年演唱会门票都去哪了？

go 泛型出炉，看起来还是不错的

About Joyk