Project 1(Computer Vision): Images of the Russian Empire

阅读量：

Background

Sergei Mikhailovich Prokudin-Gorskii (1863-1944) was a pioneering photographer whose work anticipated technological advancements in the field. He envisioned color photography as the next big thing and devised an innovative method involving capturing black-and-white images through red, green, and blue filters before projecting them in color to reconstruct vibrant images. At that time, color printing was fraught with challenges. Prokudin-Gorskii's greatest achievement came when he became famous for producing monochrome negatives using red, green, and blue filters. His artistic contributions were further highlighted when he received royal approval and financial backing to traverse vast landscapes across Russia, documenting them with his groundbreaking multicolored photographs. The Russian Empire’s cultural significance inspired him to seek recognition for his work. He obtained royal approval and financial backing to traverse vast landscapes across Russia, documenting them with his groundbreaking multicolored photographs. His RGB glass plate negatives were purchased in 1948 by the Library of Congress. Their digital archiving has made these negatives accessible online.

Photo Processing

Split and Merge

Since the images are arranged from top to bottom in BGR order, I divided each image into three segments and merged them to produce the colorized image.

Aligning

In order to overlay the 3 BGR images in alignment, there are some ways.

Sum of squared differences (SSD) between the intensities of the two features and the latter computes the correlation between the intensity values of the two features.
Mean squares method and the normalized correlation （NCC）
Enhanced Correlation Coefficient (ECC)

我采用ECC算法来对齐图像。该算法基于OpenCV 3中的ECC图像对齐方法，并参考了Evangelidis和Psarakis于2008年发表的一篇论文Parametric Image Alignment using Enhanced Correlation Coefficient Maximization。他们提出了一种新的相似性度量——Enhanced Correlation Coefficient（简称ECC），用于估计运动模型的参数。这种方法具有两个主要优势。

Unlike conventional pixel-based similarity measures that rely on differences in pixel intensities, ECC demonstrates remarkable robustness against variations in image brightness and contrast.
Despite the objective function being a complex nonlinear function of the parameters, their developed iterative method for solving the optimization problem remains linear. Meaningwise, they transformed what initially seemed like a computationally intensive challenge into an approach that can be solved more efficiently through iteration.

Automatic Cropping

Canny Edge Detector

Canny是一种非常有效的方法用于从不同视觉物体中提取有用的结构信息。
我应用它来获取边缘，在检测边缘时只关注边缘区域。
目的是裁剪白色、黑色或其他颜色的边框。
相反地，我将图像分为四个部分：顶部、底部、左侧和右侧。
每个部分仅包含Step2图像高度或宽度的10%。

(left)

(right)

Probabilistic Hough Transform

The Hough technique is used for identifying shapes. If a shape can be represented by a mathematical expression, then it can be detected in an image using the Hough Transform. Straight lines represent the simplest case; I employ the Probabilistic Hough Transform because it provides a more efficient approach compared to traditional Hough Transform for detecting curves.

By enhancing its intuitiveness for users to comprehend and utilize effectively. By first identifying all line segments across the entire Canny edge map and subsequently projecting these line segments onto a colorized version of the original image. However, in our project implementation, we divided these line segments into four distinct regions: top, bottom, left, and right as previously outlined.

Cropping

Once detected, I select specific regions from each image section. Specifically, I choose the lowest line from the upper section, and then select corresponding lines from other sections such as top, bottom, right and left. After that, I trim along these key edges to achieve uniformity.

Image Pyramid

When I try to run the above algorithm on a large image, the procedure was broken down. However, for a 500 X 500 image, it cost no more than 1 second. Thus, the idea is to resize the image by a factor of 0.5 recursively until the size of height is small than 500px and record the recurvise times N. If I resize the image 1 time by factor 0.5, the height and width [x, y] will become [0.5x, 0.5y]

Following this step, I obtain a small image and apply the aforementioned algorithm to it. Particularly, in ECC alignment, while obtaining a warp_matrix from the small image, I multiply this matrix by [[1 1 2]; [1 1 2]] for N iterations and then apply it to the large image to thereby expand the X,Y offsets.

Similarly, as I apply the Canny algorithm to small images, it necessitates multiplying 2^N times on either the X or Y axis of lines when cropping the image.

(partial)

There are some results:

Code:

https://gitee.com/sengo/Russian-Empire

Reference:

1.https://www.learnopencv.com/image-alignment-ecc-in-opencv-c-python/

Python OpenCV 教程面向Python的OpenCV官方指南中文版 https://github.com/makelove/OpenCV-Python-Tutorial/blob/master/OpenCV-Python-Tutorial-中文版.pdf
http://robertpless.com/classes/ComputerVision/project1/description.html

4.https://en.wikipedia.org/wiki/Canny_edge_detector

5.https://en.wikipedia.org/wiki/Hough_transform

6.http://crcv.ucf.edu/videos/lectures/2012.php

7.http://www.jb51.net/article/33274.htm

全部评论 (0)

还没有任何评论哟~

Project 1(Computer Vision): Images of the Russian Empire

Background SergeiMikhailovichProkudinGorskii18631944wasaphotographeraheadofhistime.Hesawcolorphotogr...

Project 2 (Computer Vision) : Hybrid Images

Overview ThegoalofthispartoftheassignmentistocreatehybridimagesusingtheapproachdescribedintheSIGGRAP...

Mastering the Advanced Techniques of Computer Vision

作者：禅与计算机程序设计艺术 1.背景介绍在过去的几年里，无论是在科技还是商业领域都发生了翻天覆地的变化，计算机视觉技术也从应用落到了基础科研上。深度学习、图像处理、机器学习等技术都成为各行各业的重...

[Computer Vision 3] The Plenoptic Function and the Elements of Early Vision

ThePlenopticFunction Theimageinformationavailablefromasingleviewingposition,isdefinedbythepencilofli...

Project 1: The Game of Hog

Problem0 Problem1 Problem2 Problem3 Problem4 Problem5 Problem6 Problem8 Problem9 Problem10 Problem11

Arch2030 A Vision of Computer Architecture Research over the Next 15 Years

Arch2030:AVisionofComputerArchitectureResearchovertheNext15Years2016,MarkD.Hillreport 1\.新的挑战：如何为信息技...

CS61A Project 1: The Game of Hog

项目简介：为骰子游戏Hog开发一个模拟器和多种策略，需要同时使用控制语句和高阶函数（Lecture14） Phase1:Simulator 按照HintVideo里给的Pseudocode转成代码（注...

Bag-of-words model in computer vision

一、Bagofwordsmodel Bagofwordsmodel是用于自然语言处理和信息检索中的一种简单的文档表示方法。通过这一模型，一篇文档可以通过统计所有word的数目来表示，这种方法不考虑语法...

(Inceptionv3)Rethinking the Inception Architecture for Computer Vision

翻译论文汇总：https://github.com/SnailTyan/deeplearningpaperstranslation RethinkingtheInceptionArchitecture...

CS 61A 2020 Fall Project 1: The Game of Hog

CS61APresentsTheGameofHog. fromdiceimportsixsided,foursided,maketestdice fromucbimportmain,trace,int...

是否确定退出登录?

Project 1(Computer Vision): Images of the Russian Empire

Background

全部评论 (0)

相关文章推荐

Project 1(Computer Vision): Images of the Russian Empire

Project 2 (Computer Vision) : Hybrid Images

Mastering the Advanced Techniques of Computer Vision

[Computer Vision 3] The Plenoptic Function and the Elements of Early Vision

Project 1: The Game of Hog

Arch2030 A Vision of Computer Architecture Research over the Next 15 Years

CS61A Project 1: The Game of Hog

Bag-of-words model in computer vision

(Inceptionv3)Rethinking the Inception Architecture for Computer Vision

CS 61A 2020 Fall Project 1: The Game of Hog