2023年全國(guó)碩士研究生考試考研英語(yǔ)一試題真題(含答案詳解+作文范文)_第1頁(yè)
已閱讀1頁(yè),還剩5頁(yè)未讀, 繼續(xù)免費(fèi)閱讀

下載本文檔

版權(quán)說(shuō)明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)

文檔簡(jiǎn)介

1、Pseudo-polar based estimation of large translations rotations and scalings in imagesYosi Keller Amir Averbuch Moshe IsraeliDepartment of Mathematics Department of Computer Science Department of Computer Science Yale Univ

2、rsity Tel-Aviv University Technion Institute of Technology New Haven, CT, USA Tel-Aviv, Israel Haifa, Israel yosi.keller@yale.eduAbstractOne of the major challenges related to image registration is the estimation of larg

3、e motions without prior knowledge. This paper presents a Fourier based approach that estimates large translation, scale and rotation motions. The algorithm uses the pseudo-polar transform to achieve substantial im- prove

4、d approximations of the polar and log-polar Fourier transforms of an image. Thus, rotation and scale changes are reduced to translations which are estimated using phase correlation. By utilizing the pseudo-polar grid we

5、increase the performance (accuracy, speed, robustness) of the reg- istration algorithms. Scales up to 4 and arbitrary rotation angles can be robustly recovered, compared to a maximum scaling of 2 recovered by the current

6、 state-of-the-art algo- rithms. The algorithm utilizes only 1D FFT calculations whose overall complexity is significantly lower than prior works. Experimental results demonstrate the applicability of these algorithms.1 I

7、ntroductionImage registration plays a vital role in many image pro- cessing applications such as video compression [1], video enhancement [2] and scene representation [3] to name a few. This problem was analyzed using va

8、rious computa- tional techniques, such as pixel domain Gradient methods [2], correlation techniques [15] and discrete Fourier (DFT) domain algorithms [6, 11]. Gradient methods based image registration algorithms are cons

9、idered to be the state-of-the- art. They may fail unless the two images are misaligned by only a moderate motion. Fourier based schemes, which are able to estimate relatively large rotation, scaling and transla- tion, ar

10、e often used as bootstrap for more accurate gradient methods. The basic notion related to Fourier based schemes is the shift property [18] of the Fourier transform which allows robust estimation of translations using the

11、 normal- ized phase-correlation algorithm [6, 9, 10]. Hence, in or-der to account for rotations and scaling, the image is trans- formed into a polar or log-polar Fourier grid (referred to as the Fourier-Mellin transform)

12、. Rotations and scaling are reduced to translations in these representations and can be estimated using phase-correlation. In this paper we propose to iteratively estimate the po- lar and log-polar DFT using the pseudo-p

13、olar FFT (PPFFT) [19]. The resulting algorithm is able to robustly register im- ages rotated by arbitrary angles and scaled up to a factor of 4. It should be noted that the maximum scale factor re- covered in [11] and [1

14、6] was 2.0 and 1.8, respectively. In particular, the proposed algorithm does not result to inter- polation in either spatial or Fourier domain. Only 1D FFT operations are used, making it much faster and especially suited

15、 for real-time applications. The rest of paper is organized as follows: Prior results related to FFT based image registration are given in Section 2, while the proposed algorithm, is presented in Section 3. Experimental

16、results are discussed in Section 4 and final conclusions are given in Section 5.2 Previous related work2.1 Translation estimationThe basis of the Fourier based motion estimation is the shift property [18] of the Fourier

17、transform. Denote byF ff (x, y)g , b f (ωx, ωy) (1)the Fourier transform of f (x, y). Then,F ff (x + ¢x, y + ¢y)g = b f (ωx, ωy) ej(ωx¢x+ωy¢y). (2) Equation 2 can be used for the estimation of image t

18、ransla- tion [6, 10]. Assume the images I1 (x, y) and I2 (x, y) have some overlap thatI1 (x + ¢x, y + ¢y) = I2 (x, y) . (3)Proceedings of the IEEE Workshop on Motion and Video Computing (WACV/MOTION’05) 0-7695

19、-2271-8/05 $ 20.00 IEEE rotation and translation estimation algorithm operates as follows:1. Let (m1, l1) and (m2, l2) be the sizes of I1 (i, j) and I2 (i, j) , respectively. Then, at iteration n = 0, I1 (i, j) and I2 (i

20、, j) are zero padded such thatm1 = l1 = m2 = l2 = 2k, k 2 Z. (12)2. The magnitudes MP P 1 ¡ θi, rj ¢ and MP P 2 ¡ θi, rj ¢ ofthe PPFFTs of I(n) 1 (i, j) and I2 (i, j) are calculated, respectively.3. T

21、he polar DFTs ,magnitudes c MPolar 1 ¡ θi, rj ¢ and c MPolar 2 ¡ θi, rj ¢ of I(n) 1 (i, j) and I2 (i, j) are substi-tuted by MPP 1 ¡θi, rj ¢ and MP P 2 ¡θi, rj ¢ respectively.4. Th

22、e translation along the ?! θ axis of MPP 1 ¡ θi, rj ¢and MP P 2 ¡θi, rj ¢ is estimated using phase correla- tion. The result is denoted by ¢θn.5. Let θn be the accumulated rotation angle estimate

23、d at iteration nθn ,n Xi=0 ¢θi = θn? 1 + ¢θn.Then, the input image I1 (i, j) is rotated by θn (around the center of the image) using the FFT based image ro- tation algorithm described in [4]. This rotation sche

24、me is accurate and fast since only 1D FFT operations are usedI(n+1) 1 (θ, r) = I(0) 1 (θ + θn, r) , n = 1, . . .The rotation can be conducted around any pixel. We recommend to use the central pixel of I1 (i, j) such that

25、 the bounding rectangular of the rotated image will be as small as possible.6. Steps 2-5 are reiterated until the angular refinement term ¢θn is smaller than a predefined threshold εθ, i.e. j¢θnj 0). The polar

26、 axis is approximated using the same procedure as in section 3.1, while the radial axis is approximated using nearest-neighbor interpolation.4. The relative translation between c MLog? Polar 1 (i,j) and c MLog? P olar 2

27、(i,j) is recovered by a 2D phase correla-tion on the ?! θ and ?! r axes.5. Let ¢θn and ¢rn be the rotation angle and the scaling value estimated at iteration n, respectively. Then, the input image I1 (x, y) is

28、rotated (around the center of the image) [4] and then scaled using DFT domain zero paddingI(n+1) 1 (θ, r) = I(0) 1 (θ + θn, r ¢ rn) (16)whereθn =n Xi=0 ¢θi = θn? 1 + ¢θnrn =n Yi=0 ¢ri = rn? 1 ¢ &

溫馨提示

  • 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 眾賞文庫(kù)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。

最新文檔

評(píng)論

0/150

提交評(píng)論