TY - JOUR
T1 - Robust region-of-interest determination based on user attention model through visual rhythm analysis
AU - Chi, Ming Chieh
AU - Yeh, Chia Hung
AU - Chen, Mei Juan
N1 - Funding Information:
Manuscript received October 26, 2007; revised April 4, 2008 and July 4, 2008. First version published May 12, 2009; current version published July 22, 2009. This work was supported in part by the National Science Council of the Republic of China, under Grants NSC 96-2221-E-259-011 and NSC 96-2628-E-110-020-MY2. This paper was recommended by Associate Editor W. Gao.
PY - 2009/7
Y1 - 2009/7
N2 - Region-of-interest (ROI) determination is very important for video processing and it is desirable to find a simple method to identify the ROI. Along this direction, this paper investigates a user attention model based on visual rhythm analysis for automatic determination of ROI in a video. The visual rhythm, which is an abstraction of a video, is a thumbnail version of a video by a 2-D image that captures the temporal information of a video sequence. Four sampling lines, including diagonal, anti-diagonal, vertical, and horizontal lines, are employed to obtain four visual rhythm maps in order to analyze the location of the ROI from video data. Via the variation on visual rhythms, object and camera motions can be efficiently distinguished. As for hardware design consideration, the proposed scheme can accurately extract ROI with very low computational complexity for real-time applications. The promising results from the experiments demonstrate that the moving object is effectively and efficiently extracted. Finally, we present a way to use flexible macroblock ordering in combination with ROI determination as a preprocessing step for H.264/AVC video coding, and experimental results show the quality of ROI regions is significantly enhanced.
AB - Region-of-interest (ROI) determination is very important for video processing and it is desirable to find a simple method to identify the ROI. Along this direction, this paper investigates a user attention model based on visual rhythm analysis for automatic determination of ROI in a video. The visual rhythm, which is an abstraction of a video, is a thumbnail version of a video by a 2-D image that captures the temporal information of a video sequence. Four sampling lines, including diagonal, anti-diagonal, vertical, and horizontal lines, are employed to obtain four visual rhythm maps in order to analyze the location of the ROI from video data. Via the variation on visual rhythms, object and camera motions can be efficiently distinguished. As for hardware design consideration, the proposed scheme can accurately extract ROI with very low computational complexity for real-time applications. The promising results from the experiments demonstrate that the moving object is effectively and efficiently extracted. Finally, we present a way to use flexible macroblock ordering in combination with ROI determination as a preprocessing step for H.264/AVC video coding, and experimental results show the quality of ROI regions is significantly enhanced.
KW - Content analysis
KW - Feature extraction
KW - Region-of-interest (ROI)
KW - User attention
KW - Visual rhythm
UR - http://www.scopus.com/inward/record.url?scp=68249112310&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=68249112310&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2009.2022822
DO - 10.1109/TCSVT.2009.2022822
M3 - Article
AN - SCOPUS:68249112310
SN - 1051-8215
VL - 19
SP - 1025
EP - 1038
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 7
M1 - 4914872
ER -