OpenIMAJ: Open Intelligent Multimedia Analysis

If you use OpenIMAJ in an academic publication, please cite the following paper:

Jonathon S. Hare, Sina Samangooei, and David P. Dupplaw. 2011. OpenIMAJ and ImageTerrier: Java libraries and tools for scalable multimedia analysis and indexing of images. In Proceedings of the 19th ACM international conference on Multimedia (MM ’11). ACM, New York, NY, USA, 691-694. DOI=10.1145/2072298.2072421 http://doi.acm.org/10.1145/2072298.2072421

The following is a list of the papers from which OpenIMAJ implements various algorithms and techniques. This list is also available in BibTeX here.

S. E. and M. Slaney. Construction And Evaluation Of A Robust Multifeature Speech/music Discriminator. Proc. ICASSP-97, Munich.. 1997.

R. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision. Cambridge University Press, ISBN: 0521540518. 2004.

H. Sp"ath. Fitting affine and orthogonal transformations between two sets of points.. Mathematical Communications. Croatian Mathematical Society, Division Osijek, Osijek; Faculty of Electrical Engineering, University of Osijek, Osijek. pp27-34. 2004.

C. Steger. On the Calculation of Moments of Polygons. August, 1996. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.29.8765&rep=rep1&type=pdf

P. J. Rousseeuw. Least Median of Squares Regression. Journal of the American Statistical Association. pp871, , 880. December, 1984. http://www.jstor.org/stable/2288718

A. Hyvrinen, J. Hurri and P. O. Hoyer. Natural Image Statistics: A Probabilistic Approach to Early Computational Vision.. Springer Publishing Company, Incorporated. 2009.

Z. Zhang, R. Deriche, O. Faugeras and Q. Luong. A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry . Artificial Intelligence . pp87 - 119. 1995. http://www.sciencedirect.com/science/article/pii/0004370295000224

C. J. Taylor, D. H. Cooper and J. Graham. Training models of shape from sets of examples. Proc. BMVC92, Springer-Verlag. pp9, , 18. 1992.

A. J. Bell and T. J. Sejnowski. The ‘Independent Components’ of Natural Scenes are Edge Filters.. VISION RESEARCH. pp3327, , 3338. 1997.

T. F. Cootes and C. J. Taylor. Statistical Models of Appearance for Computer Vision. October, 2001. http://isbe.man.ac.uk/~bim/Models/app_model.ps.gz

B. K. P. Horn, H. Hilden and S. Negahdaripour. Closed-Form Solution of Absolute Orientation using Orthonormal Matrices. JOURNAL OF THE OPTICAL SOCIETY AMERICA. pp1127-1135. 1988.

R. A. Fisher. The use of multiple measurements in taxonomic problems. Annals Eugen.. pp179, , 188. 1936.

J. Hare, S. Samangooei and D. Dupplaw. OpenIMAJ and ImageTerrier: Java Libraries and Tools for Scalable Multimedia Analysis and Indexing of Images. ACM Multimedia 2011. ACM. pp691-694. November, 2011. http://eprints.soton.ac.uk/273040/

A. A. Efros and T. K. Leung. Texture Synthesis by Non-Parametric Sampling. Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2. IEEE Computer Society. p1033. 1999. http://dl.acm.org/citation.cfm?id=850924.851569

W. Dong, Z. Wang and K. Li. High-Confidence Near-Duplicate Image Detection. ACM International Conference on Multimedia Retrieval. 2012.

Z. Zhang. A flexible new technique for camera calibration. Pattern Analysis and Machine Intelligence, IEEE Transactions on. pp1330-1334. Nov, 2000.

M. Everingham, J. Sivic and A. Zisserman. Hello! My name is... Buffy - Automatic naming of characters in TV video. In BMVC. 2006.

J. M. Saragih, S. Lucey and J. F. Cohn. Face alignment through subspace constrained mean-shifts. IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27 - October 4, 2009. IEEE. pp1034-1041. 2009.

M. Turk and A. Pentland. Face recognition using eigenfaces. Computer Vision and Pattern Recognition, 1991. Proceedings CVPR ’91., IEEE Computer Society Conference on. pp586 -591. jun, 1991.

F. Samaria and A. Harter. Parameterisation of a stochastic model for human face identification. Applications of Computer Vision, 1994., Proceedings of the Second IEEE Workshop on. pp138 -142. dec, 1994.

P. N. Belhumeur, J. P. Hespanha and D. J. Kriegman. Fisherfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection. IEEE Trans. Pattern Anal. Mach. Intell.. IEEE Computer Society. pp711, , 720. July, 1997. http://dx.doi.org/10.1109/34.598228

D. Ozkan and P. Duygulu. Finding people frequently appearing in news. Proceedings of the 5th international conference on Image and Video Retrieval. Springer-Verlag. pp173, , 182. 2006. http://dx.doi.org/10.1007/11788034_18

P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features. Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on. pp I, 511 , I, 518 vol.1. 2001.

X. Tan and B. Triggs. Enhanced local texture feature sets for face recognition under difficult lighting conditions. Trans. Img. Proc.. IEEE Press. pp1635-1650. June, 2010. http://dx.doi.org/10.1109/TIP.2010.2042645

K. Sandeep and A. N. Rajagopalan. Human Face Detection in Cluttered Color Images Using Skin Color and Edge Information. Electrical Engineering. Citeseer. 2002. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.12.730&rep=rep1&type=pdf

X. Tan and B. Triggs. Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions. Image Processing, IEEE Transactions on. pp1635 -1650. june , 2010.

B. Loni, M. Menendez, M. Georgescu, L. Galli, C. Massari, I. S. Altingovde, D. Martinenghi, M. Melenhorst, R. Vliegendhart and M. Larson. Fashion-focused creative commons social dataset. Proceedings of the 4th ACM Multimedia Systems Conference. ACM. pp72, , 77. 2013. http://doi.acm.org/10.1145/2483977.2483984

A. Krizhevsky and G. Hinton. Learning multiple layers of features from tiny images. Master’s thesis, Department of Computer Science, University of Toronto. Citeseer. 2009.

C. Yeh, Y. Ho, B. A. Barsky and M. Ouhyoung. Personalized Photograph Ranking and Selection System. Proceedings of ACM Multimedia. pp211-220. October, 2010.

X. Liu and J. Samarabandu. Multiscale Edge-Based Text Extraction from Complex Images. Multimedia and Expo, 2006 IEEE International Conference on. pp1721 -1724. july, 2006.

X. Tan and B. Triggs. Enhanced local texture feature sets for face recognition under difficult lighting conditions. Trans. Img. Proc.. IEEE Press. pp1635, , 1650. June, 2010. http://dx.doi.org/10.1109/TIP.2010.2042645

R. Achanta, S. Hemami, F. Estrada and S. S"usstrunk. Frequency-tuned Salient Region Detection. IEEE International Conference on Computer Vision and Pattern Recognition (CVPR). 2009. http://infoscience.epfl.ch/record/135217/files/1708.pdf

J. S. Pedro and S. Siersdorfer. Ranking and Classifying Attractiveness of Photos in Folksonomies. 18th International World Wide Web Conference. pp771, , 771. April, 2009. http://www2009.eprints.org/78/

N. Dalal and B. Triggs. Histograms of Oriented Gradients for Human Detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05) - Volume 1 - Volume 01. IEEE Computer Society. pp886, , 893. 2005. http://dx.doi.org/10.1109/CVPR.2005.177

Y. Luo and X. Tang. Photo and Video Quality Evaluation: Focusing on the Subject. Proceedings of the 10th European Conference on Computer Vision: Part III. Springer-Verlag. pp386, , 399. 2008. http://dx.doi.org/10.1007/978-3-540-88690-7_29

T. Ojala, M. Pietikainen and D. Harwood. A Comparative Study of Texture Measures with Classification Based on Feature Distributions. Pattern Recognition. pp51-59. January, 1996.

Y. H. B. A. B. M. O. Che-Hua Yeh. Personalized Photograph Ranking and Selection System. Proceedings of ACM Multimedia. pp211-220. October, 2010.

P. N. Belhumeur, J. P. Hespanha and D. J. Kriegman. Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection. IEEE Trans. Pattern Anal. Mach. Intell.. IEEE Computer Society. pp711, , 720. July, 1997. http://dx.doi.org/10.1109/34.598228

P. F. Felzenszwalb and D. P. Huttenlocher. Efficient Graph-Based Image Segmentation. Int. J. Comput. Vision. Kluwer Academic Publishers. pp167-181. September, 2004. http://dx.doi.org/10.1023/B:VISI.0000022288.19776.77

A. Oliva and A. Torralba. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope. Int. J. Comput. Vision. Kluwer Academic Publishers. pp145, , 175. May, 2001. http://dx.doi.org/10.1023/A:1011139631724

X. Liu and J. Samarabandu. An edge-based text region extraction algorithm for indoor mobile robot navigation. Mechatronics and Automation, 2005 IEEE International Conference. pp 701 - 706 Vol. 2. July-1 Aug., 2005.

D. Hasler and S. Süsstrunk. Measuring Colourfulness in Natural Images. Proc. {IS}&{T}/{SPIE} {E}lectronic {I}maging 2003: {H}uman {V}ision and {E}lectronic {I}maging {VIII}. pp87, , 95. 2003. http://infoscience.epfl.ch/record/33994/files/HaslerS03.pdf?version=1

Y. Luo and X. Tang. Photo and Video Quality Evaluation: Focusing on the Subject. Proceedings of the 10th European Conference on Computer Vision: Part III. Springer-Verlag. pp386-399. 2008. http://dx.doi.org/10.1007/978-3-540-88690-7_29

K. Huang, Q. Wang and Z. Wu. Natural color image enhancement and evaluation algorithm based on human visual system. Comput. Vis. Image Underst.. Elsevier Science Inc.. pp52, , 63. jul, 2006. http://dx.doi.org/10.1016/j.cviu.2006.02.007

T. Ojala, M. Pietik"ainen and T. M"aenp"a"a. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Trans. Pattern Anal. Mach. Intell.. IEEE Computer Society. pp971, , 987. July, 2002. http://dx.doi.org/10.1109/TPAMI.2002.1017623

T. F. Cootes and C. J. Taylor. Statistical Models of Appearance for Computer Vision. October, 2001. http://isbe.man.ac.uk/~bim/Models/app_model.ps.gz

T. F. Cootes, C. J. Taylor and A. Lanitis. Active shape models: Evaluation of a multi-resolution method for improving image search. Proc British Machine Vision Conference. BMVA Press. pp327, , 336. 1994. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.141.4937&rep=rep1&type=pdf

M. B. Stegmann, B. K. Ersbøll and R. Larsen. FAME – A Flexible Appearance Modelling Environment. IEEE Trans. on Medical Imaging. IEEE. pp1319-1331. 2003.

Y. Ke, X. Tang and F. Jing. The Design of High-Level Features for Photo Quality Assessment. Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1. IEEE Computer Society. pp419, , 426. 2006. http://dx.doi.org/10.1109/CVPR.2006.303

B. Epshtein, E. Ofek and Y. Wexler. Detecting text in natural scenes with stroke width transform. Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. pp2963-2970. 2010.

S. Suzuki and K. Abe. Topological Structural Analysis of Digitized Binary Image by Border Following. Computer Vision, Graphics and Image Processing. pp32-46. January, 1985.

T. Ojala, M. Pietikainen and T. Maenpaa. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. Pattern Analysis and Machine Intelligence, IEEE Transactions on. pp971 -987. jul, 2002.

A. Bosch, A. Zisserman and X. Munoz. Representing shape with a spatial pyramid kernel. Proceedings of the 6th ACM international conference on Image and video retrieval. ACM. pp401, , 408. 2007. http://doi.acm.org/10.1145/1282280.1282340

T. F. Cootes and C. J. Taylor. Active Shape Models. in Proceedings of the British Machine Vision Conference. 1992.

M. Turk and A. Pentland. Face recognition using eigenfaces. Computer Vision and Pattern Recognition, 1991. Proceedings CVPR ’91., IEEE Computer Society Conference on. pp586 -591. jun, 1991.

F. Perronnin, J. S’anchez and T. Mensink. Improving the Fisher Kernel for Large-scale Image Classification. Proceedings of the 11th European Conference on Computer Vision: Part IV. Springer-Verlag. pp143, , 156. 2010. http://dl.acm.org/citation.cfm?id=1888089.1888101

J. Morel and G. Yu. ASIFT: A New Framework for Fully Affine Invariant Image Comparison. SIAM J. Img. Sci.. Society for Industrial and Applied Mathematics. 2009.

D. Lowe. Object recognition from local scale-invariant features. Proc. of the International Conference on Computer Vision ICCV. pp1150-1157. 1999.

H. Jegou, M. Douze, C. Schmid and P. Perez. Aggregating local descriptors into a compact image representation. Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. pp3304 -3311. june, 2010.

J. Hare, S. Samangooei and P. Lewis. Efficient clustering and quantisation of SIFT features: Exploiting characteristics of the SIFT descriptor and interest region detectors under image inversion. The ACM International Conference on Multimedia Retrieval (ICMR 2011). ACM Press. April, 2011.

M. Brown and D. Lowe. Invariant Features from Interest Point Groups. BMVC 2002: 13th British Machine Vision Conference. pp253, , 262. September, 2002.

J. Matas, O. Chum, M. Urban and T. Pajdla. Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing. pp761 - 767. 2004. http://www.sciencedirect.com/science/article/pii/S0262885604000435

W. Dong, Z. Wang and K. Li. High-Confidence Near-Duplicate Image Detection. ACM International Conference on Multimedia Retrieval. 2012.

G. J. Burghouts and J. Geusebroek. Performance evaluation of local colour invariants. Comput. Vis. Image Underst.. Elsevier Science Inc.. pp48, , 62. jan, 2009. http://dx.doi.org/10.1016/j.cviu.2008.07.003

K. E. A. van de Sande, T. Gevers and C. G. M. Snoek. Evaluating Color Descriptors for Object and Scene Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence. pp1582, , 1596. 2010. http://www.science.uva.nl/research/publications/2010/vandeSandeTPAMI2010

Y. Cui, N. Hasler, T. Thorm"ahlen and H. Seidel. Scale Invariant Feature Transform with Irregular Orientation Histogram Binning. Proceedings of the 6th International Conference on Image Analysis and Recognition. Springer-Verlag. pp258, , 267. 2009.

F. Perronnin and C. Dance. Fisher Kernels on Visual Vocabularies for Image Categorization. Computer Vision and Pattern Recognition, 2007. CVPR ’07. IEEE Conference on. pp1-8. 2007.

D. Lowe. Distinctive image features from scale-invariant keypoints. IJCV. pp91-110. January, 2004.

S. M. Smith. A new class of corner finder. Proc. 3rd British Machine Vision Conference. p139-148. 1992. http://users.fmrib.ox.ac.uk/~steve/susan/susan/node4.html

A. Telea. An Image Inpainting Technique Based on the Fast Marching Method.. J. Graphics, GPU, & Game Tools. pp23-34. 2004. http://dblp.uni-trier.de/db/journals/jgtools/jgtools9.html#Telea04

K. I. Laws. Rapid Texture Identification. Proc. SPIE Conf. Image Processing for Missile Guidance. pp376, , 380. 1980.

R. B. Blackman and J. W. Tukey. Particular Pairs of Windows. In The Measurement of Power Spectra, From the Point of View of Communications Engineering. Dover. pp98-99. 1959.

A. Telea. An Image Inpainting Technique Based on the Fast Marching Method.. J. Graphics, GPU, & Game Tools. pp23-34. 2004. http://dblp.uni-trier.de/db/journals/jgtools/jgtools9.html#Telea04

B. Epshtein, E. Ofek and Y. Wexler. Detecting text in natural scenes with stroke width transform. Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. pp2963-2970. 2010.

P. Perona and J. Malik. Scale-Space and Edge Detection Using Anisotropic Diffusion. IEEE Trans. Pattern Anal. Mach. Intell.. IEEE Computer Society. pp629, , 639. July, 1990. http://dx.doi.org/10.1109/34.56205

D. Schumacher. Graphics Gems III. Academic Press Professional, Inc.. pp8, , 16. 1992. http://dl.acm.org/citation.cfm?id=130745.130747

J. Morel and G. Yu. ASIFT: A New Framework for Fully Affine Invariant Image Comparison. SIAM J. Img. Sci.. Society for Industrial and Applied Mathematics. 2009.

P. Kovesi. Fast Almost-Gaussian Filtering. Digital Image Computing: Techniques and Applications (DICTA), 2010 International Conference on. pp121-125. Dec, 2010.

J. A. Sethian. A Fast Marching Level Set Method for Monotonically Advancing Fronts. Proc. Nat. Acad. Sci. pp1591, , 1595. 1995.

N. Otsu. A Threshold Selection Method from Gray-Level Histograms. Systems, Man and Cybernetics, IEEE Transactions on. pp62-66. 1979.

Y. Cai, W. Tong, L. Yang and A. G. Hauptmann. Constrained Keypoint Quantization: Towards Better Bag-of-Words Model for Large-scale Multimedia Retrieval. ACM International Conference on Multimedia Retrieval. 2012.

A. Ramanan and M. Niranjan. Resource-Allocating Codebook for Patch-based Face Recognition. IIS. 2009. http://eprints.ecs.soton.ac.uk/21401/

A. Kumar, P. Rai and H. D. III. Co-regularized Multi-view Spectral Clustering. Advances in Neural Information Processing Systems 24. pp1413, , 1421. 2011.

D. Nist’er and H. Stew’enius. Scalable Recognition with a Vocabulary Tree. CVPR. pp2161, , 2168. 2006.

F. Moosmann, E. Nowak and F. Jurie. Randomized Clustering Forests for Image Classification. IEEE PAMI. 2008. http://dx.doi.org/10.1109/TPAMI.2007.70822

A. Vedaldi and A. Zisserman. Efficient Additive Kernels via Explicit Feature Maps. Pattern Analysis and Machine Intelligence, IEEE Transactions on. pp480-492. 2012.

J. Hare and P. Lewis. Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlation and Semantic Spaces. Imaging and Printing in a Web 2.0 World; and Multimedia Content Access: Algorithms and Systems IV. SPIE. January, 2010. http://eprints.soton.ac.uk/268496/

A. Vedaldi and A. Zisserman. Efficient Additive Kernels via Explicit Feature Maps. Proceedings of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR). 2010.

R. Fan, K. Chang, C. Hsieh, X. Wang and C. Lin. LIBLINEAR: A Library for Large Linear Classification. J. Mach. Learn. Res.. JMLR.org. pp1871, , 1874. june, 2008. http://dl.acm.org/citation.cfm?id=1390681.1442794

F. Orabona, C. Castellini, B. Caputo, L. Jie and G. Sandini. On-line independent support vector machines. Pattern Recognition. pp1402-1412. 2010.

A. F. T. Martins. The Geometry of Constrained Structured Prediction: Applications to Inference and Learning of Natural Language Syntax. 2012.

H. Jegou, M. Douze and C. Schmid. Product Quantization for Nearest Neighbor Search. IEEE Trans. Pattern Anal. Mach. Intell.. IEEE Computer Society. pp117, , 128. January, 2011. http://dx.doi.org/10.1109/TPAMI.2010.57

M. S. Charikar. Similarity estimation techniques from rounding algorithms. Proceedings of the thiry-fourth annual ACM symposium on Theory of computing. ACM. pp380, , 388. 2002. http://doi.acm.org/10.1145/509907.509965

P. Indyk and R. Motwani. Approximate nearest neighbors: towards removing the curse of dimensionality. Proceedings of the thirtieth annual ACM symposium on Theory of computing. ACM. pp604, , 613. 1998. http://doi.acm.org/10.1145/276698.276876

Q. Lv, M. Charikar and K. Li. Image similarity search with compact data structures. Proceedings of the thirteenth ACM international conference on Information and knowledge management. ACM. pp208, , 217. 2004. http://doi.acm.org/10.1145/1031171.1031213

M. Datar, N. Immorlica, P. Indyk and V. S. Mirrokni. Locality-sensitive hashing scheme based on p-stable distributions. Proceedings of the twentieth annual symposium on Computational geometry. ACM. pp253, , 262. 2004. http://doi.acm.org/10.1145/997817.997857

M. Muja and D. G. Lowe. Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration. International Conference on Computer Vision Theory and Application VISSAPP’09). INSTICC Press. pp331-340. 2009.

M. Lui and T. Baldwin. Cross-domain Feature Selection for Language Identification. in Proceedings of 5th International Joint Conference on Natural Language Processing. 2011.

J. Wiebe, T. Wilson and C. Cardie. Annotating expressions of opinions and emotions in language. . 2005.

T. Steiner, R. Verborgh, J. Gabarr’o Vall’es, R. Troncy, M. Hausenblas, R. Van de Walle and A. Brousseau. Enabling on-the-fly video shot detection on YouTube. WWW 2012, 21st International World Wide Web Conference Developer’s Track, April 16-20, 2012, Lyon, France. 04, 2012. http://www.eurecom.fr/publication/3676