Contents

-

Efficient (δ,γ)-Pattern-Matching with Don’t Cares

Yoan José Pinzén Ardila1, Manolis Christodoulakis2, Costas S. Tliopoulos2, Manal Mohamed2
1National University of Colombia Department of System Engineering and Industrial Engineering Ciudad Universitaria – Bogoté D.C.- Colombia Avenida Carrera 30 No 45-03
2King’s College London, Department of Computer Science, London WC2R 2LS, UK

Abstract

Here we consider string matching problems that arise naturally in applications to music retrieval. The δ-Matching problem calculates, for a given text T1..n and a pattern P1..m on an alphabet of integers, the list of all indices Iδ={1inm+1:maxj=1m{|PjTi+j1|δ}}. The γ-Matching problem computes, for given T and P, the list of all indices Iγ={1inm+1:j=1m|PjTi+j1|γ}. In this paper, we extend the current result on the different matching problems to handle the presence of “\emph{don’t care}” symbols. We present efficient algorithms that calculate Iδ, Iγ, and I(δ,γ)=IδIγ for pattern P with occurrences of “don’t cares”.

Keywords: 5-matching; -y-matching; wildcard matching; music in- formation retrieval.