Counting distinct squares in partial words

  • Francine Blanchet-Sadri
  • Robert Mercaş
  • Geoffrey Scott

Abstract

A well known result of Fraenkel and Simpson states that the number of distinct squares in a word of length n is bounded by 2n since at each position there are at most two distinct squares whose last occurrence start. In this paper, we investigate the problem of counting distinct squares in partial words, or sequences over a finite alphabet that may have some "do not know" symbols or "holes" (a (full) word is just a partial word without holes). A square in a partial word over a given alphabet has the form uu' where u is compatible with u, and consequently, such square is compatible with a number of full words over the alphabet that are squares. We consider the number of distinct full squares compatible with factors in a partial word with h holes of length n over a k-letter alphabet, and show that this number increases polynomially with respect to k in contrast with full words, and give bounds in a number of cases. For partial words with one hole, it turns out that there may be more than two squares that have their last occurrence starting at the same position. We prove that if such is the case, then the hole is in the shortest square. We also construct a partial word with one hole over a k-letter alphabet that has more than k squares whose last occurrence start at position zero.

Downloads

Download data is not yet available.
Published
2009-01-01
How to Cite
Blanchet-Sadri, F., Mercaş, R., & Scott, G. (2009). Counting distinct squares in partial words. Acta Cybernetica, 19(2), 465-477. Retrieved from https://cyber.bibl.u-szeged.hu/index.php/actcybern/article/view/3777
Section
Regular articles