Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    Efficient algorithms for two extensions of LPF table: the power of suffix arrays

    Access Status
    Fulltext not available
    Authors
    Crochemore, M.
    Iliopoulos, Costas
    Kubica, M.
    Rytter, W.
    Walen, T.
    Date
    2010
    Type
    Conference Paper
    
    Metadata
    Show full item record
    Citation
    Crochemore, M. and Iliopoulos, C. and Kubica, M. and Rytter, W. and Walen, T. 2010. Efficient algorithms for two extensions of LPF table: the power of suffix arrays, in Jan v Leeuwen, Anca Muscholl, David Peleg, Jaroslav Pokorny and Bernhard Rumpe (ed), 36th Conference on Current Trends in Theory and Practice of Computer Science,, Jan 23 2010, pp. 296-307. Spindleruv Mlyn, Czech Republic: Springer.
    Source Title
    Lecture notes in computer science, volume 5091: theory and practice of computer science - SOFSEM 2010
    Source Conference
    36th Conference on Current Trends in Theory and Practice of Computer Science,
    DOI
    10.1007/978-3-642-11266-9_25
    Additional URLs
    http://www.springerlink.com/content/5177t4t8k4m66112
    ISBN
    978-364205005-3
    School
    Digital Ecosystems and Business Intelligence Institute (DEBII)
    URI
    http://hdl.handle.net/20.500.11937/19459
    Collection
    • Curtin Research Publications
    Abstract

    Su?x arrays provide a powerful data structure to solve several questions related to the structure of all the factors of a string. We show how they can be used to compute e?ciently two new tables storing di?erent types of previous factors (past segments) of a string. The concept of a longest previous factor is inherent to Ziv-Lempel factorization of strings in text compression, as well as in statistics of repetitions and symmetries. The longest previous reverse factor for a given position i is the longest factor starting at i, such that its reverse copy occurs before, while the longest previous non-overlapping factor is the longest factor v starting at i which has an exact copy occurring before. The previous copies of the factors are required to occur in the pre?x ending at position i -1. We design algorithms computing the table of longest previous reverse factors (LPrF table) and the table of longest previous nonoverlapping factors (LPnF table). The latter table is useful to computerepetitions while the former is a useful tool for extracting symmetries. These tables are computed, using two previously computed read-only arrays (SUF and LCP) composing the su?x array, in linear time on anyinteger alphabet. The tables have not been explicitly considered before, but they have several applications and they are natural extensions of the LPF table which has been studied thoroughly before. Our results improve on the previous ones in several ways. The running time of the computation no longer depends on the size of the alphabet, which drops a log factor. Moreover the newly introduced tables store additional information on the structure of the string, helpful to improve, for example, gapped palindrome detection and text compression using reverse factors. computing their primitive roots. Applications of runs, despite their importance, are underrepresented in existing literature (approximately one page in the paper of Kolpakov & Kucherov, 1999). In this paper we attempt to ?ll in this gap. We use Lyndon words and introduce the Lyndon structure of runs as a useful tool when computing powers. In problems related to periods we use some versions of the Manhattan skyline problem.

    Related items

    Showing items related by title, author, creator and subject.

    • Lempel-Ziv factorization using less time & space
      Chen, G.; Puglisi, Simon; Smyth, B. (2008)
      For 30 years the Lempel-Ziv factorization LZ x of a string x = x[1..n] has been a fundamental data structure of string processing, especially valuable for string compression and for computing all the repetitions (runs) ...
    • Fast, Practical Algorithms for Computing All the Repeats in a String
      Puglisi, Simon; Smyth, William; Yusufu, M. (2010)
      Given a string x = x[1..n] on an alphabet of size α, and a threshold p min ≥ 1, we describe four variants of an algorithm PSY1 that, using a suffix array, computes all the complete nonextendible repeats in x of length p ...
    • Efficient prefix updates for IP router using lexicographic ordering and updateable address set
      Soh, Sieteng; Hiryanto, L.; Rai, S. (2008)
      Dynamic IP router table schemes, which have recently been proposed in the literature, perform an IP lookup or an online prefix update in O(log2|T|) memory accesses (MAs). In terms of lookup time, they are still slower ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.