Curtin University Homepage
  • Library
  • Help
    • Admin

    espace - Curtin’s institutional repository

    JavaScript is disabled for your browser. Some features of this site may not work without it.
    View Item 
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item
    • espace Home
    • espace
    • Curtin Research Publications
    • View Item

    An efficient sampling scheme for approximate processing of decision support queries

    194734_101202_ICEIS_2012_144.pdf (1.011Mb)
    Access Status
    Open access
    Authors
    Rudra, Amit
    Gopalan, Raj
    Achuthan, Narasimaha
    Date
    2012
    Type
    Conference Paper
    
    Metadata
    Show full item record
    Citation
    Rudra, Amit and Gopalan, Raj P. and Achuthan, N.R. 2012. An efficient sampling scheme for approximate processing of decision support queries, in Cordeiro, J., Maciaszek, L., Cuzzocrea, A. (ed), 14th International Conference on Enterprise Information Systems, Jun 28 2012, pp. 16-26. Wroclaw, Poland: INSTICC.
    Source Title
    Proceedings of ICEIS
    Source Conference
    14th International Conference on Enterprise Information Systems
    Remarks

    Publisher: SciTePress. (2012). ISBN: 9789898565105

    URI
    http://hdl.handle.net/20.500.11937/28648
    Collection
    • Curtin Research Publications
    Abstract

    Decision support queries usually involve accessing enormous amount of data requiring significant retrieval time. Faster retrieval of query results can often save precious time for the decision maker. Pre-computation of materialised views and sampling are two ways of achieving significant speed up. However, drawing random samples for queries on range restricted attributes has two problems: small random samples may miss relevant records and drawing larger samples from disk can be inefficient due to the large number of disk accesses required. In this paper, we propose an efficient indexing scheme for quickly drawing relevant samples for data warehouse queries as well as propose the concepts of database and sample relevancy ratios. We describe a method for estimating query results for range restricted queries using this index and experimentally evaluate the scheme using a relatively large real dataset. Further, we compute the confidence intervals for the estimates to investigate whether the results can be guaranteed to be within the desired level of confidence. Our experiments on data from a retail data warehouse show promising results. We also report the levels of accuracy achieved for various types of aggregate queries and relate them to the database relevancy ratios of the queries.

    Related items

    Showing items related by title, author, creator and subject.

    • Selecting adequate samples for approximate decision support queries
      Rudra, Amit; Gopalan, Raj; Achuthan, Narasimaha (2013)
      For highly selective queries, a simple random sample of records drawn from a large data warehouse may not contain sufficient number of records that satisfy the query conditions. Efficient sampling schemes for such queries ...
    • Picking adequate samples for approximate decision support queries using inverse SRSWOR
      Rudra, Amit; Gopalan, Raj; Achuthan, Narasimaha (2012)
      A simple random sample of records from a large data warehouse may not contain sufficient number of records that satisfy highly selective queries. Efficient sampling schemes for such queries involve using innovative ...
    • Approximate Query Processing on High Dimensionality Database Tables Using Multidimensional Cluster Sampling View
      Inoue, T.; Krishna, Aneesh; Gopalan, Raj (2016)
      Approximate query processing based on random sampling is one of the most useful methods for the efficient computation of large quantities of data kept in databases. However, small samples obtained through random sampling ...
    Advanced search

    Browse

    Communities & CollectionsIssue DateAuthorTitleSubjectDocument TypeThis CollectionIssue DateAuthorTitleSubjectDocument Type

    My Account

    Admin

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Follow Curtin

    • 
    • 
    • 
    • 
    • 

    CRICOS Provider Code: 00301JABN: 99 143 842 569TEQSA: PRV12158

    Copyright | Disclaimer | Privacy statement | Accessibility

    Curtin would like to pay respect to the Aboriginal and Torres Strait Islander members of our community by acknowledging the traditional owners of the land on which the Perth campus is located, the Whadjuk people of the Nyungar Nation; and on our Kalgoorlie campus, the Wongutha people of the North-Eastern Goldfields.