Estimating Sufficient Sample Sizes for Approximate Decision Support Queries
Access Status
Fulltext not available
Authors
Rudra, Amit
Gopalan, Raj
Achuthan, Narasimaha
Date
2014Type
Book Chapter
Metadata
Show full item recordCitation
Rudra, A. and Gopalan, R. and Achuthan, N. 2014. Estimating Sufficient Sample Sizes for Approximate Decision Support Queries. In S. Hammoudi, J. Cordeiro, L.A. Maciaszek, J. Filipe (eds), Enterprise Information Systems, 85-99. Switzerland: Springer.
Source Title
Enterprise Information Systems
ISBN
School
School of Information Systems
Collection
Abstract
Sampling schemes for approximate processing of highly selective decision support queries need to retrieve sufficient number of records that can provide reliable results within acceptable error limits. The k-MDI tree is an innovative index structure that supports drawing rich samples of relevant records for a given set of dimensional attribute ranges. This paper describes a method for estimating sufficient sample sizes for decision support queries based on inverse simple random sampling without replacement (SRSWOR). Combined with a k-MDI tree index, this method is shown to offer a reliable approach to approximate query processing for decision support.
Related items
Showing items related by title, author, creator and subject.
-
Rudra, Amit; Gopalan, Raj; Achuthan, Narasimaha (2012)Decision support queries usually involve accessing enormous amount of data requiring significant retrieval time. Faster retrieval of query results can often save precious time for the decision maker. Pre-computation of ...
-
Inoue, T.; Krishna, Aneesh; Gopalan, Raj (2016)Approximate query processing based on random sampling is one of the most useful methods for the efficient computation of large quantities of data kept in databases. However, small samples obtained through random sampling ...
-
Inoue, Tomohiro (2015)This dissertation studies efficient and effective approximate query processing for decision support systems. A novel method that enables fast query processing and reliable approximation even in highly selective queries ...