Selecting adequate samples for approximate decision support queries
dc.contributor.author | Rudra, Amit | |
dc.contributor.author | Gopalan, Raj | |
dc.contributor.author | Achuthan, Narasimaha | |
dc.contributor.editor | Salimane Hammoudi | |
dc.contributor.editor | Leszek Maciaszek | |
dc.contributor.editor | Jose Cordeiro | |
dc.contributor.editor | Jan Dietz | |
dc.date.accessioned | 2017-01-30T13:38:07Z | |
dc.date.available | 2017-01-30T13:38:07Z | |
dc.date.created | 2013-09-09T20:00:36Z | |
dc.date.issued | 2013 | |
dc.identifier.citation | Rudra, Amit and Gopalan, Raj and Achuthan, Narasimaha. 2013. Selecting adequate samples for approximate decision support queries, in Hammoudi, S. and Maciaszek, L. and Cordeiro. J. and Dietz, J. (ed), ICEIS 2013, Jul 4-7 2013, pp. 46-55. Angers, France: Science and Technology Publications. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/33606 | |
dc.description.abstract |
For highly selective queries, a simple random sample of records drawn from a large data warehouse may not contain sufficient number of records that satisfy the query conditions. Efficient sampling schemes for such queries require innovative techniques that can access records that are relevant to each specific query. In drawing the sample, it is advantageous to know what would be an adequate sample size for a given query. This paper proposes methods for picking adequate samples that ensure approximate query results with a desired level of accuracy. A special index based on a structure known as the k-MDI Tree is used to draw samples. An unbiased estimator named inverse simple random sampling without replacement is adapted to estimate adequate sample sizes for queries. The methods are evaluated experimentally on a large real life data set. The results of evaluation show that adequate sample sizes can be determined such that errors in outputs of most queries are wtihin the acceptable limit of 5%. | |
dc.publisher | Science and Technology Publications | |
dc.subject | Data Warehousing | |
dc.subject | Inverse Simple Random Sample without Replacement (SRSWOR) | |
dc.subject | Approximate Query Processing | |
dc.subject | Sampling | |
dc.title | Selecting adequate samples for approximate decision support queries | |
dc.type | Conference Paper | |
dcterms.source.startPage | 46 | |
dcterms.source.endPage | 55 | |
dcterms.source.title | Proceedings of the 15th International Conference on Enterprise Information Systems | |
dcterms.source.series | Proceedings of the 15th International Conference on Enterprise Information Systems | |
dcterms.source.isbn | 9789898565594 | |
dcterms.source.conference | ICEIS 2013 | |
dcterms.source.conference-start-date | Jul 4 2013 | |
dcterms.source.conferencelocation | Angers, France | |
dcterms.source.place | France | |
curtin.department | ||
curtin.accessStatus | Open access |