Large Scale Data Analytics with Language Integrated Query
Access Status
Open access
Authors
Cho, Chung Yik
Date
2018Supervisor
Veeramani Shanmugam
Type
Thesis
Award
MPhil
Metadata
Show full item recordFaculty
Science and Engineering
School
Electrical Engineering, Computer and Math Science (EECMS)
Collection
Abstract
Databases can easily reach petabytes (1,048,576 gigabytes) in scale. A system to enable users to efficiently retrieve or query data from multiple databases simultaneously is needed. This research introduces a new, cloud-based query framework, designed and built using Language Integrated Query, to query existing data sources without the need to integrate or restructure existing databases. Protein data obtained through the query framework proves its feasibility and cost effectiveness.
Related items
Showing items related by title, author, creator and subject.
-
Rudra, Amit; Gopalan, Raj; Achuthan, Narasimaha (2012)Decision support queries usually involve accessing enormous amount of data requiring significant retrieval time. Faster retrieval of query results can often save precious time for the decision maker. Pre-computation of ...
-
Rudra, Amit; Gopalan, Raj; Achuthan, Narasimaha (2013)For highly selective queries, a simple random sample of records drawn from a large data warehouse may not contain sufficient number of records that satisfy the query conditions. Efficient sampling schemes for such queries ...
-
Rudra, Amit; Gopalan, Raj; Achuthan, Narasimaha (2012)A simple random sample of records from a large data warehouse may not contain sufficient number of records that satisfy highly selective queries. Efficient sampling schemes for such queries involve using innovative ...