Wavelets

Information about Wavelets

Published on November 21, 2007

Author: Jade

Source: authorstream.com

Content

Wavelets and Ranking of database query results:  Wavelets and Ranking of database query results Prepared by -Archana vijayalakshmanan Contents:  Contents Wavelets Haar wavelets Haar wavelet coefficients Examples Comparison with sampling AQP methods Ranking Similarity function Examples Wavelets:  Wavelets In signal processing community, wavelets are used to break the complicated signal into single component. Similarly in this context wavelets are used to break the dataset into simple component. Haar wavelet - simple wavelet, easy to understand Slide4:  Haar Wavelet Resolution Averages Detail Coefficients [2, 2, 0, 2, 3, 5, 4, 4] [2, 1, 4, 4] [0, -1, -1, 0] ---- 3 2 1 0 wavelet decomposition(wavelet transform): [2.75, -1.25, 0.5, 0, 0, -1, -1, 0] Haar Wavelet Coefficients :  Haar Wavelet Coefficients Using wavelet coefficients one can pull the raw data Keep only the large wavelet coefficients and pretend other coefficients to be 0. [2.75, -1.25, 0.5, 0, 0, 0, 0, 0]-synopsis of the data The elimination of small coefficients introduces only small error when reconstructing the original data Example -1:  Example -1 Query: SELECTsalary FROM employee WHERE empid=5 Employee Result: By using the synopsis [2.75,- 1.25, 0.5, 0, 0, 0, 0, 0] and constructing the tree on fly, salary=4 will be returned.Whereas the correct result is salary=3.This error is due to truncation of wavelength Example-2 on range query:  Example-2 on range query SELECT avg(salary) FROM Employee WHERE 3 < empid <7 Keep the original data in form of cumulative dataset Find the Haar wavelet transformation and construct the tree Comparison with sampling:  Comparison with sampling For Haar wavelet transformation all the data must be numeric. In the example, even empid must be numeric and must be sorted Multidimension Haar wavelet transformation Sampling gives the probabilistic error measure whereas Haar wavelet does not provide any Haar wavelet is more robust than sampling. The final average gives the average of all data values. Hence all the tuples are involved. AQP methods:  AQP methods Sampling- Most promising, robust (works on categorical and unsorted data too. Histogram Wavelet- have to reconstruct the tree Join distributive model Bayesian network Ranking:  Ranking Example 1- In Google, one gives the query (or search phrase) to database and a tuple (or page ) matching the search phrase is displayed Example 2- In library catalog, one gives the partial information on a book, the top 10 books are ranked according to the matching requirements and are displayed Query, reviews, reliability plays important role in ranking Similarity function:  Similarity function Similarity function S(Q,t)= [0,1] where Q-query t-tuple. Bigger number means more appropriate to query Using similarity function, one can execute query quickly Apart from ranking ,summarization of the results can also be done. Eg. On typing jaguar in Google, website of jaguar cars and website of jaguar animal can be displayed Example for ranking:  Example for ranking Consider a search data engine to display top restaurants. Let cuisine, price and location be the attributes or the criteria under which restaurants are ranked Assume the database to be numeric. Let Score(t)= 3.5 price+2.8 location+8.3 cuisine be the ranking function Apply the ranking function to all tuples, find the score, sort the top score and print the result. References:  References maids.ncsa.uiuc.edu/documents/readings/rastogi02.ppt

Related presentations


Other presentations created by Jade

TRB JPR 2007
28. 02. 2008
0 views

TRB JPR 2007

communism
23. 12. 2007
0 views

communism

1stTuesday2003
13. 04. 2008
0 views

1stTuesday2003

solar sys overview
07. 04. 2008
0 views

solar sys overview

file42877
30. 03. 2008
0 views

file42877

Franchise Launch Presntation Rom
27. 03. 2008
0 views

Franchise Launch Presntation Rom

UK Becta
21. 03. 2008
0 views

UK Becta

AlexandraBuchler
18. 03. 2008
0 views

AlexandraBuchler

SARS Arjun
11. 03. 2008
0 views

SARS Arjun

att165
06. 03. 2008
0 views

att165

Block 2 digLA GoodPractices
03. 10. 2007
0 views

Block 2 digLA GoodPractices

Pathogens in Raw Bulk Tank Milk
09. 11. 2007
0 views

Pathogens in Raw Bulk Tank Milk

19188
15. 11. 2007
0 views

19188

Autostudie 2007 Febiac
16. 11. 2007
0 views

Autostudie 2007 Febiac

Loftus Aqua USGS SFWMD 2003
20. 11. 2007
0 views

Loftus Aqua USGS SFWMD 2003

EarlierSymposia
03. 10. 2007
0 views

EarlierSymposia

Elvis
14. 11. 2007
0 views

Elvis

astro
30. 12. 2007
0 views

astro

karst hydrology
01. 01. 2008
0 views

karst hydrology

zoon
23. 11. 2007
0 views

zoon

teachers
07. 12. 2007
0 views

teachers

Nutrition and Your Athletes2
04. 03. 2008
0 views

Nutrition and Your Athletes2

fundamentos reabilitacao
29. 12. 2007
0 views

fundamentos reabilitacao

92E931 C05
11. 10. 2007
0 views

92E931 C05

2006 presentation part 3
07. 11. 2007
0 views

2006 presentation part 3

DAnCE OMG RTWS
23. 11. 2007
0 views

DAnCE OMG RTWS

SDY06a
07. 01. 2008
0 views

SDY06a

NGG MOEW
10. 12. 2007
0 views

NGG MOEW

Nature Area Trees
03. 01. 2008
0 views

Nature Area Trees

WriteThinkLearn
16. 11. 2007
0 views

WriteThinkLearn

vicki ict exam
29. 12. 2007
0 views

vicki ict exam

TitanOcean
04. 01. 2008
0 views

TitanOcean