chi00

Information about chi00

Published on November 19, 2007

Author: Dabby

Source: authorstream.com

Content

Bringing Order to the Web: Automatically Categorizing Search Results:  Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen, CS Division, UC Berkeley Susan Dumais, Microsoft Research ACM:CHI April 4, 2000 Organizing Search Results:  Organizing Search Results Query: jaguar Outline:  Outline Background Using category structure to organize information SWISH System Searching With Information Structured Hierarchically Text classification User interface User Study Future Work Using Category Structure:  Using Category Structure To Organize Information Superbook, Cat-a-Cone, etc. To Help Web Search Yahoo!, Northern Light What’s New in SWISH? Automatic categorization of new documents User interface that tightly couples hierarchical category structure with search results User study for the new user interface SWISH System:  SWISH System Combines the Advantages of Manually crafted & easily understood directory structure Broad coverage from search engines System Components Text classification models User interface Text Classification:  Text Classification Text Classification Assign documents to one or more of a predefined set of categories E.g., News feeds, Email - spam/no-spam, Web data Manually vs. automatically Inductive Learning for Classification Training set: Manually classified a set of documents Learning: Learn classification models Classification: Use the model to automatically classify new documents Training Set: LookSmart Web Directory:  Category Structure (spring 99) 13 top-level categories 150 second-level categories Training Set ~50k web pages; chosen randomly from all cats Top-level Categories Training Set: LookSmart Web Directory Learning & Classification:  Learning & Classification Support Vector Machine (SVM) Accurate and efficient for text classification (Dumais et al., Joachims) Model = weighted vector of words “Automobile” = motorcycle, vehicle, parts, automobile, harley, car, auto, honda, porsche … “Computers & Internet” = rfc, software, provider, windows, user, users, pc, hosting, os, downloads ... Hierarchical Models 1 model for N top level categories N models for second level categories Very useful in conjunction w/ user interaction SWISH Architecture:  SWISH Architecture Interface Characteristics:  Interface Characteristics Problems Large amount of information to display Search results Category structure Limited screen real estate Solutions Information overlay Distilled information display Information Overlay:  Information Overlay Use tooltips to show Summaries of web pages Category hierarchy Expansion of Category Structure:  Expansion of Category Structure Expansion of Web Page List:  Expansion of Web Page List User Study - Conditions:  User Study - Conditions User Study:  User Study User Study:  User Study Participants: 18 intermediate Web users Tasks 30 search tasks e.g., “Find home page for Seattle Art Museum” Search terms are fixed for each task Experimental Design Category/List – within subjects 15 search tasks with each interface Order (Category/List First) – counterbalanced between subjects Both Subjective and Objective Measures Subjective Results:  Subjective Results 7-point rating scale (1=disagree; 7=agree) Questions: Use of Interface Features:  Use of Interface Features Average Number of Uses of Feature per Task Search Time:  Search Time Category: 56 secs List: 85 secs p < .002 50% faster with Category interface Search Time by Query Difficulty:  Search Time by Query Difficulty Top20: 57 secs NotTop20: 98 secs No reliable interaction between query difficulty and interface condition Category interface is helpful for both easy and difficult queries Summary:  Summary Text Classification Organize search results Use hierarchical category models Classify new web pages on-the-fly User Interface Tightly couple search results with category structure Allow manipulation of presentation of category structure User Study Suggest strong preference and performance advantages for categorically organized presentation of search results Open Issues:  Open Issues Improve Accuracy of Classification Algorithms Enhance User Interface Heuristics for selecting categories and pages to display Query_Match: rank of page, and sometimes match score Categ_Match: p(category for each page) Integration with non-content information Conduct End-to-end User Study More info: http://research.microsoft.com/~sdumais Searching With Information Structured Hierarchically:  Searching With Information Structured Hierarchically SWISH

Related presentations


Other presentations created by Dabby

Propaganda Comparativa
16. 11. 2007
0 views

Propaganda Comparativa

ch 6 ppt
15. 06. 2007
0 views

ch 6 ppt

Feudal Japan Origin Religion
09. 10. 2007
0 views

Feudal Japan Origin Religion

Riedel DASER2
25. 09. 2007
0 views

Riedel DASER2

Shen CRF
25. 09. 2007
0 views

Shen CRF

Anna
11. 10. 2007
0 views

Anna

intro CS 3
16. 10. 2007
0 views

intro CS 3

TheatreHistoryO
17. 10. 2007
0 views

TheatreHistoryO

panama 5
22. 10. 2007
0 views

panama 5

Lesson 1 Intro and Pre WW II
22. 10. 2007
0 views

Lesson 1 Intro and Pre WW II

gf5
25. 09. 2007
0 views

gf5

hao discr prob mod rel dat
25. 09. 2007
0 views

hao discr prob mod rel dat

Correcting News Mistakes
05. 10. 2007
0 views

Correcting News Mistakes

MRCME Febrile Rash
23. 10. 2007
0 views

MRCME Febrile Rash

Microfinance MDGs
28. 11. 2007
0 views

Microfinance MDGs

kinetic models
25. 09. 2007
0 views

kinetic models

rtc
16. 10. 2007
0 views

rtc

debate
26. 10. 2007
0 views

debate

SALSA RTE Burchardt Frank
01. 11. 2007
0 views

SALSA RTE Burchardt Frank

Behav Interv Gay MA Users
02. 11. 2007
0 views

Behav Interv Gay MA Users

usits2001 talk
29. 10. 2007
0 views

usits2001 talk

ECCR IU Mar15 07
21. 11. 2007
0 views

ECCR IU Mar15 07

Lesson 1 Introduction
28. 12. 2007
0 views

Lesson 1 Introduction

99 ChemAware Chapter 03
02. 01. 2008
0 views

99 ChemAware Chapter 03

Dr G B Reddy
03. 01. 2008
0 views

Dr G B Reddy

Sloboda Prague
25. 09. 2007
0 views

Sloboda Prague

ber
02. 08. 2007
0 views

ber

05 bandura
02. 08. 2007
0 views

05 bandura

Robins
25. 09. 2007
0 views

Robins

Comp Gen Phylo HMM
25. 09. 2007
0 views

Comp Gen Phylo HMM

plkongres2007 crop 04
04. 10. 2007
0 views

plkongres2007 crop 04

lysenko
26. 11. 2007
0 views

lysenko

CNE120 11 8 04
02. 08. 2007
0 views

CNE120 11 8 04

Martin Hilbert
22. 10. 2007
0 views

Martin Hilbert

antioxidants
04. 03. 2008
0 views

antioxidants

presentation reynolds
07. 11. 2007
0 views

presentation reynolds

certeau present
03. 01. 2008
0 views

certeau present

NewBrunswick
12. 03. 2008
0 views

NewBrunswick

JVM models in ACL2
25. 09. 2007
0 views

JVM models in ACL2

ge203 08
25. 03. 2008
0 views

ge203 08

Q307 englanti
26. 03. 2008
0 views

Q307 englanti

auerickson
25. 09. 2007
0 views

auerickson

EcologicalFootprints
07. 04. 2008
0 views

EcologicalFootprints

TradeinHealthService s130207
28. 03. 2008
0 views

TradeinHealthService s130207

april cyprus lnarayanan
30. 03. 2008
0 views

april cyprus lnarayanan

BRAMBLE
31. 12. 2007
0 views

BRAMBLE

Macro course 2005 lecture 4
09. 04. 2008
0 views

Macro course 2005 lecture 4

summit2008a
10. 04. 2008
0 views

summit2008a

Wayne NY NJPresentation
13. 04. 2008
0 views

Wayne NY NJPresentation

AE2 C04 2007
14. 04. 2008
0 views

AE2 C04 2007

Rinolfi
17. 10. 2007
0 views

Rinolfi

HDX4000 Training NA
22. 04. 2008
0 views

HDX4000 Training NA

chapman poster 14jan05
25. 09. 2007
0 views

chapman poster 14jan05

BBC Series State of the Earth
08. 10. 2007
0 views

BBC Series State of the Earth

1960spowerpoint
02. 11. 2007
0 views

1960spowerpoint

hansjeppson
15. 10. 2007
0 views

hansjeppson

hegel
05. 01. 2008
0 views

hegel

exec blue 060120
18. 06. 2007
0 views

exec blue 060120

Ethiopia session II
18. 06. 2007
0 views

Ethiopia session II

emergenuity
18. 06. 2007
0 views

emergenuity

experiencia aenor
18. 06. 2007
0 views

experiencia aenor

India Work Plan UNCT
07. 01. 2008
0 views

India Work Plan UNCT

Tropsha 4 5 05
24. 11. 2007
0 views

Tropsha 4 5 05

posterH2OinPFCs
01. 01. 2008
0 views

posterH2OinPFCs

etd2004
12. 10. 2007
0 views

etd2004

38613SciTechStudies1
16. 10. 2007
0 views

38613SciTechStudies1

educause 2004 Fedora
25. 09. 2007
0 views

educause 2004 Fedora

cours7
23. 10. 2007
0 views

cours7

comics
15. 06. 2007
0 views

comics

Columbia Political Cartoons
15. 06. 2007
0 views

Columbia Political Cartoons

Collins Math Stats2
15. 06. 2007
0 views

Collins Math Stats2

Chapter Eight student version
15. 06. 2007
0 views

Chapter Eight student version

blagues
15. 06. 2007
0 views

blagues

Anime Manga Pres
15. 06. 2007
0 views

Anime Manga Pres

1193 Cartoons pig
15. 06. 2007
0 views

1193 Cartoons pig

1 cartoon
15. 06. 2007
0 views

1 cartoon

PBOCJapan060103
09. 10. 2007
0 views

PBOCJapan060103

control
15. 06. 2007
0 views

control

jcdl contentmodels
25. 09. 2007
0 views

jcdl contentmodels

curso dq abp joao
28. 12. 2007
0 views

curso dq abp joao

conf present 045
07. 01. 2008
0 views

conf present 045

05 International Conflict
23. 11. 2007
0 views

05 International Conflict

banse1
15. 06. 2007
0 views

banse1

Feg Express
18. 06. 2007
0 views

Feg Express

Fantasztikus programozas
18. 06. 2007
0 views

Fantasztikus programozas

smp99
25. 09. 2007
0 views

smp99

efg pr005
07. 11. 2007
0 views

efg pr005

F8 Femenino
18. 06. 2007
0 views

F8 Femenino

9 3 DEPAC SLPRS Ppresentation
29. 11. 2007
0 views

9 3 DEPAC SLPRS Ppresentation

geer sesiposter
25. 09. 2007
0 views

geer sesiposter