About
The importance of named entities in information retrieval and knowledge management has recently brought interest in characterizing semantic relationships between entities. In this paper, we propose a method for measuring semantic similarity, an important type of semantic relationship, between entities. The method is based on Google Directory, a search interface to the Open Directory Project. Via the search engine, we can locate the web pages relevant to an entity and automatically create a profile of the entity according to the directory assignments of its web pages, which capture various features of the entity. Using their profiles, the semantic similarity between entities can be measured in different dimensions. We apply the semantic similarity measurement to two knowledge acquisition tasks: thesaurus construction of entities and fine grained categorization of entities. Our experiments demonstrate that the proposed method works effectively in these two tasks.
Citation
Jiahui Liu, Larry Birnbaum, "Measuring Semantic Similarity between Named Entities by Searching the Web Directory", 2007 IEEE/WIC/ACM International Conference on Web Intelligence (WI'07), 2006
Related Papers
MakeMyPage: Social Media Meets Automatic Content Generation
Rich Interfaces for Reading News on the Web
Categorizing Blogger’s Interests Based on Short Snippets of Blog Posts
Pivot: Automatically Offering Information and Services to Real-World Shoppers
LocalSavvy: Aggregating Local Points of View about News Issues
What Do They Think? Aggregating Local Views about News Events and Topics
Compare&Contrast: Using the Web to Discover Comparable Cases for News Stories
Collaborative Reasoning and Collaborative Ontology Development in CRAFT
Between Ontology and Folksonomy: A Study of Collaborative and Implicit Ontology Evolution
Reasoning Through Search: A Novel Approach to Sentiment Classification
Measuring Semantic Similarity between Named Entities by Searching the Web Directory
Learning to Gesture: Applying animations To Spoken Text
TagAssist: Automatic Tag Suggestion for Blog Posts
Context Transformations for Just-in-time Retrieval: Adapting the Watson System to User Needs
Buzz: Telling Compelling Stories
Using Explicit Semantic Models to Track Situations across News Articles
Creating Polite Agents: 5 Heuristics for User Experience Design
Computational Support for Compelling Story Telling
Believable Performance Agents for Interactive Conversations
Domain Specific Affective Classification of Documents
Network Arts: Defining Emotional Interaction in Media Arts
MusicStory: a Personalized Music Video Creator
Analogy, Intelligent IR, and Knowledge Integration for Intelligence Analysis
Affective Behaviors for Theatrical Agents
Concept Maps Applied to Mars Exploration Public Outreach
Using Web Frequency Within Multimedia Exhibitions
The Association Engine: A Free Associative Digital Improviser
Between Now and the Semantic Web
Imagination Environment: Using the web as a source of popular culture
TextPool: Visualizing Live Text Streams
Keyless Media Delivery and Security: Location-aware computing through Interaction Design
Network Arts: Exposing Cultural Reality
Low-Fidelity Location Based Information Systems
Context-Aware Keyless Computing
Beyond Broadcast
Beyond Broadcast: a demo
Towards a Non-Linear Narrative Construction
Clustering for Opportunistic Communication
Automatically Indexing Documents: Content vs. Reference
Java Settlers: A research Environment for Studying Multi-Agent Negotiation
Supporting Online Resource Discovery in the Context of Ongoing Tasks with Proactive Software Assistants
Flytrap: Intelligent Group Music Recommendation
Using Citations to Facilitate Precise Indexing and Automatic Index Creation in Collections of Research Papers
XLibris: An Automated Library Research Assistant
Facilitating Opportunistic Communication by Tracking the Documents People Use
Information access in context
Beyond Similarity
Jabberwocky: You don't have to be a rocket scientist to change slides for a hydrogen combustion lecture
Guiding People to Information: Providing an Interface to a Digital Library Using Reference as a Basis for Indexing
Improving Human Computer Interaction in a Classroom Environment using Computer Vision
User Interactions with Everyday Applications as Context for Just-in-time Information Access
Mining Navigation History for Recommendation
Constructing Indices from Citations in Collections of Research Papers
CBR Textuality
Q&A: A System for the Capture, Organiation and Reuse of Expertise
Selecting Task-Relevant Sources for Just-in-Time Retrieval
Automatically Indexing Research Papers Using Text Surrounding Citations
InfoLab Package
Beyond "Next slide, please": The use of content and speech in multi-modal control
Learning for Question Answering and Text Classification: Integrating Knowledge-Based and Statistical Techniques
All gadget and no representation makes Jack a dull environment
Cooperating with people: The Intelligent Classroom
Gargoyle: Vision in the Intelligent Classroom
Integrating Range and Object Data for Robot Navigation
Happy Patrons Make Better Tippers: Creating a Robot Waiter Using Perseus and the Animate Agent Architecture
Machine-Generated Multimedia Content
From Generating to Mining: Automatically Scripting Conversations Using Existing Online Sources
News at Seven: The Future of the Future.
Classifying Paintings by Artistic Genre: An Analysis of Features & Classifiers



