<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Associative Search and the Semantic Web: The Next Step Beyond Natural Language Search</title>
	<atom:link href="http://www.novaspivack.com/technology/associative-search-and-the-semantic-web-the-next-step-beyond-natural-language-search/feed" rel="self" type="application/rss+xml" />
	<link>http://www.novaspivack.com/technology/associative-search-and-the-semantic-web-the-next-step-beyond-natural-language-search</link>
	<description>The Future of the Web, Search Technology, and the Global Brain</description>
	<lastBuildDate>Tue, 07 Sep 2010 12:35:00 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Ben Toth</title>
		<link>http://www.novaspivack.com/technology/associative-search-and-the-semantic-web-the-next-step-beyond-natural-language-search/comment-page-1#comment-4611</link>
		<dc:creator>Ben Toth</dc:creator>
		<pubDate>Fri, 16 May 2008 09:23:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.novaspivack.com/?p=55#comment-4611</guid>
		<description>A couple of points in response.
1 The problem with theoretical illustrations of why a new technology is necessary is that they tend to be constructed to illustrate the benefits of the technology, and don&#039;t necessarily reflect real use cases. I once had a new search engine offered to me on the basis that it could distinguish between blind venetians and venetian blinds. We didn&#039;t buy it, but the point is that the use case was completely fanciful. Just how many real world searchers are interested in &#039;all products by Sony&#039;?
2 I often have a problem with remembering the names of people in relation to a project . I have a few scraps of asociated information, some of which turn out to be wrong. So I search my e-mail, or my hard disk, and sometimes it takes a few searches. Generally I get there. The point is that I use excisting technology and it doesn&#039;t require the sort of overhead implied above.
The overall point is that the semantic and AI approaches carry a significant overhead, and the illustrations above don&#039;t justify this overhead. I am in favour of better search, and even better metadata (I am a librarian) - but better metadata driven by real use cases.</description>
		<content:encoded><![CDATA[<p>A couple of points in response.<br />
1 The problem with theoretical illustrations of why a new technology is necessary is that they tend to be constructed to illustrate the benefits of the technology, and don&#8217;t necessarily reflect real use cases. I once had a new search engine offered to me on the basis that it could distinguish between blind venetians and venetian blinds. We didn&#8217;t buy it, but the point is that the use case was completely fanciful. Just how many real world searchers are interested in &#8216;all products by Sony&#8217;?<br />
2 I often have a problem with remembering the names of people in relation to a project . I have a few scraps of asociated information, some of which turn out to be wrong. So I search my e-mail, or my hard disk, and sometimes it takes a few searches. Generally I get there. The point is that I use excisting technology and it doesn&#8217;t require the sort of overhead implied above.<br />
The overall point is that the semantic and AI approaches carry a significant overhead, and the illustrations above don&#8217;t justify this overhead. I am in favour of better search, and even better metadata (I am a librarian) &#8211; but better metadata driven by real use cases.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sherman Monroe</title>
		<link>http://www.novaspivack.com/technology/associative-search-and-the-semantic-web-the-next-step-beyond-natural-language-search/comment-page-1#comment-4610</link>
		<dc:creator>Sherman Monroe</dc:creator>
		<pubDate>Wed, 14 May 2008 14:30:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.novaspivack.com/?p=55#comment-4610</guid>
		<description>This is an excellent piece, and important rebuttal to the discussions around Powerset/Hakia/True Knowledge, especially coming from one as yourself who is a proponent of NLP. Lately, I have been attempting to &lt;a href=&quot;http://sdmonroe.blogspot.com/2008/05/poweset-hype-and-norvig-pooh-poohs-on.html&quot; rel=&quot;nofollow&quot;&gt;attempting to draw the distinction&lt;/a&gt; between what is possible and what is useful in NLP, because there is a difference. As you have observed, NLP is not a solution which provides enhancements in all cases. This is especially true in current web document search. I believe NLP will find its niche as an interface control for accessing and updating data in structured databased, such as RDF repositories, where users are averse to technical query languages. NLP will provide the user a way to use verbs, nouns, adjectives as controls and operators for expressing complex select and update queries. In this way, NLP does for the semantic web, what the GUI did for the command line.
But, again as you correctly identify, as hard as NLP is to fully realize in software, there are still aspects to information retrieval which it is not equipped to address. I believe science fiction has had a lot to do with it, but for whatever reason, people equate the ability to understand natural language as indicating human intelligence in a complete sense, when in fact, a system can be created which correctly parses the Wall Street Journal into formal logical statements, and still be unable to determine whether a particular joke is &quot;funny&quot;. Thus a query for &quot;funny statements made by George Bush&quot; would still be unsolvable (if there is no explicit statement asserting that something he said was funny). The associative aspect of information is one of these very important aspects to point out. The NLP world must be prepared to 1) identify and classify the various modes of information search and retrieval, and in which of those modes can NLP add tremendous value, and 2) focus on providing additional support for user that go a step beyond the answers provided by NLP technology, including research insensitive features such as result set refinement, filtering and faceting.
Thanks so much for the insight.</description>
		<content:encoded><![CDATA[<p>This is an excellent piece, and important rebuttal to the discussions around Powerset/Hakia/True Knowledge, especially coming from one as yourself who is a proponent of NLP. Lately, I have been attempting to <a href="http://sdmonroe.blogspot.com/2008/05/poweset-hype-and-norvig-pooh-poohs-on.html" rel="nofollow">attempting to draw the distinction</a> between what is possible and what is useful in NLP, because there is a difference. As you have observed, NLP is not a solution which provides enhancements in all cases. This is especially true in current web document search. I believe NLP will find its niche as an interface control for accessing and updating data in structured databased, such as RDF repositories, where users are averse to technical query languages. NLP will provide the user a way to use verbs, nouns, adjectives as controls and operators for expressing complex select and update queries. In this way, NLP does for the semantic web, what the GUI did for the command line.<br />
But, again as you correctly identify, as hard as NLP is to fully realize in software, there are still aspects to information retrieval which it is not equipped to address. I believe science fiction has had a lot to do with it, but for whatever reason, people equate the ability to understand natural language as indicating human intelligence in a complete sense, when in fact, a system can be created which correctly parses the Wall Street Journal into formal logical statements, and still be unable to determine whether a particular joke is &#8220;funny&#8221;. Thus a query for &#8220;funny statements made by George Bush&#8221; would still be unsolvable (if there is no explicit statement asserting that something he said was funny). The associative aspect of information is one of these very important aspects to point out. The NLP world must be prepared to 1) identify and classify the various modes of information search and retrieval, and in which of those modes can NLP add tremendous value, and 2) focus on providing additional support for user that go a step beyond the answers provided by NLP technology, including research insensitive features such as result set refinement, filtering and faceting.<br />
Thanks so much for the insight.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lars Ludwig</title>
		<link>http://www.novaspivack.com/technology/associative-search-and-the-semantic-web-the-next-step-beyond-natural-language-search/comment-page-1#comment-4609</link>
		<dc:creator>Lars Ludwig</dc:creator>
		<pubDate>Wed, 14 May 2008 10:10:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.novaspivack.com/?p=55#comment-4609</guid>
		<description>The differnece between remembering by association and remembering by keyword is similar to the difference between recognition-based search and recall-based search. The SERP offers recognition-based search after initial recall-based search. The suggest box to some degree enables recognition-based search. In my concept of &#039;artificial memory&#039; I found that the personal &#039;association&#039; network is the best starting point for any recall-based searching, because it encloses or points to personal recognizable concepts/things. Semantic search thus would highly benefit from personal knowledge bases.</description>
		<content:encoded><![CDATA[<p>The differnece between remembering by association and remembering by keyword is similar to the difference between recognition-based search and recall-based search. The SERP offers recognition-based search after initial recall-based search. The suggest box to some degree enables recognition-based search. In my concept of &#8216;artificial memory&#8217; I found that the personal &#8216;association&#8217; network is the best starting point for any recall-based searching, because it encloses or points to personal recognizable concepts/things. Semantic search thus would highly benefit from personal knowledge bases.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kingsley Idehen</title>
		<link>http://www.novaspivack.com/technology/associative-search-and-the-semantic-web-the-next-step-beyond-natural-language-search/comment-page-1#comment-4608</link>
		<dc:creator>Kingsley Idehen</dc:creator>
		<pubDate>Tue, 13 May 2008 20:43:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.novaspivack.com/?p=55#comment-4608</guid>
		<description>Nova,
Well articulated!
My only quibble is that 15 man years away is too far out. Thus, I am hoping your unit of measure is Internet Time :-)
The Linked Data substrate is already growing organically, the injection of complimentary Lookup oriented Linked Data Spaces is also taking shape as per the UMBEL project.
For example, a corpus of Named Entity Dictionaries + Subject matter Concept schemes + Ontolology level cross mappings gives us a sustrate where &quot;Context&quot; attainment via Subject/Topic aided &quot;Disambiguation&quot; blooms. This also implies that NLP challenges of yore are much reduced.
Historically:
NLP atop SQL just wasn&#039;t/isn&#039;t feasible.
NLP over Unstructured Text (what many are grapping with today over the Web via &quot;Semantic inside&quot; solutions) lacks Contextual richeness, ultimately.
NLP over Structured Linked Data is a whole different ball game.  Especially, when you mesh NLP with the Extraction  Disambiguation prowess that Linked Data accords. This is basically what&#039;s taking shape now in the Linked Data realm.</description>
		<content:encoded><![CDATA[<p>Nova,<br />
Well articulated!<br />
My only quibble is that 15 man years away is too far out. Thus, I am hoping your unit of measure is Internet Time <img src='http://www.novaspivack.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /><br />
The Linked Data substrate is already growing organically, the injection of complimentary Lookup oriented Linked Data Spaces is also taking shape as per the UMBEL project.<br />
For example, a corpus of Named Entity Dictionaries + Subject matter Concept schemes + Ontolology level cross mappings gives us a sustrate where &#8220;Context&#8221; attainment via Subject/Topic aided &#8220;Disambiguation&#8221; blooms. This also implies that NLP challenges of yore are much reduced.<br />
Historically:<br />
NLP atop SQL just wasn&#8217;t/isn&#8217;t feasible.<br />
NLP over Unstructured Text (what many are grapping with today over the Web via &#8220;Semantic inside&#8221; solutions) lacks Contextual richeness, ultimately.<br />
NLP over Structured Linked Data is a whole different ball game.  Especially, when you mesh NLP with the Extraction  Disambiguation prowess that Linked Data accords. This is basically what&#8217;s taking shape now in the Linked Data realm.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk (enhanced) (user agent is rejected)
Database Caching 1/11 queries in 0.007 seconds using memcached

Served from: www.novaspivack.com @ 2010-09-09 07:02:24 -->