<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>MicahLogic &#187; search</title>
	<atom:link href="http://dubinko.info/blog/tags/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://dubinko.info/blog</link>
	<description>From an XML geek, a reader, a writer, a connector, a man of the people (says keep hope alive)</description>
	<lastBuildDate>Thu, 02 Feb 2012 06:43:33 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Eulogy for SearchMonkey</title>
		<link>http://dubinko.info/blog/2010/08/22/eulogy-for-searchmonkey/</link>
		<comments>http://dubinko.info/blog/2010/08/22/eulogy-for-searchmonkey/#comments</comments>
		<pubDate>Mon, 23 Aug 2010 06:07:18 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[announcement]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[microformats]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[yahoo]]></category>
		<category><![CDATA[bing]]></category>
		<category><![CDATA[RIP]]></category>
		<category><![CDATA[searchmonkey]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=887</guid>
		<description><![CDATA[This is indeed a sad day for all of us, for on October 1, a great app will be gone. Though we hardly had enough time during his short life to get to know him, like the grass that withers and fades, this monkey will finish his earthly course. I know he left many things [...]]]></description>
			<content:encoded><![CDATA[<p>This is indeed a sad day for all of us, for on October 1, a great app will be <a href="http://www.ysearchblog.com/2010/08/17/news-about-our-searchmonkey-program/">gone</a>. Though we hardly had enough time during his short life to get to know him, like the grass that withers and fades, this monkey will finish his earthly course.</p>
<div class="wp-caption alignright" style="width: 190px"><a title="Updated SearchMonkey logo by mdubinko, on Flickr" href="http://www.flickr.com/photos/mdubinko/4911814062/"><img title="SearchMonkey updated logo" src="http://farm5.static.flickr.com/4101/4911814062_c7dd2a2c17_m.jpg" alt="Updated SearchMonkey logo" width="180" height="240" /></a><p class="wp-caption-text">Photo by Micah</p></div>
<p>I know he left many things undone, for example only enhancing 60% of the delivered result pages. He never got a chance to finish his life&#8217;s ambition of promoting RDFa and microformats to the masses or to be the killer app of the (lower-case) semantic web. You could say he will live on as &#8220;some of this structured data processing will be supported natively by the Microsoft platform&#8221;. Part of the monkey we loved will live on as enhanced results continue to flow forth from the Yahoo/Bing alliance.</p>
<p>The SearchMonkey Alumni group on LinkedIn is filled with wonderful mourners. Micah Alpern wrote there</p>
<p style="padding-left: 30px;">I miss the team, the <a href="http://vimeo.com/3288386  ">songs</a>, and the aspiration to solve a hard problem. Everything else is just code.</p>
<p>Isaac Asimov was reported to have said &#8220;<em>If my doctor told me I had only six minutes to live, I wouldn&#8217;t brood. I&#8217;d type a little faster.</em>&#8221; Today we can identify with that sentiment. Keep typing.</p>
<p>-m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2010/08/22/eulogy-for-searchmonkey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Command Lines on the frontier of user interface</title>
		<link>http://dubinko.info/blog/2010/06/12/command-lines-on-the-frontier-of-user-interface/</link>
		<comments>http://dubinko.info/blog/2010/06/12/command-lines-on-the-frontier-of-user-interface/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 05:13:53 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[search]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[stuff]]></category>
		<category><![CDATA[design]]></category>
		<category><![CDATA[donnorman]]></category>
		<category><![CDATA[ui]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=847</guid>
		<description><![CDATA[This came from a comment on the prior post, and it&#8217;s worth a shout of its own. Don Norman on the importance of command lines, including the ubiquitous search box, in modern UI. -m]]></description>
			<content:encoded><![CDATA[<p>This came from a comment on the prior post, and it&#8217;s worth a shout of its own. Don Norman on the importance of <a href="http://www.jnd.org/dn.mss/ui_breakthroughcomma.html">command lines</a>, including the ubiquitous search box, in modern UI. -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2010/06/12/command-lines-on-the-frontier-of-user-interface/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>&#8220;Google syntax&#8221; for semantic queries?</title>
		<link>http://dubinko.info/blog/2010/06/09/google-syntax-for-semantic-queries/</link>
		<comments>http://dubinko.info/blog/2010/06/09/google-syntax-for-semantic-queries/#comments</comments>
		<pubDate>Wed, 09 Jun 2010 07:14:42 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[everythingismiscellaneous]]></category>
		<category><![CDATA[intentional web]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[stuff]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[rdf]]></category>
		<category><![CDATA[searchmonkey]]></category>
		<category><![CDATA[semantics]]></category>
		<category><![CDATA[semweb]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=831</guid>
		<description><![CDATA[Thought experiment: are there any commonly-expressed semantic queries&#8211;the kind of queries you&#8217;d run over a triple store, or perhaps a SearchMonkey-annotated web site&#8211;expressible in common type-in-a-searchbox query grammar? As a refresher, here&#8217;s some things that Google and other search engines can handle. The square brackets represent the search box into which the queries are typed, [...]]]></description>
			<content:encoded><![CDATA[<p>Thought experiment: are there any commonly-expressed semantic queries&#8211;the kind of queries you&#8217;d run over a triple store, or perhaps a <a href="http://developer.yahoo.com/searchmonkey/">SearchMonkey</a>-annotated web site&#8211;expressible in common type-in-a-searchbox query grammar?</p>
<p>As a refresher, here&#8217;s some things that Google and other search engines can handle. The square brackets represent the search box into which the queries are typed, not part of the queries themselves.</p>
<p>[term]</p>
<p>[term -butnotthis]</p>
<p>[term1 OR term2]</p>
<p>["phrase term"]</p>
<p>[tem1 OR term2 -"but not this" site:dubinko.info filetype:html]</p>
<p>So what kind of semantic queries would be usefully expressed in a similar way, avoiding SPARQL and the like? For example, maybe [by:"Micah Dubinko"] could map to a document containing a triple like &lt;this document&gt; &lt;dc:author&gt; &#8220;Micah Dubinko&#8221;. What other kinds of graph queries are interesting, common, and simple to express like this? Comments welcome.</p>
<p>-m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2010/06/09/google-syntax-for-semantic-queries/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>US Federal Register in XML</title>
		<link>http://dubinko.info/blog/2009/10/07/us-federal-register-in-xml/</link>
		<comments>http://dubinko.info/blog/2009/10/07/us-federal-register-in-xml/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 05:07:33 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[announcement]]></category>
		<category><![CDATA[IPR]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[xml]]></category>
		<category><![CDATA[fedthread]]></category>
		<category><![CDATA[syndication]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=664</guid>
		<description><![CDATA[Fed Thread is a front end for the newly XMLified Federal Register. Why is this a big deal? It&#8217;s a daily publication of the goings-on of the US government. It&#8217;s a primary source for all kinds of things that normally only get rehashed through news organizations. And it is bulky&#8211;nobody can read through it on [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.fedthread.org/">Fed Thread</a> is a front end for the newly XMLified Federal Register. Why is this a big deal? It&#8217;s a daily publication of the goings-on of the US government. It&#8217;s a primary source for all kinds of things that normally only get rehashed through news organizations. And it is bulky&#8211;nobody can read through it on a regular basis. A yearly subscription (printed) would cost nearly $1000 and fill over 80,000 pages.</p>
<p>Having it in XML enables all kinds of searching, syndication, and annotation via flexible front ends like this one. Yay for transparency. -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2009/10/07/us-federal-register-in-xml/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google Rich Snippets powered by RDFa</title>
		<link>http://dubinko.info/blog/2009/05/12/google-rich-snippets-powered-by-rdfa/</link>
		<comments>http://dubinko.info/blog/2009/05/12/google-rich-snippets-powered-by-rdfa/#comments</comments>
		<pubDate>Wed, 13 May 2009 04:43:09 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[commercialism]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[intentional web]]></category>
		<category><![CDATA[languages]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[microformats]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[yahoo]]></category>
		<category><![CDATA[rdfa]]></category>
		<category><![CDATA[rich]]></category>
		<category><![CDATA[searchmonkey]]></category>
		<category><![CDATA[snippets]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=532</guid>
		<description><![CDATA[The new feature called rich snippets shows that SearchMonkey has caught the eye of the 800 pound gorilla. Many of the same microformats and RDF vocabularies are supported. It seems increasingly inevitable that RDFa will catch on, no matter what the HTML5 group thinks. -m]]></description>
			<content:encoded><![CDATA[<p>The new feature called <a href="http://googlewebmastercentral.blogspot.com/2009/05/introducing-rich-snippets.html">rich snippets</a> shows that SearchMonkey has caught the eye of the 800 pound gorilla. Many of the same microformats and RDF vocabularies are supported. It seems increasingly inevitable that RDFa will catch on, no matter what the HTML5 group thinks. -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2009/05/12/google-rich-snippets-powered-by-rdfa/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Playing with Wolfram Alpha</title>
		<link>http://dubinko.info/blog/2009/05/03/playing-with-wolfram-alpha/</link>
		<comments>http://dubinko.info/blog/2009/05/03/playing-with-wolfram-alpha/#comments</comments>
		<pubDate>Mon, 04 May 2009 02:14:19 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[commercialism]]></category>
		<category><![CDATA[intentional web]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[alpha]]></category>
		<category><![CDATA[inference]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[wolfram]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=517</guid>
		<description><![CDATA[I&#8217;ve been experimenting with the preview version of Wolfram Alpha. It&#8217;s not like any current search engine because it&#8217;s not a search engine at all. Others have already written more eloquent things about it. The key feature of it is that it doesn&#8217;t just find information, it infers it on the fly. Take for exmple [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been experimenting with the preview version of <a href="http://www.wolframalpha.com">Wolfram Alpha</a>. It&#8217;s not like any current search engine because it&#8217;s not a search engine at all. Others have already <a href="http://searchengineland.com/wolfram-alpha-fact-engine-18431">written</a> more eloquent things about it.</p>
<p>The key feature of it is that it doesn&#8217;t just find information, it infers it on the fly. Take for exmple the query</p>
<p style="padding-left: 30px;">next solar eclipse in Sunnyvale</p>
<p>AFAIK, nobody has ever written a regular web page describing this important (to me) topic. Try it in <a href="http://search.yahoo.com/search?p=next+solar+eclipse+in+Sunnyvale">Yahoo!</a> or <a href="http://www.google.com/search?q=next+solar+eclipse+in+Sunnyvale">Google</a> and see for yourself. There are a few potentially interesting links based on the abstracts, but they turn out to be spammy. Wolfram Alpha figures out that I&#8217;m talking about the combination of a concept (&#8220;solar eclipse&#8221;) and a place (&#8220;Sunnyvale, CA&#8221;, but with an offer to switch to Sunnyvale, TX) and combines the two. The result is a simple answer&#8211;4:52 pm PDT  |  Sunday, May 20, 2012 (3.049 years from now). Hey, that&#8217;s sooner than I thought! Besides the date, there&#8217;s many related facts and a cool map.</p>
<p>This is in contrast to SearchMonkey, which I helped create, in two main areas:</p>
<ol>
<li>Wolfram Alpha uses metadata to produce the result, then renders it through a set of pre-arranged renderers. The response is facts, not web pages.</li>
<li>SearchMonkey focuses on sites providing their own metadata, while Wolfram Alpha focuses on hand-curation.</li>
</ol>
<p>Search engines have been striving to do a better job at fact-queries. Wolfram&#8217;s approach shows that an approach disjoint from finding web pages from an index can be hugely useful.</p>
<p>The engineers working on this have a sense of humor too. The query</p>
<p style="padding-left: 30px;">1.21GW</p>
<p>returns a page that includes the text &#8220;power required to operate the flux capacitor in the DeLorean DMC-12 time machine&#8221; as well as a useful comparison (~ 0.1 x the power of space shuttle at launch).</p>
<p>Yahoo! and Google do various kinds of internal &#8220;query rewriting&#8221;, but usually don&#8217;t let you know other than in the broadest terms (&#8220;did you mean &#8230;&#8221;). Wolfram Alpha shows a diagram of what it understood the query to be. The diagrams make it evident that <em>something like</em> the RDF model is in use, but without peeking under the hood, it&#8217;s hard to say something definitive.</p>
<p>One thing I wonder about is whether Wolfram Alpha creates dynamic (as was a major goal of SearchMonkey) of giving web authors a reason to put more metadata in their sites&#8211;a killer app if you will. It&#8217;s not clear at this early date how much web crawling or site metadata extraction (say RDFa) plays into the curation process.</p>
<p>In any case Wolfram Alpha is something to watch. It&#8217;s set to launch publicly this month. -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2009/05/03/playing-with-wolfram-alpha/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Online etymology database</title>
		<link>http://dubinko.info/blog/2008/10/24/online-etymology-database/</link>
		<comments>http://dubinko.info/blog/2008/10/24/online-etymology-database/#comments</comments>
		<pubDate>Sat, 25 Oct 2008 03:55:30 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[everythingismiscellaneous]]></category>
		<category><![CDATA[languages]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[carboy]]></category>
		<category><![CDATA[etymology]]></category>
		<category><![CDATA[history]]></category>
		<category><![CDATA[langauge]]></category>
		<category><![CDATA[word]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=372</guid>
		<description><![CDATA[I&#8217;ve been playing lately with this site, and it&#8217;s a fantastic resource. The word carboy probably comes from Persian qarabah &#8220;large flagon.&#8221; Who knew? -m]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been playing lately with <a href="http://www.etymonline.com/">this site</a>, and it&#8217;s a fantastic resource. The word carboy probably comes from Persian <span class="foreign">qarabah</span> &#8220;large flagon.&#8221; Who knew? -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2008/10/24/online-etymology-database/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Micahpedia</title>
		<link>http://dubinko.info/blog/2008/05/01/micahpedia/</link>
		<comments>http://dubinko.info/blog/2008/05/01/micahpedia/#comments</comments>
		<pubDate>Thu, 01 May 2008 07:12:50 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[announcement]]></category>
		<category><![CDATA[everythingismiscellaneous]]></category>
		<category><![CDATA[intentional web]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[stuff]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[micahpedia]]></category>
		<category><![CDATA[persona]]></category>
		<category><![CDATA[push button paradise]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[XForms]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=247</guid>
		<description><![CDATA[Today happens to mark the 6th anniversary of my blog. To celebrate going into year seven I&#8217;m refocusing it, including a new name: Micahpedia. Blogging is an important skill, a subset of the overall skill of managing your online persona, so it&#8217;s worth devoting some attention to. The ego-burst doesn&#8217;t hurt either. My concrete goal [...]]]></description>
			<content:encoded><![CDATA[<p>Today happens to mark the 6th anniversary of my blog. To celebrate going into year seven I&#8217;m refocusing it, including a new name: Micahpedia.</p>
<p>Blogging is an important skill, a subset of the overall skill of managing your online persona, so it&#8217;s worth devoting some attention to. The ego-burst doesn&#8217;t hurt either. My concrete goal is to get in the top 10 search results for the query [Micah], though I face some stiff competition including the prophet.</p>
<p>From an SEO perspective, &#8220;Push Button Paradise&#8221; wasn&#8217;t the greatest choice of name. It suffers from the common SEO mistake of being excessively clever and/or cute reflection of what I happened to be working on at the moment, namely <a href="http://xformsinstitute.com/">XForms</a>. If you see the old name standalone, or in a blogroll, or in an RSS reader, you still don&#8217;t have much of an idea what it&#8217;s about or who&#8217;s behind it. True I get pretty good ranking on the exact phrase, but nobody searches for that&#8230;</p>
<p>I will continue SEO tweaks on this site as time goes on and welcome any advice from any of my 7 readers.</p>
<p>In short, Micahpedia is about what I&#8217;m reading, writing, thinking about, and working on. I have plenty to say about these things. :-) The best is yet to come. -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2008/05/01/micahpedia/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is there an inverse to the Innovator&#8217;s Dilemma?</title>
		<link>http://dubinko.info/blog/2008/04/27/is-there-an-inverse-to-the-innovators-dilemma/</link>
		<comments>http://dubinko.info/blog/2008/04/27/is-there-an-inverse-to-the-innovators-dilemma/#comments</comments>
		<pubDate>Mon, 28 Apr 2008 05:40:37 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[commercialism]]></category>
		<category><![CDATA[everythingismiscellaneous]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[trends]]></category>
		<category><![CDATA[development]]></category>
		<category><![CDATA[dilemma]]></category>
		<category><![CDATA[features]]></category>
		<category><![CDATA[innovation]]></category>
		<category><![CDATA[products]]></category>
		<category><![CDATA[simplicity]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/?p=190</guid>
		<description><![CDATA[Roughly speaking, the innovator&#8217;s dilemma happens when a product progressively gets more and more advanced features, to the point that it misses out (by listening to customers) on an entire new opportunity. At that point, a simpler, competing product can come into play and make large gains. But what happens when a company is generally [...]]]></description>
			<content:encoded><![CDATA[<p>Roughly speaking, the innovator&#8217;s dilemma happens when a product progressively gets more and more advanced features, to the point that it misses out (by listening to customers) on an entire new opportunity. At that point, a simpler, competing product can come into play and make large gains.</p>
<p>But what happens when a company is generally aware of the Innovator&#8217;s Dilemma and tries to compensate? It seems like second order effects might come into their own. A product widely known for being (and remaining) minimalist is exposed to attacks from deliberate enhancements and related complexificaiton of competitive products. As the market gets more mature, the steadfastly-simple market leader gets left behind. In a sense, it&#8217;s a role reversal from what <a href="http://www.amazon.com/exec/obidos/ASIN/0875845851/dubinkoinfo-20">Clayton Christensen describes</a>. But can it work out the same in the end? Please comment. -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2008/04/27/is-there-an-inverse-to-the-innovators-dilemma/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The (lowercase) semantic web goes mainstream</title>
		<link>http://dubinko.info/blog/2008/03/13/the-lowercase-semantic-web-goes-mainstream/</link>
		<comments>http://dubinko.info/blog/2008/03/13/the-lowercase-semantic-web-goes-mainstream/#comments</comments>
		<pubDate>Fri, 14 Mar 2008 02:22:44 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[announcement]]></category>
		<category><![CDATA[everythingismiscellaneous]]></category>
		<category><![CDATA[intentional web]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[microformats]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[standards]]></category>
		<category><![CDATA[trends]]></category>
		<category><![CDATA[web20]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/2008/03/13/the-lowercase-semantic-web-goes-mainstream/</guid>
		<description><![CDATA[So today Yahoo! announced a major facet of what I&#8217;ve been working on lately: making the web more meaningful. Lots of fantastic coverage, including TechCrunch and ReadWriteWeb (and others, please link in the comments), and supportive responses and blog posts across the board. It&#8217;s been a while since I&#8217;ve felt this good about being a [...]]]></description>
			<content:encoded><![CDATA[<p>So today <span class="yshortcuts" id="lw_1205468064_0">Yahoo</span>! <a href="http://www.ysearchblog.com/archives/000527.html" title="aka ">announced</a> a major facet of what I&#8217;ve been working on lately: making the web more meaningful. Lots of fantastic coverage, including <a href="http://www.techcrunch.com/2008/03/13/yahoo-embraces-the-semantic-web-expect-the-web-to-organize-itself-in-a-hurry/">TechCrunch</a> and <a href="http://www.readwriteweb.com/archives/yahoo_supports_semantic_web.php">ReadWriteWeb</a> (and others, please link in the comments), and supportive responses and blog posts across the board. It&#8217;s been a while since I&#8217;ve felt this good about being a Yahoo.</p>
<p>So what exactly is it?</p>
<p>A few months ago I went through the pages on this very blog and <a href="http://dubinko.info/blog/2007/10/05/playing-with-microformats/">added hAtom markup</a>. As a result of this change&#8230;well, nothing happened. I had a good experience learning about exactly what is involved in retrofitting an existing site with microformats, but I didn&#8217;t get any tangible benefit. With the &#8220;SearchMonkey&#8221; platform, any site using microformats, or RDFa or eRDF, is exposed to developers who can enhance search results. An enhanced result won&#8217;t directly make my my site rank higher in search, it it most certainly make it prone to more clicks, and ultimately more readership, more inlinks, and better organic ranking.</p>
<p>How about some questions and answers:</p>
<p>Q: Is this <span class="yshortcuts" id="lw_1205468064_1">Tim Berners-Lee</span>&#8216;s vision of the Semantic Web finally getting fulfilled?</p>
<p>A: No.</p>
<p>Q: Does this presuppose everybody rushing to change their sites to include microformats, RDF, etc?</p>
<p>A: No. After all, there is a developer platform. Naturally, developers will have an easier time with sites that use official and community standards for structuring data, but there is no obligation for any site to make changes in order to participate and benefit.</p>
<p>Q: Why would a site want to expose all its precious data in an easily-extractable way?</p>
<p>A: Because within a healthy ecosystem it results in a measurable increase in traffic and customer satisfaction. Data on the public web is already extractable, given enough eyeballs. An openness strategy pays off (of which SearchMonkey is an existence proof).</p>
<p>Q: What about <a href="http://www.well.com/~doctorow/metacrap.htm" title="A Doctorow classic">metacrap</a>? We can never trust sites to provide honest metadata.</p>
<p>A: The system does have significant spam deterrents built in, of which I won&#8217;t say more. But perhaps more importantly, the plugin nature of the platform uses the power of the community to shape itself. A spammy plugin won&#8217;t get installed by users. A site that mixes in fraudulent RDFa metadata with real content will get exposed as fraudulent, and users will abandon ship.</p>
<p>Q: Didn&#8217;t <a href="http://ask.com/" target="_blank"><span class="yshortcuts" id="lw_1205468064_2">ask.com</span></a> prove that having a better user interface doesn&#8217;t help gain search market share?</p>
<p>A: Perhaps. But this isn&#8217;t about user interface&#8211;it&#8217;s about data (which enables a much better interface.)</p>
<p>Q: Won&#8217;t (Google|Microsoft|some startup) just immediately clone this idea and take advantage of all the new metadata out there?</p>
<p>A: I&#8217;m sure these guys will have some kind of response, and it&#8217;s true that a rising tide lifts all boats. But I don&#8217;t see anyone else cloning this exactly. The way it&#8217;s implemented has a distinctly <span class="yshortcuts" id="lw_1205468064_3">Yahoo</span>! appeal to it. Nobody has cloned <span class="yshortcuts" id="lw_1205468064_4">Yahoo! Answers</span> yet, either. In some ways, this is a return to roots, since Yahoo! started off as a human-guided directory. SearchMonkey is similar, except a much broader group of people can now participate. And there are some specific human, technical and financial reasons why as well, but I suggest inviting me out for beers if you want specifics. :-)</p>
<p>Disclaimer: as always, I&#8217;m not speaking for my employer. See the <a href="http://dubinko.info/blog/about/">standard disclaimer</a>. -m</p>
<p>Update: more Q and A</p>
<p>Q: How is SearchMonkey related to the recently announced <a href="http://www.yr-bcn.es/demos/microsearch/">Yahoo! Microsearch</a>?</p>
<p>A: In brief, Microsearch is a research project (and a very cool one) with far-reaching goals, while SearchMonkey is targeted as imminently shipping software. I frequently talk to and compare notes with Peter Mika, the lead researcher for Microsearch.</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2008/03/13/the-lowercase-semantic-web-goes-mainstream/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yahoo! Announces Open Search Platform</title>
		<link>http://dubinko.info/blog/2008/02/26/yahoo-announces-open-search-platform/</link>
		<comments>http://dubinko.info/blog/2008/02/26/yahoo-announces-open-search-platform/#comments</comments>
		<pubDate>Tue, 26 Feb 2008 07:24:18 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[announcement]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[stuff]]></category>
		<category><![CDATA[trends]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/2008/02/26/yahoo-announces-open-search-platform/</guid>
		<description><![CDATA[As spotted on TechCrunch, full article. This is a game-changer folks. Check out the comments attached to the article. -m]]></description>
			<content:encoded><![CDATA[<p>As spotted on <span class="yshortcuts" id="lw_1204010539_0">TechCrunch</span>, <a href="http://www.techcrunch.com/2008/02/25/yahoo-announces-open-search-platform/">full article</a>. This is a game-changer folks. Check out the comments attached to the article. -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2008/02/26/yahoo-announces-open-search-platform/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>XML 2007 buzz: Hadoop</title>
		<link>http://dubinko.info/blog/2007/12/21/xml-2007-buzz-hadoop/</link>
		<comments>http://dubinko.info/blog/2007/12/21/xml-2007-buzz-hadoop/#comments</comments>
		<pubDate>Fri, 21 Dec 2007 17:56:30 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[search]]></category>
		<category><![CDATA[xml]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/2007/12/21/xml-2007-buzz-hadoop/</guid>
		<description><![CDATA[OK, the majority of the buzz came from my talk, where I strongly encouraged folks to take a look at Hadoop. This article seems to be saying much the same things. If you&#8217;re curious about the future of distributed computation and storage, it&#8217;s worth a look. -m]]></description>
			<content:encoded><![CDATA[<p>OK, the majority of the buzz came from my talk, where I strongly encouraged folks to take a look at Hadoop. This article seems to be <a href="http://www.businessweek.com/magazine/content/07_52/b4064000281756.htm">saying much the same things</a>. If you&#8217;re curious about the future of distributed computation and storage, it&#8217;s worth a look. -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2007/12/21/xml-2007-buzz-hadoop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Search On</title>
		<link>http://dubinko.info/blog/2007/06/03/search-on/</link>
		<comments>http://dubinko.info/blog/2007/06/03/search-on/#comments</comments>
		<pubDate>Mon, 04 Jun 2007 04:31:47 +0000</pubDate>
		<dc:creator>mdubinko</dc:creator>
				<category><![CDATA[announcement]]></category>
		<category><![CDATA[everythingismiscellaneous]]></category>
		<category><![CDATA[mobile]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://dubinko.info/blog/2007/06/03/search-on/</guid>
		<description><![CDATA[The approximately seven readers of this blog have probably already heard this, but just in case: I have a new role at Yahoo!&#8211;working on next generation search. Lots of details are still falling into place. For now I describe it: &#8220;Imagining, specifying, prototyping, developing, and evangelizing next-generation web search experiences leveraging the full and unique [...]]]></description>
			<content:encoded><![CDATA[<p>The approximately seven readers of this blog have probably already heard this, but just in case: I have a new role at Yahoo!&#8211;working on next generation search.</p>
<p>Lots of details are still falling into place. For now I describe it: &#8220;Imagining, specifying, prototyping, developing, and evangelizing next-generation web search experiences leveraging the full and unique capabilities available within Yahoo!&#8221;</p>
<p>In many ways, this is a logical stepping stone after <a href="http://m.yahoo.com">oneSearch</a>, and I&#8217;ll be dealing with lowercase semantic web issues more now. Expect the focus of this blog to shift accordingly (though I&#8217;m still interested in mobile and will make note of important happenings.)</p>
<p>Search On! -m</p>
]]></content:encoded>
			<wfw:commentRss>http://dubinko.info/blog/2007/06/03/search-on/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

