<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>TheContentGuy &#187; semantic web</title>
	<atom:link href="http://thecontentguy.net/blog/tag/semantic-web/feed/" rel="self" type="application/rss+xml" />
	<link>http://thecontentguy.net</link>
	<description>all things unstructured</description>
	<lastBuildDate>Sat, 05 Mar 2011 06:00:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
		<item>
		<title>Weekly Digest for 2009-12-05</title>
		<link>http://thecontentguy.net/blog/2009/12/06/weekly-digest-for-2009-12-05/</link>
		<comments>http://thecontentguy.net/blog/2009/12/06/weekly-digest-for-2009-12-05/#comments</comments>
		<pubDate>Sun, 06 Dec 2009 14:08:21 +0000</pubDate>
		<dc:creator>paulwlodarczyk</dc:creator>
				<category><![CDATA[Digest]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[ECM]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[SharePoint]]></category>
		<category><![CDATA[social media]]></category>

		<guid isPermaLink="false">http://thecontentguy.net/?p=773</guid>
		<description><![CDATA[This week's twitter digest from TheContentGuy.]]></description>
			<content:encoded><![CDATA[<ul>
<li>[ECM] Bearingpoint on SharePoint governance and risk <a href="http://bit.ly/4xFXdc" target="_blank">http://bit.ly/4xFXdc</a> <a class="twitter-hashtag" href="http://search.twitter.com/search?q=%23ecm" target="_blank">#ecm</a> <a class="twitter-hashtag" href="http://search.twitter.com/search?q=%23bearingpoint" target="_blank">#bearingpoint</a> <a class="twitter-hashtag" href="http://search.twitter.com/search?q=%23sharepoint" target="_blank">#sharepoint</a> RT <a class="twitter-user" href="http://twitter.com/jmancini77" target="_blank">@jmancini77</a></li>
<li>[ECM] If buying cars were like buying ECM (very funny and insightful) <a href="http://bit.ly/6F3Q0N" target="_blank">http://bit.ly/6F3Q0N</a> <a class="twitter-hashtag" href="http://search.twitter.com/search?q=%23ecm" target="_blank">#ecm</a> RT <a class="twitter-user" href="http://twitter.com/ldallasBMOC" target="_blank">@ldallasBMOC</a></li>
<li>[semantic web] Matt McAlister says Socially Linked Data is here today &#8211; you&#8217;re using it right now <a href="http://ow.ly/HtbX" target="_blank">http://ow.ly/HtbX</a></li>
<li>[cloud] AFP: IBM builds Blue Insight BI platform for employees; model for Smart Analytics Cloud offering <a href="http://j.mp/22xuez" target="_blank">http://j.mp/22xuez</a> via <a class="twitter-user" href="http://twitter.com/dcarli" target="_blank">@dcarli</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2009/12/06/weekly-digest-for-2009-12-05/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Are you on Glue? (Maybe you should be&#8230;)</title>
		<link>http://thecontentguy.net/blog/2009/04/06/are-you-on-glue-maybe-you-should-be/</link>
		<comments>http://thecontentguy.net/blog/2009/04/06/are-you-on-glue-maybe-you-should-be/#comments</comments>
		<pubDate>Mon, 06 Apr 2009 17:17:08 +0000</pubDate>
		<dc:creator>paulwlodarczyk</dc:creator>
				<category><![CDATA[semantic technology]]></category>
		<category><![CDATA[social technology]]></category>
		<category><![CDATA[Glue]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[social networks]]></category>

		<guid isPermaLink="false">http://thecontentguy.net/blog/?p=159</guid>
		<description><![CDATA[Glue is a &#8220;contextual social networking&#8221; browser plug-in from Adaptive Blue.  Glue works automatically as you browse popular sites about books, music, movies, wines, restaurants, gadgets, stocks, actors, TV shows, and other web content.  The Glue bar appears in your browser and lists friends who have browsed the same content and their comments.  This week in Read Write Web, [...]]]></description>
			<content:encoded><![CDATA[<p><a title="Get Glue" href="http://www.getglue.com/" target="_blank">Glue</a> is a &#8220;contextual social networking&#8221; browser plug-in from <a title="Adaptive Blue" href="http://www.adaptiveblue.com/about.php" target="_blank">Adaptive Blue</a>.  Glue works automatically as you browse popular sites about books, music, movies, wines, restaurants, gadgets, stocks, actors, TV shows, and other web content.  The Glue bar appears in your browser and lists friends who have browsed the same content and their comments. </p>
<p>This week in Read Write Web, Phil Glockner writes about a personal test drive of the latest version of Glue with the founder of Adaptive Blue, Alex Iskold. <br />
<span id="more-159"></span><br />
Below is an excerpt about two of the new features on Glue – connected conversations (which transcend sites), and Smart Recommendations.</p>
<blockquote><p><strong>Connected Conversations<br />
</strong><img class="alignright" title="Glue Conversations" src="http://www.readwriteweb.com/images/glue-conversation-apr09.jpg" alt="" width="300" height="325" /><br />
Building on the concept of being able to share thoughts and opinions on things with your friends on Glue, regardless of the site those things are found on, is taken to the logical next step with the addition of conversations. Now, if you see that someone has commented on something that you are looking at, or have an opinion on, you can add a comment to their opinion. In turn they can comment back, or others can join in on the conversation. Through these interactions, you will be exposed to new people who perhaps came to the conversation from a completely different web site, Wikipedia for instance, instead of Amazon, but are using Glue to transcend the social boundaries of these sites</p>
<p><strong>Smart Recommendations</strong></p>
<p>Being a contextual network that uses semantic technology to gather information and trends, Glue now aggregates this data and can present what books, movies and music your friends like the most instantly. Creating this recommendation data is done automatically as people use the Glue application by indicating what they like. The lesson here is, the more you use Glue, the better a resource you become to your friends who also use the service.</p></blockquote>
<p>Read the full article <a title="Read Write Web" href="http://www.readwriteweb.com/archives/glue_gets_stickier_with_conversations_and_recommen.php" target="_blank">here.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2009/04/06/are-you-on-glue-maybe-you-should-be/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Connecting the dots: How XML authoring enables the Semantic Web</title>
		<link>http://thecontentguy.net/blog/2008/08/15/connecting-the-dots-how-xml-authoring-enables-the-semantic-web/</link>
		<comments>http://thecontentguy.net/blog/2008/08/15/connecting-the-dots-how-xml-authoring-enables-the-semantic-web/#comments</comments>
		<pubDate>Fri, 15 Aug 2008 20:10:17 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[DITA]]></category>
		<category><![CDATA[XML]]></category>
		<category><![CDATA[semantic technology]]></category>
		<category><![CDATA[Calais]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[markup]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[natural language processing]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Search Monkey]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[web services]]></category>

		<guid isPermaLink="false">http://paulwlodarczyk.wordpress.com/?p=4</guid>
		<description><![CDATA[What if we start combining semantic web technologies and semantic document technologies?]]></description>
			<content:encoded><![CDATA[<p><a title="New, Improved *Semantic* Web!" href="http://flickr.com/photos/14829735@N00/303503677"><img class="alignleft" style="margin: 2px;" src="http://farm1.static.flickr.com/105/303503677_e83d70118f_m.jpg" alt="" width="193" height="240" /></a>I recently attended the <a title="Linked Data Planet" href="http://www.linkeddataplanet.com/" target="_blank">Linked Data Planet </a>conference where a number of pioneers in the field of Semantic Web shared their perspectives on the state of the art – and business – of helping the world tag their web pages for meaning.  For those of you in the dark about semantic mark-up, it lets authors annotate their web pages with metadata (HTML attributes that don’t get displayed in the document) that describe what those pages are about. <br />
<span id="more-7"></span><br />
So for example, when I say “New York” in an HTML document it&#8217;s ambiguous – do I mean the city, the state, the Yankees, the Mets, the Giants, the Jets, the song, the steak, the state of mind – you get the idea.  Words are ambiguous – except in the context of the language in which they occur.  So if I am writing about a sporting event <strong>you</strong> know from the context of the article that I mean the team, but the typical search engine does not.  To a search engine, New York is just a string that occurs in the document with some frequency. </p>
<p>There are two ways to make sense out of words in a document.  One is semantic analysis (I&#8217;ll leave that topic to another day).  The other is semantic tagging &#8211; adding metadata to a document.<br />
With metadata, I can define things precisely.  I can state that this document is about the sports team, not the steak.  I can do this by tagging the named entities in the document – the people, places, things, events, and facts – in an unambiguous way.  I can also set those entities into relationships with each other.  For example, a piece of text may refer to two companies involved in a merger.  So I can tag the document being about <strong>Company A</strong> (thing number one) and <strong>Company B</strong> (thing number two) involved in a <strong>merger</strong> (an event, but also a relationship between the two named entities). </p>
<p>So semantic tagging adds meaning to documents that goes beyond the text, and it does it in an unambiguous way, which is handy.  But it has traditionally faced two large hurdles: (1) it’s been relatively expensive to add semantic markup (either with investments in labor or technology) and (2) there has been little mass market for consuming this markup.  Both of those hurdles are rapidly falling away. </p>
<p>Let’s address the second point first.  Yahoo has introduced <a title="Yahoo! Search Monkey" href="http://developer.yahoo.com/searchmonkey/" target="_blank">Search Monkey</a> – a new technology that rates web pages not on the keywords and number of links to the page (the “wisdom of crowds”) but on the semantic markup that is embedded in the page (the wisdom of the author).  This creates a substantial motive for adding the markup: Search Engine Optimization.  Semantic markup makes your content more likely to be found and more relevant to the searcher.</p>
<p>Great, so how do you add semantic markup?  For legacy content, you need to use some combination of people and automation to add markup to what you already wrote.  Using people to tag content requires specialized skills that are not in good supply.  Natural language processing technologies for auto-tagging content have been around since the late 90s in lab settings; auto-tagging products are emerging in new and interesting forms in the marketplace today. Thomson-Reuter’s <a title="Thomson-Reuters Calais" href="http://www.opencalais.com/">Calais</a> open source project is a great example.  For a demo <a title="Calais Viewer Demo" href="http://sws.clearforest.com/calaisviewer/" target="_blank">click here</a> and try pasting some <a title="Terms of use" href="http://www.opencalais.com/terms" target="_blank">non-proprietary</a> text that describes what your company does (for example, I tried the “About Our Company” page we used in proposals at JustSystems and it accurately tagged all of the named companies, legal entities, products, technologies, countries, cities, and correctly identified JustSystems’s acquisition of XMetaL from Blast Radius as a business event).</p>
<p>Adding semantic markup to new web content as it is created &#8211; making it available as data &#8211; is the way to go.  But what about other types of unstructured content, like documents, that might be published to the web and other channels?  We’ve been doing this with XML and SGML documents all along, using semantic tags to unambiguously flag specific pieces of text for future discovery.  This has ranged from tagging part numbers in a service manual (which could automate adding hyperlinks or improve search relevance), to tagging financial reports with XBRL to find specific facts within the MD&amp;A or footnotes of an annual report (which could prevent another Enron).  But the important concept here is this: when content is tagged, it can be treated as data</p>
<p>More recent XML standards like <a title="DITA.XML.ORG" href="http://dita.xml.org/" target="_blank">DITA</a> help authors focus on creating granular content – primarily for content reuse.  But our customers are finding that DITA and other topic-oriented XML approaches are helping them break out of the document model – where loads of facts are locked-up within documents.  Think of a lengthy Policies and Procedures manual.  The historical reason it’s all bound in one book is for the convenience of publishing.  Today – with electronic publishing on the web, intranets, and portals – you really only want to publish a single policy or procedure as it is added or revised.  The book itself is obsolete when you can publish a procedure at a time. </p>
<p>In a DITA world, because of its granular nature, a single document (like a Policy manual that was one very large document in your document management system) may instead be managed as a collection of hundreds of DITA topics in your CMS or XML object store.  The document would no longer exist, it becomes a collection of topics, more like records in a database.  To effectively manage large collections of DITA topics, you <strong>need</strong> to specify metadata for each topic – just so that you can find any given topic again.  So a typical DITA project would define the CMS metadata scheme and the taxonomy for classifying the DITA topics.  For those of us in the XML document world, this is old hat.</p>
<p>So all this makes me ask:</p>
<ul>
<li>What if we start combining semantic web technologies and semantic document technologies?</li>
<li>What if we combine technologies that auto-tag named entities with granular authoring approaches like DITA?</li>
<li>What if you could automatically tag named entities within the DITA topic you are creating, tagging as you type? </li>
<li>What if a web service could automatically provide the CMS metadata when you go to check-in a new topic?</li>
<li>What if the publishing tools that transform your DITA to HTML could automatically add the semantic markup to your HTML pages that are published from your DITA content?</li>
<li>How would that change how you publish business documents like policies and procedures to your employees?</li>
<li>How would it change how you create marketing content for your web site?</li>
<li>How would it change the way you create and manage your product technical content?</li>
</ul>
<p>Could the secret to the semantic web be right under our nose?</p>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2008/08/15/connecting-the-dots-how-xml-authoring-enables-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

