<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>TheContentGuy &#187; content management</title>
	<atom:link href="http://thecontentguy.net/blog/tag/content-management/feed/" rel="self" type="application/rss+xml" />
	<link>http://thecontentguy.net</link>
	<description>all things unstructured</description>
	<lastBuildDate>Sat, 12 May 2012 05:00:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
		<item>
		<title>Webinar: Content Management and Social Media for the Insurance Industry</title>
		<link>http://thecontentguy.net/blog/2009/09/17/webinar-content-management-and-social-media-for-the-insurance-industry/</link>
		<comments>http://thecontentguy.net/blog/2009/09/17/webinar-content-management-and-social-media-for-the-insurance-industry/#comments</comments>
		<pubDate>Thu, 17 Sep 2009 18:50:38 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ECM]]></category>
		<category><![CDATA[social technology]]></category>
		<category><![CDATA[content management]]></category>
		<category><![CDATA[Earley & Associates]]></category>
		<category><![CDATA[insurance]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[social networks]]></category>
		<category><![CDATA[webcast]]></category>

		<guid isPermaLink="false">http://thecontentguy.net/blog/?p=575</guid>
		<description><![CDATA[September 24, 2009 1:00-2:30 p.m. EDT Price: Free John Greene, Managing Director, Guy Carpenter &#38; Company, LLC Seth Earley, President &#38; CEO , Earley &#38; Associates Mike Axelrod, Senior Consultant , Earley &#38; Associates What is the role of Web 2.0 and Social Media in the insurance industry? How do these new technologies and approaches [...]]]></description>
			<content:encoded><![CDATA[<p><strong><img class="alignright size-full wp-image-362" title="Earley &amp; Associates" src="http://thecontentguy.net/wp-content/uploads/2009/06/earleysmall.png" alt="Earley &amp; Associates" width="120" height="126" />September 24, 2009<br />
1:00-2:30 p.m. EDT</strong><br />
Price: <strong>Free</strong></p>
<p><strong>John Greene</strong>, Managing Director, Guy Carpenter &amp; Company, LLC<br />
<strong>Seth Earley</strong>, President &amp; CEO , Earley &amp; Associates<br />
<strong>Mike Axelrod</strong>, Senior Consultant , Earley &amp; Associates</p>
<p>What is the role of Web 2.0 and Social Media in the insurance industry? How do these new technologies and approaches for creating content fit in with more structured content processes?</p>
<p>In this first session we&#8217;ll survey the core processes of insurance operations, show what types of content typically supports such processes, and talk about how some of the newer approaches to collaboration and user generated content fit in to the big picture. Learn how Guy Carpenter &amp; Company, LLC, the world’s leading risk and reinsurance specialist, utilizes social media from John Green, Managing Director in Global Marketing &amp; Communications.</p>
<p>We’ll provide tangible examples of how improvements can provide bottom line benefits and how services can be better performed, products can be more accurately priced, and customer experiences can be improved by well designed content management approaches. We’ll discuss types of challenges in each of these areas and approaches for remediation.</p>
<p>Topics include:</p>
<ul>
<li>Overview of content intensive processes</li>
<li>The use of social media in insurance</li>
<li>How well architected content can improve knowledge flow</li>
<li>Classes of content technology in support of insurance processes</li>
</ul>
<p><a title="Earley.com - free webinar info and registration" href="http://www.earley.com/webinars/jumpstarts/insurance-and-content-management/insurance-processes-and-CM" target="_blank">Visit earley.com for more information or to <strong>register</strong></a></p>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2009/09/17/webinar-content-management-and-social-media-for-the-insurance-industry/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>AIIM Digital Landfill: 50+ ECM Blogs to Follow</title>
		<link>http://thecontentguy.net/blog/2009/09/14/aiim-digital-landfill-50-ecm-blogs-to-follow/</link>
		<comments>http://thecontentguy.net/blog/2009/09/14/aiim-digital-landfill-50-ecm-blogs-to-follow/#comments</comments>
		<pubDate>Mon, 14 Sep 2009 19:37:23 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ECM]]></category>
		<category><![CDATA[social technology]]></category>
		<category><![CDATA[blogging]]></category>
		<category><![CDATA[content management]]></category>
		<category><![CDATA[ERM]]></category>
		<category><![CDATA[WCM]]></category>

		<guid isPermaLink="false">http://thecontentguy.net/blog/?p=568</guid>
		<description><![CDATA[John Mancini (Mr. &#8220;8 Things&#8221;) put a request out on Twitter last week for people to help him build a list of blogs they follow on ECM, ERM, WCM, and Enterprise 2.0., and published the list today on his blog, Digital Landfill. I like that John used some of my favorite social technology to compile [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" title="8 Things..." src="http://aiim.typepad.com/.a/6a00d834520bef69e201157128ddf7970c-120wi" alt="" width="72" height="99" />John Mancini (Mr. &#8220;8 Things&#8221;) put a request out on Twitter last week for people to help him <a title="AIIM: 50+ ECM Blogs to Follow" href="http://aiim.typepad.com/aiim_blog/2009/09/53-ecm-erm-e20-and-wcm-blogs-to-watch-from-twitter-followers.html" target="_blank">build a list of blogs they follow</a> on ECM, ERM, WCM, and Enterprise 2.0., and published the list today on his blog, <a title="Digital Landfill" href="http://aiim.typepad.com/aiim_blog/" target="_blank">Digital Landfill</a>. I like that John used some of my favorite social technology to compile the list: Twitter, GoogleDocs, and blogs. We&#8217;re on the list, alongside many of our friends.  Thanks John!</p>
<p>You can <a title="Add your favs to the list" href="http://spreadsheets.google.com/ccc?key=0AvALkWyfdej-dFRCQmlCdFlqTktOQlVpV2ZIY1o0SXc&amp;hl=en" target="_blank">add your own favorite blogs to the list</a> on GoogleDocs.</p>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2009/09/14/aiim-digital-landfill-50-ecm-blogs-to-follow/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Weekly Digest for 2009-09-11</title>
		<link>http://thecontentguy.net/blog/2009/09/11/weekly-digest-for-2009-09-11/</link>
		<comments>http://thecontentguy.net/blog/2009/09/11/weekly-digest-for-2009-09-11/#comments</comments>
		<pubDate>Fri, 11 Sep 2009 10:00:00 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Digest]]></category>
		<category><![CDATA[blogging]]></category>
		<category><![CDATA[content management]]></category>
		<category><![CDATA[DITA]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://thecontentguy.net/blog/2009/09/11/weekly-digest-for-2009-09-11/</guid>
		<description><![CDATA[[twitter] MT @codestripper: MT @Shoq INTRODUCING the &#8220;Modified Tweet (MT)&#8221;: for when it&#8217;s more than a retweet. This rocks! http://j.mp/bhgLF [ECM] Challenge: Explain ECM in &#60;= 1 min: Can you do it? http://j.mp/17rgpc via @jmancini77 #AIIM #ECM #video #contest [blog] Success! thecontentguy.net and blog are now relocated to BlueHost.com. I love it when stuff WORKS. [...]]]></description>
			<content:encoded><![CDATA[<ul class="aktt_tweet_digest">
<li>[twitter] MT @codestripper: MT @<a class="aktt_username" href="http://twitter.com/Shoq">Shoq</a> INTRODUCING the &#8220;Modified Tweet (MT)&#8221;: for when it&#8217;s more than a retweet. This rocks! <a rel="nofollow" href="http://j.mp/bhgLF">http://j.mp/bhgLF</a></li>
<li>[ECM] Challenge: Explain ECM in &lt;= 1 min: Can you do it? <a rel="nofollow" href="http://j.mp/17rgpc">http://j.mp/17rgpc</a> via @<a class="aktt_username" href="http://twitter.com/jmancini77">jmancini77</a> #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23AIIM">AIIM</a> #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23ECM">ECM</a> #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23video">video</a> #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23contest">contest</a></li>
<li>[blog] Success! thecontentguy.net and blog are now relocated to BlueHost.com. I love it when stuff WORKS.</li>
<li>[blog] Moving thecontentguy.net domain to BlueHost.com; bear with us&#8230;.</li>
<li>[twitter] Ping.fm discusses yesterday&#8217;s massive outage: Q: What does it all mean? A: All new users in last 5 days lost! <a rel="nofollow" href="http://j.mp/vGnTJ">http://j.mp/vGnTJ</a></li>
<li>[ECM] @<a class="aktt_username" href="http://twitter.com/amcafee">amcafee</a> speaking at #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23AIIM">AIIM</a> ATM: &#8220;Internet is like the largest library in the world, but all the books are on the floor&#8221; RT @<a class="aktt_username" href="http://twitter.com/jmancini77">jmancini77</a></li>
<li>[site down] host webspaceexchange.com still dark. thecontentguy.net blog, email down 36+ hours. Reading-up on website migration&#8230;</li>
<li>[metadata] The role of metadata, taxonomies, classification, schemas &amp; thesauri <a rel="nofollow" href="http://ur.lc/7eo">http://ur.lc/7eo</a> #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23erm">erm</a> #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23metadata">metadata</a> RT @<a class="aktt_username" href="http://twitter.com/jmancini77">jmancini77</a> @<a class="aktt_username" href="http://twitter.com/vladenache">vladenache</a></li>
<li>[social tech] Infographic: the hierarchy of digital distractions <a rel="nofollow" href="http://ow.ly/ozr0">http://ow.ly/ozr0</a> &#8211; validated the past hour (via @<a class="aktt_username" href="http://twitter.com/coryripley">coryripley</a>) RT @<a class="aktt_username" href="http://twitter.com/swissmiss">swissmiss</a></li>
<li>[DITA] via Scott Abel: (new book out soon) DITA Specialization, by Zarella Rendon <a rel="nofollow" href="http://ping.fm/scw5t">http://ping.fm/scw5t</a> RT @<a class="aktt_username" href="http://twitter.com/scottabel">scottabel</a></li>
<li>[ECM] oldie but goodie via SlideShare: &#8220;Cool Uniforms and Flying Cars&#8221; 2008 AIIM ECM survey results <a rel="nofollow" href="http://j.mp/4b54nR">http://j.mp/4b54nR</a></li>
<li>[hey jude] ReadWriteWeb: Yoko Ono: Beatle&#8217;s Catalogue on iTunes Tomorrow <a rel="nofollow" href="http://bit.ly/CmAyq">http://bit.ly/CmAyq</a></li>
<li>[green] @<a class="aktt_username" href="http://twitter.com/dcarli">dcarli</a> when does the math favor train vs truck freight? <a class="aktt_tweet_reply" href="http://twitter.com/dcarli/statuses/3850638768">in reply to dcarli</a></li>
<li>[bad hosting] OK webspaceexchange.com, we know you&#8217;re in Maui, and it IS Mercury Retrograde, but we&#8217;re going on a 20+ hr outage, what gives?</li>
<li>[WCM] AIIM Digital Landfill: 8 Reasons Why You Blame Your Web CMS Vendor, But Shouldn’t <a rel="nofollow" href="http://j.mp/12keIo">http://j.mp/12keIo</a> #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23metadata">metadata</a> #<a class="aktt_hashtag" href="http://search.twitter.com/search?q=%23wcm">wcm</a></li>
<li>[blogging] RT @siftonpark: Great talk on SEO by @<a class="aktt_username" href="http://twitter.com/mattcutts">mattcutts</a> for bloggers and webmasters &#8211; <a rel="nofollow" href="http://cli.gs/15hUXD">http://cli.gs/15hUXD</a></li>
<li>[tech horoscope] Why is everything crashing today? It&#8217;s Mercury Retrograde! Started yesterday, of course! <a rel="nofollow" href="http://j.mp/oUBV">http://j.mp/oUBV</a></li>
<li>[twitter] trying pixelpipe.com as a relay service since Ping.fm is still down for the count. Hope this gets thru!</li>
<li>[MAJOR bummer] now ping.fm is DOWN and not forwarding ANYTHING. Geez, no website, no email, no tweets &#8211; what next?</li>
<li>[robot apocalypse] via Engadget: this is how it starts: robots using aps. Here a French Nao is using MSFT Surface(TM) <a rel="nofollow" href="http://ping.fm/H4uUx">http://ping.fm/H4uUx</a></li>
<li>[robotics] via Engadget: Man builds master-slave robot, but which is the master? <a rel="nofollow" href="http://ping.fm/yTURj">http://ping.fm/yTURj</a></li>
<li>[major bummer] 12 hours later and my hosting company webspaceexchange.com is STILL crashed&#8230; not a fan anymore, next time I get an SLA</li>
<li>[science] via Space.com: New theory on origins of life relies on more UV: &#8220;Zinc and Zap&#8221; <a rel="nofollow" href="http://ping.fm/c2yXU">http://ping.fm/c2yXU</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2009/09/11/weekly-digest-for-2009-09-11/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Taxonomy CoP Webinar: Metadata Maturity Survey Findings</title>
		<link>http://thecontentguy.net/blog/2009/07/10/taxonomy-cop-webinar-metadata-maturity-survey-findings/</link>
		<comments>http://thecontentguy.net/blog/2009/07/10/taxonomy-cop-webinar-metadata-maturity-survey-findings/#comments</comments>
		<pubDate>Fri, 10 Jul 2009 16:47:59 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ECM]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[content management]]></category>
		<category><![CDATA[Earley & Associates]]></category>
		<category><![CDATA[maturity]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[webcast]]></category>

		<guid isPermaLink="false">http://thecontentguy.net/blog/?p=410</guid>
		<description><![CDATA[Paul Wlodarczyk, Earley &#38; Associates Ron Daniel, Taxonomy Strategies LLC Date: Wednesday, July 15, 2009 Time: 1:00 &#8211; 2:00 Eastern Time Cost: $50 (Survey respondents need not register and will receive an invitation to this call at no cost) Register here. Four years ago, a benchmark survey was conducted to understand how organizations understood and [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-362" title="Earley &amp; Associates" src="http://thecontentguy.net/wp-content/uploads/2009/06/earleysmall.png" alt="Earley &amp; Associates" width="120" height="126" /><strong>Paul Wlodarczyk, Earley &amp; Associates<br />
Ron Daniel, Taxonomy Strategies LLC</strong></p>
<p><strong>Date: Wednesday, July 15, 2009<br />
Time: 1:00 &#8211; 2:00 Eastern Time<br />
Cost: $50</strong> (Survey respondents need not register and will receive an invitation to this call at no cost)</p>
<p><a title="Register: Taxonomy CoP Webinar" href="http://www.earley.com/webinars/metadata-maturity-survey-findings" target="_blank">Register here.</a></p>
<p><span id="more-410"></span>Four years ago, a benchmark survey was conducted to understand how organizations understood and applied metadata to content assets and what business benefits they were realizing. Earley &amp; Associates along with Taxonomy Strategies, launched a project to update this research. The goal of the study was to measure industry progress on a Metadata Maturity Model scale, a road map designed to help practitioners get started &#8211; and continuously improve &#8211; content tagging within organizations.</p>
<p>Preliminary results have shown that organizations that are more mature in metadata and taxonomy best practices outperform less mature organizations, with more mature organizations reporting findability and content management problems 10-15% less often. In addition to presenting results around maturity levels, this call will benchmark current search, taxonomy, and metadata practices against the 2005 survey and report on some surprising key findings around information access and best practices adoption.</p>
<p><a title="Register: Taxonomy CoP Webinar" href="http://www.earley.com/webinars/metadata-maturity-survey-findings" target="_blank">Register here.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2009/07/10/taxonomy-cop-webinar-metadata-maturity-survey-findings/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Turn Tagging into Cash: Take the Metadata Best Practices Survey</title>
		<link>http://thecontentguy.net/blog/2009/05/26/how-to-turn-tagging-into-cash-take-the-metadata-best-practices-survey/</link>
		<comments>http://thecontentguy.net/blog/2009/05/26/how-to-turn-tagging-into-cash-take-the-metadata-best-practices-survey/#comments</comments>
		<pubDate>Tue, 26 May 2009 15:07:21 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[DITA]]></category>
		<category><![CDATA[ECM]]></category>
		<category><![CDATA[XML]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[semantic technology]]></category>
		<category><![CDATA[Benchmarking]]></category>
		<category><![CDATA[content management]]></category>
		<category><![CDATA[Earley & Associates]]></category>
		<category><![CDATA[markup]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[semantic search]]></category>
		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://thecontentguy.net/blog/?p=310</guid>
		<description><![CDATA[We tag stuff to add meaning, and so that we and others – especially information systems – can find it.  But is your approach to tagging business content effective?  Find out - take the Metadata Best Practices Benchmarking Survey from Earley &#038; Associates and Taxonomy Strategies.]]></description>
			<content:encoded><![CDATA[<p>If you couldn’t tell by now, one of my particular interests is tagging, a.k.a. content classification, a.k.a. metadata.  We tag stuff to add meaning, and so that we and others – especially information systems – can find it.  But is your approach to tagging business content effective?  Find out &#8211; take the <strong><a title="Metadata Best Practices Benchmarking Survey" href="http://www.surveymonkey.com/s.aspx?sm=TEtPrAKwkiKIXhkey6revA_3d_3d" target="_blank">Metadata Best Practices Benchmarking Survey</a></strong> from Earley &amp; Associates and Taxonomy Strategies.</p>
<p><strong><span style="color: black; font-size: 14pt;  mso-bidi-font-size: 11.0pt; "><a title="Metadata Best Practices Benchmarking Survey" href="http://www.surveymonkey.com/s.aspx?sm=TEtPrAKwkiKIXhkey6revA_3d_3d" target="_blank"><span style="color: blue;"><span style="font-family: Calibri;">Take the Survey</span></span></a></span></strong></p>
<p><span id="more-310"></span>Depending upon context, “tagging” can mean one of three different things: tagging a document, tagging within a document, or tagging a content object.</p>
<p style="PADDING-LEFT: 30px"><strong>Tagging documents.</strong>  These days most of us think of tagging as the keywords we put on our documents – like our photos and websites – so that others can find them when they search.  User tags are fine for finding photos in flickr, but for tagging to be effective in business we need to make it systematic, so that we avoid ambiguity and improve search recall and relevance.  So we’re increasingly “mature” in our approaches to tagging: We use taxonomy to organize our terms into classes and to manage the relationships between terms.  We develop thesauri and foreign language equivalents.  We integrate taxonomies and thesauri into search indexes for ECM and site search and SEO.</p>
<p style="PADDING-LEFT: 30px"><strong>Tagging within a document.</strong>  I got interested in tagging in the early days of XML (back when we spelled it &#8220;S-G-M-L&#8221;), when we were tagging within documents.  By tagging unstructured content inside documents we could do really sophisticated things – not just multi-channel output.  For example, knowing that a paragraph in a document was a step in a service procedure or that a string of gibberish was a part number let us bring life to that content when we transformed it from markup into an interactive electronic technical manual.  <strong>Tagging let us turn books into diagnostic software.</strong></p>
<p style="PADDING-LEFT: 30px"><strong>Tagging reusable content objects.</strong> As content reuse matured with standards like DITA, organizations had more reusable components, with more people creating them in more departments.  Tagging reusable content objects became essential to actually reusing them – if you couldn’t find it, you’d never reuse it.  If you had a single service manual with 100 procedures, now you have at least 100 reusable content objects, so the search scope increased by two orders of magnitude.  At IBM, colleagues report having over a million DITA topics in more than six repositories, with over a dozen departments sharing content across thousands of publications.  <strong>Searching for content objects is like trying to find a needle in a haystack, except you’re trying to find the right needle, and you have more and smaller needles to search amongst, in more and increasingly bigger haystacks.</strong></p>
<p><strong>Measuring Metadata Maturity.</strong>  Each type of tagging can have measurable benefits on your business.  Five years ago, <a title="Earley &amp; Associates" href="www.earley.com" target="_blank">Earley &amp; Associates</a> and <a title="Taxonomy Strategies" href="www.taxonomystrategies.com" target="_blank">Taxonomy Strategies</a> developed a survey to understand metadata maturity for various types of businesses.  Earley is conducting an updated survey to see how organizations have moved up the learning curve.  Since we have a baseline of responses from five years ago, we’ll be able to describe how metadata and taxonomy practices have matured over time.  Also, the original survey was focused on the impact of metadata best practices on knowledge management and e-commerce search.  We now recognize that metadata is also used by technical communicators – especially those that use XML and other technologies to create, manage, and multichannel publish reusable content.  We want to hear from you all for the first time.</p>
<p>The survey is pretty detailed, so you might want to grab your favorite caffeinated beverage before you dig in.  As compensation for your time (about 15 minutes) Earley &amp; Associates is offering these nifty incentives:</p>
<ul type="disc">
<li style="line-height: 14.25pt; margin: 0in 0in 10pt; color: black; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1; tab-stops: list .5in"><strong>A free pass to any future Earley &amp; Associates Community of Practice conference call</strong> (a $50 value).  These are monthly, and the next one is Wednesday June 2<sup>nd</sup> on <a title="Taxonomy Community of Practice - June 2009" href="http://www.earley.com/_June2009.asp" target="_blank">Taxonomy for Portals</a> featuring Giovanni Piazza, Chief Knowledge Officer of Ernst &amp; Young, and Ralph Poole of Earley &amp; Associates.</li>
<li style="line-height: 14.25pt; margin: 0in 0in 10pt; color: black; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1; tab-stops: list .5in;"><strong>A $200 discount on registration to the <a title="Henry Stewart Digital Asset Management Conference" href="http://www.damusers.com/" target="_blank">Henry Stewart conference</a></strong> on digital asset management, June 1-2 in NYC.  Seth Earley will be there presenting preliminary results.</li>
<li style="line-height: 14.25pt; margin: 0in 0in 10pt; color: black; mso-margin-top-alt: auto; mso-margin-bottom-alt: auto; mso-list: l0 level1 lfo1; tab-stops: list .5in;"><strong>Free participation</strong> in a webcast reviewing the results of the survey (date TBA).</li>
</ul>
<p class="MsoNormal" style="line-height: 14.25pt; margin: 0in 0in 0pt;"><strong><span style="color: black; font-size: 14pt; mso-fareast-font-family: 'Times New Roman'; mso-bidi-font-family: 'Times New Roman'; mso-bidi-font-size: 11.0pt; mso-ascii-font-family: Calibri; mso-hansi-font-family: Calibri;"><a title="Metadata Best Practices Benchmarking Survey" href="http://www.surveymonkey.com/s.aspx?sm=TEtPrAKwkiKIXhkey6revA_3d_3d" target="_blank"><span style="color: blue;"><span style="font-family: Calibri;">Take the Survey</span></span></a></span></strong></p>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2009/05/26/how-to-turn-tagging-into-cash-take-the-metadata-best-practices-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>UIMA &#8211; First Standard for Accessing Unstructured Information</title>
		<link>http://thecontentguy.net/blog/2009/04/07/uima-first-standard-for-accessing-unstructured-information/</link>
		<comments>http://thecontentguy.net/blog/2009/04/07/uima-first-standard-for-accessing-unstructured-information/#comments</comments>
		<pubDate>Tue, 07 Apr 2009 16:14:24 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ECM]]></category>
		<category><![CDATA[semantic technology]]></category>
		<category><![CDATA[content management]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[OASIS]]></category>
		<category><![CDATA[unstructured content]]></category>

		<guid isPermaLink="false">http://thecontentguy.net/blog/?p=184</guid>
		<description><![CDATA[I posted earlier today at Not Otherwise Categorized&#8230; about the announcement last month of OASIS&#8217;s new standard for the Unstructured Information Management Architecture, Version 1.0.  Read all about it here:  OASIS Approves UIMA &#8211; the first standard for accessing Unstructured Information]]></description>
			<content:encoded><![CDATA[<p>I posted earlier today at <a title="Earley &amp; Associates blog" href="http://sethearley.wordpress.com" target="_blank">Not Otherwise Categorized&#8230;</a> about the announcement last month of OASIS&#8217;s new standard for the Unstructured Information Management Architecture, Version 1.0.  Read all about it here:  <a title="OASIS Approves UIMA - the first standard for accessing Unstructured Information" rel="bookmark" href="http://sethearley.wordpress.com/2009/04/07/oasis-approves-uima-the-first-standard-for-accessing-unstructured-information/" target="_blank"><span style="color: #105cb6;">OASIS Approves UIMA &#8211; the first standard for accessing Unstructured Information</span></a></p>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2009/04/07/uima-first-standard-for-accessing-unstructured-information/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Automating CMS metadata &#8211; could that work?  How?</title>
		<link>http://thecontentguy.net/blog/2008/08/19/automating-cms-metadata-could-that-work-how/</link>
		<comments>http://thecontentguy.net/blog/2008/08/19/automating-cms-metadata-could-that-work-how/#comments</comments>
		<pubDate>Tue, 19 Aug 2008 20:40:21 +0000</pubDate>
		<dc:creator>paulwlodarczyk</dc:creator>
				<category><![CDATA[DITA]]></category>
		<category><![CDATA[ECM]]></category>
		<category><![CDATA[XML]]></category>
		<category><![CDATA[semantic technology]]></category>
		<category><![CDATA[auto-tagging]]></category>
		<category><![CDATA[categorization technology]]></category>
		<category><![CDATA[content classification]]></category>
		<category><![CDATA[content lifecycle]]></category>
		<category><![CDATA[content management]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[named entity analysis]]></category>
		<category><![CDATA[natural language processing]]></category>
		<category><![CDATA[taxonomy]]></category>
		<category><![CDATA[web services]]></category>

		<guid isPermaLink="false">http://paulwlodarczyk.wordpress.com/?p=16</guid>
		<description><![CDATA[In a previous post I asked the question, "What if a web service could automatically provide the CMS metadata when you go to check-in a new topic?"  In this post I'll discuss why you would want to do that, some of the candidate technologies, and what is necessary to make it real.]]></description>
			<content:encoded><![CDATA[<p>In a previous post I asked the question, &#8220;What if a web service could automatically provide the CMS metadata when you go to check-in a new topic?&#8221;  In this post I&#8217;ll discuss why you would want to do that, some of the candidate technologies, and what is necessary to make it real.<br />
<span id="more-13"></span><br />
<strong><img class="alignleft" title="Paper Tag" src="http://thecontentguy.net/wp-content/uploads/2008/08/tag-image.jpg" alt="" width="170" height="251" />Check-in, metadata, and taxonomies.</strong> Anyone who&#8217;s worked with a content or document management system knows this scenario:  You&#8217;re going to check-in newly authored content, and a dialog box comes up asking you to enter some keywords to describe the content.  This is metadata – data about your data. It’s important because if you fill it in properly, other people (and you, too) can find your content. If you leave it blank, then other users will need to rely on a full-text search of some machine indexing to find your content.</p>
<p>Many organizations have a formal system for classifying content called a taxonomy. Think of it like the naming of the sections in the yellow pages directory – it provides consistent category names. This avoids the problem I call “the Yellow Pages Problem,” where some people call those guys who represent you in court “lawyers” while other people call them “attorneys at law” (or worse things). When an organization uses a taxonomy, everyone uses consistent category names – that is, if they actually use it.</p>
<p><strong>Compliance blues.</strong> Taxonomies can be configured into the CMS, so that category names are able to be selected on the check-in dialog box. While that saves the author guesswork of remembering category names and avoids mistyping, it still requires an author to take action – and to get it right. This is a point of failure of many ECM initiatives: authors either fail to classify content at check-in time, or they accept the default settings, or the author applies the wrong category (or even too few categories when the content really crosses genres).</p>
<p>The problem is worse when the author isn’t a fulltime writer, but instead a business contributor who’s creating content as they serve their role in some business process. In these cases the author lacks the time, talent, or motivation to tag the content with the appropriate metadata. They may not see it as part of their job.</p>
<p><strong>Cure for the blues?</strong> So can this process be automated? Absolutely. Technologies have existed for some years now to analyze unstructured content. Algorithms involve some combination of statistical, linguistic, and structural analysis.</p>
<ul>
<li><em>Statistical methods</em> look at the document as a “bag of words” – words or phrases that occur more frequently, or that are “improbable” statistically are more important. Amazon uses SIPs – Statistically Improbable Phrases – to pull keywords out of books. This is purely statistical – the system doesn’t know what the words mean, just that they are “odd” so probably meaningful.</li>
<li><em>Linguistic methods</em> actually analyze the natural language in the document. If you know what the subject, verb, and object are in a sentence, then you know what it is about. Linguistic methods have gotten better with improvements in algorithms and increases in computing power.</li>
<li><em>Structural methods</em> leverage underlying markup in documents, like XML structural tags or even styling or text flow (e.g. recognizing terms in headers).</li>
</ul>
<p>These methods not only provide automated metadata tagging (document categorization), they can also determine what type of document is being analyzed (document classification). They can also be used to identify Named Entities – named people, places, things, and events. It’s one thing to say this document is a Legal Brief (document type or class). It’s another to say that Legal Brief is about Patent Infringement (a category). It’s another thing still to say that it’s a case between Palm and Xerox (named companies) about handwriting recognition (a named technology). Named entities can be extracted and listed in metadata. They can also be tagged in-line in an XML document (this is often called &#8220;auto-tagging&#8221; &#8211; a post for another day).</p>
<p>Named entities are not addressed by taxomonies, rather by lists or directories of named entities.  A number of these named entity directories are available as web services. Several are kept evergreen by using Wikipedia to drive the ever growing list of named entities.</p>
<p><strong>Making it real.</strong> So given this technology, how do you implement such a system?  My preferred method is to customize the authoring environment so that the “Save” dialog box in the editor of choice presents the ECM system’s check-in dialog.  This way the author does not take extra steps to check content in.</p>
<p>Also at check-in time, in the background, the customized editor performs a temporary save to the local file system, and automatically sends a copy of the document to a categorizer web service. This is a content categorizer application running on a server.  That categorizer service would apply the organization’s standard taxonomy to the document, using some classification algorithm to define one or more categories for the document. The results can be applied in either of two ways:</p>
<ul>
<li>Classify the document automatically with no user intervention. This can be done completely in the background with no user interface, even as part of an automated check-in workflow.</li>
<li>Classify the document automatically and have the user verify the results. This requires exposing the proposed metadata tags in the check-in dialog.</li>
</ul>
<p>Categorizers often provide some scoring of the certainty of a given tag; this score can be used to make the call about whether the automatic tag is applied, or whether it needs (or allows) end user verification or editing. Business requirements determine what the best approach or best combination is.</p>
<p><strong>What are the barriers?</strong> The reason this technique isn’t used more often is the integration required between the authoring tools, the ECM solution, and the categorization technology. In today’s market these technologies are typically provided by independent software vendors, who have few incentives for bundling tightly integrated solutions (and wish to remain “vendor neutral” with their own technology). As the ECM marketplace continues to consolidate vertically we may see some content lifecycle vendors with more complete solutions (watch IBM and EMC). Services firms specializing in unstructured content and ECM can be one source for prepackaged solutions that combine these ECM, authoring tools, and content classification into a seamless user experience – which is the key to success in deploying an automated solution.</p>
<p>At the end of the day, consideration of the needs and behavior of content authors and contributors (who are very often change-averse) is the most important step in adoption of a content lifecycle solution. Making content classification and categorization a “no brainer” through automation and a seamless user experience improves the likelihood of success.</p>
]]></content:encoded>
			<wfw:commentRss>http://thecontentguy.net/blog/2008/08/19/automating-cms-metadata-could-that-work-how/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

