<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Fern Halper's data makes the world go 'round</title>
	<atom:link href="http://fbhalper.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://fbhalper.wordpress.com</link>
	<description>the big wide world of data and information management and analysis</description>
	<pubDate>Tue, 08 Jul 2008 15:11:38 +0000</pubDate>
	<generator>http://wordpress.org/?v=MU</generator>
	<language>en</language>
			<item>
		<title>Syndicating Text Analytics</title>
		<link>http://fbhalper.wordpress.com/2008/07/08/syndicating-text-analytics/</link>
		<comments>http://fbhalper.wordpress.com/2008/07/08/syndicating-text-analytics/#comments</comments>
		<pubDate>Tue, 08 Jul 2008 13:38:59 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[Syndicated Services]]></category>

		<category><![CDATA[Text Analytics]]></category>

		<category><![CDATA[Add new tag]]></category>

		<category><![CDATA[Elsivier]]></category>

		<category><![CDATA[Expert System]]></category>

		<category><![CDATA[Fern Halper]]></category>

		<category><![CDATA[Hurwitz &amp; Associates]]></category>

		<category><![CDATA[Netbase]]></category>

		<category><![CDATA[semantics]]></category>

		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=57</guid>
		<description><![CDATA[Over the past several weeks, I’ve been briefed by a number of text analytics vendors and companies in partnership with text analytics vendors about syndicated services that make use of text analytics. Of course, syndicated services such as brand monitoring and news services that make use this technology to some degree have been around for [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p class="MsoNormal"><span style="font-family:Arial;">Over the past several weeks, I’ve been briefed by a number of text analytics vendors and companies in partnership with text analytics vendors about syndicated services that make use of text analytics.<span> </span>Of course, syndicated services such as brand monitoring and news services that make use this technology to some degree have been around for a while.<span> </span>But, how about some of the newer services?</span></p>
<p class="MsoNormal"><span style="font-family:Arial;"><strong> illumin8</strong></span></p>
<p class="MsoNormal"><span style="font-family:Arial;">An interesting example of this is <a href="http://www.illumin8.com">illumin8</a>, which is being offered by Elsevier, in partnership with <a href="http://www.netbase.com">Netbase</a>.<span> </span>The service is targeted at R&amp;D knowledge workers looking to solve technical and business problems.<span> </span>According to Elsevier, knowledge workers spend more time per week trying to discover relevant content relating to a particular problem area than analyzing that information (5.5 hours/week accessing vs. 4.7 hours/week analyzing).<span> </span>These workers are usually using a google-like search engine.<span> </span>I think everyone can agree that the google-like search engine is not ideal for research purposes, so I won’t belabor the point here.<span> </span>In the case of the R&amp;D knowledge worker, often one goal is to gather information relating to a particular problem, finding products that solve that problem, as well as understanding the approach used to solve the problem.<span> </span></span></p>
<p class="MsoNormal"><span style="font-family:Arial;">Elsevier has aggregated 5 billion business sources, 3 million full text articles, 33 million scientific records, and 21 million patents as the source of information for this service.<span> </span>Using the Netbase semantic index, Elsevier crawls through the information and extracts solutions that solve a problem and the approaches used to address a specific issue.<span> </span>In this way, R&amp;D can help answer the following questions:</span></p>
<ul style="margin-top:0;" type="disc">
<li class="MsoNormal"><span style="font-family:Arial;">Solutions that exist to solve a problem</span></li>
<li class="MsoNormal"><span style="font-family:Arial;">New applications and processes that might exist to help solve a problem</span></li>
<li class="MsoNormal"><span style="font-family:Arial;">Information about what competitors are doing in the particular problem space</span></li>
<li class="MsoNormal"><span style="font-family:Arial;">What the experts are saying about a particular problem area</span></li>
</ul>
<p class="MsoNormal"><span style="font-family:Arial;">Here is a screen shot of what an end-user might see using this service.<span> </span>In this example, the user is interested in solving the problem of fuel efficiency in boats.<span> </span>He or she wants to see what products and approaches are out on the market to address this problem and what companies are providing these solutions.<span> </span>The user enters the topic (boats) and the benefit (fuel efficiency) in the search box and gets back information that is organized in a logical way. In this example, you can see that query returns information about products that address the problem as well as the companies that make the products, organizations that deal with energy, as well approaches to solving the problem (drag, stroke, etc).<span> </span>These are ranked.<span> </span>Users can then drill down on any of these areas to get snippets (and full text) associated with areas that he/she is interested in analyzing.</span></p>
<p class="MsoNormal"><a href="http://fbhalper.files.wordpress.com/2008/07/image00119.jpg"><img class="alignnone size-full wp-image-60" src="http://fbhalper.files.wordpress.com/2008/07/image00119.jpg?w=544&h=311" alt="" width="544" height="311" /></a></p>
<p class="MsoNormal"><span style="font-family:Arial;">During the demo, I asked to see what would happen if we input “text analytics” as the problem space in the search box.<span> </span>I was actually impressed that what was returned was a good set of information about the players, organizations dealing with text analytics and other information about it.<span> </span>The service is not inexpensive, but it does cull a lot of information.</span></p>
<p><strong>Syndicated Services</strong></p>
<p>I believe that the number of syndicated services using text analytics will continue to grow. We&#8217;re certainly seeing action in the brand monitoring space on this front. Vendors are also getting into the act. <a href="http://www.expertsystem.net">Expert System</a>, for example, has its own service that is targeted at the auto industry. I believe that other vendors may get into the act if they determine that the financial benefits of offering syndicated services (as opposed to SaaS offerings) makes sense.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/57/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/57/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/57/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/57/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/57/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/57/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/57/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/57/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/57/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/57/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/57/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/57/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=57&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/07/08/syndicating-text-analytics/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>

		<media:content url="http://fbhalper.files.wordpress.com/2008/07/image00119.jpg" medium="image" />
	</item>
		<item>
		<title>My two cents on the 2008 Text Analytics Summit</title>
		<link>http://fbhalper.wordpress.com/2008/06/20/my-two-cents-on-the-2008-text-analytics-summit/</link>
		<comments>http://fbhalper.wordpress.com/2008/06/20/my-two-cents-on-the-2008-text-analytics-summit/#comments</comments>
		<pubDate>Fri, 20 Jun 2008 22:51:11 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[Text Analytics]]></category>

		<category><![CDATA[Content Management]]></category>

		<category><![CDATA[ECM]]></category>

		<category><![CDATA[Fern Halper]]></category>

		<category><![CDATA[Seth Grimes]]></category>

		<category><![CDATA[text mining]]></category>

		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=56</guid>
		<description><![CDATA[I add my thoughts to others who have blogged on the Summit.  See the Seth Grimes posting, for all of the other comments.

I too agree that it was great to see a large number of end users at the Summit, this year.  I was especially interested in the fact that a number of [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>I add my thoughts to others who have blogged on the Summit.  See <a href="http://www.intelligententerprise.com/blog/archives/2008/06/news_surprises.html">the Seth Grimes posting,</a> for all of the other comments.</p>
<ul>
<li>I too agree that it was great to see a large number of end users at the Summit, this year.  I was especially interested in the fact that a number of them were in the investigation phase.  And, what were many investigating?  You got it - Voice of the Customer and the closely intertwined area of sentiment analysis.</li>
<li>VoC was a major theme.  In fact,  I was overwhelmed by the number of talks in the area.  I thought that the presentation about what Gaylord Hotels is doing with text analytics and VoC was extremely interesting.   Tony Bodoh took the audience through a journey beginning with the fact that before text analytics it would take the company weeks to even see customer comments.  He said that the working pilot the company did in conjunction with Clarabridge took only 10 weeks. He reported benefits in process improvements, value-oriented marketing, and facility improvements.  He even told us about reticular activating systems! Go look that one up.</li>
<li>I also met a number of people from new start-ups in the sentiment space, each taking a slightly different angle. I wonder if sentiment analysis will become a confusing space in the near future.</li>
<li>There was some discussion at the Summit about text analytics and Web 2.0.  I would have like to hear more about this, as text analytics will be important in Web 2.0</li>
<li>And speaking of important, another interesting concept was brought up several times - the idea that text analytics will morph to become part of something bigger.   I don&#8217;t want to say component, although others were.</li>
<li>I was hoping to hear more about text analytics and content management.  At one point, during the expert panel, I had the chance to ask the audience if anyone was deploying their text analytics in conjunction with their content management systems.  A handful responded affirmatively.  Unfortunately, I didn&#8217;t catch up with them.  If you&#8217;re reading this and are deploying text analytics with your ECM system, I&#8217;d like to hear from you.</li>
</ul>
<p>All in all though, I was impressed with the Summit.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/56/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/56/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/56/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/56/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/56/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/56/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/56/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/56/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/56/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/56/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/56/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/56/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=56&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/06/20/my-two-cents-on-the-2008-text-analytics-summit/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>Text Analytics and the Predictive Enterprise</title>
		<link>http://fbhalper.wordpress.com/2008/06/12/text-analytics-and-the-predictive-enterprise/</link>
		<comments>http://fbhalper.wordpress.com/2008/06/12/text-analytics-and-the-predictive-enterprise/#comments</comments>
		<pubDate>Thu, 12 Jun 2008 12:42:52 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[Text Analytics]]></category>

		<category><![CDATA[Hurwitz &amp; Associates]]></category>

		<category><![CDATA[Fern Halper]]></category>

		<category><![CDATA[text mining]]></category>

		<category><![CDATA[predictive modeling]]></category>

		<category><![CDATA[SPSS]]></category>

		<category><![CDATA[churn]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=55</guid>
		<description><![CDATA[I recently had the chance to get an update on what SPSS is up to in text analytics.  It was an interesting conversation for several reasons:
 

First, it highlighted an important point about text analytics – which we know but is worth repeating – which is that the analysis of unstructured data can be more useful, [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">I recently had the chance to get an update on what <a href="http://www.spss.com">SPSS</a> is up to in text analytics.<span>  </span>It was an interesting conversation for several reasons:</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<ul style="margin-top:0;" type="disc">
<li class="MsoNormal"><span style="font-family:Arial;"><span style="font-size:small;">First, it highlighted an important point about text analytics – which we know but is worth repeating – which is that the analysis of unstructured data can be more useful, in many scenarios, when accompanied by structured data.</span></span></li>
<li class="MsoNormal"><span style="font-family:Arial;"><span style="font-size:small;">Second, it got me thinking more about social media/network analysis, which prompted the question on the recent “<a href="http://fbhalper.wordpress.com">four questions about innovations in analysis</a><span style="text-decoration:underline;">”</span> blog I recently posted. </span></span></li>
</ul>
<p class="MsoNormal" style="margin:0 0 0 0.25in;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">A few words of background.<span>   </span>SPSS’s goal is help its customers analyze everything about data associated with people – behavior, attitudes, and so on to help an organization understand anyone it interacts with.<span>  </span>In fact, Olivier Jouve, VP of Corporate Development at SPSS was quite clear that SPSS is not a BI company.<span>  </span>Rather, SPSS software helps to enable what SPSS refers to as the “Predictive Enterprise”.<span>   </span>The Predictive Enterprise makes use of analytics (not simply reports) to help manage multiple dimensions across the enterprise including customer intimacy, product placement, and even operational issues such as fraud.<span>  </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoBodyText" style="margin:0;"><span style="font-size:small;"><span class="A4"><span style="font-family:Arial;">SPSS offers a suite of text-mining products that is based on 25 years of research in the application of natural language processing (NLP) technologies. In 2002, SPSS bought LexiQuest™, a linguistics-based text-mining company, intending to combine LexiQuest’s extraction capabilities with SPSS’s data-mining capabilities in order to strengthen the company’s position in predictive analytics. All of SPSS’s text-analytics products now share this same core linguistic functionality. </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<h1 style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">It’s not just about text</span></span></h1>
<p class="MsoNormal" style="margin:0;"><span style="color:#000000;font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoBodyText" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">While the market for text analytics has moved out of the early adopter stage, depending on what type of analysis you’re trying to accomplish, it often is not just about the text.</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="display:none;color:#000000;font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="color:#000000;font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="display:none;color:#000000;font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">For example, consider the following churn scenario:<span>  </span>A telecommunications company is concerned about churn.<span>  </span>The company realizes that it has a wealth of information at its disposal to help predict churn.<span>  </span>On the structured data side it has collected demographic information, usage information, trouble ticket, and product information about each of its customers.<span>  </span>On the unstructured side, it also has collected call center notes, emails, and customer satisfaction surveys.<span>  </span>The company decides to invest in text analytics software that can sift through its call center notes, emails, and survey notes. At the end of the exercise, the company has some great insight into customer complaints that it can certainly act on.<span>  </span>However, it has not exactly gotten the information it might need to solve the churn problem.<span>  </span>In order to do this, it is probably more useful to marry the unstructured information from the call centers and surveys and emails to an actual customer and all of the structured information about that customer.<span>  </span>This way, using some predictive modeling the company can train its system to zero in on those customers that are likely to drop its service and make the right decisions to help retain them.<span>   </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoBodyText" style="margin:0;"><span style="font-size:small;"><span class="A4"><span style="font-family:Arial;">According to SPSS many of its customers have seen upwards of a 50% reduction in churn by combining data mining with text mining. </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<h1 style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">Social media is becoming an important source of information for companies</span></span></h1>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">What about other forms of media such as blogs, message threads, etc.?<span>  </span>SPSS is also moving into social network/media analysis because as Olivier said, “The number of people participating in Web 2.0 activities is growing rapidly across all age groups, and businesses are using the direct influence they have traditionally had over customers’ decisions about their products.<span>  </span>Peer to Peer networks are now a trusted source of insight and information.”<span>   </span>This is quite true.<span>  </span>Our recent Hurwitz &amp; Associates survey confirmed that companies do plan to make use of the information found in various kinds of social networks, even if they don’t think they are making use of text analytics.<span>  </span>One interesting point on this front is that blogs, message boards, etc. do provide a great source of information of customer sentiment, opinions, etc. The challenge will be mapping this kind of information back to the other information that a company keeps about its customers, and making sense of the behaviors.<span>  </span>I’ll look forward to hearing more about what SPSS is doing to help solve this problem.</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/55/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/55/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/55/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/55/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/55/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/55/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/55/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/55/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/55/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/55/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/55/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/55/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=55&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/06/12/text-analytics-and-the-predictive-enterprise/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>Four Questions about Innovations in Analysis</title>
		<link>http://fbhalper.wordpress.com/2008/06/12/four-questions-about-innovations-in-analysis/</link>
		<comments>http://fbhalper.wordpress.com/2008/06/12/four-questions-about-innovations-in-analysis/#comments</comments>
		<pubDate>Thu, 12 Jun 2008 12:29:03 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[BI innovation]]></category>

		<category><![CDATA[Business Intelligence]]></category>

		<category><![CDATA[Data visualization]]></category>

		<category><![CDATA[Information Management]]></category>

		<category><![CDATA[Text Analytics]]></category>

		<category><![CDATA[Uncategorized]]></category>

		<category><![CDATA[Visual Analytics]]></category>

		<category><![CDATA[data analysis]]></category>

		<category><![CDATA[BI]]></category>

		<category><![CDATA[Fern Halper]]></category>

		<category><![CDATA[Hurwitz &amp; Assoicates]]></category>

		<category><![CDATA[innovations]]></category>

		<category><![CDATA[performance management]]></category>

		<category><![CDATA[predictive modeling]]></category>

		<category><![CDATA[survey]]></category>

		<category><![CDATA[text mining]]></category>

		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=47</guid>
		<description><![CDATA[Several weeks ago, Hurwitz &#38; Associates deployed a short survey entitled, “Four questions about innovations in analysis”.  Well, the results and they are quite interesting!
 
THE SURVEY
 
First, a few words about the survey itself and who responded to the survey. 
 

We wanted to make the survey short and sweet.  We were interested in what  kinds of [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"><a href="http://fbhalper.files.wordpress.com/2008/06/figure-1-for-blog2.jpg"></a><a href="http://fbhalper.files.wordpress.com/2008/06/figure-1-for-blog1.jpg"></a>Several weeks ago, Hurwitz &amp; Associates deployed a short survey entitled, “Four questions about innovations in analysis”.<span>  </span>Well, the results and they are quite interesting!</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"><strong>THE SURVEY</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">First, a few words about the survey itself and who responded to the survey. </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<ol style="margin-top:0;" type="1">
<li class="MsoNormal"><span style="font-family:Arial;"><span style="font-size:small;">We wanted to make the survey short and sweet.<span>  </span>We were interested in what<span>  </span>kinds of analytical technology companies thought were important and specifically how companies were using text analytics to analyze unstructured information.<span>  </span>Finally, since there has been a lot of buzz about analyzing social media we asked about this, as well</span></span></li>
<li class="MsoNormal"><span style="font-family:Arial;"><span style="font-size:small;">Let me say up front that given the nature of our list, I would categorize <em>most</em> of the respondents to the survey as fairly technology savvy.<span>  </span>In all,<span>  </span>61 people responded to the survey, 32% of these respondents were from high technology companies.<span>  </span>The verticals included professional services, followed by manufacturing, financial/insurance, healthcare and pharmaceutical. There were also some responses from governmental agencies, telecommunications and energy companies.<span>  </span>So, while the results are unscientific in terms of a random sample across all companies, they probably do reflect the intentions of potential early adopters, although not in a statistically significant manner. </span></span></li>
<li class="MsoNormal"><span style="font-family:Arial;"><span style="font-size:small;">In analyzing the results, I first looked at the overall picture and then examined individual verticals as well as filtered the results by other attributes (such as those using text analytics vs. those not using the technology) to get a feel for what these companies were thinking about and whether one group was different from another.<span>  </span>These subgroups are of course, quite small and the results should be viewed accordingly.</span></span></li>
</ol>
<p class="MsoNormal"><span style="font-family:Arial;"><span style="font-size:small;"><strong><span style="font-family:Arial;">THE RESULTS</span></strong></span></span></p>
<p class="MsoNormal">
<div class="MsoNormal"><span style="font-family:Arial;"><span style="font-size:small;"><span style="font-family:Arial;"><strong>The importance of innovative technologies</strong></span></span></span></div>
<div></div>
<div><span style="font-family:Arial;"></span></div>
<p><span style="font-family:Arial;"><span style="font-size:small;"></p>
<p class="MsoNormal" style="margin:0;"><strong><span style="font-family:Arial;"> </span></strong><span style="font-family:Arial;">We first asked all of the respondents to rate a number of technologies in terms of importance to their companies.<span>  </span>Figure 1 shows the results.<span>  </span>Overall, most of these technologies were at least somewhat important to this technology savvy group, with query and reporting leading the pack.<span>  </span>This isn’t surprising.<span>  </span>Interestingly, OLAP data cubes appeared to be the least important analytical technology – at least with this group of respondents.<span>  </span>Other technologies, such as performance management, predictive modeling, and visualization ranked fairly high, as well.<span>  </span>Again not surprisingly, text analytics ranked lower than some of the other technologies probably since it is just moving out of the early adopter stage.<span>  </span>Some of the respondents, from smaller firms, had no idea what any of these technologies were.<span>  </span>And, in terms of text analytics, one company commented, ” yeekes, this must be big time company kind of stuff. Way up in the clouds here, come down to earth.” They, no doubt, are still using Excel and Access for their analytical needs.<span>  </span>Other smaller companies were very interested in “non-cube” technologies such as some of the visualization products on the market today.</span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> <a href="http://fbhalper.files.wordpress.com/2008/06/figure-1-for-blog.jpg"></a></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> <a href="http://fbhalper.files.wordpress.com/2008/06/figure-1-for-blog3.jpg"><img class="aligncenter size-full wp-image-52" src="http://fbhalper.files.wordpress.com/2008/06/figure-1-for-blog3.jpg?w=544&h=408" alt="" width="544" height="408" /></a></span></p>
<p class="MsoNormal">
<p class="MsoNormal" style="margin:0;"> <span id="more-47"></span><!--more--></p>
<div class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"><span style="font-family:Arial;">Those respondents whose companies consider text analytics or other innovative analytical technologies such as predictive modeling or even performance management to be very important appear to value analytic technologies, in general. The respondents stated that most of the technologies listed were very important.<span>   </span>For example, when I examined those companies that viewed text analytics as very important, the average rating for all of the technologies listed above was a good percentage point higher than those that stated that it wasn’t at all important. Granted, the groups were small, but the pattern was clear. </span></span></span></div>
<div><span style="font-family:Arial;"></span></div>
<p><span style="font-family:Arial;"><span style="font-size:small;"></p>
<p class="MsoNormal">
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> </span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;">What does this mean?<span>  </span>I would speculate that it means that companies that have found value in perhaps one or more analytical technologies are more apt to try out others.<span>  </span>Success breeds success.<span>  </span>We’ve seen this trend in the text analytics space.<span>  </span>When a company deploys the technology and finds success, other parts of the organization are quick to jump on board.<span>  </span>These technology savvy companies no doubt are gaining benefits from these technologies and are willing to invest in them because they provide significant value to the company.<span>  </span>Those that either haven’t invested in the technologies, or are caught up in one problem –say getting their data into a warehouse- haven’t had the opportunity to experience the benefits of these other technologies, for budgetary or other reasons and so they are not that important to their companies right now.</span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"><strong>Text Analytics</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;">Question 2 asked which statement best describes your company’s use of text analytics?<span>  </span>The results here indicate a surprisingly high percent – 32.7%- actually using the technology today.<span>  </span>Again, the reason behind this is no doubt due to the fact that the companies polled are probably very technology savvy.<span>  </span>I believe this is significantly higher than the overall adoption rate of text analytics, which Hurwitz &amp; Associates estimated at 10-15% of large enterprises in 2007. On the other hand, almost 45% either didn’t know what the technology was or didn’t plan to use it in the next 12-18 months.<span>  </span>It is important to note that these results were industry independent.</span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> <a href="http://fbhalper.files.wordpress.com/2008/06/figure-2-for-blog2.jpg"><img class="aligncenter size-full wp-image-54" src="http://fbhalper.files.wordpress.com/2008/06/figure-2-for-blog2.jpg?w=544&h=408" alt="" width="544" height="408" /></a></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;">We asked respondents that were using text analytics or planning to use it. what they were planning to use it for.<span>  </span>An important point is that the majority of respondents were planning to use the technology in more than one application area.<span>  </span>Not surprisingly, Voice of the Customer and Competitive Intelligence/Brand Intelligence led the space with over 60% of the respondents cited these two areas.<span>  </span>Next up on the application front was e-Discovery, an area that is starting to gain a lot of attention.<span>  </span></span></p>
<p class="MsoNormal">
<p class="MsoNormal">
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> </span><span style="font-family:Arial;">Let’s slice and dice this a bit&#8230;.</span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> </span><span style="font-family:Arial;">First up, those respondents deploying text analytics.<span>  </span>These respondents came from a range of industries.<span>  </span>In this group, companies are using text analytics for (in order of importance)<span>  </span>Competitive/Brand intelligence, Voice of the Customer, e-discovery, quality and early warning and fraud.<span>  </span>The respondents also cited other areas their companies are using text analytics for including spend analysis, analyzing company metrics, product lifecycle management, and mining business rules. Analysis of social media was very important to this group.<span>  </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> </span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;">Those who were planning to deploy text analytics in the next 12-18 months also viewed query and reporting as most important to their company.<span>  </span>Interestingly, the top application this group is considering is Voice of the Customer followed by competitive intelligence/brand image.<span>   </span>This tracks with the primary research study we performed last year, indicating that Voice of the Customer should continue to be a hot area for text analytics implementations for at least the next year or so. Analysis of social networks and media was of high interest to this group.</span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> </span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;">Even those who did not have plans for text analytics were still interested in social networks – indicating that companies are starting to think about using unstructured information to gain insight about customers.<span>  </span>These companies may be considering brand monitoring companies to serve its needs, or perhaps haven’t even gotten that far in the thought process, yet. </span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"><strong>The Upshot?</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<ul style="margin-top:0;" type="disc">
<li class="MsoNormal"><span style="font-family:Arial;">Broadly speaking, companies that believe that innovative analysis technologies such as predictive modeling, text analytics, and even performance management are very important, generally rated all analytical technologies as having a higher value than those companies that did not believe they were very important.<span>  </span></span></li>
<li class="MsoNormal"><span style="font-family:Arial;">Text analytics continues to grow in importance, at least with this group of respondents.<span>  </span>Companies deploying the technology today are doing so across several application areas.<span>  </span>Companies that plan to deploy the technology are also looking at several application areas.<span>  </span>Voice of the Customer will continue to be a hot area for the foreseeable future.<span>  </span>E-discovery is also making noise.</span></li>
<li class="MsoNormal"><span style="font-family:Arial;">Companies across the board expressed interest in making use of social media and social networks.</span></li>
</ul>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"> </span></p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<div></div>
<p><span style="font-family:Arial;"></p>
<p class="MsoNormal"> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p></span></p>
<p class="MsoNormal"> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p> </p>
<p></span></span></p>
<p></span></span></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/47/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/47/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/47/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=47&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/06/12/four-questions-about-innovations-in-analysis/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>

		<media:content url="http://fbhalper.files.wordpress.com/2008/06/figure-1-for-blog3.jpg" medium="image" />

		<media:content url="http://fbhalper.files.wordpress.com/2008/06/figure-2-for-blog2.jpg" medium="image" />
	</item>
		<item>
		<title>Four questions about BI Innovation</title>
		<link>http://fbhalper.wordpress.com/2008/05/22/four-questions-about-bi-innovation/</link>
		<comments>http://fbhalper.wordpress.com/2008/05/22/four-questions-about-bi-innovation/#comments</comments>
		<pubDate>Thu, 22 May 2008 21:03:20 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[BI innovation]]></category>

		<category><![CDATA[Business Intelligence]]></category>

		<category><![CDATA[Data visualization]]></category>

		<category><![CDATA[Text Analytics]]></category>

		<category><![CDATA[data analysis]]></category>

		<category><![CDATA[BI]]></category>

		<category><![CDATA[social networks]]></category>

		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=46</guid>
		<description><![CDATA[

As many of you know, I have been spending a great deal of time researching the area of innovations in BI. Yesterday, I posted a short four question survey regarding how companies might be using some of the analysis technologies that are out in the market today. I&#8217;m starting to get some interesting responses!





I&#8217;d be [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><div>
<div>
<div style="font-size:12pt;font-family:'Times New Roman',serif;margin:0;"><span style="font-size:10pt;color:#000000;font-family:Arial,sans-serif;">As many of you know, I have been spending a great deal of time researching the area of innovations in BI. Yesterday, I posted a short four question survey regarding how companies might be using some of the analysis technologies that are out in the market today. I&#8217;m starting to get some interesting responses!<br />
</span></div>
</div>
</div>
<div>
<div>
<div style="font-size:12pt;font-family:'Times New Roman',serif;margin:0;"><span style="font-size:10pt;color:#000000;font-family:Arial,sans-serif;">I&#8217;d be interested in hearing your thoughts.</span></div>
</div>
</div>
<div>
<div>
<div style="font-size:12pt;font-family:'Times New Roman',serif;margin:0;"><span style="font-size:10pt;color:#000000;font-family:Arial,sans-serif;">Simply click on the link below and answer a few questions. It should take no more than 30 seconds.</span></div>
<div style="font-size:12pt;font-family:'Times New Roman',serif;margin:0;"><a href="http://http://survey.constantcontact.com/survey/a07e2atkh65fggv88jg/start"></a></div>
<div style="font-size:12pt;font-family:'Times New Roman',serif;margin:0;"><span style="font-size:x-small;"><a href="http://survey.constantcontact.com/survey/a07e2atkh65fggv88jg/a013mfgjmy4vn/questions">http://survey.constantcontact.com/survey/a07e2atkh65fggv88jg/a013mfgjmy4vn/questions</a><a href="http://http://survey.constantcontact.com/survey/a07e2atkh65fggv88jg/start"></a>  </p>
<p></span></div>
</div>
</div>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/46/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/46/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/46/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/46/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/46/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/46/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/46/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/46/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/46/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/46/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/46/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/46/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=46&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/05/22/four-questions-about-bi-innovation/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>Customer Experience Intelligence and Text Analytics</title>
		<link>http://fbhalper.wordpress.com/2008/05/15/customer-experience-intelligence-and-text-analytics/</link>
		<comments>http://fbhalper.wordpress.com/2008/05/15/customer-experience-intelligence-and-text-analytics/#comments</comments>
		<pubDate>Thu, 15 May 2008 15:50:41 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[BI innovation]]></category>

		<category><![CDATA[Text Analytics]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=44</guid>
		<description><![CDATA[Several months ago, I wrote a piece about Voice of the Customer (VoC) and how text analytics companies are getting good traction from this application.  I recently got an update from Sid Banerjee, CEO of Clarabridge that further supports this observation.  Clarabridge, an independent text analytics vendor is garnering much success with what it is [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">Several months ago, I wrote a piece about Voice of the Customer (VoC) and how text analytics companies are getting good traction from this application.<span>  </span>I recently got an update from Sid Banerjee, CEO of <a href="http://www.clarabridge.com">Clarabridge </a>that further supports this observation.<span>  </span>Clarabridge, an independent text analytics vendor is garnering much success with what it is referring to as Customer Experience Intelligence (CEI).<span>  </span>This is really another term for a Voice of the Customer with an emphasis on analyzing the experience – from the customer’s point of view.</span></span></p>
<p class="MsoNormal" style="margin:0;"><strong><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></strong></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<h1 style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">The Chief Customer Officer wants to know</span></span></h1>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">It goes without saying that customer experience matters to companies. Companies want to know what their customers are thinking about their products and services.<span>  </span>New positions, such as the Chief Customer Officer have even been developed at some larger companies to address this issue.<span>  </span>The traditional method of manually reading through surveys, emails, etc to answer questions about customer experience is time consuming and not effective.<span>  </span>I remember reading through this kind of information myself to try to determine why customers were dropping a service and I’ll be the first to say that the process was not pretty.<span>  </span>Text analytics enables companies to sift through this massive amount of information much more efficiently and effectively and derive valuable insight from it.<span>   </span>Clarabridge is getting good traction from groups concerned with enterprise feedback management, CRM including call centers, tech support and the Chief Customer officer her/himself, to help analyze structured and unstructured information to understand customer experience.<span>  </span>These types of deployments have helped fuel the company’s rapid growth.</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<h1 style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">Clarabridge Navigator</span></span></h1>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">While text analytics technology helps companies gain valuable insight about their customers, companies deploying the technology have faced challenges, regardless of the vendor they have chosen to work with.<span>  </span>Our Hurwitz &amp; Associates research last year highlighted a number of challenges including taxonomy and rule development.<span>  </span>End-users complained that taxonomy development was often a very frustrating experience.<span>   </span>End-users also stated that rule development was often difficult and time consuming.</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">Earlier this year, Clarabridge released Clarabridge Navigator.<span>  </span>Navigator is an interface for business users to help them build taxonomies and categories more effectively.<span>   </span>The interface enables end-users to iteratively explore categories that they build.<span>  </span>The example Sid used in our discussion utilized hotel related data.<span>  </span>Suppose I’ve created a category (in Navigator) called bed comfort.<span>  </span>I’ve keyed off of words such as comfort and bed and, using Navigator, I can see the kinds of text that gets returned off of these words.<span>  </span>Perhaps when I am exploring this I see that people are talking about other words such as queen and king.<span>  </span>I can then incorporate the concept of bed size into my categorization.<span>  </span>In this way the end-user can interactively and iteratively explore text data and develop a better categorization scheme.<span>  </span>Navigator provides the same functionality for sentiment.<span>  </span>The Clarabridge product comes with over ten thousand words that are sentiment related.<span>  </span>However, if the user wants to explore the data and change the defaults, it can be done via Navigator.<span>  </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">If the categories are correct, this makes rule development more efficient.<span>  </span>Instead of running the data through the rules and hoping it makes sense, this gives end-users a more effective way to develop rules.<span>  </span>This doesn’t negate the need for people involvement in the taxonomy and rule development process, but it is one step in making the process easier – and enabling text analytics to become more mainstream.</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/44/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/44/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/44/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/44/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/44/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/44/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/44/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/44/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/44/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/44/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/44/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/44/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=44&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/05/15/customer-experience-intelligence-and-text-analytics/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>What&#8217;s Next For Text Analytics?</title>
		<link>http://fbhalper.wordpress.com/2008/05/07/whats-next-for-text-analytics/</link>
		<comments>http://fbhalper.wordpress.com/2008/05/07/whats-next-for-text-analytics/#comments</comments>
		<pubDate>Wed, 07 May 2008 16:50:21 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[Text Analytics]]></category>

		<category><![CDATA[data analysis]]></category>

		<category><![CDATA[data mining]]></category>

		<category><![CDATA[Fern Halper]]></category>

		<category><![CDATA[Hurwitz &amp; Associates]]></category>

		<category><![CDATA[research study]]></category>

		<category><![CDATA[text]]></category>

		<category><![CDATA[text mining]]></category>

		<category><![CDATA[unstructured data]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=42</guid>
		<description><![CDATA[  

 





When I first started to research the text analytics industry about three years ago, people didn’t understand why I was so interested in this nascent marketplace.  I had always anticipated how important it would be.  I remember working at Bell Laboratories, in the 1980s, analyzing structured data using advanced data mining techniques, trying to predict [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><div><span style="font-size:small;"><span style="font-family:Times New Roman;">  </p>
<p></span></span></div>
<div><span style="font-size:small;"><span style="font-family:Times New Roman;"> </p>
<p></span></span></div>
<div><span style="font-size:small;"><span style="font-family:Times New Roman;"></p>
<div></div>
<p></span></span></div>
<p><span style="font-size:small;"><span style="font-family:Times New Roman;"><span></p>
<p class="MsoBodyTextIndent" style="line-height:150%;margin:0;"><span style="font-family:Arial;">When I first started to research the text analytics industry about three years ago, people didn’t understand why I was so interested in this nascent marketplace.<span>  </span>I had always anticipated how important it would be.<span>  </span>I remember working at Bell Laboratories, in the 1980s, analyzing structured data using advanced data mining techniques, trying to predict customer actions.<span>  </span>I knew then, that this structured data was only a piece of the puzzle and that unstructured data needed to be part of the mix.<span>  </span>Since that time technology and computing power have obviously come a long way and text analytics has become an increasingly mature market.</span></p>
<p class="MsoBodyTextIndent" style="line-height:150%;margin:0;"> </p>
<p class="MsoBodyTextIndent" style="line-height:150%;margin:0;"><span style="font-family:Arial;"> </span><span style="font-family:Arial;">Over the past several years, I’ve gotten to know many of the vendors in the space, talked to many customers deploying the technology, and in 2007 published a benchmark study entitled, “Text Analytics: The Road to Understanding Your Company’s Unstructured Data.” Three key findings from our research have been borne out in the market:</span></p>
<p class="MsoNormal" style="text-indent:-0.25in;line-height:150%;margin:0 0 0 1in;"><span style="font-family:Symbol;">·<span style="font:7pt;">        </span></span><span style="font-family:Arial;">Voice of the Customer is a key application area.<span>  </span>Our research indicated that end-users placed a high priority on customer focused applications, with75% of respondents planning to deploy customer care related solutions.<span>   </span>Today, many of the vendors in this space have gained great traction in this area.</span></p>
<p class="MsoNormal" style="text-indent:-0.25in;line-height:150%;margin:0 0 0 1in;"><span style="font-family:Symbol;">·<span style="font:7pt;">        </span></span><span style="font-family:Arial;">Text analytics moves beyond business intelligence.<span>  </span>While early adopters of the technology deployed text analytics together with data mining and BI, and this is still a very popular means of deploying the technology, our study indicated that end-users were interested in deploying text analytics in conjunction with both BI and content management.<span>  </span>Since then, I’ve seen more evidence in the market for this kind of deployment. </span></p>
<p class="MsoNormal" style="text-indent:-0.25in;line-height:150%;margin:0 0 0 1in;"><span style="font-family:Symbol;">·<span style="font:7pt;">        </span></span><span style="font-family:Arial;">The market will continue to consolidate. At the time the report was published, ClearForest and Inxight had recently been acquired. I predicted that additional market consolidation would take place; and it has. </span></p>
<p class="MsoNormal" style="line-height:150%;margin:0 0 0 0.5in;"><span style="font-family:Arial;"> </span></p>
<p class="MsoNormal" style="line-height:150%;margin:0;"><span style="font-family:Arial;">Since the study was published in August, 2007, and due to the factors mentioned above, the market has reached the next level of maturity.<span>  </span>For these reasons, I believe it is time to provide another benchmark.<span>  </span>I am looking for a few sponsors who would like to partner with our firm in bringing valuable market research to the end-user community.<span>  </span>If you are interested, please contact Carol Caliendo at <a href="mailto:carol.caliendo@hurwitz.com">carol.caliendo@hurwitz.com</a> or 617-454-1030.</span></p>
<p class="MsoNormal" style="line-height:150%;margin:0;"><span style="font-family:Arial;"> </span><span style="font-family:Arial;"> </span></p>
<p class="MsoBodyTextIndent" style="line-height:150%;margin:0;"> </p>
<div></div>
<p></span></span><span style="font-size:small;"></p>
<p class="MsoBodyTextIndent" style="line-height:150%;margin:0;"> </p>
<p> </p>
<p> </p>
<p> </p>
<p></span></span></p>
<p class="MsoBodyTextIndent" style="line-height:150%;margin:0;"> </p>
<p class="MsoBodyTextIndent" style="line-height:150%;margin:0;"> </p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/42/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/42/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/42/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/42/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/42/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/42/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/42/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/42/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/42/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/42/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/42/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/42/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=42&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/05/07/whats-next-for-text-analytics/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>A different way to search?</title>
		<link>http://fbhalper.wordpress.com/2008/04/25/a-different-way-to-search/</link>
		<comments>http://fbhalper.wordpress.com/2008/04/25/a-different-way-to-search/#comments</comments>
		<pubDate>Fri, 25 Apr 2008 12:34:49 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[Business Intelligence]]></category>

		<category><![CDATA[Content Management]]></category>

		<category><![CDATA[Text Analytics]]></category>

		<category><![CDATA[classification]]></category>

		<category><![CDATA[search]]></category>

		<category><![CDATA[Attensity]]></category>

		<category><![CDATA[Brainware]]></category>

		<category><![CDATA[Fash]]></category>

		<category><![CDATA[Fern Halper]]></category>

		<category><![CDATA[GlobalBrain]]></category>

		<category><![CDATA[Google]]></category>

		<category><![CDATA[Hurwitz &amp; Associates]]></category>

		<category><![CDATA[neural networks]]></category>

		<category><![CDATA[Yahoo!]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=41</guid>
		<description><![CDATA[I recently had an interesting conversation about classification and search with James Zubok, CFO of Brainware, Inc. Brainware is a Virginia based company that was once part of SER Systems AG, a former German ECM company.  Brainware provides products that help companies extract information from unstructured and semi-structured documents, such as invoices, order forms, contracts, [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">I recently had an interesting conversation about classification and search with James Zubok, CFO of <a href="http://www.brainware.com">Brainware, Inc</a>. Brainware is a Virginia based company that was once part of SER Systems AG, a former German ECM company.<span>  </span>Brainware provides products that help companies extract information from unstructured and semi-structured documents, such as invoices, order forms, contracts, etc. without using templates.<span>  </span>The company also offers some interesting classification and search technology and this is what our conversation focused on. </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">We discussed two different, but interrelated technologies that Brainware has developed; one a search engine based on n-grams and another,<span>  </span>a classification engine that uses neural networks. Brainware offers both enterprise and desktop editions of each.<span>  </span>I received a demo of the desktop version of the products and now have both running on my laptop.</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<h1 style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">A Search example</span></span></h1>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">On the desktop search side, the product, called Globalbrain Personal Edition, differs from many other search products on the market in that it does not make use of keyword search.<span>  </span>Rather, it searches are natural language based, using a patented n-gram approach.<span>  </span>When indexing a word, the word is parsed into three parts and then a vector is created.<span>  </span>For example, the word sample would be parsed as sam, amp, mpl, etc.<span>  </span>According to Brainware, this three-letter snippet approach makes the search engine language independent.<span>   </span>The capability provided by Brainware lets users search, not simply on key words, but on whole paragraphs. For example, I have many documents (in various formats) on my desktop that deal with all of the companies I speak with.<span>  </span>Say, I want to find some documents relating to specific challenges companies faced in deploying their text analytics solutions.<span>  </span>Rather than simply inputting “text analytics” and “challenges”, I can type in a phrase or even a paragraph with the wording I’m looking for.<span>  </span>This returns a much more targeted set of documents.<span>  </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<h1 style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">A Classification example</span></span></h1>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">On the desktop classification front, the product is very easy to use.<span>  </span>I simply loaded the software which provided me a user interface where I could develop classes and then train my system to automatically classify documents based on a few training examples. As I mentioned, I have many documents on my desktop that deal with various technology areas and I might want to classify them in an intelligent manner for some research I’m planning.<span>  </span>So, I created several classes: text analytics, visualization, and MDM. I simply created these classes and then dragged documents that I felt fell into each category onto those classes.<span>  </span>I trained the system on these examples.<span>  </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">Brainware provides a visual interface that lets me view how “good” the learned set is via a series of points in three-dimensional space.<span>  </span>The closer together the points (from the same class) are on the plot, the better the classification will be.<span>  </span>Also, the more separate the various class points are, the better the classification.<span>  </span>In my classification test, the visualization and the MDM documents were tightly clustered, while the text analytics information was not.<span>  </span>In any event, I then ran the classifier over the rest of my documents (supplying a few parameters) and the system automatically classified what it could.<span>  </span>It also gave me a list of documents that it couldn’t classify, but suggested the appropriate categories. I could then just drag those documents into the appropriate categories and run the classifier again. I should add that it did a good job of suggesting the right class for the documents it put in the unclassified category.</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<h1 style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">Brainware on an enterprise level</span></span></h1>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">The enterprise edition of the product combines the search and classification capabilities and lets users search and classify over 400 different document types.<span>  </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">Now, Brainware isn’t planning to compete with Google, Yahoo!, Fast, etc.<span>  </span>Rather, the company sees its search as a complement to these inverted index approaches.<span>  </span>The idea would be to embed its search into other applications that deal with archiving, document management, or e-discovery, to name a few.<span>  </span>The classification piece could also be embedded into the appropriate applications.<span>  </span>I asked if the company was in discussions with content management providers and service providers that store emails and documents.<span>  </span>It would seem to me that this software would be a natural complement to some of these systems. My understanding is that the company is looking for partnerships in the area.<span>  </span>Brainware currently has a partnership with Attensity, a text analytics provider, to help classify and search documents as part of the text analytics process. </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;">I’m interested to see what will develop with this company.</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family:Arial;"><span style="font-size:small;"> </span></span></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/41/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/41/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/41/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/41/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/41/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/41/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/41/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/41/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/41/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/41/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/41/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/41/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=41&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/04/25/a-different-way-to-search/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>SAS Purchases Teragram</title>
		<link>http://fbhalper.wordpress.com/2008/03/20/sas-purchases-teragram/</link>
		<comments>http://fbhalper.wordpress.com/2008/03/20/sas-purchases-teragram/#comments</comments>
		<pubDate>Thu, 20 Mar 2008 21:55:03 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[BI innovation]]></category>

		<category><![CDATA[Text Analytics]]></category>

		<category><![CDATA[text mining]]></category>

		<category><![CDATA[Business Objects]]></category>

		<category><![CDATA[Fern Halper]]></category>

		<category><![CDATA[Hurwitz &amp; Associates]]></category>

		<category><![CDATA[Inxight]]></category>

		<category><![CDATA[Natural Language Processing]]></category>

		<category><![CDATA[NLP]]></category>

		<category><![CDATA[SAS]]></category>

		<category><![CDATA[Teragram]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=40</guid>
		<description><![CDATA[Monday, SAS announced that it had purchased Teragram, a privately held natural language processing (NLP) company, for an undisclosed sum.  Teragram is now a SAS company meaning that the Teragram brand will be maintained.  Its solutions and OEM business will be retained. 
A good move for SAS
This acquisition is a good move for SAS for a [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><span style="font-family:Arial;">Monday, <a href="http://www.sas.com">SAS </a>announced that it had purchased <a href="http://www.teragram.com">Teragram</a>, a privately held natural language processing (NLP) company, for an undisclosed sum.<span>  </span>Teragram is now a SAS company meaning that the Teragram brand will be maintained.<span>  </span>Its solutions and OEM business will be retained.</span><b><span style="font-family:Arial;"> </span></b></p>
<p><b><span style="font-family:Arial;"></span></b><b><span style="font-family:Arial;">A good move for SAS</span></b></p>
<p><span style="font-family:Arial;">This acquisition is a good move for SAS for a number of reasons.<span>  </span>First, SAS had partnered with Inxight to supply text analytics software components for its text mining solution and, of course, Inxight has been acquired by Business Objects (and subsequently SAP).<span>  </span>It was just a matter of time before SAS would have to replace these capabilities and Teragram is a logical choice because of its NLP technology.<span>   </span>More importantly, in my discussion with SAS and Teragram about the acquisition, it was clear that the purchase is more than just a move to replace technology components.<span>  </span>The purchase is actually quite strategic in nature.<span>  </span>Teragram technology can be used to enhance rather than simply replace existing capabilities. </span><b><span style="font-family:Arial;"> </span></b></p>
<p><b><span style="font-family:Arial;"></span></b><b><span style="font-family:Arial;">Structured and Unstructured Data are Not Separate Domains</span></b></p>
<p><span class="A4"><span><span><font face="Times New Roman"> </font></span></span></span><span style="font-family:Arial;">SAS’s strategy is to use both structured and unstructured data in analysis and to integrate it for descriptive and predictive modeling.<span>  </span>The company’s aim is to provide users with a seamless deployment of predictive model results and improve the consistency and accuracy of enterprise intelligence.</span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">The folks at SAS believe that structured and unstructured data have typically been viewed as two separate silos that can be joined together and then analyzed. Traditionally, unstructured data is the realm content management systems; structured data is the realm of BI.<span>  </span>SAS believes that both of these data sources should be brought together earlier in the analysis process and utilized as a joint asset.<span>  </span>Unstructured information can be categorized and indexed and even extracted and integrated more intelligently.<span>  </span>This approach makes a lot of sense to me and Teragram has technology to help make this happen.</span><span style="font-family:Arial;"> </span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">While a primary focus of the acquisition is to more seamlessly integrate structured and unstructured data into BI, SAS also mentioned a few other exciting concepts during our discussion.</span></p>
<ul type="disc" style="margin-top:0;">
<li class="MsoNormal"><span style="font-family:Arial;">SAS hopes to use Teragram’s NLP capabilities to make BI more pervasive.<span>  </span>For example, by combining “</span><span style="font-family:Arial;">SAS business intelligence, data integration and advanced analytics with Teragram’s NLP technologies to deliver answers to search queries in seconds”.<span>  </span>This appears to be an extension of the Teragram Direct Answers solution.</span><span style="font-family:Arial;"></span></li>
<li class="MsoNormal"><span style="font-family:Arial;">Mobile BI</span></li>
<li class="MsoNormal"><span style="font-family:Arial;">Real time alerting</span></li>
</ul>
<p><span class="A4"><span style="font-size:12pt;font-family:Arial;"> </span></span><span style="font-family:Arial;">And we shouldn’t forget that Teragram will also provide SAS with a much needed search capability.<span>  </span>I’m definitely looking forward to hearing more about integration and roll-out plans in the near future.</span></p>
<p><font face="Times New Roman"> </font><font face="Times New Roman"> </font></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/40/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/40/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/40/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=40&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/03/20/sas-purchases-teragram/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>Database Activity Monitoring</title>
		<link>http://fbhalper.wordpress.com/2008/03/17/database-activity-monitoring/</link>
		<comments>http://fbhalper.wordpress.com/2008/03/17/database-activity-monitoring/#comments</comments>
		<pubDate>Mon, 17 Mar 2008 00:21:24 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
		
		<category><![CDATA[Database Activity Monitoring]]></category>

		<category><![CDATA[activity monitoring]]></category>

		<category><![CDATA[Database]]></category>

		<category><![CDATA[Microsoft]]></category>

		<category><![CDATA[Oracle]]></category>

		<category><![CDATA[Sentrigo; Hurwitz &amp; Associates]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=36</guid>
		<description><![CDATA[Data makes the world go ‘round and a lot of this critical data resides in databases.  This is why it is important to monitor database activity in order to prevent attacks and invasions.  
I recently had an interesting discussion with Rani Osnat, VP of Marketing at Sentrigo Software about its database monitoring product called Hedgehog [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><font face="Times New Roman"><span style="font-family:Arial;">Data makes the world go ‘round and a lot of this critical data resides in databases.<span>  </span>This is why it is important to monitor database activity in order to prevent attacks and invasions.<span>  </span></span></font></p>
<p><font face="Times New Roman"><span style="font-family:Arial;">I recently had an interesting discussion with Rani Osnat, VP of Marketing at <a href="http://www.sentrigo.com">Sentrigo </a>Software about its database monitoring product called Hedgehog and its recently announced 2.0 release.<span>  </span>For those of you who aren’t familiar with Sentrigo, it is an Israeli/US based company, founded in 2006, that focuses on real time database activity monitoring and intrusion prevention.<span>  </span>The company released its first product, Hedgehog 1.0 in 2007.</span><span style="font-family:Arial;"> </span></font><font face="Times New Roman"> </font><font face="Times New Roman"></font><font face="Times New Roman"></font><font face="Times New Roman"></font><font face="Times New Roman"></p>
<h1><span style="font-family:Arial;"><font size="3">Here’s how it works</font></span></h1>
<p><span style="font-family:Arial;"> </span><span style="font-family:Arial;">Hedgehog samples and monitors transactions using data that in the database cache. It uses the cache to minimize performance issues.<span>  </span>The software utilizes rules that act on certain parameters such as time of day, day of week, IP addresses, etc.<span>  </span>All of the parameters can be combined in Boolean conditions and operators such as “includes”, “does not include”, “between” (for a range of values), as well as AND, OR, NOT and nested expressions using parentheses.<span>  </span>For example, a simple rule might be something like this:</span><span style="font-family:Arial;"> </span></p>
<p align="center"><span style="font-family:Arial;"></span><span style="font-family:Arial;">If<span>  </span>&lt;certain user type&gt; accesses &lt;Credit Card Table&gt; at &lt;non-working hours&gt; then terminate.</span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">Rules can be developed by administrators using a wizard-based interface.<span>  </span>Hedgehog also comes with a series of built in rules called Virtual Patches, that provide rules to detect and prevent vulnerabilities in databases as they become known.<span>  </span>Sentrigo maintains a team of security professionals that research and track these issues and provide the patches that act as a band-aid until the database vendor issues their own patches.<span>  </span>These updates are distributed continually.</span><span style="font-family:Arial;"> </span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">In release 2.0, the company provides: </span></p>
<ul type="disc" style="margin-top:0;">
<li class="MsoNormal"><span style="font-family:Arial;">Support for Microsoft SQL Server in addition to already supported Oracle (with Sybase and DB2 to follow).<span>  </span></span></li>
<li class="MsoNormal"><span style="font-family:Arial;"><span></span></span><span style="font-family:Arial;">Hedgehog IDentifier – a patent-pending technology that allows positive user identification in n-tier environments by attaching tags with IDs for every transaction.<span>  </span>This means, if 1000 people are using SAP finance in a pooled manner, but connect through a super-user in database, Hedgehog can identify the individual user.</span></li>
<li class="MsoNormal"><span style="font-family:Arial;">Compliance templates- which guide users through the process of translating requirements into rules for monitoring compliance in support of PCI DSS, Sarbannes Oxley, and SAS 70.<span>  </span></span></li>
</ul>
<h1><span style="font-family:Arial;"><font size="3">Get Proactive</font></span></h1>
<p><span style="font-family:Arial;"> </span><span style="font-family:Arial;">I have had a number of conversations with database administrators in the past and I was frankly surprised that some of these administrators seemed to be more reactive than proactive in regard to monitoring their database(s).<span>    </span>Many seem to be driven by compliance mandates, rather than taking a proactive approach to intrusion detection and prevention.<span>  </span>Companies need to think through their entire data security strategy which includes attacks on data from inside and outside the organization.<span>  </span>Perhaps attacks such as those that we’ve all read about in the past year and the addition of new mandates have opened their eyes.<span>  </span></span><span style="font-family:Arial;"> </span><span style="font-family:Arial;"> </span><span style="font-family:Arial;"> </span><span style="font-family:Arial;"> </span></p>
<p></font></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/fbhalper.wordpress.com/36/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/fbhalper.wordpress.com/36/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/36/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/36/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/36/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/36/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/36/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&blog=2214430&post=36&subd=fbhalper&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2008/03/17/database-activity-monitoring/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/fbhalper-128.jpg" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
	</channel>
</rss>