<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Fern Halper's data makes the world go 'round</title>
	<atom:link href="http://fbhalper.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://fbhalper.wordpress.com</link>
	<description>the big wide world of data and information management and analysis</description>
	<lastBuildDate>Tue, 24 Jan 2012 00:09:46 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='fbhalper.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Fern Halper's data makes the world go 'round</title>
		<link>http://fbhalper.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://fbhalper.wordpress.com/osd.xml" title="Fern Halper&#039;s data makes the world go &#039;round" />
	<atom:link rel='hub' href='http://fbhalper.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Four Vendor Views on Big Data and Big Data Analytics Part 2- SAS</title>
		<link>http://fbhalper.wordpress.com/2011/12/14/four-vendor-views-on-big-data-and-big-data-analytics-part-2-sas/</link>
		<comments>http://fbhalper.wordpress.com/2011/12/14/four-vendor-views-on-big-data-and-big-data-analytics-part-2-sas/#comments</comments>
		<pubDate>Wed, 14 Dec 2011 13:00:02 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[big data appliances]]></category>
		<category><![CDATA[data streams]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[SAS]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=648</guid>
		<description><![CDATA[Next up in my discussion on big data providers is SAS.  What’s interesting about SAS is that, in many ways, big data analytics is really just an evolution for the company.  One of the company’s goals has always been to support complex analytical problem solving.  It is well respected by its customers for its ability [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=648&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Next up in my discussion on big data providers is <a href="http://www.sas.com">SAS</a>.  What’s interesting about SAS is that, in many ways, big data analytics is really just an evolution for the company.  One of the company’s goals has always been to support complex analytical problem solving.  It is well respected by its customers for its ability to analyze data at scale.  It is also well regarded for its ETL capabilities.  SAS has had parallel processing capabilities for quite some time.  Recently, the company has been pushing analytics into databases and appliances.  So, in many ways big data is an extension of what SAS has been doing for quite a while.</p>
<p>At SAS, big data goes hand in hand with big data analytics.  The company is focused on analyzing big data to make decisions.  SAS defines big data as follows, “When volume, velocity and variety of data exceeds an organization’s storage or compute capacity for accurate and timely decision-making.”   However, SAS also includes another attribute when discussing big data which is <em>relevance</em> in terms of analysis.  In other words, big data analytics is not simply about analyzing large volumes of disparate data types in real time.  It is also about helping companies to analyze relevant data.</p>
<p>SAS can support several different big data analytics scenarios.  It can deal with complete datasets.   It can also deal with situations where it is not technically feasible to utilize an entire big data set or where the entire set is not relevant to the analysis.  In fact, SAS supports what it terms a “stream it, store it, score it” paradigm to deal with big data relevance.   It likens this to an email spam filter that determines what emails are relevant for a person.  Only appropriate emails go to the person to be read.  Likewise, only relevant data for a particular kind of analysis might be analyzed using SAS statistical and data mining technologies.</p>
<p>The specific solutions that support the “stream it, store it, score it” model include:</p>
<ul>
<li><strong>Data reduction of very large data volumes using stream processing</strong>.  This occurs at the data preparation stage.  SAS Information Management capabilities are leveraged to interface with various data sources that can be streamed into the platform and filtered based on analytical models built from what it terms “organizational knowledge” using products like SAS Enterprise Miner, SAS Text Miner and SAS Social Network Analytics. SAS Information Management (SAS DI Studio, DI Server, which includes DataFlux capabilities) provides the high speed filtering and data enrichment (with additional meta-data that is used to build more indices that makes the downstream analytics process more efficient).  In other words, it utilizes analytics and data management to prioritize, categorize, and normalize data while it is determining relevance.  This means that massive amounts of data does not have to be stored in an appliance or data warehouse.</li>
</ul>
<ul>
<li><strong>SAS High Performance Computing (HPC)</strong>. SAS HPC includes a combination of grid, in-memory and in-database technologies. It is appliance ready software built on specifically configured hardware from SAS database partners.  In addition to the technology, SAS provides pre-packaged solutions that are using the in-memory architecture approach.</li>
<li><strong>SAS Business Analytics.  </strong>SAS offerings include a combination of reporting, BI, and other advanced analytics functionality (including text analytics, forecasting, operations research, model management and deployment) using some of the same tools (SAS Enterprise Miner, etc) as listed above.  SAS also includes support for mobile devices.</li>
</ul>
<p>Of course, this same set of products can be used to handle a complete data set.</p>
<p>Additionally, SAS supports a Hadoop implementation to enable its customers to push data into Hadoop and be able to manage it.  SAS analytics software can be used to run against Hadoop for analysis.  The company is working to utilize SAS within Hadoop so that data does not have to be brought out to SAS software.</p>
<p>SAS has utilized its software to help clients solve big data problems in a number of areas including:</p>
<ul>
<li>Retail:  Analyzing data in real time at check-out to determine store coupons at big box stores; Markdown optimization at point of sale; Assortment planning</li>
<li>Finance: Scoring transactional data in real time for credit card fraud prevention and detection; Risk modeling: e.g. moving from looking at loan risk modeling as one single model to  running multiple models against a complete data set that is segmented.</li>
<li>Customer Intelligence: using social media information and social network analysis</li>
</ul>
<p>For example, one large U.S. insurance company is scoring over 600,000 records per second on a multi node parallel set of processors.</p>
<p>What is a differentiator about the SAS approach is that since the company has been growing its big data capabilities through time, all of the technologies are delivered or supported based on a common framework or platform.  While newer vendors may try to down play SAS by saying that its technology has been around for thirty years, why is that a bad thing?  This has given the company time to grow its analytics arsenal and to put together a cohesive solution that is architected so that the piece parts can work together.  Some of the newer big data analytics vendors don’t have nearly the analytics capability of SAS.   Experience matters.  Enough said for now.</p>
<div></div>
<div>Next Up:  IBM</div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/648/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/648/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/648/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/648/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/648/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/648/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/648/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/648/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/648/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/648/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/648/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/648/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/648/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/648/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=648&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/12/14/four-vendor-views-on-big-data-and-big-data-analytics-part-2-sas/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>SAP moves to social media analysis with NetBase partnership</title>
		<link>http://fbhalper.wordpress.com/2011/12/12/sap-moves-to-social-media-analysis-with-netbase-partnership/</link>
		<comments>http://fbhalper.wordpress.com/2011/12/12/sap-moves-to-social-media-analysis-with-netbase-partnership/#comments</comments>
		<pubDate>Mon, 12 Dec 2011 17:10:08 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[ConsumerBase]]></category>
		<category><![CDATA[Netbase]]></category>
		<category><![CDATA[SAP]]></category>
		<category><![CDATA[social media analytics]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=656</guid>
		<description><![CDATA[Today, SAP and NetBase announced that SAP will resell NetBase solutions as the SAP® Social Media Analytics application by NetBase. What does this mean?  According to the announcement: “SAP Social Media Analytics is a cloud-based solution that is able to process more than 95 million social media posts per day. It uses an advanced natural [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=656&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p align="left">Today, <a href="http://www.sap.com">SAP</a> and <a href="http://www.netbase.com">NetBase</a> announced that SAP will resell NetBase solutions as the <strong>S</strong>AP® Social Media Analytics application by NetBase.</p>
<p align="left">What does this mean?  According to the announcement:</p>
<p align="center">“<em>SAP Social Media Analytics is a cloud-based solution that is able to process more than 95 million social media posts per day. It uses an advanced natural language processing (NLP) engine to read and categorize each one of these posts according to the opinions, emotions and behaviors that the market is expressing</em>. “</p>
<p>NetBase is a SaaS social media insight and analytics platform that contains one year of social media data.  This data consists of blogs, tweets, newsfeeds, and other Web content.  NetBase combines deep Natural Language Processing (NLP) analytics with a content aggregation service and a reporting capability.  The product provides analysis around likes/dislikes, emotions, reasons why, and behaviors. For example, whereas some social media services might interpret the sentence, “Listerine kills germs because it hurts” as either a negative or neutral statement, the NetBase technology uses a semantic data model to understand not only that this is a positive statement, but also the reason it is positive.</p>
<p align="left">The platform is currently used by hundreds of corporate customers, and was developed in partnership with five of the top 10 consumer packaged goods companies, including Coca-Cola and Kraft.  I have used NetBase for competitive intelligence, most notably when I was putting together the Victory Index for Predictive Analytics.  The platform is quite robust and easy to use.</p>
<p align="left"> The idea is that an end-user could do his or her social media analysis in the NetBase solution and then, using an API provided with the solution, export data into Business Objects to further analyze it.  Here are a few screen shots I pulled from the company’s online demo that illustrate this:</p>
<p align="left">Step 1:  In this simple example, say an end-user is trying to understand the buzz around a specific product (in this case a PC product).  He or she utilizes the NetBase system to understand some of the key opinions, passions, and sentiment regarding this brand.</p>
<p align="left">Step 2:  Once the end user has done some analysis, he or she can export the results of the analysis to SAP Business Objects.  The illustration below shows the kind of data that is exported.  In this case, there is information about attributes and emotions about the product.  These values also have a sentiment indicator associated with them.</p>
<p style="text-align:left;" align="left"> <a href="http://fbhalper.files.wordpress.com/2011/12/screen-shot-2011-12-11-at-12-04-06-pm.png"><img class="size-full wp-image-657" title="Screen shot 2011-12-11 at 12.04.06 PM" src="http://fbhalper.files.wordpress.com/2011/12/screen-shot-2011-12-11-at-12-04-06-pm.png?w=468&#038;h=260" alt="" width="468" height="260" /></a></p>
<p align="left">This data can then be visually displayed and analyzed in SAP Business Objects.   In the example below, the insights are displayed on an IPad.</p>
<p align="left"><a href="http://fbhalper.files.wordpress.com/2011/12/screen-shot-2011-12-11-at-12-04-46-pm.png"><img class="alignleft size-full wp-image-658" title="Screen shot 2011-12-11 at 12.04.46 PM" src="http://fbhalper.files.wordpress.com/2011/12/screen-shot-2011-12-11-at-12-04-46-pm.png?w=468&#038;h=347" alt="" width="468" height="347" /></a></p>
<p align="left">In addition to simply displaying information in SAP Business Objects, the plan moving forward is to be able to operationalize this data throughout workflows that are part of an enterprise business process.  I imagine that SAP HANA will enter the picture too at some point.</p>
<p align="left">I am glad to see that SAP is partnering with NetBase on this solution.  It is a good time for SAP to incorporate social media analysis into its products.  As social media analysis becomes more mainstream, SAP customers are, no doubt, asking for a solution that can work with SAP products.  While SAP bought Inxight, a text analytics vendor, a number of years ago, it does not have the social media historical data or the SaaS business around it.  This partnership seems like a good solution in the short term.  I will be interested to learn more about how SAP will incorporate social media analysis into enterprise workflows.   Certainly NetBase will benefit from the huge SAP installed base.  I suspect that SAP customers will be intrigued by this new partnership.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/656/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/656/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/656/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/656/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/656/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/656/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/656/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/656/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/656/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/656/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/656/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/656/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/656/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/656/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=656&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/12/12/sap-moves-to-social-media-analysis-with-netbase-partnership/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>

		<media:content url="http://fbhalper.files.wordpress.com/2011/12/screen-shot-2011-12-11-at-12-04-06-pm.png" medium="image">
			<media:title type="html">Screen shot 2011-12-11 at 12.04.06 PM</media:title>
		</media:content>

		<media:content url="http://fbhalper.files.wordpress.com/2011/12/screen-shot-2011-12-11-at-12-04-46-pm.png" medium="image">
			<media:title type="html">Screen shot 2011-12-11 at 12.04.46 PM</media:title>
		</media:content>
	</item>
		<item>
		<title>Four Vendor Views on Big Data and Big Data Analytics       Part 1: Attensity</title>
		<link>http://fbhalper.wordpress.com/2011/11/16/four-vendor-views-on-big-data-and-big-data-analytics-part-1-attensity/</link>
		<comments>http://fbhalper.wordpress.com/2011/11/16/four-vendor-views-on-big-data-and-big-data-analytics-part-1-attensity/#comments</comments>
		<pubDate>Wed, 16 Nov 2011 17:53:22 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[content analytics]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Attensity]]></category>
		<category><![CDATA[Big Data Analytics]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=639</guid>
		<description><![CDATA[I am often asked whether it is the vendors or the end users who are driving the Big Data market. I usually reply that both are. There are early adopters of any technology that push the vendors to evolve their own products and services. The vendors then show other companies what can be done with [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=639&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I am often asked whether it is the vendors or the end users who are driving the Big Data market. I usually reply that both are. There are early adopters of any technology that push the vendors to evolve their own products and services. The vendors then show other companies what can be done with this new and improved technology.</p>
<p>Big Data and Big Data Analytics are hot topics right now. Different vendors of course, come at it from their own point of view. Here’s a look at how four vendors (<a href="http://www.attensity.com">Attensity</a>, <a href="http://www-01.ibm.com/software/data/bigdata/">IBM</a>, <a href="http://www.sas.com">SAS</a>, and <a href="http://www.sap.com">SAP</a>) are positioning around this space, some of their product offerings, and use cases for Big Data Analytics.</p>
<p>In Attensity’s world Big Data is all about high volume customer conversations. Attensity text analytics solutions can be used to analyze both internal and external data sources to better understand the customer experience. For example, it can analyze sources such as call center notes, emails, survey verbatim and other documents to understand customer behavior. With its recent acquisition of Biz360 the company can combine social media from 75 million sources and analyze this content to understand the customer experience. Since industry estimates put the structured/unstructured data ratio at 20%/80%, this kind of data needs to be addressed. While vendors with Big Data appliances have talked about integrating and analyzing unstructured data as part of the Big Data equation, most of what has been done to date has dealt primarily with structured data. This is changing, but it is good to see a text analytics vendor address this issue head on.</p>
<p>Attensity already has a partnership with Teradata so it can marry information extracted from its unstructured data (from internal conversations) together with structured data stored in the Teradata Warehouse. Recently, Attensity extended this partnership to Aster data, which was acquired by Teradata. Aster Data provides a platform for Big Data Analytics. The Aster MapReduce Platform is a massively parallel software solution that embeds MapReduce analytic processing with data stores for big data analytics on what the company terms “multistructured data sources and types.” Attensity can now be embedded as a runtime SQL in the Aster Data library to enable the real time analysis of social media streams. Aster Data will also act as long term archival and analytics platform for the Attensity real-time Command Center platform for social media feeds and iterative exploratory analytics. By mid 2012 the plan is for complete integration to the Attensity Analyze application.</p>
<p>Attensity describes several use cases for the real time analysis of social streams:</p>
<p>1. Voice of the Customer Command Center: the ability to semantically annotate real-time social data streams and combine that with multi-channel customer conversation data in a Command Center view that gives companies a real-time view of what customers are saying about their company, products and brands.<br />
2. Hotspotting: the ability to analyze customer conversations to identify emerging trends. Unlike common keyword based approaches, Hotspot reports identify issues that a company might not already know about, as they emerge, by measuring the “significance” of change in probability for a data value between a historical period and the current period. Attensity then assigns a “temperature” value to mark the degree of difference between the two probabilities. Hot means significantly trending upward in the current period vs. historical. Cold means significantly trending downward in the current period vs. historical.<br />
3. Customer service: the ability to analyze conversations to identify top complaints and issues and prioritize incoming calls, emails or social requests accordingly.</p>
<p>Next Up:SAS</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/639/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/639/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/639/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/639/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/639/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/639/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/639/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/639/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/639/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/639/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/639/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/639/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/639/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/639/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=639&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/11/16/four-vendor-views-on-big-data-and-big-data-analytics-part-1-attensity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>The Inaugural Hurwitz &amp; Associates Predictive Analytics Victory Index is complete!</title>
		<link>http://fbhalper.wordpress.com/2011/09/28/the-inaugural-hurwitz-associates-predictive-analytics-victory-index-is-complete/</link>
		<comments>http://fbhalper.wordpress.com/2011/09/28/the-inaugural-hurwitz-associates-predictive-analytics-victory-index-is-complete/#comments</comments>
		<pubDate>Wed, 28 Sep 2011 13:32:49 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[BI innovation]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[Information Management]]></category>
		<category><![CDATA[Operational BI]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[predictive analytics survey findings]]></category>
		<category><![CDATA[predictive modeling]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=589</guid>
		<description><![CDATA[For more years than I like to admit, I have been focused on the importance of managing data so that it helps companies anticipate changes and therefore be prepared to take proactive action. Therefore, as I watched the market for predictive analytics really emerge I thought it was important to provide customers with a holistic [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=589&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://fbhalper.files.wordpress.com/2011/09/hurwitz_vlogov1.png"><img src="http://fbhalper.files.wordpress.com/2011/09/hurwitz_vlogov1.png?w=115&#038;h=150" alt="" title="Hurwitz_VLogoV" width="115" height="150" class="alignright size-thumbnail wp-image-630" /></a>For more years than I like to admit, I have been focused on the importance of managing data so that it helps companies anticipate changes and therefore be prepared to take proactive action.  Therefore, as I watched the market for predictive analytics really emerge I thought it was important to provide customers with a holistic perspective on the value of commercial offerings.  I was determined that when I provided this analysis it would be based on real world factors.  Therefore, I am delighted to announce the release of the Hurwitz &amp; Associates Victory Index for Predictive Analytics!  I’ve been working on this report for a quite some time and I believe that it will be very valuable tool for companies looking to understand predictive analytics and the vendors that play in this market.   </p>
<p>Predictive analytics has become a key component of a highly competitive company’s analytics arsenal. Hurwitz &amp; Associates defines predictive analytics as: </p>
<p><em>A statistical or data mining solution consisting of algorithms and techniques that can be used on both structured and unstructured data (together or individually) to determine future outcomes. It can be deployed  for prediction, optimization, forecasting, simulation, and many other uses.</em></p>
<p>So what is this report all about?  The Hurwitz &amp; Associates Victory Index is a market research assessment tool, developed by Hurwitz &amp; Associates that analyzes vendors across four dimensions: Vision, Viability, Validity and Value.  Hurwitz &amp; Associates takes a holistic view of the value and benefit of important technologies. We assess not just the technical capability of the technology but its ability to provide tangible value to the business.   For the Victory Index we examined more than fifty attributes including: customer satisfaction, value/price, time to value, technical value, breadth and depth of functionality, customer adoption, financial viability, company vitality, strength of intellectual capital, business value, ROI, and clarity and practicality of strategy and vision.  We also examine important trends in the predictive analytics market as part of the report and provide detailed overviews of vendor offerings in the space.  </p>
<p>Some of the key vendor highlights include:<br />
•	Hurwitz &amp; Associates named six vendors as Victors across two categories including SAS, IBM (SPSS), Pegasystems, Pitney Bowes, StatSoft and Angoss.<br />
•	Other vendors recognized in the Victory Index include KXEN, Megaputer Intelligence, Rapid-I, Revolution Analytics, SAP, and TIBCO. </p>
<p>Some of the key market findings include:<br />
•	Vendors have continued to place an emphasis on improving the technology’s ease of use, making strides towards automating model building capabilities and presenting findings in business context.<br />
•	Predictive analytics is no longer relegated to statisticians and mathematicians. The user profile for predictive analytics has shifted dramatically as the ability to leverage data for competitive advantage has placed business analysts in the driver’s seat.<br />
•	As companies gather greater volumes of disparate kinds of data, both structured and unstructured, they require solutions that can deliver high performance and scalability.<br />
•	The ability to operationalize predictive analytics is growing in importance as companies have come to understand the advantage to incorporating predictive models in their business processes. For example, statisticians at an insurance company might build a model that predicts the likelihood of a claim being fraudulent.</p>
<p>I invite you to find out more about the report by visiting our website: <a href="http://www.hurwitz.com">www.hurwitz.com</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/589/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/589/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/589/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/589/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/589/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/589/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/589/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/589/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/589/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/589/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/589/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/589/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/589/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/589/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=589&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/09/28/the-inaugural-hurwitz-associates-predictive-analytics-victory-index-is-complete/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>

		<media:content url="http://fbhalper.files.wordpress.com/2011/09/hurwitz_vlogov1.png?w=115" medium="image">
			<media:title type="html">Hurwitz_VLogoV</media:title>
		</media:content>
	</item>
		<item>
		<title>Informatica announces 9.1 and puts stake in the ground around big data</title>
		<link>http://fbhalper.wordpress.com/2011/06/09/informatica-announces-9-1-and-puts-stake-in-the-ground-around-big-data/</link>
		<comments>http://fbhalper.wordpress.com/2011/06/09/informatica-announces-9-1-and-puts-stake-in-the-ground-around-big-data/#comments</comments>
		<pubDate>Thu, 09 Jun 2011 17:40:05 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[HealthNow]]></category>
		<category><![CDATA[Informatica]]></category>
		<category><![CDATA[predictive analytics]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=584</guid>
		<description><![CDATA[Earlier this week, Informatica announced the release of the Informatica 9.1 Platform for Big Data. The company joins other data centric vendors such as EMC and IBM by putting its stake in the ground around the hot topic of Big Data. Informatica defines Big Data as, “all data, including both transaction and interaction data, in [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=584&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Earlier this week, <a href="http://www.informatica.com">Informatica</a> announced the release of the Informatica 9.1 Platform for Big Data.  The company joins other data centric vendors such as EMC and IBM by putting its stake in the ground around the hot topic of Big Data.  Informatica defines Big Data as, “all data, including both transaction and interaction data, in sets whose size or complexity exceeds the ability of commonly used technologies to capture, manage and process at a reasonable cost and timeframe. Indeed, Informatica ‘s stance is that Big Data is the confluence of the three technology trends including big transaction data, big interaction data and big data processing.”  In Informatica parlance the transactional data includes OLTP, OLAP, and data warehouse data; the interaction data might include social media data, call center records, click stream data, and even scientific data like that associated with genomics.  Informatica targets native, high performance connectivity and future integration with Hadoop, the Big Data processing platform.</p>
<p>In 9.1 Informatica is providing an updated set of capabilities around self-service, authoritative and trustworthy data (MDM and data quality), data services and data integration.  I wanted to focus on the data services here because of the connection to Big Data.  Informatica is providing a platform that companies can use to integrate transactional data (at petabyte scale and beyond in volume) and social network data from Facebook, LinkedIn, and Twitter.  Additionally, 9.1 provides the capability to move all data into and out of Hadoop in batch or real time using universal connectivity to including mainframe, databases, and applications which can help in managing unstructured data.</p>
<p>So, how will companies utilize this latest release?  I recently had the opportunity to speak with Rob Myers, an Informatica customer, who is the manager of BI architecture and data warehousing, MDM, enterprise integration for <a href="http://www.healthnow.org">HealthNow</a>.  HealthNow is a BlueCross/BlueShield provider for parts of western New York and the Albany area.  The company is expanding geographically and is also providing value added services such as patient portals.  It views its mission not simply as a claims processor but as a service provider to healthcare providers and patients.  According to Rob, the company is looking to offer other value added services to doctors and patients as part of its competitive strategy.  These offerings may include real time claims processing, identifying fraudulent claims, or analytics for healthier outcomes.  For example, HealthNow might provide a service where it identifies patients with diabetes and provide proactive services to them to help manage the disease.  Or, it might provide physicians with suggestions of tests they might consider for certain patients, given their medical records.   </p>
<p>Currently, the company utilizes Informatica PowerCenter and Informatica Data Services for data integration including ETL and data abstraction.  HealthNow has one large data warehouse and is currently building out a second. It is exposing data out to a logical model in data services tier. For example, its member portal utilizes data services to enable members to sign in and in real time, integrate 30-40 attributes around each member including demographic information, products, and eligibility for certain services into the portal.  In addition, the company’s actuaries, marketing groups, and health services group have been utilizing its data warehouses to perform their own analysis.  Rob doesn’t consider the data in these warehouses to be Big Data.  Rather they are just sets of relational data.   He views Big Data as some of the other data that the company currently has a hard time mining, for example data on social networks and the unstructured data in claims and medical notes.  The company is in the beginning phase of determining how to gather and parse through this text and expose it in a way that it can be analyzed.  For example, the company is interested in utilizing the data that they already have together with unstructured data and providing predictive analytics to its community.  HealthNow is exploring Hadoop data stores as part of this plan and is excited about the direction that Informatica is moving.  It views Informatica as the middleware that can get the trusted data out of the various silos and integrated in a way that it can then be analyzed or used in other value-added services.  </p>
<p>It is certainly interesting to see what end-users have in mind for Big Data and, for that matter, how they define Big Data.  Rob clearly views Big Data as high volume and disparate in nature (i.e. including structured and unstructured data).  There seems to be a time dimension to it.  He also made the point that its not just about having Big Data, it’s about doing something that he couldn’t do before with it – like processing and analyzing it.  This is an important point that vendors and end-users are starting to pick up on.  If Big Data were simply about volume of different kinds of data, then it would be a moving target.  Really, an important aspect of Big Data is about is being able to perform activities on the data that weren’t possible before.  I am glad to companies thinking about their use cases for Big Data and vendors such as Informatica putting a stake in the ground around the subject.  </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/584/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/584/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/584/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/584/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/584/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/584/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/584/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/584/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/584/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/584/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/584/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/584/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/584/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/584/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=584&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/06/09/informatica-announces-9-1-and-puts-stake-in-the-ground-around-big-data/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>EMC and Big Data- Observations from EMC World 2011</title>
		<link>http://fbhalper.wordpress.com/2011/05/19/emc-and-big-data-observations-from-emc-world-2011/</link>
		<comments>http://fbhalper.wordpress.com/2011/05/19/emc-and-big-data-observations-from-emc-world-2011/#comments</comments>
		<pubDate>Thu, 19 May 2011 16:09:00 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[content analytics]]></category>
		<category><![CDATA[Content Management]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Greenplum]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hurwitz & Associates]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=577</guid>
		<description><![CDATA[I attended EMC’s User Conference last week in Las Vegas. The theme of the event was Big Data meets the Cloud. So, what’s going on with Big Data and EMC? Does this new strategy make sense? EMC acquired Greenplum in 2010. At the time EMC described Greenplum as a “shared nothing, massively parallel processing (MPP) [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=577&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I attended <a href="http://www.emc.com">EMC’s</a> User Conference last week in Las Vegas.  The theme of the event was Big Data meets the Cloud.   So, what’s going on with Big Data and EMC?  Does this new strategy make sense?</p>
<p>EMC acquired Greenplum in 2010.  At the time EMC described Greenplum as a “shared nothing, massively parallel processing (MPP) data warehousing system.”   In other words, it could handle pedabytes of data.  While the term data warehouse denotes a fairly static data store, at the user conference, EMC executives characterized big data as a high volume of disparate data, which is structured and unstructured, it is growing fast, and it may be processed in real time.  Big data is becoming increasingly important to the enterprise not just because of the need to store this data but also because of the need to analyze it. Greenplum has some of its own analytical capabilities but recently the company formed a partnership with SAS to provide more oomph to its analytical arsenal.  At the conference, EMC also announced that it has now included Hadoop as part of its Greenplum infrastructure to handle unstructured information.</p>
<p>Given EMC’s strength in data storage and content management, it is logical for  EMC to move into the big data arena.  However, I am left with some unanswered questions. These include questions related to how EMC will make storage, content management, data management, and data analysis all fit together.  </p>
<p>•	Data Management.  How will data management issues be handled (i.e. quality, loading, etc.)? EMC has a partnership with Informatica and SAS has data management capabilities, but how will all of these components work together?<br />
•	Analytics.  What analytics solutions will emerge from the partnership with SAS?  This is important since EMC is not necessarily known for analytics.  SAS is a leader in analytics and can make a great partner for EMC.  But, its partnership with EMC is not exclusive.   Additionally, EMC made a point of the fact that 90% most enterprises’ data is unstructured.  EMC has incorporated Hadoop into Greenplum, ostensibly to deal with unstructured data.  EMC executives mentioned that the open source community has even begun developing analytics around Hadoop.   EMC Documentum also has some text analytics capabilities as part of Center Stage.  SAS also has text analytics capabilities.  How will all of these different components converge into a plan?<br />
•	Storage and content management. How do the storage and content management parts of the business fit into the big data roadmap?   It was not clear from the discussions at the meeting how EMC plans to integrate its storage platforms into an overall big data analysis strategy. In the short term we may not see a cohesive strategy emerge. </p>
<p>EMC is taking on the right issues by focusing on customers’ needs to manage big data.  However, it is a complicated area and I don’t expect EMC to have all of the answers today.  The market is still nascent.  Rather, it seems to me that EMC is putting its stake in the ground around big data.  This will be an important stake for the future. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/577/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/577/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/577/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/577/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/577/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/577/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/577/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/577/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/577/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/577/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/577/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/577/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/577/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/577/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=577&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/05/19/emc-and-big-data-observations-from-emc-world-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>What is Networked Content and Why Should We Care?</title>
		<link>http://fbhalper.wordpress.com/2011/04/12/what-is-networked-content-and-why-should-we-care/</link>
		<comments>http://fbhalper.wordpress.com/2011/04/12/what-is-networked-content-and-why-should-we-care/#comments</comments>
		<pubDate>Tue, 12 Apr 2011 12:20:50 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[Content Management]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[classification]]></category>
		<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[content analytics]]></category>
		<category><![CDATA[TEMIS]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[networked content]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=573</guid>
		<description><![CDATA[This is the first in a series of blogs about text analytics and content management. This one uses an interview format. I recently had an interesting conversation with Daniel Mayer, from TEMIS regarding his new paper, the Networked Content Manifesto. I just finished reading it and found it to be insightful in terms of what [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=573&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><em>This is the first in a series of blogs about text analytics and content management.  This one uses an interview format.</em></p>
<p>I recently had an interesting conversation with Daniel Mayer, from TEMIS regarding his new paper, the Networked Content Manifesto.   I just finished reading it and found it to be insightful in terms of what he had to say about how enriched content might be used today and into the future.  </p>
<p>So what is networked content?  According to the Manifesto, networked content, “creates a network of semantic links between documents that enable new forms of navigation and improves retrieval from a collection of documents. “  It uses text analytics techniques to extract semantic metadata from documents. This metadata can be used to link documents together across the organization, thus providing a rich source of connected content for use by an entire company.  Picture 50 thousand documents linked together across a company by enriched metadata that includes people, places, things, facts, or concepts and you can start to visualize what this might look like.  </p>
<p>Here is an excerpt of my conversation with Daniel:</p>
<p>FH:  <strong>So, what is the value of networked content?</strong></p>
<p>DM:  Semantic metadata creates a richer index than was previously possible using techniques such as manual tagging.  There are five benefits that semantic metadata provides.  The first two benefits are that it makes content more findable and easier to explore.  You can’t find what you don’t know how to query.  In many cases people don’t know how they should be searching.  Any advanced search engine with facets is a simple example of how you can leverage metadata to enhance information access by enabling exploration.  The third benefit is that networked content can boost insight into a subject of interest by revealing context and placing it into perspective. Context is revealed by showing what else there is around your precise search – for example related documents. Perspective is typically reached through analytics.  That is, attaining a high level of insight into what can be found in a large amount of documents, like articles or call center notes.  The final two benefits are more future looking.   The first of these benefits is something we call “proactive delivery”.  Up to now, people mostly access information by using search engines to return documents associated with a certain topic.  For example, I might ask, “What are all of the restaurants in Boston?”  But by leveraging information about your past behavior, your location, or your profile, I can proactively send you alerts about relevant restaurants you might be interested in.  This is done by some advanced portals today, and the same principle can be applied to virtually any forms of content.  The last benefit is tight integration with workflow applications.  Today, people are used to searching Google or other search engines which require a dedicated interface.  If you are writing a report and need to go to the web to look for more information, this interferes with your workflow.  But instead, it is possible to pipe content directly to your workflow so that you don’t need to interrupt your work to access it. For example, we can foresee how in the near future, when typing a report in a word processing application such as MS Word, , right in the interface, you will be able to receive bits of information related contextually to what you are typing.  As a chemist, , you might receive suggestions of scientific articles based on the metadata extracted from the text you are typing. Likewise, Content management interfaces in the future will be enriched with widgets that provide related documents and analytics.</p>
<p>FH:  How is networked content different from other kinds of advanced classification systems provided by content management vendors today?</p>
<p>DM: Networked Content is ultimately a vision for how content can be better managed and distributed by leveraging semantic content enrichment. This vision is underpinned by an entire technical ecosystem, of which the Content Management System is only one element. Our White Paper illustrates how text analytics engines such as the Luxid® Content Enrichment Platform are a key part of this emerging ecosystem.  </p>
<p>Making a blanket comparison is difficult, but generally speaking, Networked Content can leverage a level of granularity and domain specificity that the classification systems you are referring to don’t generally support.</p>
<p>FH:  <strong>Do you need a taxonomy or ontology to make this work?</strong></p>
<p>DM: I’d like to make sure we use caution when we use these terms. A taxonomy or ontology can be helpful, certainly.  If a customer wants to improve navigation in content and already has an enterprise taxonomy, it will undoubtedly help by providing guidance and structure. However, in most cases it is not sufficient in and of itself to perform content enrichment. To do this you need to build an actual engine that is able to process text and identify within it some characteristics that will trigger the assignment of metadata (either by extracting concepts from the text itself or by judging the text as a whole) In the news domain, for example, the standard IPTC taxonomy is used to categorize news articles into topic areas such as economy, politics, or sports, and into subcategories like economy/economic policy or economy/macroeconomics, etc… You can think of this as a file cabinet where you ultimately want to file every article. What the IPTC taxonomy does is that it tells you the structure the file cabinet should have. But it doesn’t do the filing for you. For that, you need to build the metadata extraction engine. That’s  where we come in. We provide a platform that includes standard extraction engines – that we call Skill Cartridges® as well as the full development environment to customize them, extend their coverage, and develop new ones from the ground up if needed. </p>
<p>FH:  <strong>I know that TEMIS is heavily into the publishing industry and you cite publishing examples in the Manifesto.  What other use cases do you see?</strong></p>
<p>DM: The Life Sciences industry (especially Pharma and Crop Science) has been an early adopter of this technology for applications such as scientific discovery, IP management, knowledge management, pharmacovigilance,.  These are typical use cases for all research-intensive sectors. Another group of common use cases for this technology in the private sector is what we call Market Intelligence: understanding your competitors and complementors (Competitive Intelligence), your customers (Voice of the Customer) and/or what is being said about you (Sentiment Analysis) You can think of all of these as departmental applications in the sense that primarily serve the needs of one department: R&amp;D,  Marketing, Strategy, etc…</p>
<p>Furthermore, we believe there is an ongoing trend for the Enterprise to adopt Networked Content transversally, beyond departmental applications, as a basic service of its core information system.  There, content enrichment can act as the glue between content management, search, and BI, and can bring productivity gains and boost insight throughout the organization. This is what has led us to deploy within EMC Documentum and Microsoft SharePoint 2010 In the future all the departmental applications will become even more ubiquitous thanks to such deployments.</p>
<p>FH:  <strong>How does Networked Content relate to the Semantic Web?</strong></p>
<p>DM:  They are very much related. The Semantic Web has been primarily concerned with how information that is available on the Web should be intelligently structured to facilitate access and manipulation by machines. Networked Content is focused on corporate – or private – content and how it can be connected with other content, either private, or public. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/573/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/573/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/573/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/573/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/573/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/573/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/573/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/573/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/573/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/573/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/573/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/573/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/573/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/573/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=573&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/04/12/what-is-networked-content-and-why-should-we-care/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
		<item>
		<title>Five vendors committed to content analytics for ECM</title>
		<link>http://fbhalper.wordpress.com/2011/04/05/five-vendors-committed-to-content-analytics-for-ecm/</link>
		<comments>http://fbhalper.wordpress.com/2011/04/05/five-vendors-committed-to-content-analytics-for-ecm/#comments</comments>
		<pubDate>Tue, 05 Apr 2011 16:00:22 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[Content Management]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[OpenText]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[TEMIS]]></category>
		<category><![CDATA[content analytics]]></category>
		<category><![CDATA[Enterprise Content Management]]></category>
		<category><![CDATA[content insight]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=557</guid>
		<description><![CDATA[In 2007, Hurwitz &#38; Associates fielded one of the first market studies on text analytics. At that time, text analytics was considered to be more of a natural extension to a business intelligence system than a content management system. However, in that study, we asked respondents who were planning to use the software, whether they [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=557&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In 2007, Hurwitz &amp; Associates fielded one of the first market studies on text analytics.  At that time, text analytics was considered to be more of a natural extension to a business intelligence system than a content management system.  However, in that study, we asked respondents who were planning to use the software, whether they were planning to deploy it in conjunction with their content management systems.  It turns out that a majority of respondents (62%) intended to use text analytics software in this manner. Text analytics, of course, is the natural extension to content management and we have seen the market evolve to the point where several  vendors have included text analytics as part of the their offerings to enrich content management solutions.<br />
<a href="http://fbhalper.files.wordpress.com/2011/03/slide1.jpg"><img src="http://fbhalper.files.wordpress.com/2011/03/slide1.jpg?w=468" alt="" title="Slide1"   class="aligncenter size-full wp-image-558" /></a></p>
<p>Over the next few months, I am going to do a deeper dive into solutions that are at the intersection of text analytics and content management; three from content management vendors <a href="http://www.emc.com">EMC</a>, <a href="http://www.ibm.com">IBM</a>, and <a href="http://www.opentext.com">OpenText</a> as well as solutions from text analytics vendor <a href="http://www.temis.com">TEMIS</a> and analytics vendor <a href="http://www.sas.com">SAS</a>.  Each of these vendors is actively offering solutions that provide insight into content stored in enterprise content management systems.  Many of the solutions described below also go beyond providing insight for content stored in enterprise content management systems to include insight over other content both internal and external to an organization.  A number of solutions also integrate structured data with unstructured information.</p>
<p>•  <strong>EMC</strong>:  EMC refers to its content analytics capability as Content Intelligence Services (CIS).   CIS supports entity extraction as well as categorization.    It enables advanced search and discovery over a range of platforms including ECM systems such as EMC’s Documentum, Microsoft SharePoint, and others.</p>
<p>•  <strong>IBM</strong>: IBM offers a number of products with text analytics capabilities. Its goal is to provide rapid and deep insight into unstructured data. The IBM Content Analytics solution provides integration into IBM ECM (FileNet) solutions such as IBM Case Manager, its big data solutions (Netezza) and integration technologies (DataStage). It also integrates securely with other ECM solutions such as SharePoint, Livelink, Documentum and others.</p>
<p>•  <strong>OpenText</strong>:  OpenText acquired text analytics vendor Nstein in 2010 in order to invest in semantic technology and expand its semantic coverage.  Nstein semantic services are now integrated with OpenText’s ECM suite.  This includes automated content categorization and classification as well as enhanced search and navigation. The company will soon be releasing additional analytics capabilities to support content discovery.  Content Analytics services can also be integrated into other ECM systems.  </p>
<p>•  <strong>SAS</strong>: SAS Institute provides a number of products for unstructured information access and discovery as part of its vision for the semantically integrated enterprise.  These include SAS Enterprise Content Categorization, SAS Ontology Management (both for improving document relevance) and SAS Sentiment Analysis and SAS Text Miner for knowledge discovery.  The products integrate with structured information; with Microsoft SharePoint, FAST ESP, Endeca, EMC Documentum; as well as with both Teradata and Greenplum. </p>
<p>• <strong>TEMIS</strong>: TEMIS recently released its Networked Content Manifesto, which describes its vision of a network of semantic links connecting documents to enable new forms of navigation and retrieval from a collection of documents.  It uses text analytics techniques to extract semantic metadata from documents that can then link documents together.  Content Management systems form one part of this linked ecosystem. TEMIS integrates into ECM systems including EMC Documentum and Centerstage, Microsoft SharePoint 2010 and MarkLogic.  </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/557/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/557/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/557/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/557/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/557/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/557/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/557/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/557/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/557/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/557/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/557/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/557/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/557/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/557/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=557&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/04/05/five-vendors-committed-to-content-analytics-for-ecm/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>

		<media:content url="http://fbhalper.files.wordpress.com/2011/03/slide1.jpg" medium="image">
			<media:title type="html">Slide1</media:title>
		</media:content>
	</item>
		<item>
		<title>Four Findings from the Hurwitz &amp; Associates Advanced Analytics Survey</title>
		<link>http://fbhalper.wordpress.com/2011/02/23/highlights-from-the-hurwitz-associates-advanced-analytics-survey/</link>
		<comments>http://fbhalper.wordpress.com/2011/02/23/highlights-from-the-hurwitz-associates-advanced-analytics-survey/#comments</comments>
		<pubDate>Wed, 23 Feb 2011 18:09:16 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[advanced analytics survey]]></category>
		<category><![CDATA[predictive analytics survey findings]]></category>
		<category><![CDATA[predictive analytics use cases]]></category>
		<category><![CDATA[survey results]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=541</guid>
		<description><![CDATA[Hurwitz &#38; Associates conducted an online survey on advanced analytics in January 2011. Over 160 companies across a range of industries and company size participated in the survey. The goal of the survey was to understand how companies are using advanced analytics today and what their plans are for the future. Specific topics included: - [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=541&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Hurwitz &amp; Associates conducted an online survey on advanced analytics in January 2011. Over 160 companies across a range of industries and company size participated in the survey.  The goal of the survey was to understand how companies are using advanced analytics today and what their plans are for the future. Specific topics included:</p>
<p>-	Motivation for advanced analytics<br />
-	Use cases for advanced analytics<br />
-	Kinds of users of advanced analytics<br />
-	Challenges with advanced analytics<br />
-	Benefits of the technology<br />
-	Experiences with BI and advanced analytics<br />
-	Plans for using advanced analytics</p>
<p><strong>What is advanced analytics ? </strong><br />
Advanced analytics provides algorithms for complex analysis of either structured or unstructured data. It includes sophisticated statistical models, machine learning, neural networks, text analytics, and other advanced data mining techniques. Among its many use cases, it can be deployed to find patterns in data, prediction, optimization, forecasting, and for complex event processing. Examples include predicting churn, identifying fraud, market basket analysis, and analyzing social media for brand management. Advanced analytics does not include database query and reporting and OLAP cubes.</p>
<p>Many early adopters of this technology have used predictive analytics as part of their marketing efforts.  However, the diversity of use cases for predictive analytics is growing.  In addition to marketing related analytics for use in areas such as market basket analysis, promotional mix, consumer behavior analysis, brand loyalty, churn analysis, companies are using the technology in new and innovative ways. For example, there are newer industry use cases emerging including reliability assessment (i.e. predicting failure in machines), situational awareness, behavior (defense), investment analysis, fraud identification (insurance, finance), predicting disabilities from claims (insurance), and finding patterns in health related data (medical)</p>
<p>The two charts below illustrate several key findings from the survey on how companies use advanced analytics and who within the organization is using this technology. </p>
<p>•	Figure 1 indicates that the top uses for advanced analytics include finding patterns in data and building predictive models. </p>
<p>•	Figure 2 illustrates that users of advanced analytics in many organizations have expanded from statisticians and other highly technical staff to include business analysts and other business users. Many vendors anticipated this shift to business users and enhanced their offerings by adding new user interfaces, for example, which suggest or dictate what model should be used, given a certain set of data.</p>
<p><a href="http://fbhalper.files.wordpress.com/2011/02/slide3.jpg"><img src="http://fbhalper.files.wordpress.com/2011/02/slide3.jpg?w=468" alt="" title=""   class="aligncenter size-full wp-image-542" /></a></p>
<p><a href="http://fbhalper.files.wordpress.com/2011/02/slide41.jpg"><img src="http://fbhalper.files.wordpress.com/2011/02/slide41.jpg?w=468" alt="" title=""   class="aligncenter size-full wp-image-547" /></a></p>
<p>Other highlights include: </p>
<p>•	Survey participants have seen a huge business benefit from advanced analytics.  In fact, over 40% of the respondents who had implemented advanced analytics believed it had increased their company’s top-line revenue.  Only 2% of respondents stated that advanced analytics provided little or no value to their company.<br />
•	Regardless of company size, the vast majority of respondents expected the number of users of advanced analytics in their companies to increase over the next six to 12 months.  In fact, over 50% of respondents currently using the technology expected the number of users to increase over this time period. </p>
<p>The final report will be published in March 2011.  Stay tuned!</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/541/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/541/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/541/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/541/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/541/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/541/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/541/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/541/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/541/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/541/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/541/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/541/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/541/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/541/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=541&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/02/23/highlights-from-the-hurwitz-associates-advanced-analytics-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>

		<media:content url="http://fbhalper.files.wordpress.com/2011/02/slide3.jpg" medium="image" />

		<media:content url="http://fbhalper.files.wordpress.com/2011/02/slide41.jpg" medium="image" />
	</item>
		<item>
		<title>Five basic questions to ask before leaping into low cost social media monitoring</title>
		<link>http://fbhalper.wordpress.com/2011/01/18/five-basic-questions-to-ask-before-leaping-into-low-cost-social-media-monitoring/</link>
		<comments>http://fbhalper.wordpress.com/2011/01/18/five-basic-questions-to-ask-before-leaping-into-low-cost-social-media-monitoring/#comments</comments>
		<pubDate>Tue, 18 Jan 2011 16:36:42 +0000</pubDate>
		<dc:creator>fbhalper</dc:creator>
				<category><![CDATA[advanced analytics]]></category>
		<category><![CDATA[Business Analytics]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[text mining]]></category>
		<category><![CDATA[Fern Halper]]></category>
		<category><![CDATA[keyword search]]></category>
		<category><![CDATA[low-cost social media tools]]></category>
		<category><![CDATA[monitoring buzz]]></category>
		<category><![CDATA[social media buzz]]></category>
		<category><![CDATA[social media monitoring]]></category>

		<guid isPermaLink="false">http://fbhalper.wordpress.com/?p=534</guid>
		<description><![CDATA[I just finished testing two low cost (&#60;$50.00/mo) social media monitoring tools. They were both easy to use with clean user interfaces. Both had some nice features, especially around reaching back out to people making comments in social media. However, running these two services side by side brought home some issues with these kinds of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=534&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I just finished testing two low cost (&lt;$50.00/mo) social media monitoring tools. They were both easy to use with clean user interfaces.  Both had some nice features, especially around reaching back out to people making comments in social media.  However, running these two services side by side brought home some issues with these kinds of offerings &#8211; specifically in the area of coverage, sentiment analysis, and analytics.   Note that I am not naming names because I believe the issues I ran into are not unique to these specific tools, but to the low cost social media monitoring market, in general.  Some of these issues will also apply to higher priced offerings!</p>
<p><strong>What I did:</strong></p>
<p>I ran an analysis using the term “advanced analytics” including the term itself as well as companies (as additional topics) in the space.  I made sure to be as specific and clear as I could be, since I knew topics were keyword based.  I let the services run side by side for several weeks, interacting with the tools on a regular basis.  </p>
<p><strong>What I noticed:</strong></p>
<p>1.	Topic specification.  Tools will vary in how the end user can input what he or she is looking for.  Some will let you try to refine your keyword search (and these are keyword based, they won’t let you discover topics per se), other won’t.  Some will only allow search across all languages, others will allow the user to specify the language.  Find out how you can be sure that you are getting what you are searching for.  For example, does the tool allow you be very specific about words that should and should not be included (i.e. SAS is the name of an analytics company and also an airline)?  Since these low cost tools often don’t provide a way to build a thesaurus, you need to be careful.<br />
2.	Completeness of Coverage. The coverage varied tremendously between the services.   Nor was the coverage the same for the same day for the same company name that I was tracking (and I was pretty specific, although see #1 above).   In fact it seemed to vary by at least an order of magnitude.  I even compared this manually in twitter streams.   When I asked, I was told by one company that if they weren’t picking up everything, it must be a bug and it should be reported (!?). The other company told me all of my content came to me in one big fire hose, because there had been a problem with it, before (!?).  In both cases, there still seemed to be a problem with the completeness of content.  The amount of content just didn’t add up between the two services.  In fact, one company told me that since I was on a trial, I wasn’t getting all of the content – yet even with the firehose effect, the numbers didn’t make sense.  Oh.  And don’t forget to ask if the service can pull in message boards, and which message boards (i.e. public vs. private). For an analysis, all of these content issues might mean that completeness of buzz might be misrepresented which can lead to problems.<br />
3.	Duplicate Handling.  What about the amount of buzz?  I thought that part of my content counting discrepancy might be due to how the company was dealing with duplicates.  So beware.  Some companies count duplicates (such as retweets) as buzz and some do not.  However, be sure to ask when duplicate content is considered duplicate and when it is not.  One company told me that retweets are not counted in buzz, but are included in the tag cloud (!?).<br />
4.	Sentiment analysis. The reality is that most of the low cost tools are not that good at analyzing sentiment.  Even though the company will tell you they are 75% accurate the reality is more like 50%.  Case in point:  on one offering, one job listing was rated positive and another job posting listed as negative.  In looking at the two postings, it wasn’t clear why (shouldn’t a job post be neutral anyway) Note, however, that many of these tools provide a means to change the sentiment from +/-/neutral (if they don’t then don’t buy it).  So, if sentiment is a big deal to you then be prepared to wade through the content and change sentiment, if need be.  Also ask how the company does sentiment analysis and find out at what level it does the analysis (article, sentence, phrase)<br />
5.	Analysis.  Be prepared to ask a lot of questions about how the company is doing its analysis.  For example, sometimes I could not map the total buzz to other analytics numbers (was it duplicates handling or something else).  Additionally, some social media monitoring tools will break down buzz by gender.  How is this determined?  Some companies determine gender based on a name algorithm, while others use profile information from facebook or twitter (obviously not a complete view of buzz by gender, since not all information sources are twitter and facebook like).  Additionally, some of the tools will only show a percentage (a no-no in this case), while others may show the number and the percent.  Ditto with geolocation information.  If the data is incomplete (and isn’t representative of the whole) then there could be a problem with using it for analytical purposes.</p>
<p><strong>What this means</strong></p>
<p>Certainly, the lure of low cost social media platforms is strong, especially for small and medium businesses.   However, I would caution people to do their homework and ask the right questions, before purchasing even a low cost product.  I would also suggest testing a few products (running them side by side for the same time period, even if you have to pay for it for a month or so) to compare the tools in terms of coverage, sentiment, and analysis.  </p>
<p>The reality is that you can end up with an analysis that is completely wrong if you don’t ask the right questions of the service provider.  The amount of buzz might not be what you think it is, how your company compares to another company might be wrong based on how you specified the company name, the sentiment might be entirely wrong if you don’t check it, and the analysis may be misleading unless you understand how it was put together.  </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/fbhalper.wordpress.com/534/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/fbhalper.wordpress.com/534/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/fbhalper.wordpress.com/534/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/fbhalper.wordpress.com/534/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/fbhalper.wordpress.com/534/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/fbhalper.wordpress.com/534/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/fbhalper.wordpress.com/534/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/fbhalper.wordpress.com/534/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/fbhalper.wordpress.com/534/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/fbhalper.wordpress.com/534/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/fbhalper.wordpress.com/534/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/fbhalper.wordpress.com/534/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/fbhalper.wordpress.com/534/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/fbhalper.wordpress.com/534/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=fbhalper.wordpress.com&amp;blog=2214430&amp;post=534&amp;subd=fbhalper&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://fbhalper.wordpress.com/2011/01/18/five-basic-questions-to-ask-before-leaping-into-low-cost-social-media-monitoring/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/b5ac81563adee11f86027093fa2247da?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fern</media:title>
		</media:content>
	</item>
	</channel>
</rss>
