<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Findability blog &#187; Research</title>
	<atom:link href="http://blog.findwise.com/category/research/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.findwise.com</link>
	<description>The enterprise search and findability blog</description>
	<lastBuildDate>Wed, 09 May 2012 17:59:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Architecture of Search Systems and Measuring the Search Effectiveness</title>
		<link>http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/</link>
		<comments>http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/#comments</comments>
		<pubDate>Tue, 24 Apr 2012 10:03:28 +0000</pubDate>
		<dc:creator>Pawel Wroblewski</dc:creator>
				<category><![CDATA[Academia]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Lecture]]></category>
		<category><![CDATA[Presentation]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Search Analytics]]></category>

		<guid isPermaLink="false">http://blog.findwise.com/?p=3063</guid>
		<description><![CDATA[Lecture made at the 19th of April 2012, at the Warsaw University of Technology. This is the 9th lecture in the regular course for master grade studies, &#8220;Introduction to text mining&#8221;. View more presentations from Findwise Keywords: Search, search effectiveness, search architecture &#160;&#8226;&#160;]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><div id="__ss_12598871" style="width: 510px;">Lecture made at the 19th of April 2012, at the Warsaw University of Technology. This is the 9th lecture in the regular course for master grade studies, &#8220;Introduction to text mining&#8221;. <object id="__sse12598871" width="510" height="426" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="wmode" value="transparent" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=w9-searcharchitecture-120419021838-phpapp01&amp;rel=0&amp;stripped_title=architecture-of-search-systems-and-measuring-the-search-effectiveness&amp;userName=findwise" /><param name="allowscriptaccess" value="always" /><param name="allowfullscreen" value="true" /><embed id="__sse12598871" width="510" height="426" type="application/x-shockwave-flash" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=w9-searcharchitecture-120419021838-phpapp01&amp;rel=0&amp;stripped_title=architecture-of-search-systems-and-measuring-the-search-effectiveness&amp;userName=findwise" allowFullScreen="true" allowScriptAccess="always" wmode="transparent" allowscriptaccess="always" allowfullscreen="true" /> </object></p>
<div style="padding: 5px 0 12px;">View more presentations from <a href="http://www.slideshare.net/findwise">Findwise</a></div>
</div>
</span></span><meta itemprop="inLanguage" content="en"><meta itemprop="isFamilyFriendly" content="Y"><div class="schema_property_wrap">
<span class="schema_property">
    <span class="schema_property_name"><b>Keywords:</b> </span>
    <span class="schema_property_value" itemprop="keywords" content="">Search, search effectiveness, search architecture</span>
</span>&nbsp;&bull;&nbsp;
</div><meta itemprop="url" content="http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/"><meta itemprop="datePublished" content="2012-04-24T11:03:28+00:00"><meta itemprop="dateModified" content="2012-04-24T11:05:57+00:00"><meta itemprop="dateCreated" content="2012-04-24T10:51:51+00:00"><meta itemprop="wordCount" content="35"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A look at European Conference on Information Retrieval (ECIR) 2012</title>
		<link>http://blog.findwise.com/a-look-at-european-conference-on-information-retrieval-ecir-2012/</link>
		<comments>http://blog.findwise.com/a-look-at-european-conference-on-information-retrieval-ecir-2012/#comments</comments>
		<pubDate>Wed, 18 Apr 2012 13:25:34 +0000</pubDate>
		<dc:creator>Paula Petcu</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[information retriaval]]></category>

		<guid isPermaLink="false">http://blog.findwise.com/?p=3053</guid>
		<description><![CDATA[The 34th European Conference on Information Retrieval was held  1-5 April 2011, in the lovely but crowded city of Barcelona, Spain. The core conference attracted over 100 attendees, with a total of 35 accepted full papers, 28 posters, and 7 demos being presented. As opposed to the previous year, which had 2 parallel sessions, this [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><div>The 34th <a href="http://ecir2012.upf.edu/index.html">European Conference on Information Retrieval</a> was held  1-5 April 2011, in the lovely but crowded city of Barcelona, Spain. The core conference attracted over 100 attendees, with a total of 35 accepted full papers, 28 posters, and 7 demos being presented. As opposed to the <a href="http://blog.findwise.com/ecir-2011-in-retrospect/">previous year</a>, which had 2 parallel sessions, this year’s conference included a single running session. The accepted papers covered a diverse range of topics, and were divided into query representation, blog and online-community search, semi-structured retrieval, applications, evaluation, retrieval models, classification, categorisation and clustering, image and video retrieval, and systems efficiency.</div>
<div>
<p>The best paper award went to Guido Zuccon, Leif Azzopardi, Dell Zhang and Jun Wang for their work entitled &#8220;<a href="http://www.dcs.bbk.ac.uk/~dell/publications/dellzhang_ecir2012.pdf">Top-k Retrieval using Facility Location Analysis</a>&#8221; and presented by Leif Azzopardi during the retrieval models session. The authors propose using facility location analysis taken from the discipline of operations research to address the top-k retrieval problem of finding “the optimal set of k documents from a number of relevant documents given the user&#8217;s query”.</p>
<p>Meanwhile, &#8220;<a href="http://staff.science.uva.nl/~tsagias/wp-content/uploads/2012/01/ecir2012-imdb.pdf">Predicting IMDB Movie Ratings using Social Media</a>&#8221; by Andrei Oghina, Mathias Breuss, Manos Tsagkias and Maarten de Rijke won the best poster award. With a different goal from the best paper, the authors of the poster experiment with a prediction model for rating movies using a set of qualitative and quantitative features extracted from the stream of two social media channels, YouTube and Twitter. Their findings show that the highest predictive performance is obtained by combining features from both channels, and propose as future work to include other social media channels.</p>
<p>The conference was preceded by a full day of workshops and tutorials running in parallel. I attended two workshops: Information Retrieval Over Query Sessions (SIR) during the morning and Task-Based and Aggregated Search (TBAS) in the afternoon. The second workshop ended with an interactive discussion. A third, full-day workshop was Searching 4 Fun!.</p>
<p>The last day was the Industry Day. Only 2 papers here, plus 5 oral contributions, and around 50 attendees. A strong focus of the talks given at the industry day was on opinion-mining: four of the six participating companies/institutions presented work on sentiment analysis and opinion mining from social media streams. Jussi Karlgren, from <a href="http://www.gavagai.se/">Gavagai</a>, argued that sentiment analysis from social media can be used by companies for example in finding reviews or comments made about their product or service, analyse their market position, and predict price movements. Rianne Kaptein, from <a href="http://www.oxyme.com/">Oxyme</a>, backed this up by adding that businesses are interested by what the consumers say about their brand, products or campaigns on social media streams. Furthermore, Hugo Zaragoza from <a href="http://websays.com/">Websays</a> identified two basic needs inside a company: a need for help in reading so that someone can act, and a need for help in explaining so that it can convince. Very interesting topic indeed, and research in this direction will advance as companies become more aware of the business gains from opinion mining of social media.</p>
<p>Overall, ECIR 2012 was a very inspiring conference. It also seemed a very friendly conference, offering many opportunities to network with the fellow attendees. Despite that, several participants said that the number of attendees at this year’s conference has decreased in comparison with previous years. The workshops and the core conference gave me the impression that it has a strong focus on young researchers, as many of the accepted contributions had a student as a first author and presenter at the conference. The fact that there was only one session running at a time was a good decision in my opinion, as the attendees were not forced to miss presentations. Nevertheless, the workshops and tutorials were running in parallel, and although the proceedings of the workshops will be made freely available, I still feel that I missed something that day. The industry day was very exciting, offering the opportunity to share ideas between academia and industry. However, there were not so many presentations, and the topics were not as diverse. I propose that next year Findwise will be among the speakers at the Industry track!</p>
<p><a href="http://ecir2013.org/">ECIR 2013</a> will be held in Moscow, Russia, between 24-28 March. See you there!</p>
</div>
</span></span><meta itemprop="inLanguage" content="en"><meta itemprop="isFamilyFriendly" content="Y"><div class="schema_property_wrap">
<span class="schema_property">
    <span class="schema_property_name"><b>Accountable Person:</b> </span>
    <span class="schema_property_value" itemprop="accountablePerson" content="">Paula Petcu</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Author:</b> </span>
    <span class="schema_property_value" itemprop="author" content="">Paula Petcu</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Keywords:</b> </span>
    <span class="schema_property_value" itemprop="keywords" content="">information retrieval</span>
</span>&nbsp;&bull;&nbsp;
</div><meta itemprop="url" content="http://blog.findwise.com/a-look-at-european-conference-on-information-retrieval-ecir-2012/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/a-look-at-european-conference-on-information-retrieval-ecir-2012/"><meta itemprop="datePublished" content="2012-04-18T14:25:34+00:00"><meta itemprop="dateModified" content="2012-04-18T14:28:47+00:00"><meta itemprop="dateCreated" content="2012-04-18T14:19:42+00:00"><meta itemprop="keywords" content="conference,information retriaval,Research"><meta itemprop="wordCount" content="684"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/a-look-at-european-conference-on-information-retrieval-ecir-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Semantic Search Engine &#8211; What is the Meaning?</title>
		<link>http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/</link>
		<comments>http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/#comments</comments>
		<pubDate>Fri, 30 Mar 2012 07:59:12 +0000</pubDate>
		<dc:creator>Pawel Wroblewski</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Future development]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Semantic]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Information science]]></category>
		<category><![CDATA[open source technology]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[Semantic search]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[Technology/Internet]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2867</guid>
		<description><![CDATA[The shortest dictionary definition of semantics is: the study of meaning. The more complex explanation of this term would lead to a relationship that maps words, terms and written expressions into common sense and understanding of objects and phenomena in the real world. It is worthy to mention that objects, phenomena and relationships between them [...]]]></description>
			<content:encoded><![CDATA[<strong itemprop="description"></strong><br /><span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>The shortest dictionary definition of semantics is: <em>the study of meaning.</em> The more complex explanation of this term would lead to a relationship that maps words, terms and written expressions into common sense and understanding of objects and phenomena in the real world. It is worthy to mention that objects, phenomena and relationships between them are language independent. It means that the same semantic network of concepts can map to multiple languages which is useful in automatic translations or cross-lingual searches.</p>
<h2>The approach</h2>
<p>In the proposed approach semantics will be modeled as a defined ontology making it possible for the web to &#8220;understand&#8221; and satisfy the requests and intents of people and machines to use the web content. The ontology is a model that encapsulates knowledge from specific domain and consists of hierarchical structure of classes (taxonomy) that represents concepts of things, phenomena, activities etc. Each concept has a set of attributes that represent the mapping of that particular concept to words and phrases that represents that concepts in written language (as shown at the top of the figure below). Moreover, the proposed ontology model will have horizontal relationships between concepts, e.g. the linguistic relationships (synonymy, homonymy etc.) or domain specific relationships (medicine, law, military, biological, chemical etc.). Such a defined ontology model will be called a <strong>Semantic Map </strong>and will be used in the proposed search engine. An exemplar part of an enriched ontology of beverages is shown in the figure below. The ontology is enriched, so that the concepts can be easily identified in text using attributes such as the representation of the concept in the written text.</p>
<h2>Semantic Map</h2>
<p>The Semantic Map is an ontology that is used for bidirectional mapping of textual representation of concepts into a space of their meaning and associations. In this manner, it becomes possible to transform user queries into concepts, ideas and intent that can be matched with indexed set of similar concepts (and their relationships) derived from documents that are returned in a form of result set. Moreover, users will be able to precise and describe their intents using visualized facets of concept taxonomy, concept attributes and horizontal (domain) relationships. The search module will also be able to discover users’ intents based on the history of queries and other relevant factors, e.g. ontological axioms and restrictions. A potentially interesting approach will retrieve additional information regarding the specific user profile from publicly available information available in social portals like Facebook, blog sites etc., as well as in user’s own bookmarks and similar private resources, enabling deeper intent discovery.</p>
<p><a href="http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/pomost/" rel="attachment wp-att-3060"><img itemprop="image" class="aligncenter size-full wp-image-3060" title="Semantic Search" src="http://blog.findwise.com/wp-content/uploads/2012/03/pomost.jpg" alt="" width="740" height="578" /></a></p>
<h2>Semantic Search Engine</h2>
<p>The search engine will be composed of the following components:</p>
<ul>
<li><strong>Connector</strong> – This module will be responsible for acquisition of data from external repositories and pass it to the search engine. The purpose of the connector is also to extract text and relevant metadata from files and external systems and pass it to further processing components.</li>
<li><strong>Parser</strong> – This module will be responsible for text processing including activities like: tokenization (breaking text into lexems – words or phrases), lemmatization (normalization of grammar forms), exclusion of stop-words, paragraph and sentence boundary detector. The result of parsing stage is structured text with additional annotations that is passed to semantic Tagger.</li>
<li><strong>Tagger</strong> – This module is responsible for adding semantic information for each lexem extracted from the processed text. Technically it refers to addition of identifiers to relevant concepts stored in the Semantic Map for each lexem. Moreover phrases consisting of several words are identified and disambiguation is performed basing on derived contexts. Consider the example illustrated in the figure.</li>
<li><strong>Indexer</strong> – This module is responsible for taking all the processed information, transformation and storage into the search index. This module will be enriched with methods of semantic indexing using ontology (semantic map) and language tools.</li>
<li><strong>Search index </strong>– The central storage of processed documents (document repository) structured properly to manage full text of the documents, their metadata and all relevant semantic information (document index). The structure is optimized for search performance and accuracy.</li>
<li><strong>Search </strong>– This module is responsible for running queries against the search index and retrieval of relevant results. The search algorithms will be enriched to use user intents (complying data privacy) and the prepared Semantic Map to match semantic information stored in the search index.</li>
</ul>
<p>What do you think? Please let us know by writing a comment.</p>
</span></span><meta itemprop="inLanguage" content="en"><meta itemprop="isFamilyFriendly" content="Y"><div class="schema_property_wrap">
<span class="schema_property">
    <span class="schema_property_name"><b>Accountable Person:</b> </span>
    <span class="schema_property_value" itemprop="accountablePerson" content="">Pawel Wroblewski</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>About:</b> </span>
    <span class="schema_property_value" itemprop="about" content="">Semantic Search</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Description:</b> </span>
    <span class="schema_property_value" itemprop="description" content="">A proposal for a semantic search engine.</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Keywords:</b> </span>
    <span class="schema_property_value" itemprop="keywords" content="">semantic search engine</span>
</span>&nbsp;&bull;&nbsp;
</div><meta itemprop="url" content="http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/"><meta itemprop="datePublished" content="2012-03-30T08:59:12+00:00"><meta itemprop="dateModified" content="2012-04-18T17:37:24+00:00"><meta itemprop="dateCreated" content="2012-01-26T21:54:34+00:00"><meta itemprop="keywords" content="Information science,open source technology,search engine,Semantic search,semantic web,Technology/Internet"><meta itemprop="wordCount" content="724"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Searching for Zebras: Doing More with Less</title>
		<link>http://blog.findwise.com/searching-for-zebras-doing-more-with-less/</link>
		<comments>http://blog.findwise.com/searching-for-zebras-doing-more-with-less/#comments</comments>
		<pubDate>Wed, 15 Feb 2012 13:50:44 +0000</pubDate>
		<dc:creator>Paula Petcu</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Future development]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[British Medical Journal]]></category>
		<category><![CDATA[correct disease]]></category>
		<category><![CDATA[diseases]]></category>
		<category><![CDATA[genetic diseases]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[medical diagnostic]]></category>
		<category><![CDATA[New England Journal]]></category>
		<category><![CDATA[only rare and genetic disease]]></category>
		<category><![CDATA[open-source search engine]]></category>
		<category><![CDATA[rare diseases]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[the New England Journal of Medicine]]></category>
		<category><![CDATA[web resources]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2876</guid>
		<description><![CDATA[There is a very controversial and highly cited 2006 British Medical Journal (BMJ) article called &#8220;Googling for a diagnosis &#8211; use of Google as a diagnostic aid: internet based study&#8221; which concludes that, for difficult medical diagnostic cases, it is often useful to use Google Search as a tool for finding a diagnosis. Difficult medical [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>There is a very controversial and highly cited 2006 British Medical Journal (BMJ) article called &#8220;<a href="http://www.bmj.com/content/early/2005/12/31/bmj.39003.640567.AE.abstract">Googling for a diagnosis &#8211; use of Google as a diagnostic aid: internet based study</a>&#8221; which concludes that, for difficult medical diagnostic cases, it is often useful to use Google Search as a tool for finding a diagnosis. Difficult medical cases are often represented by rare diseases, which are diseases with a very low prevalence.</p>
<p>The authors use 26 diagnostic cases published in the New England Journal of Medicine (NEJM) in order to compile a short list of symptoms describing each patient case, and use those keywords as queries for Google. The authors, blinded to the correct disease (a rare diseases in 85% of the cases), select the most &#8216;prominent&#8217; diagnosis that fits each case. In 58% of the cases they succeed in finding the correct diagnosis.</p>
<p>Several other articles also point to Google as a tool often used by clinicians when searching for medical diagnoses.</p>
<p>But is that so convenient, is that enough, or can this process be easily improved? Indeed, two major advantages for Google are the clinicians&#8217; familiarity with it, and its fresh and extensive index. But how would a vertical search engine with focused and curated content compare to Google when given the task of finding the correct diagnosis for a difficult case?</p>
<p>Well, take an open-source search engine such as <a href="http://lemurproject.com">Indri</a>, index around 30,000 freely available medical articles describing rare or genetic diseases, use an off-the-shelf retrieval model, and there you have <a href="http://findzebra.com">Zebra</a>. In medicine, the term &#8220;zebra&#8221; is a slang for a surprising diagnosis. In comparison with a search on Google, which often returns results that point to unverified content from blogs or content aggregators, the documents from this vertical search engine are crawled from 10 web resources containing only rare and genetic disease articles, and which are mostly maintained by medical professionals or patient organizations.</p>
<p>Evaluating on a set of 56 queries extracted in a similar manner to the one described above, Zebra easily beats Google. Zebra finds the correct diagnosis in top 20 results in 68% of the cases, while Google succeeds in 32% of them. And this is only the performance of the Zebra with the baseline relevance model — imagine how much more could be done (for example, displaying results as a network of diseases, clustering or even ranking by diseases, or automatic extraction and translation of electronic health record data).</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/searching-for-zebras-doing-more-with-less/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/searching-for-zebras-doing-more-with-less/"><meta itemprop="datePublished" content="2012-02-15T14:50:44+00:00"><meta itemprop="dateModified" content="2012-02-25T15:16:38+00:00"><meta itemprop="dateCreated" content="2012-02-07T12:43:41+00:00"><meta itemprop="keywords" content="British Medical Journal,correct disease,diseases,genetic diseases,Google,medical diagnostic,New England Journal,only rare and genetic disease,open-source search engine,rare diseases,search engine,the New England Journal of Medicine,web resources"><meta itemprop="wordCount" content="394"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/searching-for-zebras-doing-more-with-less/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Enterprise search &#8211; market overview 2011</title>
		<link>http://blog.findwise.com/enterprise-search-market-overview-2011/</link>
		<comments>http://blog.findwise.com/enterprise-search-market-overview-2011/#comments</comments>
		<pubDate>Mon, 26 Sep 2011 07:53:55 +0000</pubDate>
		<dc:creator>Caroline Abrahamsson</dc:creator>
				<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Market trends]]></category>
		<category><![CDATA[Research]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2738</guid>
		<description><![CDATA[A few weeks ago Forrester research released a report with an overview of the 12 leading Enterprise search vendors on the global market (Attivio, Autonomy, Coveo, Endeca, Exalead, Fabasoft, Google, IBM, ISYS Search, Microsoft, Sinequa and Vivisimo). When I wrote about the Gartner report, readers commented on the fact that open source solutions were not [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>A few weeks ago Forrester research released a report with an overview of the 12 leading Enterprise search vendors on the global market (Attivio, Autonomy, Coveo, Endeca, Exalead, Fabasoft, Google, IBM, ISYS Search, Microsoft, Sinequa and Vivisimo).</p>
<p>When I wrote about the <a href="http://findabilityblog.se/gartner-and-the-magic-quadrants-%E2%80%93-crowning-the-leaders-of-enterprise-search/">Gartner report</a>, readers commented on the fact that open source solutions were not part of the scope, even though their market share is increasing rapidly. The Forrester report has the same approach, except it includes vendors offering their products stand-alone as well as those with products integrated in portal/ECM solutions.</p>
<p>So why the exclusion of open source? Well, it appears difficult to decide on <strong>how</strong> to evaluate open source, especially when it comes to more advanced appliances.</p>
<p>Looking at the Forrester report, it includes some familiar conclusions but also a few new insights. Leslie Owen from Forrester concludes that<em> “Google, Autonomy, and Microsoft are the most well-known names; they own a large portion of the existing market”.</em> Hence, these vendors are still standing strong, even though they are challenged in various areas.</p>
<p>More surprisingly, some niche players get higher scores than the giants in core areas such as “Indexing and connectivity”, “Interface flexibility” and “Social and collaborative features”.</p>
<p>Vivisimo is seen as somewhat of a leader (with a slightly lower score on Mobile support and Semantics/text analysis). In the Gartner report, Vivisimo was excluded from the information access evaluation due to the fact that they were ”<em>focusing on specialized application categories, such as customer service</em>”.</p>
<p style="text-align: center;"><a href="http://findabilityblog.se/enterprise-search-market-overview-2011/forrester/" rel="attachment wp-att-2742"><img itemprop="image" class="size-medium wp-image-2742 aligncenter" src="http://media.findabilityblog.se//2011/09/forrester-300x201.png" alt="Search vendor overview" width="300" height="201" /></a></p>
<p>An interesting reflection from Forrester is that “<em>in the next few years, we expect prices to rise as specialized vendors wax poetic on the transformative power of search in order to distinguish their products from Google and Microsoft FAST Search for SharePoint”. </em>On the Nordic market, we have not seen a shift to such a strategy, but rather the opposite, since open source (with zero license fees) is becoming accepted in an Enterprise environment to a larger extent.</p>
<p>The vendors that provide integrated solutions (to CMS/WCM etc) still remains strong, whereas the stand-alone solutions becomes exposed to completion in new ways. It will be interesting to follow the US and Nordic market to see how this evolves within the next year. It might be that the market differs when it comes to open source adaption.</p>
<p>If you wish to read the full report it can be <a href="http://vivisimo.com/landing/download-forresterwave.html">downloaded</a> from Vivisimo through a simple registration.<br />
To get a complete overview of vendors, I recommend reading both the Gartner and Forrester report.</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/enterprise-search-market-overview-2011/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/enterprise-search-market-overview-2011/"><meta itemprop="datePublished" content="2011-09-26T07:53:55+00:00"><meta itemprop="dateModified" content="2011-09-26T07:53:55+00:00"><meta itemprop="dateCreated" content="2011-09-23T07:39:31+00:00"><meta itemprop="wordCount" content="418"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/enterprise-search-market-overview-2011/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>ECIR 2011 in retrospect</title>
		<link>http://blog.findwise.com/ecir-2011-in-retrospect/</link>
		<comments>http://blog.findwise.com/ecir-2011-in-retrospect/#comments</comments>
		<pubDate>Wed, 27 Apr 2011 07:25:39 +0000</pubDate>
		<dc:creator>Svetoslav Marinov</dc:creator>
				<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Findwise]]></category>
		<category><![CDATA[Internet search]]></category>
		<category><![CDATA[Relevancy]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Artificial Artificial Intelligence]]></category>
		<category><![CDATA[Dublin]]></category>
		<category><![CDATA[Evgeniy Gabrilovich]]></category>
		<category><![CDATA[hard advertising]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Oscar Täckström]]></category>
		<category><![CDATA[retrieval systems]]></category>
		<category><![CDATA[search performace]]></category>
		<category><![CDATA[search results]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[social network]]></category>
		<category><![CDATA[Thorsten Joachims]]></category>
		<category><![CDATA[Web search results]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2516</guid>
		<description><![CDATA[The European Conference on Information Retrieval (ECIR) 2011 took place in Dublin last week, 18-21 April. In this blogpost I would try to highlight some of the papers and talks from the conference which caught my attention and back it up with what other attendees said about it. First, I was intrigued by the session [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>The <a href="http://www.ecir2011.dcu.ie/">European Conference on Information Retrieval</a> (ECIR) 2011 took place in Dublin last week, 18-21 April. In this blogpost I would try to highlight some of the papers and talks from the conference which caught my attention and back it up with what other attendees said about it.</p>
<p>First, I was intrigued by the session on evaluation for IR and especially the topic of Croudsourcing. In my opition, the paper <a href="http://www.springerlink.com/content/j8nvr881n3686161/">A Methodology for Evaluating Aggregated Search Results</a>, which also got the prize for best student paper, was among the most pedagogically presented ones. It deals with the task of incorporating search results from a number of different sources, called verticals, into Web search results. By using a small number of human judgements for a given query the authors present the way to evaluate any possible permutation of verticals in the result presentation. I think that this methodology should be adopted in the world of Enterprise search, since it is exactly there where we crawl, index and present information from a number of different sources &#8211; Web, databases, fileshares, etc. The prerequisites are really minimal and low cost but the return value, the user experience, seems quite high.</p>
<p><a href="https://www.mturk.com/mturk/welcome">Amazon Mechanical Turk</a>, or the Artificial Artificial Intelligence, which is the marketplace for Croudsourcing, provides a way for a ridiculously small sum of money to perform evaluation, relevance assessment or any task for which you would need humans to give you some judgements. Leaving aside ethical issues, two papers in the conference presented ways of how you can utilize this service for some IR tasks.</p>
<p><a href="http://www.cs.technion.ac.il/~gabr/">Evgeniy Gabrilovich</a> from Yahoo! Research, who won the Karen Sparck Jones award for 2010, gave a very interesting keynote talk on Computational Advertising. Up to now, it has never struck me how hard advertising in Information Retrieval systems is actually. I liked one of his points on the future of Ads &#8211; by using product feeds, one can automatically create product description via Text Summarization and Natural Language Generation and index this, thus avoiding bid words.</p>
<p>Another interesting and very pedagogically presented paper was about the <a href="http://nlp.fi.muni.cz/projekty/gensim/">gensim package</a> by Radim Řehůřek. I definitely think we can use it in some of our projects. In general, text categorization and IR for social network were the dominant tracks. In one of the social networks tracks, Oscar Täckström presented a neat way of discovering fine-grained sentiment where some coarse-grained supervision is available. It really hooked me on trying it for any of our customers where sentiment analysis is required.</p>
<p><a href="http://www.cs.cornell.edu/people/tj/">Thorsten Joachims</a>, the last of the keynote speakers, gave a very inspiring talk on The Value of User Feedback. He put forward the idea of designing retrieval systems for feedback. In stead of just looking at the clicklogs <em>post factum</em> one can think of a system which uses the clicks feedback to learn, thus creating a better ranker for a given query and a given user need. In a single session, we can use click feedback to disambiguate the query and deliver results on the run which are of immediate benefit to the users.</p>
<p>Unfortunately, I guess I could have missed other interesting presentations but with two parallel sessions and several workshops there was a limit to what I could devour. What surprised me though, was that there were very few papers by the industry. We do try to solve exactly the same problems and tackle the same issues as academia. We, at Findwise, have constantly flagged the huge benefit of good, relevant Metadata for the task of achieving better search performace, which was also touched upon in the paper &#8220;Topic Classification in Social Media using Metadata from Hyperlinked Objects&#8221;.</p>
<p>It was really great to visit Dublin and attent ECIR 2011. It was an inspiring conference and I do believe that at next ECIR we, from Findwise, can be on the podium, sharing our knowledge and hands-on experience on Enterprise search and IR.</p>
<p><strong>Sláinte!</strong></p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/ecir-2011-in-retrospect/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/ecir-2011-in-retrospect/"><meta itemprop="datePublished" content="2011-04-27T08:25:39+00:00"><meta itemprop="dateModified" content="2011-04-27T08:25:39+00:00"><meta itemprop="dateCreated" content=""><meta itemprop="keywords" content="Amazon,Artificial Artificial Intelligence,Dublin,Evgeniy Gabrilovich,hard advertising,Metadata,Oscar T&Atilde;&curren;ckstr&Atilde;&para;m,retrieval systems,search performace,search results,social media,social network,Thorsten Joachims,Web search results,Yahoo"><meta itemprop="wordCount" content="651"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/ecir-2011-in-retrospect/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Delivering information where it&#8217;s needed</title>
		<link>http://blog.findwise.com/delivering-information-where-its-needed/</link>
		<comments>http://blog.findwise.com/delivering-information-where-its-needed/#comments</comments>
		<pubDate>Thu, 07 Apr 2011 05:21:08 +0000</pubDate>
		<dc:creator>David Ronnqvist</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[cellular telephone]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[niche product]]></category>
		<category><![CDATA[search results]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2508</guid>
		<description><![CDATA[I recently started working at Findwise after having finished my thesis on location-based information delivery in a mobile phone. The purpose of my thesis was to: Investigate how location-based information (as opposed to fixed locations) could be connected to search results Improve quality of location-based information by considering the course and velocity of the user [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>I recently started working at Findwise after having finished my thesis on location-based information delivery in a mobile phone. The purpose of my thesis was to:</p>
<ul>
<li>Investigate how <em>location-based information</em> (as opposed to fixed locations) could be connected to search results</li>
<li>Improve quality of location-based information by considering the course and velocity of the user</li>
</ul>
<p>To start with, I created an iPhone application with a location-based reminder system. The reminders described location constraints and users could create reminders with single locations (<em>at home</em>) or groups of locations (<em>at any pharmacy</em>). To find these groups of locations, the system searched for locations with associated information (like nearby pharmacies) and delivered this information without users having to click <em>Search</em> repeatedly.</p>
<p>This is an unusual approach to search as the user is passive, instead the system is performing searches for the user. However, to make search results relevant one has to add contextual constraints to describe <em>when, where</em> and to <em>whom</em> a piece of information is relevant. When all constraints are met, information should be relevant. If not, the system lacks some crucial contextual constraints.</p>
<p>When search is automated, the importance of relevant search results increases and the more you know of the users world, the better you can adjust the results. However, traditional search can also benefit from contextual information. It can be used as a filter where search results that are irrelevant in the current context are removed. Alternatively it could be a part of the relevance model, improving search results by reordering them according to context. Hence, whereas automatic information delivery is probably undesirable for many types of information &#8211; contextual constraints can still be of good use!</p>
<p>The people who tested my application created 25% of their reminders as groups of locations and found it useful as it helped them find places they weren’t aware of, facilitating opportunistic behavior. The course and velocity information reduced the number of false-positive information deliveries. Overall, the system worked well as a niche product.</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/delivering-information-where-its-needed/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/delivering-information-where-its-needed/"><meta itemprop="datePublished" content="2011-04-07T06:21:08+00:00"><meta itemprop="dateModified" content="2011-04-07T06:21:08+00:00"><meta itemprop="dateCreated" content=""><meta itemprop="keywords" content="cellular telephone,iPhone,niche product,search results"><meta itemprop="wordCount" content="330"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/delivering-information-where-its-needed/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Gartner and the magic quadrants – crowning the leaders of Enterprise Search</title>
		<link>http://blog.findwise.com/gartner-and-the-magic-quadrants-%e2%80%93-crowning-the-leaders-of-enterprise-search/</link>
		<comments>http://blog.findwise.com/gartner-and-the-magic-quadrants-%e2%80%93-crowning-the-leaders-of-enterprise-search/#comments</comments>
		<pubDate>Tue, 25 Jan 2011 21:42:40 +0000</pubDate>
		<dc:creator>Caroline Abrahamsson</dc:creator>
				<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Market trends]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Search Watch]]></category>
		<category><![CDATA[Vendors]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2444</guid>
		<description><![CDATA[For years Gartner, the research and advisory company, has been publishing their magic quadrants – and their verdict of everything from ECM-systems to Data Warehouse and E-commerce plays a big role in many company’s decision to choose the right tools. Simply put, the vendors are presented in a matrix measuring the different players by ability [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>For years <a title="Gartner" href="http://www.gartner.com/technology/home.jsp" target="_blank">Gartner</a>, the research and advisory company, has been publishing their <a title="Gartners' magic quadrants" href="http://www.gartner.com/it/products/mq/mq_ms.jsp" target="_blank">magic quadrants </a>– and their verdict of everything from ECM-systems to Data Warehouse and E-commerce plays a big role in many company’s decision to choose the right tools.<br />
Simply put, the vendors are presented in a matrix measuring the different players by ability to execute <em>(product, overall viability, customer experience etc.) </em>and the completeness of their vision <em>(offering strategy, innovation etc.)</em>. The vendors are then positioned as niche players (a rather crowded spot), visionaries, challengers and leaders.</p>
<p>At the end of last year Gartner decided to retire their old “Information Access Quadrant” and introduce “<a title="Enterprise Search MarketScope" href="http://www.gartner.com/technology/media-products/reprints/microsoft/vol14/article9/article9.html" target="_blank">Enterprise Search MarketScope</a>” due to a more mature market. A number of vendors (such as Vivisimo and Recommind) were removed, in order to exclude those whose businesses were not entirely search driven.</p>
<p>The evaluation criteria’s for MarketScope cover: offering (product) strategy, Innovation, Overall viability (business unit, financial, strategy, and organization), Customer experience, Market understanding and business model.<br />
To summarize: the criteria’s are to a large extent the same, but the two areas “<em>overall viability</em>” and <em>&#8220;customer experience</em>” are weighted higher than the rest. This is most likely a result of the last years discussion around user friendly interfaces, easier administration and the fact that some customers have suffered quite bad when vendors do not survive (one example in Northen Europe is the <a title="SurfRay " href="http://www.jboye.com/blogpost/surfray-goes-bankrupt-what-it-means-for-customers/" target="_blank">Danish vendor</a> that went bankrupted for some time)</p>
<p>The yearly fight between the three leaders; <a title="Microsoft search" href="http://sharepoint.microsoft.com" target="_self">Microsoft</a>, <a title="Endeca search" href="http://www.endeca.com/en/home.html" target="_blank">Endeca</a> and <a title="Autonomy" href="http://autonomy.com/">Autonomy</a> has been somewhat disrupted and Microsoft, Endeca and <a title="Google Enterprise Search" href="http://www.google.com/enterprise/" target="_blank">Google</a> are now seen as the leaders.<br />
Microsoft has got a very broad product line, which stretches from low-price and less functionality to Enterprise Search built on the former FAST technology. Endeca follow the same trend, as Gartner puts it their “products (are) intended to serve organizations seeking to develop general search installations..(..) broadly applicable for a variety of different search challenges”.<br />
In the old quadrant, Google remained a “challenger” for quite some time – but never made it to the “leaders” corner. Ease of administration and “user friendly” are two words that keeps being repeated. That, in combination with a profit of $ 7290000000 during the last quarter of 2010 makes Google a player that easily can continue to develop their Enterprise business.</p>
<div id="attachment_2446" class="wp-caption aligncenter" style="width: 490px"><a rel="attachment wp-att-2446" href="http://findabilityblog.se/gartner-and-the-magic-quadrants-%e2%80%93-crowning-the-leaders-of-enterprise-search/marketscope/"><img itemprop="image" class="size-full wp-image-2446" title="Marketscope" src="http://media.findabilityblog.se//2011/01/Marketscope1.gif" alt="" width="480" height="288" /></a><p class="wp-caption-text">Gartner&#39;s MarketScope for Enterprise Search </p></div>
<p style="text-align: center;">&nbsp;</p>
<p>Autonomy should still not be disregarded, the main reason for it falling a bit behind the three others seem to be conquerable problems with support and pricing transparency. It will be interesting to see how Autonomy chooses to handle these issues during 2011.</p>
<p>To put it short: the new MarketScope is good reading with quite few surprises. If you wish to get a better understanding of the development going on at the different vendors, start with <a title="Gartner MarketScope for Enterprise Search" href="http://www.gartner.com/technology/media-products/reprints/microsoft/vol14/article9/article9.html" target="_blank">Gartner</a> and continue to search among <a title="Findability blog" href="http://findabilityblog.se/" target="_blank">our blog posts</a>.</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/gartner-and-the-magic-quadrants-%e2%80%93-crowning-the-leaders-of-enterprise-search/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/gartner-and-the-magic-quadrants-%e2%80%93-crowning-the-leaders-of-enterprise-search/"><meta itemprop="datePublished" content="2011-01-25T22:42:40+00:00"><meta itemprop="dateModified" content="2011-06-30T12:28:39+00:00"><meta itemprop="dateCreated" content="2011-01-25T22:42:40+00:00"><meta itemprop="wordCount" content="468"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/gartner-and-the-magic-quadrants-%e2%80%93-crowning-the-leaders-of-enterprise-search/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Better search engines and other stuff about information practices in workplaces</title>
		<link>http://blog.findwise.com/better-search-engines-and-other-stuff-about-information-practices-in-workplaces/</link>
		<comments>http://blog.findwise.com/better-search-engines-and-other-stuff-about-information-practices-in-workplaces/#comments</comments>
		<pubDate>Wed, 20 Oct 2010 12:53:19 +0000</pubDate>
		<dc:creator>Katriina Bystrom</dc:creator>
				<category><![CDATA[Information seeking behaviour]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[ASIS&T]]></category>
		<category><![CDATA[enterprise search engine]]></category>
		<category><![CDATA[Henrik Strindberg]]></category>
		<category><![CDATA[Pennsylvania]]></category>
		<category><![CDATA[Pittsburgh]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[search engine plays]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[Swedish Foundation for Strategic Research]]></category>
		<category><![CDATA[United States]]></category>
		<category><![CDATA[University of Borås]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2350</guid>
		<description><![CDATA[During this year I have worked on a research project that aims to facilitate the development and implementation of an enterprise search engine. By understanding the use and value of information at the workplace, we hope to create even better preconditions for optimizing a search engine to the requirements of a specific organization. We use [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>During this year I have worked on a research project that aims to facilitate the development and implementation of an enterprise search engine. By understanding the use and value of information at the workplace, we hope to create even better preconditions for optimizing a search engine to the requirements of a specific organization.</p>
<p>We use a work-task based research approach where we study information practices – that is, the normalized ways we use to recognize information needs, look for information, and how it is valued and used. By studying such practices in real-life work tasks, we can outline the role that a search engine plays in relation to other work tasks as well as to other ways of finding information. In short, being engaged in a creativity-oriented work task initiates different types of information practices compared to the practices we use in everyday, routine-based work tasks …</p>
<p>The creativity-oriented work tasks involve a dimension of innovation, and concepts such as learning and development are often used to describe these activities. Uncertainty is something that is associated with curiosity and may be seen as a driving force behind information seeking. Information that is rich in nuances and that offers different, even contradictory explanations or descriptions is usually appreciated, and the task outcome is only vaguely discerned at first. Routine-oriented tasks, on the other hand, are focused on increasing effectiveness and reducing uncertainty as quickly as possible in the task outcome, which itself may be sketched out relatively clearly from the beginning. Information seeking is often directed to readily available facts. All this means that a search engine must support a variety of information practices at any given workplace!</p>
<p>The “we” in this project is myself together with a <a href="http://www.findwise.se">Findwise</a> colleague Henrik Strindberg. The project is financially supported by the Swedish Foundation for Strategic Research, and while I am not working with the present project I am employed by the University of Borås.</p>
<p>Just now I am finalizing a presentation of the project for the <a href="http://www.ickm-2010.org/">ICKM conference</a> in Pittsburgh, PA, USA, next week. The presentation is entitled “Interrelated use and value of information sources”, and will be available through the conference proceedings in due time.</p>
<p>Very exciting … and while there I will also attend the board meetings of the ASIS&amp;T’s Board of Directors as a newly appointed Director-at-Large. Very exciting, too!</p>
<p>The 73rd Annual Meeting of <a href="http://www.asis.org/">ASIS&amp;T</a> focuses on “Navigation Streams in an Information Ecosystem”.</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/better-search-engines-and-other-stuff-about-information-practices-in-workplaces/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/better-search-engines-and-other-stuff-about-information-practices-in-workplaces/"><meta itemprop="datePublished" content="2010-10-20T13:53:19+00:00"><meta itemprop="dateModified" content="2010-10-20T13:53:19+00:00"><meta itemprop="dateCreated" content=""><meta itemprop="keywords" content="ASIS&amp;amp;T,enterprise search engine,Henrik Strindberg,Pennsylvania,Pittsburgh,search engine,search engine plays,search engines,Swedish Foundation for Strategic Research,United States,University of Bor&Atilde;&yen;s"><meta itemprop="wordCount" content="407"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/better-search-engines-and-other-stuff-about-information-practices-in-workplaces/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why is search easy and hard?</title>
		<link>http://blog.findwise.com/why-is-search-easy-and-hard/</link>
		<comments>http://blog.findwise.com/why-is-search-easy-and-hard/#comments</comments>
		<pubDate>Thu, 16 Sep 2010 07:39:46 +0000</pubDate>
		<dc:creator>Maria Johansson</dc:creator>
				<category><![CDATA[Future development]]></category>
		<category><![CDATA[Information seeking behaviour]]></category>
		<category><![CDATA[Interaction Design]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[Gene Golovchinsky]]></category>
		<category><![CDATA[information access systems]]></category>
		<category><![CDATA[the New York Times]]></category>
		<category><![CDATA[Yahoo Labs]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2266</guid>
		<description><![CDATA[Last year my colleague Lina and I went to the Workshop on Human Computer Interaction and Information Retrieval (HCIR) in Washington DC. This year we did not have the possibility to attend but since all the material is available online I took part remotely any way. I wanted to share with you what I found [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>Last year my colleague Lina and I went to the <a href="http://www.hcir.info/">Workshop on Human Computer Interaction and Information Retrieval</a> (HCIR) in Washington DC. This year we did not have the possibility to attend but since all the material is available online I took part remotely any way. I wanted to share with you what I found most interesting this year. (Daniel Tunkelang who was one of the organizers also posted a <a href="http://thenoisychannel.com/2010/08/27/hcir-2010-bigger-and-better-than-ever/">good overview of the event</a> on his blog.)</p>
<p>This years keynote speaker was Dan Russell, a researcher from Google. He talked about Search Quality and user happiness; Why search is easy and hard. The point I found most interesting in his presentation was how improvement is not only needed when it comes to tools and data but also improving the users&#8217; search skills. My own experience from various search projects is similar; users are not good at searching. Even though they are looking for a specific version of a technical documentation for a specific product they might just enter the name of the product, or even the product family. (It&#8217;s a bit like searching for &#8216;camera&#8217; when you expect to find support documentation on your Dioptric lens for you Canon EOS 60D.) So I agree that users need better search skills. In his presentation Russell also presented some ideas on how a search application can help users improve their search skills.</p>
<div id="__ss_5065727" style="width: 425px; text-align: center;"><strong style="display: block; margin: 12px 0 4px;"><a title="Dan Russell - Search Quality and User Happiness" href="http://www.slideshare.net/dtunkelang/dan-russell-search-quality-and-user-happiness">Dan Russell &#8211; Search Quality and User Happiness</a></strong></div>
<p style="text-align: center;"><object id="__sse5065727" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="531" height="444" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=hcir-keynote-talk-russell-aug-22-2010-100827000301-phpapp01&amp;stripped_title=dan-russell-search-quality-and-user-happiness" /><param name="name" value="__sse5065727" /><param name="allowfullscreen" value="true" /><embed id="__sse5065727" type="application/x-shockwave-flash" width="531" height="444" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=hcir-keynote-talk-russell-aug-22-2010-100827000301-phpapp01&amp;stripped_title=dan-russell-search-quality-and-user-happiness" name="__sse5065727" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>Search is both easy and hard. Perhaps this is one of the reasons for the introduction of the HCIR Challenge as a new part of the workshop . From the HCIR website:</p>
<blockquote><p>The aims of the challenge are to encourage researchers and practitioners to build and demonstrate information access systems satisfying at least one of the following:</p>
<ul>
<li> Not only deliver relevant documents, but provide facilities for making meaning with those documents.</li>
<li> Increase user responsibility as well as control; that is, the systems require and reward human effort.</li>
<li>Offer the flexibility to adapt to user knowledge / sophistication / information need.</li>
<li>Are engaging and fun to use.</li>
</ul>
</blockquote>
<p>The winner of the challenge was a team of researchers from Yahoo Labs who presented <a href="http://sites.google.com/site/hcirworkshop/hcir-2010/proceedings/Matthews_cr32.pdf?attredirects=0">Searching Through Time in the New York Times</a>. The Time Explorer features a results page with an interactive time line that illustrates how the volume of articles (results) have changed over time. I recommend that you read the article in <a href="http://www.technologyreview.com/computing/26113/">tech review</a> to learn more about the project, or try out the <a href="http://fbmya01.barcelonamedia.org:8080/future/index.jsp">Time explorer demo</a> yourself. You can also learn more about the challenge in this <a href="http://palblog.fxpal.com/?p=4477">blog post</a> by Gene Golovchinsky.</p>
<p>All the papers and posters from the workshop can be found on the new <a href="http://www.hcir.info/">website</a>.</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/why-is-search-easy-and-hard/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/why-is-search-easy-and-hard/"><meta itemprop="datePublished" content="2010-09-16T08:39:46+00:00"><meta itemprop="dateModified" content="2010-09-16T08:39:46+00:00"><meta itemprop="dateCreated" content=""><meta itemprop="keywords" content="Gene Golovchinsky,information access systems,the New York Times,Yahoo Labs"><meta itemprop="wordCount" content="443"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/why-is-search-easy-and-hard/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

