<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Findability blog &#187; Search</title>
	<atom:link href="http://blog.findwise.com/category/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.findwise.com</link>
	<description>The enterprise search and findability blog</description>
	<lastBuildDate>Wed, 09 May 2012 17:59:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Video: Search Analytics in Practice</title>
		<link>http://blog.findwise.com/video-search-analytics-in-practice/</link>
		<comments>http://blog.findwise.com/video-search-analytics-in-practice/#comments</comments>
		<pubDate>Wed, 09 May 2012 10:43:10 +0000</pubDate>
		<dc:creator>Kristian Norling</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Findwise]]></category>
		<category><![CDATA[Information seeking behaviour]]></category>
		<category><![CDATA[Intranet]]></category>
		<category><![CDATA[Presentation]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Search Analytics]]></category>
		<category><![CDATA[User Experience]]></category>

		<guid isPermaLink="false">http://blog.findwise.com/?p=3096</guid>
		<description><![CDATA[Search Analytics in Practice from Findwise on Vimeo. This presentation is about how to use search analytics to improve the search experience. A small investment in time and effort can really improve the search on your intranet or website. You will get practical advice on what metrics to look at and what actions can be [...]]]></description>
			<content:encoded><![CDATA[<strong itemprop="description"></strong><br /><span itemprop="mainContentOfPage"><span itemprop="articleBody"><p><iframe src="http://player.vimeo.com/video/41761041" frameborder="0" width="425" height="239"></iframe></p>
<p>Search Analytics in Practice from <a href="http://vimeo.com/findwise">Findwise</a> on <a href="http://vimeo.com">Vimeo</a>.</p>
<p>This presentation is about how to use search analytics to improve the search experience. A small investment in time and effort can really improve the search on your intranet or website. You will get practical advice on what metrics to look at and what actions can be taken as a result of the analysis.</p>
<p><a href="http://video.findwise.com/video/41761040">Video in swedish &#8220;Sökanalys i praktiken&#8221;.</a></p>
<p>The presentation was recorded in Gothenburg on the 4th of May 2012.</p>
<p>The presentation featured in the video:</p>
<div id="__ss_11811924" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="Search Analytics in Practice" href="http://www.slideshare.net/findwise/search-analytics-in-practice" target="_blank">Search Analytics in Practice</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/11811924?rel=0" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></p>
<div style="padding: 5px 0 12px;">View more presentations from <a href="http://www.slideshare.net/findwise" target="_blank">Findwise</a></div>
</div>
</span></span><meta itemprop="inLanguage" content="en"><meta itemprop="isFamilyFriendly" content="Y"><div class="schema_property_wrap">
<span class="schema_property">
    <span class="schema_property_name"><b>Description:</b> </span>
    <span class="schema_property_value" itemprop="description" content="">Search analytics done right helps to improve the search experience. A small investment in time can make your search function a lot better. </span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>About:</b> </span>
    <span class="schema_property_value" itemprop="about" content="">Search Analytics</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Accountable Person:</b> </span>
    <span class="schema_property_value" itemprop="accountablePerson" content="">Kristian Norling</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Author:</b> </span>
    <span class="schema_property_value" itemprop="author" content="">Kristian Norling</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Keywords:</b> </span>
    <span class="schema_property_value" itemprop="keywords" content="">Search analytics, search,</span>
</span>&nbsp;&bull;&nbsp;
</div><meta itemprop="url" content="http://blog.findwise.com/video-search-analytics-in-practice/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/video-search-analytics-in-practice/"><meta itemprop="datePublished" content="2012-05-09T11:43:10+00:00"><meta itemprop="dateModified" content="2012-05-09T18:59:56+00:00"><meta itemprop="dateCreated" content="2012-05-09T11:42:25+00:00"><meta itemprop="wordCount" content="95"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/video-search-analytics-in-practice/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Data and Search Going Big?</title>
		<link>http://blog.findwise.com/data-and-search-going-big/</link>
		<comments>http://blog.findwise.com/data-and-search-going-big/#comments</comments>
		<pubDate>Wed, 25 Apr 2012 10:40:13 +0000</pubDate>
		<dc:creator>Martin Johansson</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Search Watch]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[scandev]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://blog.findwise.com/?p=3072</guid>
		<description><![CDATA[A few enterprise search specialists from Findwise recently attended the Scandinavian Developer Conference 2012. One of the tracks was Big Data, which is very much related to search. It had some interesting talks about how to handle large amounts of data in an efficient way. Special thanks to Theo Hultberg, Jim Webber and Tim Berglund! The theme was [...]]]></description>
			<content:encoded><![CDATA[<strong itemprop="description"></strong><br /><span itemprop="mainContentOfPage"><span itemprop="articleBody"><p><a href="http://findwise.com">A few enterprise search specialists from Findwise</a> recently attended the <a href="http://www.scandevconf.se/2012/conference/">Scandinavian Developer Conference 2012</a>. One of the tracks was Big Data, which is very much related to search. It had some interesting talks about how to handle large amounts of data in an efficient way. Special thanks to <a href="http://www.scandevconf.se/2012/conference/speakers/theo-hultberg/">Theo Hultberg</a>, <a href="http://www.scandevconf.se/2012/conference/speakers/jim-webber/">Jim Webber</a> and <a href="http://www.scandevconf.se/2012/conference/speakers/tim-berglund/">Tim Berglund</a>!</p>
<p>The theme was that you should choose a storage system which is well suited for the task. This may seem like an obvious point, but for a long time this was simply ignored; I&#8217;m talking about the era of relational databases. Don&#8217;t get me wrong, sometimes a relational database is the very best for the job, but in many cases it isn&#8217;t.</p>
<p>Data is jagged by nature, i.e. not all objects have the same properties. This is why we shouldn&#8217;t force them to fit into a square table, instead everything should be denormalized! The application accessing the data will be aware of the information structure and will handle it accordingly. This will also avoid expensive assembly operations (such as joins) to get the data in the format we want when retrieving it. Why should you split up your data if you are going to assemble it over and over again? Also remember that disk space is cheap, pre-compute as much as possible. The design of a Big Data system should be governed by how the data will be retrieved.</p>
<p>Another step away from the relational databases is the relaxation of some of the <a href="http://en.wikipedia.org/wiki/ACID">ACID properties: Atomicity, Consistency, Isolation and Durability</a>. Again, this is along the lines of choosing the components best suited for the system. Decide which properties are a must have and which are not so important.</p>
<p>Relaxing the ACID properties, such as consistency, can give great performance gains. The NoSQL database Cassandra is eventually consistent and its <a href="http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html">write performance scales linearly up to 288 nodes (and probably even higher) which gives a write performance of over 1 million writes per second!</a></p>
<p>However, relaxation of these properties is not a new concept in the world of search engines. When indexing a document, it will usually take a number of seconds before it is searchable. This is called eventual consistency, i.e. the state of the search engine will be brought from one valid state to another, within a sufficiently long period of time. Do we really need documents that were just submitted to the search engine to be<br />
searchable instantly? Most likely, no. Isolation is another property that is not crucial to a search engine. Since a document in an index doesn&#8217;t have any explicit relations to any other documents in the same index, there isn&#8217;t a great need for isolation. If two writes for the same document are submitted at the same time, there is probably something wrong in another part of the system.</p>
<p>So what does all this mean for search? There is an interesting challenge in storing jagged data in large amounts and then making good use out of it. To search in vast amounts jagged data, you need a lot of querytime field mappings (to make relevant data searchable) &#8230; or do you? There is also the issue of retaining a good relevancy model, which is absolutely vital to a search engine. How do you measure the relevance of arbitrary metadata and then weigh it all together? Maybe we need to think in new ways about relevance all together?</p>
<p>Whomever can solve these problems in a good way with a minimum amount of manual labor, is a name we&#8217;ll be hearing from a lot in the future.</p>
</span></span><meta itemprop="inLanguage" content="en"><div class="schema_property_wrap">
<span class="schema_property">
    <span class="schema_property_name"><b>Description:</b> </span>
    <span class="schema_property_value" itemprop="description" content="">Big Data, which is very much related to search</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>About:</b> </span>
    <span class="schema_property_value" itemprop="about" content="">Big data and Search</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Author:</b> </span>
    <span class="schema_property_value" itemprop="author" content="">Martin Johansson</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Keywords:</b> </span>
    <span class="schema_property_value" itemprop="keywords" content="">search, big data, enterprise search, conference</span>
</span>&nbsp;&bull;&nbsp;
</div><meta itemprop="url" content="http://blog.findwise.com/data-and-search-going-big/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/data-and-search-going-big/"><meta itemprop="datePublished" content="2012-04-25T11:40:13+00:00"><meta itemprop="dateModified" content="2012-04-25T11:47:14+00:00"><meta itemprop="dateCreated" content="2012-04-25T11:27:59+00:00"><meta itemprop="keywords" content="big data,conference,data,Enterprise Search,scandev,search"><meta itemprop="wordCount" content="601"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/data-and-search-going-big/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Architecture of Search Systems and Measuring the Search Effectiveness</title>
		<link>http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/</link>
		<comments>http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/#comments</comments>
		<pubDate>Tue, 24 Apr 2012 10:03:28 +0000</pubDate>
		<dc:creator>Pawel Wroblewski</dc:creator>
				<category><![CDATA[Academia]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Lecture]]></category>
		<category><![CDATA[Presentation]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Search Analytics]]></category>

		<guid isPermaLink="false">http://blog.findwise.com/?p=3063</guid>
		<description><![CDATA[Lecture made at the 19th of April 2012, at the Warsaw University of Technology. This is the 9th lecture in the regular course for master grade studies, &#8220;Introduction to text mining&#8221;. View more presentations from Findwise Keywords: Search, search effectiveness, search architecture &#160;&#8226;&#160;]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><div id="__ss_12598871" style="width: 510px;">Lecture made at the 19th of April 2012, at the Warsaw University of Technology. This is the 9th lecture in the regular course for master grade studies, &#8220;Introduction to text mining&#8221;. <object id="__sse12598871" width="510" height="426" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="wmode" value="transparent" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=w9-searcharchitecture-120419021838-phpapp01&amp;rel=0&amp;stripped_title=architecture-of-search-systems-and-measuring-the-search-effectiveness&amp;userName=findwise" /><param name="allowscriptaccess" value="always" /><param name="allowfullscreen" value="true" /><embed id="__sse12598871" width="510" height="426" type="application/x-shockwave-flash" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=w9-searcharchitecture-120419021838-phpapp01&amp;rel=0&amp;stripped_title=architecture-of-search-systems-and-measuring-the-search-effectiveness&amp;userName=findwise" allowFullScreen="true" allowScriptAccess="always" wmode="transparent" allowscriptaccess="always" allowfullscreen="true" /> </object></p>
<div style="padding: 5px 0 12px;">View more presentations from <a href="http://www.slideshare.net/findwise">Findwise</a></div>
</div>
</span></span><meta itemprop="inLanguage" content="en"><meta itemprop="isFamilyFriendly" content="Y"><div class="schema_property_wrap">
<span class="schema_property">
    <span class="schema_property_name"><b>Keywords:</b> </span>
    <span class="schema_property_value" itemprop="keywords" content="">Search, search effectiveness, search architecture</span>
</span>&nbsp;&bull;&nbsp;
</div><meta itemprop="url" content="http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/"><meta itemprop="datePublished" content="2012-04-24T11:03:28+00:00"><meta itemprop="dateModified" content="2012-04-24T11:05:57+00:00"><meta itemprop="dateCreated" content="2012-04-24T10:51:51+00:00"><meta itemprop="wordCount" content="35"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/architecture-of-search-systems-and-measuring-the-search-effectiveness/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Update on The Enterprise Search and Findability Survey</title>
		<link>http://blog.findwise.com/update-on-the-the-enterprise-search-and-findability-survey/</link>
		<comments>http://blog.findwise.com/update-on-the-the-enterprise-search-and-findability-survey/#comments</comments>
		<pubDate>Thu, 12 Apr 2012 11:51:47 +0000</pubDate>
		<dc:creator>Kristian Norling</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Digital Workplace]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Future development]]></category>
		<category><![CDATA[Market trends]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Search Watch]]></category>
		<category><![CDATA[Survey]]></category>
		<category><![CDATA[business]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://blog.findwise.com/?p=3033</guid>
		<description><![CDATA[A quick update on the status of the Enterprise Search survey. We now have well over a hundred respondents. The more respondents the better the data will be, so please help spreading the word. We&#8217;d love to have  several hundred more. The survey will now be open until the end of April. But most important [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>A quick update on the status of the Enterprise Search survey.</p>
<p>We now have well over a hundred respondents. The more respondents the better the data will be, so please help spreading the word. We&#8217;d love to have  several hundred more. The survey will now be open until the end of April.</p>
<p>But most important of all, if you haven&#8217;t already, have a cup of coffee and <a href="http://www.surveymonkey.com/s/fsblog">fill in the survey</a>.</p>
<h2>A Few Results from the Survey about Enterprise Search</h2>
<p>More than 60% say that the amount of searchable content in their organizations today are less or far less than needed. And in three years time 85% say that the amount of searchable content in the organisation will increase och increase significantly.</p>
<p><strong>75% say that it is critical to find the right information</strong> to support their organizations business goals and success. But the interesting to note is that over 70% of the respondents say that users don&#8217;t know <strong>where to find</strong> the right information or <strong>what to look for</strong> &#8211; and about 50% of the respondents say that it is not possible to search more than one source of information from a single search query.</p>
<p>In this context it is interesting that the primary goal for using search in organisations (where the answer is imperative or signifact) is to:</p>
<ul>
<li>Improve re-use of information and/or knowledge) &#8211; 59%</li>
<li>Accelerate brokering of people and/or expertise &#8211; 55%</li>
<li>Increase collaboration &#8211; 60%</li>
<li>Raise awareness of “What We Know” &#8211; 57%</li>
<li>and finally to eliminate siloed repositories &#8211; 59%</li>
</ul>
<p>In many organisations search is owned either by IT (60%) or Communication (27%), search has no specified budget (38%) and has less than 1 dedicated person working with search (48%).  More than 50% have a search strategy in place or are planning to have one in 2012/13.</p>
<p>These numbers I think are interesting, but definitely need to be segmented and analyzed further. That will of course be done in the report which is due to be ready in June.</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/update-on-the-the-enterprise-search-and-findability-survey/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/update-on-the-the-enterprise-search-and-findability-survey/"><meta itemprop="datePublished" content="2012-04-12T12:51:47+00:00"><meta itemprop="dateModified" content="2012-04-12T17:45:04+00:00"><meta itemprop="dateCreated" content="2012-04-12T09:46:27+00:00"><meta itemprop="keywords" content="business,Enterprise Search,findability,search,survey"><meta itemprop="wordCount" content="316"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/update-on-the-the-enterprise-search-and-findability-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Semantic Search Engine &#8211; What is the Meaning?</title>
		<link>http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/</link>
		<comments>http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/#comments</comments>
		<pubDate>Fri, 30 Mar 2012 07:59:12 +0000</pubDate>
		<dc:creator>Pawel Wroblewski</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Future development]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Semantic]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Information science]]></category>
		<category><![CDATA[open source technology]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[Semantic search]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[Technology/Internet]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2867</guid>
		<description><![CDATA[The shortest dictionary definition of semantics is: the study of meaning. The more complex explanation of this term would lead to a relationship that maps words, terms and written expressions into common sense and understanding of objects and phenomena in the real world. It is worthy to mention that objects, phenomena and relationships between them [...]]]></description>
			<content:encoded><![CDATA[<strong itemprop="description"></strong><br /><span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>The shortest dictionary definition of semantics is: <em>the study of meaning.</em> The more complex explanation of this term would lead to a relationship that maps words, terms and written expressions into common sense and understanding of objects and phenomena in the real world. It is worthy to mention that objects, phenomena and relationships between them are language independent. It means that the same semantic network of concepts can map to multiple languages which is useful in automatic translations or cross-lingual searches.</p>
<h2>The approach</h2>
<p>In the proposed approach semantics will be modeled as a defined ontology making it possible for the web to &#8220;understand&#8221; and satisfy the requests and intents of people and machines to use the web content. The ontology is a model that encapsulates knowledge from specific domain and consists of hierarchical structure of classes (taxonomy) that represents concepts of things, phenomena, activities etc. Each concept has a set of attributes that represent the mapping of that particular concept to words and phrases that represents that concepts in written language (as shown at the top of the figure below). Moreover, the proposed ontology model will have horizontal relationships between concepts, e.g. the linguistic relationships (synonymy, homonymy etc.) or domain specific relationships (medicine, law, military, biological, chemical etc.). Such a defined ontology model will be called a <strong>Semantic Map </strong>and will be used in the proposed search engine. An exemplar part of an enriched ontology of beverages is shown in the figure below. The ontology is enriched, so that the concepts can be easily identified in text using attributes such as the representation of the concept in the written text.</p>
<h2>Semantic Map</h2>
<p>The Semantic Map is an ontology that is used for bidirectional mapping of textual representation of concepts into a space of their meaning and associations. In this manner, it becomes possible to transform user queries into concepts, ideas and intent that can be matched with indexed set of similar concepts (and their relationships) derived from documents that are returned in a form of result set. Moreover, users will be able to precise and describe their intents using visualized facets of concept taxonomy, concept attributes and horizontal (domain) relationships. The search module will also be able to discover users’ intents based on the history of queries and other relevant factors, e.g. ontological axioms and restrictions. A potentially interesting approach will retrieve additional information regarding the specific user profile from publicly available information available in social portals like Facebook, blog sites etc., as well as in user’s own bookmarks and similar private resources, enabling deeper intent discovery.</p>
<p><a href="http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/pomost/" rel="attachment wp-att-3060"><img itemprop="image" class="aligncenter size-full wp-image-3060" title="Semantic Search" src="http://blog.findwise.com/wp-content/uploads/2012/03/pomost.jpg" alt="" width="740" height="578" /></a></p>
<h2>Semantic Search Engine</h2>
<p>The search engine will be composed of the following components:</p>
<ul>
<li><strong>Connector</strong> – This module will be responsible for acquisition of data from external repositories and pass it to the search engine. The purpose of the connector is also to extract text and relevant metadata from files and external systems and pass it to further processing components.</li>
<li><strong>Parser</strong> – This module will be responsible for text processing including activities like: tokenization (breaking text into lexems – words or phrases), lemmatization (normalization of grammar forms), exclusion of stop-words, paragraph and sentence boundary detector. The result of parsing stage is structured text with additional annotations that is passed to semantic Tagger.</li>
<li><strong>Tagger</strong> – This module is responsible for adding semantic information for each lexem extracted from the processed text. Technically it refers to addition of identifiers to relevant concepts stored in the Semantic Map for each lexem. Moreover phrases consisting of several words are identified and disambiguation is performed basing on derived contexts. Consider the example illustrated in the figure.</li>
<li><strong>Indexer</strong> – This module is responsible for taking all the processed information, transformation and storage into the search index. This module will be enriched with methods of semantic indexing using ontology (semantic map) and language tools.</li>
<li><strong>Search index </strong>– The central storage of processed documents (document repository) structured properly to manage full text of the documents, their metadata and all relevant semantic information (document index). The structure is optimized for search performance and accuracy.</li>
<li><strong>Search </strong>– This module is responsible for running queries against the search index and retrieval of relevant results. The search algorithms will be enriched to use user intents (complying data privacy) and the prepared Semantic Map to match semantic information stored in the search index.</li>
</ul>
<p>What do you think? Please let us know by writing a comment.</p>
</span></span><meta itemprop="inLanguage" content="en"><meta itemprop="isFamilyFriendly" content="Y"><div class="schema_property_wrap">
<span class="schema_property">
    <span class="schema_property_name"><b>Accountable Person:</b> </span>
    <span class="schema_property_value" itemprop="accountablePerson" content="">Pawel Wroblewski</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>About:</b> </span>
    <span class="schema_property_value" itemprop="about" content="">Semantic Search</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Description:</b> </span>
    <span class="schema_property_value" itemprop="description" content="">A proposal for a semantic search engine.</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Keywords:</b> </span>
    <span class="schema_property_value" itemprop="keywords" content="">semantic search engine</span>
</span>&nbsp;&bull;&nbsp;
</div><meta itemprop="url" content="http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/"><meta itemprop="datePublished" content="2012-03-30T08:59:12+00:00"><meta itemprop="dateModified" content="2012-04-18T17:37:24+00:00"><meta itemprop="dateCreated" content="2012-01-26T21:54:34+00:00"><meta itemprop="keywords" content="Information science,open source technology,search engine,Semantic search,semantic web,Technology/Internet"><meta itemprop="wordCount" content="724"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/semantic-search-engine-what-is-the-meaning/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Bryan, Brian, Briane, Bryne, or &#8230; what was his name again?</title>
		<link>http://blog.findwise.com/bryan-brian-briane-bryne-or-what-was-his-name-again/</link>
		<comments>http://blog.findwise.com/bryan-brian-briane-bryne-or-what-was-his-name-again/#comments</comments>
		<pubDate>Wed, 21 Mar 2012 13:06:35 +0000</pubDate>
		<dc:creator>Svetoslav Marinov</dc:creator>
				<category><![CDATA[Content refinement]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Information quality]]></category>
		<category><![CDATA[Language support]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Text Analytics]]></category>
		<category><![CDATA[Callie]]></category>
		<category><![CDATA[Daitch-Mokotoff Soundex]]></category>
		<category><![CDATA[George Bernard Shaw]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Kelly]]></category>
		<category><![CDATA[Lawrence Philips]]></category>
		<category><![CDATA[Linguistics]]></category>
		<category><![CDATA[massage]]></category>
		<category><![CDATA[Metaphone]]></category>
		<category><![CDATA[Pattern matching]]></category>
		<category><![CDATA[phonetic algorithm]]></category>
		<category><![CDATA[phonetic algorithms]]></category>
		<category><![CDATA[phonetic search]]></category>
		<category><![CDATA[Soundex]]></category>
		<category><![CDATA[XML]]></category>

		<guid isPermaLink="false">http://blog.findwise.com/?p=2992</guid>
		<description><![CDATA[Let the spelling loose &#8230; What do Callie and Kelly have in common (except for the double &#8216;l&#8217; in the middle)? What about &#8220;no&#8221; and &#8220;know&#8221;, or &#8220;Ceasar&#8217;s&#8221; and &#8220;scissors&#8221; and what about &#8220;message&#8221; and &#8220;massage&#8221;? You definitely got it &#8211; Callie and Kelly, &#8220;no&#8221; and &#8220;know&#8221;, &#8220;Ceasar&#8217;s&#8221; and &#8220;scissors&#8221; sound alike, but are spelled quite differently. &#8220;message&#8221; and [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><h2>Let the spelling loose &#8230;</h2>
<p>What do Callie and Kelly have in common (except for the double &#8216;l&#8217; in the middle)? What about &#8220;no&#8221; and &#8220;know&#8221;, or &#8220;Ceasar&#8217;s&#8221; and &#8220;scissors&#8221; and what about &#8220;message&#8221; and &#8220;massage&#8221;? You definitely got it &#8211; Callie and Kelly, &#8220;no&#8221; and &#8220;know&#8221;, &#8220;Ceasar&#8217;s&#8221; and &#8220;scissors&#8221; sound alike, but are spelled quite differently. &#8220;message&#8221; and &#8220;massage&#8221; on the other hand differ by only one vowel (&#8220;a&#8221; vs &#8220;e&#8221;) but their pronunciation is not at all the same.</p>
<p>It&#8217;s a well known fact for many languages that ortography does not determine the pronunciation of words. English is a classic example. George Bernard Shaw was the attributed author of &#8220;ghoti&#8221; as an alternative spelling of &#8220;fish&#8221;. And while phonology often reflects the current state of the development of the language, orthography may often lag centuries behind. And while English is notorious for that phenomenon it is not the only one. Swedish, French, Portuguese, among others, all have their ortography/pronunciation discrepancies.</p>
<h2>Phonetic Algorithms</h2>
<p>So how do we represent things that sound similar but are spelled different? It&#8217;s not trivial but for most cases it is not impossible either. Soundex is probably the first algorithm to tackle this problem. It is an example of the so called phonetic algorithms which attempt to solve the problem of giving the same encoding to strings which are pronounced in a similar fashion. Soundex was designed for English only but has its limits. DoubleMetaphone (DM) is one of the possible replacements and relatively successful. Designed by Lawrence Philips in the beginning of 1990s it not only deals with native English names but also takes proper care of foreign names so omnipresent in the language. And what is more &#8211; it can output two possible encodings for a given name, hence the &#8220;Double&#8221; in the naming of the algorithm, &#8211; an anglicised and a native (be that Slavic, Germanic, Greek, Spanish, etc.) version.</p>
<p>By relying on DM one can encode all the four names in the title of this post as &#8220;PRN&#8221;. The name George will get two encodings &#8211; JRJ and KRK, the second version reflecting a possible German pronunciation of the name. And a name with Polish origin, like Adamowicz, would also get two encodings &#8211; ATMTS and ATMFX, depending on whether you pronounce the &#8220;cz&#8221; as the English &#8220;ch&#8221; in &#8220;church&#8221; or &#8220;ts&#8221; in &#8220;hats&#8221;.</p>
<p>The original implementation by Lawrence Philips allowed a string to be encoded only with 4 characters. However, in most subsequent<br />
implementations of the algorithm this option is parameterized or just omitted.</p>
<p>Apache Commons Codec has an implementation of the DM among others (Soundex, Metaphone, RefinedSoundex, ColognePhonetic, Coverphone, to<br />
name just a few.) and here is a tiny example with it:</p>
<p><code> import org.apache.commons.codec.language.DoubleMetaphone;</code></p>
<p><code>public class DM {<br />
</code><br />
<code>public static void main(String[] args) {<br />
</code><br />
<code>String s = "Adamowicz";<br />
</code><br />
<code>DoubleMetaphone dm = new DoubleMetaphone();<br />
</code><br />
<code>// Default encoding length is 4!<br />
</code><br />
<code>// Let's make it 10<br />
</code><br />
<code>dm.setMaxCodeLen(10);<br />
</code><br />
<code>System.out.println("Alternative 1: " + dm.doubleMetaphone(s) +<br />
</code><br />
<code>// Remember, DM can output 2 possible encodings:<br />
</code><br />
<code>"\nAlternative 2: " + dm.doubleMetaphone(s, true));<br />
</code><br />
<code>}</code><br />
<code>}</code></p>
<p>The above code will print out:</p>
<p>Alternative 1: ATMTS</p>
<p>Alternative 2: ATMFX</p>
<p>It is also relatively straightforward to do phonetic search with Solr. You just need to ensure that you add the phonetic analysis to a field which contains names in your schema.xml:</p>
<h2>Enhancements</h2>
<p>While DM does perform quite well, at first sight, it has its limitations. We should know that it still originated from the English language and although it aims to tackle a variety of non-native borrowings most of the rules are English-centric. Suppose you work on any of the Scandinavian languages (Swedish, Danish, Norwegian, Icelandic) and one of the names you want to encode is &#8221;Örjan&#8221;. However, &#8220;Orjan&#8221; and &#8220;Örjan&#8221; get different encodings &#8211; ARJN vs RJN. Why is that? One look under the hood (the implementation in DoubleMetaphone.java) will give you the answer:</p>
<p><code> private static final String VOWELS = "AEIOUY";<br />
</code></p>
<p>So the Scandinavian vowels &#8220;ö&#8221;, &#8220;ä&#8221;, &#8220;å&#8221;, &#8220;ø&#8221; and &#8220;æ&#8221; are not present. If we just add these then compile and use the new version of the DM implementation we get the desired output &#8211; ARJN for both &#8220;Örjan&#8221; and &#8220;Orjan&#8221;.</p>
<p>Finally, if you don&#8217;t want to use DM or maybe it is really not suitable for your task, you still may use the same principles and create your own encoder by relying on regular expressions for example. Suppose you have a list of bogus product names which are just (mis)spelling variations of some well known names and you want to search for the original name but get back all ludicrous variants. Here is one albeit very naïve way to do it. Given the following names:</p>
<p>CupHoulder</p>
<p>CappHolder</p>
<p>KeepHolder</p>
<p>MacKleena</p>
<p>MackCliiner</p>
<p>MacqQleanAR</p>
<p>Ma&#8217;cKcle&#8217;an&#8217;ar</p>
<p>and with a bunch of regular expressions you can easily encode them as &#8221;cphldR&#8221; and &#8220;mclnR&#8221;.</p>
<p><code> String[] ar = new String[]{"CupHoulder", "CappHolder", "KeepHolder",<br />
"MacKleena", "MackCliiner", "MacqQleanAR", "Ma'cKcle'an'ar"};</code></p>
<p><code>for (String a : ar) {</code><br />
<code>a = a.toLowerCase();</code><br />
<code>a = a.replaceAll("[ae]r?$", "R");</code><br />
<code>a = a.replaceAll("[aeoiuy']", "");</code><br />
<code>a = a.replaceAll("pp+", "p");</code><br />
<code>a = a.replaceAll("q|k", "c");</code><br />
<code>a = a.replaceAll("cc+", "c");</code><br />
<code>System.out.println(a);</code><br />
<code>}</code></p>
<p>You can now easily find all the ludicrous spellings of &#8220;CupHolder&#8221; och &#8221;MacCleaner&#8221;.</p>
<p>I hope this blogpost gave you some ideas of how you can use phonetic algorithms and their principles in order to better discover names and entities that sound alike but are spelled unlike. At Findwise we have done a number of enhancements to DM in order to make it work better with Swedish, Danish and Norwegian.</p>
<h2>References</h2>
<p>You can learn more about Double Metaphone from the following article by the creator of the algorithm:<br />
<a href="http://drdobbs.com/cpp/184401251?pgno=2"> http://drdobbs.com/cpp/184401251?pgno=2</a></p>
<p>A German phonetic algorithm is the Kölner Phonetik:<br />
<a href="http://de.wikipedia.org/wiki/Kölner_Phonetik"> http://de.wikipedia.org/wiki/Kölner_Phonetik</a></p>
<p>And SfinxBis is a phonetic algorithm based on Soundex and is Swedish specific:<br />
<a href="http://www.swami.se/projekt/sfinxbis.68.html">http://www.swami.se/projekt/sfinxbis.68.html</a></p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/bryan-brian-briane-bryne-or-what-was-his-name-again/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/bryan-brian-briane-bryne-or-what-was-his-name-again/"><meta itemprop="datePublished" content="2012-03-21T14:06:35+00:00"><meta itemprop="dateModified" content="2012-03-27T21:01:15+00:00"><meta itemprop="dateCreated" content="2012-03-21T13:53:24+00:00"><meta itemprop="keywords" content="Callie,Daitch-Mokotoff Soundex,George Bernard Shaw,Java,Kelly,Lawrence Philips,Linguistics,massage,Metaphone,Pattern matching,phonetic algorithm,phonetic algorithms,phonetic search,Soundex,XML"><meta itemprop="wordCount" content="969"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/bryan-brian-briane-bryne-or-what-was-his-name-again/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Video interview: How to Improve the Search Experience</title>
		<link>http://blog.findwise.com/videointerview-how-to-improve-the-search-experience/</link>
		<comments>http://blog.findwise.com/videointerview-how-to-improve-the-search-experience/#comments</comments>
		<pubDate>Thu, 15 Mar 2012 21:15:33 +0000</pubDate>
		<dc:creator>Kristian Norling</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Digital Workplace]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Findwise]]></category>
		<category><![CDATA[Intranet]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Strategy]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[Copenhagen]]></category>
		<category><![CDATA[Kristian Norling]]></category>
		<category><![CDATA[search experience]]></category>
		<category><![CDATA[Technology/Internet]]></category>
		<category><![CDATA[VGR]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2970</guid>
		<description><![CDATA[Video interview with Kristian Norling at the Intrateam Event in Copenhagen 2012. Kristian talks about his former work at VGR and what he thinks is important for improving the search experience. Watch the video Description: Improving the search experience on the intranet or your website is important. Kristian gives advice on how to improve the [...]]]></description>
			<content:encoded><![CDATA[<strong itemprop="description"></strong><br /><span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>Video interview with <a href="http://se.linkedin.com/in/kristiannorling">Kristian Norling</a> at the <a href="http://www.intrateam.dk/gb/node/9704">Intrateam Event in Copenhagen 2012</a>. Kristian talks about his former work at VGR and what he thinks is important for improving the search experience.</p>
<p><a href="http://blog.findwise.com/videointerview-how-to-improve-the-search-experience/kristian-norling-intrateam-interview/" rel="attachment wp-att-3027"><img itemprop="image" class="aligncenter size-full wp-image-3027" title="kristian-norling-intrateam-interview" src="http://blog.findwise.com/wp-content/uploads/2012/03/kristian-norling-intrateam-interview.jpg" alt="Kristian Norling" width="509" height="257" /></a></p>
<p><a href="http://www.intrateam.dk/gb/node/9704">Watch the video</a></p>
</span></span><meta itemprop="inLanguage" content="en"><meta itemprop="isFamilyFriendly" content="Y"><div class="schema_property_wrap">
<span class="schema_property">
    <span class="schema_property_name"><b>Description:</b> </span>
    <span class="schema_property_value" itemprop="description" content="">Improving the search experience on the intranet or your website is important. Kristian gives advice on how to improve the search experience.</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>About:</b> </span>
    <span class="schema_property_value" itemprop="about" content="">Kristian Norling is interviewed on how to improve the search experience.</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Accountable Person:</b> </span>
    <span class="schema_property_value" itemprop="accountablePerson" content="">Kristian Norling</span>
</span>&nbsp;&bull;&nbsp;

<span class="schema_property">
    <span class="schema_property_name"><b>Keywords:</b> </span>
    <span class="schema_property_value" itemprop="keywords" content="">search experience intranet intrateam</span>
</span>&nbsp;&bull;&nbsp;
</div><meta itemprop="url" content="http://blog.findwise.com/videointerview-how-to-improve-the-search-experience/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/videointerview-how-to-improve-the-search-experience/"><meta itemprop="datePublished" content="2012-03-15T22:15:33+00:00"><meta itemprop="dateModified" content="2012-04-04T08:18:24+00:00"><meta itemprop="dateCreated" content="2012-03-15T22:14:31+00:00"><meta itemprop="keywords" content="Copenhagen,Kristian Norling,search experience,Technology/Internet,VGR"><meta itemprop="wordCount" content="33"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/videointerview-how-to-improve-the-search-experience/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mobile clients and Enterprise Search – What are the Implications?</title>
		<link>http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/</link>
		<comments>http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/#comments</comments>
		<pubDate>Wed, 14 Mar 2012 13:23:29 +0000</pubDate>
		<dc:creator>HakanKjellman</dc:creator>
				<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Future development]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[faceted search filters]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Greg Nudelman]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[www.uxmatters.com]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2938</guid>
		<description><![CDATA[As we all know the smartphone user base is growing explosively. According to www.statcounter.com, internet access from handheld mobile devices has doubled yearly since 2009 adding up to 8,5 % of all page views globally in January 2012. And mobile users want to be able to do all the same things that they are able [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>As we all know the <strong>smartphone user base</strong> is <strong>growing explosively</strong>. According to <a href="http://www.statcounter.com">www.statcounter.com</a>, internet access from handheld mobile devices has doubled yearly since 2009 adding up to 8,5 % of all page views globally in January 2012. And mobile users want to be able to do all the same things that they are able to do on their PC. And that includes access to the company’s Enterprise Search solution!</p>
<p>The benefits of the sales force being able to search for vital customer information before a meeting or for field service personnel being able to find documentation quickly are quite obvious. So how can an organization tweak its search solution in order to provide convenient access for the mobile users? And above all, what will it cost?</p>
<p>Well, to answer the last question first: much less than you think. Providing for the mobile user is mainly about creating a new front end/UI. The main bulk of your search solution remains the same; indexing, metadata structure and content publishing, for instance, remain essentially unaffected.</p>
<p>But you do need to provide a quite different UI in order for the user interaction to work smoothly considering the specific characteristics of the mobile client primarily when it comes to screen size/resolution and text input. But the smartphone also has a lot of features that the PC lacks – it is always available and it knows exactly where you are, it always has a camera, microphone, speaker, possibly a magnetometer and accelerometer and of course a touchscreen with motions like pinching and swiping etc. And many of these features can be quite useful as the following examples prove:</p>
<p style="text-align: center;"><a href="http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/figure_6_google_mobile/" rel="attachment wp-att-2959"><img itemprop="image" class="aligncenter size-full wp-image-2959" title="Google mobile" src="http://blog.findwise.com/wp-content/uploads/2012/03/figure_6_google_mobile.png" alt="" width="320" height="480" /></a></p>
<p align="center">Illustration 1. Google Mobile Voice Search on the iPhone. Courtesy of UX Matters, <a href="http://www.uxmatters.com">www.uxmatters.com</a></p>
<ul>
<li><strong>Google Mobile App for iPhone</strong>: in this app, the iPhone senses when the phone is lifted towards the ear and hence knows when to listen for a search command. Since the phone also knows where the user is, a search for “restaurant” automatically generates hits with restaurants in your vicinity.</li>
<li><strong>Scanning a Barcode or QR-code</strong>: scanning a Barcode or QR-code with your phone is another way of entering a search string. An example could be a product in a store where the customer could open a price-search-engine and scan the QR-code of the product and see where the best price is.</li>
</ul>
<p>As you can see, there are plenty of opportunities for those who want to be creative. But for the most part, the I/O will still be done via the screen. At UX Matters there is a great article by Greg Nudelman describing the considerations when implementing search for mobile clients and suggestions for various design patterns that can be efficient (see <a href="http://www.uxmatters.com/mt/archives/2010/04/design-patterns-for-mobile-faceted-search-part-i.php">http://www.uxmatters.com/mt/archives/2010/04/design-patterns-for-mobile-faceted-search-part-i.php</a>). I have included a brief summary below together with illustrations courtesy of UX Matters. But first, some <strong>general considerations for mobile clients:</strong></p>
<ul>
<li><strong>Use Javascript</strong> code to detect what type of device is accessing your search solution and if it is a mobile client you display the mobile interface.</li>
<li><strong>Native App or Mobile Web App:</strong> Creating a Mobile Web App is easier and cheaper than creating a native App – for one thing you don’t have to create multiple versions for different OS’s (although you still need to test your solution with different browsers/resolutions). Performance wise there isn’t a big difference between Native Apps and Web Apps and mobile browsers are increasingly gaining access to most of the phones hardware as well.</li>
<li><strong>Authentication:</strong> SSO for mobile web applications works the same as for desktop browsers.  There are also new solutions currently being launched enabling usage of the company’s existing Active Directory infrastructure. One example is <em>Centrify Directcontrol for Mobile</em> enabling a centralized administration within Active Directory of all device security settings, profiles, certificates and restrictions.</li>
<li><strong>Use HTML5 instead of FLASH:</strong> iPhones don’t support FLASH but HTML5 is a very capable alternative</li>
<li><strong>Testing:</strong> How the design looks for different resolutions can be tested through various emulators but it is always recommendable to test on a limited set of real smartphones as well.</li>
<li><strong>Access needs to be quick and simple:</strong> user interaction is more cumbersome on a phone than on a PC. Normally try to avoid solutions that require more than 3 input actions.</li>
<li><strong>Menu navigation:</strong> links on the right side are normally used to drill down in the menu hierarchy and left up/towards the home screen</li>
<li><strong>Gestures:</strong> is a very powerful toolbox that can be used in many different ways to create an efficient UI. For example, use &#8220;pinch to show more&#8221; if you want to expand the summary information of a specific item in the search hit list or “swipe” to expose the metadata (or whatever action you want to assign to that gesture).</li>
<li><strong>Be creative:</strong> the mobile client is inherently different from a PC, limited in some ways but more powerful in others. So if you just try to adopt design solutions from the PC and fit them into a mobile UI you are missing out on a lot of powerful design solutions that only make sense on a mobile client and you are definitely not giving the users the best possible search experience. Also, since mobile design is still evolving you don’t need to be limited by conventions and expectations as much as on the PC side – make the most of this freedom to be creative!</li>
<li><strong>W3C mobile:</strong> for more information about mobile web development, see <a href="http://www.w3.org/Mobile/">http://www.w3.org/Mobile/</a> which also includes a validating scheme to assess the readiness of content for the mobile web</li>
</ul>
<h2>Design patterns for mobile UI (with courtesy of Greg Nudelman/UX Matters)</h2>
<p>Mobile faceting can be tricky but by using design patterns like “4 Corners”, “Modal Overlays”, “Watermarks” and “Teaser Design” the UI can become both intuitive and easy to learn as well as providing reasonably powerful functionality. As mentioned, these techniques are summaries from an article written by Greg Nudelman for UX Matters. If you are eager to learn more, feel free to check out Greg’s website and his upcoming workshops focused on mobile design <a href="http://www.designcaffeine.com/category/workshops/">http://www.designcaffeine.com/category/workshops/</a></p>
<p><strong>4 Corners</strong>: instead of stealing scarce real estate by adding faceting options directly on the screen together with the search result, semitransparent buttons are available in each corner enabling the user to bring up a faceting menu by tapping in a corner (see illustration 2).</p>
<p><strong>Modal Overlays</strong>: the modal overlay is displayed on top of the original page. The modal overlay works well together with the 4 corners design – tapping a corner opens up the overlay containing faceting functions like filtering and sorting (see illustration 2).</p>
<p style="text-align: center;"><a href="http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/figure_4_four_corners/" rel="attachment wp-att-2958"><img itemprop="image" class="aligncenter size-full wp-image-2958" title="Four corners" src="http://blog.findwise.com/wp-content/uploads/2012/03/figure_4_four_corners.jpg" alt="" width="473" height="332" /></a></p>
<p align="center">Illustration 2. Four Corners and Modal Overlay patterns. Courtesy of UX Matters, <a href="http://www.uxmatters.com">www.uxmatters.com</a></p>
<p><strong>Watermarks</strong>: a great technique for guiding users and showing the possibility of using new functions. The watermarks, possibly animated, show a symbol for the available action, for instance arrows indicating that a swiping gesture could be used (see illustration 3).</p>
<p><strong>Full-Page Refinement Options Pattern:</strong> gives the user plenty of refinement options to choose from (see illustration 3).</p>
<p style="text-align: center;"><a href="http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/figure_6_splash_screen/" rel="attachment wp-att-2960"><img itemprop="image" class="aligncenter size-full wp-image-2960" title="Splash screen" src="http://blog.findwise.com/wp-content/uploads/2012/03/figure_6_splash_screen.jpg" alt="" width="473" height="656" /></a></p>
<p align="center">Illustration 3. Two variations of the Watermark pattern and a Refinement Options pattern. Courtesy of UX Matters, <a href="http://www.uxmatters.com">www.uxmatters.com</a></p>
<p><strong>Teaser Design</strong>: show part of the next available content so that the user is aware that there is more content available (see illustration 4).</p>
<p><a href="http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/figure3_teasers_wireframe/" rel="attachment wp-att-2961"><img itemprop="image" class="aligncenter size-full wp-image-2961" title="Teasers wireframe" src="http://blog.findwise.com/wp-content/uploads/2012/03/figure3_teasers_wireframe.png" alt="" width="473" height="383" /></a></p>
<p align="center">Illustration 4. Teaser design pattern facilitates the discovery of faceted search filters. Courtesy of UX Matters, <a href="http://www.uxmatters.com">www.uxmatters.com</a></p>
<p><strong>Persistent Status Bar:</strong> always maintain a persistent status bar containing the search string together with applied filters in the search result page. This helps the user maintain orientation. Note that all of the illustrations above have a persistent status bar.</p>
<h2><strong>Conclusion</strong></h2>
<p><strong></strong>Although Best Practices for mobile UI design are still evolving, plenty of progress has already been made and there are several solutions and design patterns to choose from depending on the specific circumstances at hand. So an implementation project need not be rocket science, as long as you learn the right tricks…</p>
<p>Bringing enterprise information to the field, readily available in a mobile handset or tablet, will mobilize your employees. The UI requires rethinking as we have seen. And security needs to be addressed properly to avoid having sensitive data compromised. But other than that, you are ready to go!</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/"><meta itemprop="datePublished" content="2012-03-14T14:23:29+00:00"><meta itemprop="dateModified" content="2012-04-02T18:44:17+00:00"><meta itemprop="dateCreated" content="2012-03-01T22:33:43+00:00"><meta itemprop="keywords" content="faceted search filters,Google,Greg Nudelman,iPhone,www.uxmatters.com"><meta itemprop="wordCount" content="1378"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/mobile-clients-and-enterprise-search-what-are-the-implications/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Automated Testing of Enterprise Search</title>
		<link>http://blog.findwise.com/automated-testing-of-enterprise-search/</link>
		<comments>http://blog.findwise.com/automated-testing-of-enterprise-search/#comments</comments>
		<pubDate>Thu, 08 Mar 2012 14:11:28 +0000</pubDate>
		<dc:creator>Mickel Gronroos</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Governance]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Testing]]></category>
		<category><![CDATA[author]]></category>
		<category><![CDATA[common author]]></category>
		<category><![CDATA[enterprise search setting]]></category>
		<category><![CDATA[enterprise search solution]]></category>
		<category><![CDATA[information systems]]></category>
		<category><![CDATA[RAM]]></category>
		<category><![CDATA[search application]]></category>
		<category><![CDATA[search index]]></category>
		<category><![CDATA[search index stays]]></category>
		<category><![CDATA[software components]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2911</guid>
		<description><![CDATA[Quality assuring an enterprise search solution is challenging, yet important. The challenge is to be able to do continuous follow-up of the quality of the solution during implementation but also after release, when the solution is in production and operated by an operations team. Testing is important, but it is also costly – unless it [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>Quality assuring an enterprise search solution is challenging, yet important. The challenge is to be able to do continuous follow-up of the quality of the solution during implementation <em>but also after release</em>, when the solution is in production and operated by an operations team. Testing is important, but it is also costly – unless it can be automated.</p>
<p>So what kind of testing is specific for a search application? And what of that can be automated?</p>
<p>The whole idea of Enterprise Search is to provide the right information to the right people at the right time. The information made findable is normally stored in many different information systems and the information in these systems is constantly changing. In the end, every enterprise search solution operates in a context where the requirements of the end-users and the available content changes on a daily basis. In other words, assuring the quality of enterprise search is about assuring the quality of the information and the way that information is accessed by and delivered to the end-users.</p>
<p>During our engagements over the years, we have set routines and developed tools for automated testing of enterprise search. What we specifically want to track in an automated fashion is:</p>
<ul>
<li>Completeness</li>
<li>Freshness</li>
<li>Access restrictions</li>
<li>Metadata quality</li>
<li>Performance</li>
<li>Relevance</li>
</ul>
<p>Allow me to take a few moments and describe what this means.</p>
<h2>Completeness testing</h2>
<p>Completeness tests aim to make sure that the search index is complete – that all information objects (such as web pages and documents) that are supposed to be searchable are really searchable. In addition, completeness testing provides proof that the correct parts of the information objects are indexed for retrieval, e.g. all pages in a multi-page document, as well as titles and other searchable metadata. It is also important to monitor that information that should not be searchable is indeed not indexed, e.g. headers and footers of web pages.</p>
<h2>Freshness testing</h2>
<p>Freshness tests aim to make sure that the search index is up to date, i.e. new content that has been added to a source (such as a document management system) becomes searchable, deleted content is removed automatically from the search index and updated content is updated in the search index – all in due time.</p>
<h2>Testing access restrictions</h2>
<p>If an enterprise search solution provides access to access-controlled information, it is of uttermost importance to be able to prove that security is never compromised. Testing access restrictions aim to do precisely that. What one needs to monitor is that existing document-level security works, i.e. that people who should have access to an information object really has access and that people who shouldn’t have access, don’t have access. The tricky part is to monitor that a change in access privileges in for instance Active Directory or in the access restrictions (the ACL) for a particular document is handled in the search index as well in due time.</p>
<h2>Testing metadata quality</h2>
<p>Each information object in the search index contains a set of fields containing metadata and text, e.g. a title, the text body, an author, a timestamp containing last modification date, information on file format, a keywords field and many more.</p>
<p>In an enterprise search setting, many different information models implemented in the source systems need to be harmonized into one common domain model (schema/index profile/information model) in the search index. This means information regarding a creator of an information object in one system and a publisher of an information object in another system can be stored in a common <em>author</em> metadata field in the search index in a common, defined format such as <em>Firstname Lastname</em> regardless of formatting in the source system<strong><em>. </em></strong>Unless you have a common model in the index, you can’t provide features like cross-system filtering with facets.</p>
<p>So how do you track that the metadata<strong><em> </em></strong>in the search index stays in good shape? This is the aim of metadata testing. The test cases provided for metadata testing need to check that the metadata in the search index conforms to the defined domain model and formatting even when the underlying content changes in the source systems.</p>
<h2>Performance testing</h2>
<p>Performance testing is probably the easiest type of tests you can create and run. In the end you will have a threshold or pain limit in milliseconds under which a query in the enterprise search solution will be required to provide an answer even under peak times with high query loads. Normally you will also be monitoring issues like RAM and processor capacity usage of the software components of your solution to be able to generate automatic alerts to the maintenance team if the hardware is under too much pressure.</p>
<h2>Relevance testing</h2>
<p>Quality assuring the relevance model of an enterprise search solution is tricky. Largely because relevance in a result set is to some extent subjective. However, when implementing search, one does need to set a relevance model that presupposes a set of business rules for what type of content is to be deemed more important than other. For example, when making documents in a document management system searchable, a typical business rule would be that documents tagged with Status=Approved must always be deemed more important than documents with any other status (such as Preliminary or Deprecated). Another typical rule is that a document for which a query term can be found in the title or in the keywords metadata field is most likely more important than documents where the query term is found elsewhere in the text body.</p>
<p>What it all boils down to is the definition of the business rules for relevance. Once you have defined the rules that govern how the results are to be ranked, you can also create test cases, i.e. associate query terms with information objects that must be returned as top results given these terms.</p>
<h2>Automating it all</h2>
<p>Once you have defined you test cases for all the above mentioned types of tests in a test plan, you are ready to automate, i.e. enter the test plan into a test automation framework. The beauty of it all is that you can automate regression testing during the implementation phase of an enterprise search solution, i.e. continuously test that new development does not break such parts of the solution that worked as intended before. This is in particular important if you add new information sources to your enterprise search solution, when there is a high risk that the relevance model that worked fine yesterday all of the sudden gets out of order. In addition, after the release of the enterprise search solution, the test automation framework will assist the operations team in monitoring that the solution behaves as expected even after the implementation team has left the building. All in all this leads to continuously good quality of the solution while lowering the costs for monitoring.</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/automated-testing-of-enterprise-search/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/automated-testing-of-enterprise-search/"><meta itemprop="datePublished" content="2012-03-08T15:11:28+00:00"><meta itemprop="dateModified" content="2012-03-07T12:51:39+00:00"><meta itemprop="dateCreated" content="2012-02-16T18:08:29+00:00"><meta itemprop="keywords" content="author,common author,Enterprise Search,enterprise search setting,enterprise search solution,information systems,RAM,search application,search index,search index stays,software components"><meta itemprop="wordCount" content="1146"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/automated-testing-of-enterprise-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Findability, our holistic approach to implementing search technology</title>
		<link>http://blog.findwise.com/findability-our-holistic-approach-to-implementing-search-technology/</link>
		<comments>http://blog.findwise.com/findability-our-holistic-approach-to-implementing-search-technology/#comments</comments>
		<pubDate>Fri, 24 Feb 2012 10:36:21 +0000</pubDate>
		<dc:creator>Mattias Ellison</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Findwise]]></category>
		<category><![CDATA[Information management]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Search Business Analysis]]></category>
		<category><![CDATA[Strategy]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[implementing search technology]]></category>
		<category><![CDATA[search project]]></category>
		<category><![CDATA[search solutions]]></category>
		<category><![CDATA[search technology]]></category>
		<category><![CDATA[search technology investments]]></category>
		<category><![CDATA[Vimeo channel]]></category>

		<guid isPermaLink="false">http://findabilityblog.se/?p=2922</guid>
		<description><![CDATA[We are proud to present the first video on our new Vimeo channel. Enjoy! Successful search project does not only involve technology and having the most skilled developers, it is not enough. To utilise the full potential and receive return on search technology investments there are five main dimensions (or perspectives) that all need to [...]]]></description>
			<content:encoded><![CDATA[<span itemprop="mainContentOfPage"><span itemprop="articleBody"><p>We are proud to present the first video on our new Vimeo channel. Enjoy!</p>
<p><iframe src="http://player.vimeo.com/video/37332486?color=919191" frameborder="0" width="640" height="360"></iframe></p>
<p>Successful search project does not only involve technology and having the most skilled developers, it is not enough. To utilise the full potential and receive return on search technology investments there are five main dimensions (or perspectives) that all need to be in focus when developing search solutions, and that require additional competencies to be involved.</p>
<p>This holistic approach to implementing search technology we call Findability by Findwise.</p>
</span></span><div class="schema_property_wrap"></div><meta itemprop="url" content="http://blog.findwise.com/findability-our-holistic-approach-to-implementing-search-technology/"><meta itemprop="discussionUrl" content="http://blog.findwise.com/findability-our-holistic-approach-to-implementing-search-technology/"><meta itemprop="datePublished" content="2012-02-24T11:36:21+00:00"><meta itemprop="dateModified" content="2012-02-24T11:38:11+00:00"><meta itemprop="dateCreated" content="2012-02-24T11:26:54+00:00"><meta itemprop="keywords" content="implementing search technology,search project,search solutions,search technology,search technology investments,Vimeo channel"><meta itemprop="wordCount" content="82"><meta itemprop="blogPosts" content="http://blog.findwise.com">]]></content:encoded>
			<wfw:commentRss>http://blog.findwise.com/findability-our-holistic-approach-to-implementing-search-technology/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

