Archive for the ‘Findability’ Category

Mahmood Ahmad

Findability on an e-commerce site

March 13 - 2011 | Mahmood Ahmad

Findability on any e-commerce site is a beast all on its own. What if visitors’ searches return no results? Will they continue to search or did you lose your chance at a sale?

While product findability is a key factor of success in e-commerce, it is predominantly enabled by simple search alone. And while simple search usually doesn’t fulfill complex needs among users, website developers and owners still regard advanced search as just another boring to-do item during development. Owners won’t go so far as to leave it out, because every e-commerce website has some kind of advanced search functionality, but they probably do not believe it brings in much revenue.

Research shows:
-    50% of online buyers go straight to the search function
-    34% of visitors leave the site if they can’t find an (available) product
-    Buyers are more likely than Browsers to use search (91%)

What can’t be found, can’t be bought:
-    Search is often mission critical in e-commerce
-    Users don’t know how to spell
-    Users often don’t even know how to describe it

First of all, Findability can accelerate the sales process. And faster sales can increase conversions, because you will not be losing customers who give up trying to find products. Furthermore, fast, precise and successful searches increase your customers’ trust.

On both e-commerce and shopping comparison sites, users can find products in two different ways: searching and browsing. Searching obviously means using the site search whilst browsing involves drilling down through the categories provided by the website. The most common location for a site search on e-commerce sites is at the top of the page, and generally on the right side. Many e-commerce sites have a site search, user login, and shopping cart info all located in the same general area. Keeping the site search in a location that is pretty common will help it to be easier to find for some of your visitors who are accustomed to this trend.

Faceted search should be the de facto standard for an e-commerce website. When a user performs a simple search first, but then on the results page, he or she can narrow the search through a drill-down link (for a single choice) or a check box selection (for multiple non-overlapping choices). The structure of the search results page must also be crystal clear. The results must be ranked in a logical order (i.e. for the user, not for you) by relevance. Users should be able to scan and comprehend the results easily. Queries should be easy to refine and resubmit, and the search results page should show the query itself.

Spell-check is also crucial. Many products have names that are hard to remember or type correctly. Users might think to correct their misspelling when they find poor results, but they will be annoyed at having to do that… or worse, they might think that the website either doesn’t work properly or does not have their product.

Query completion can decrease the problems caused by mistyping or not knowing the proper terminology. Queries usually start with words; so unambiguous character inputting is crucial.

Search analytics, contextual advertisement and behavioral targeting is more than just finding a page or a product. When people search they tell you something about their interests, time, location and what is in demand right now, they say something about search quality by the way they navigate and click in result pages and finally what they do after they found what they were looking for.

A good e-commerce solution uses search technology to:

-    Dynamically tailor a site to suit the visitors’ interests
-    Help the user to find and explore
-    Relate information and promote up- and cross sales
-    Improve visitor satisfaction
-    Increase stickiness
-    Increase sales of related products or accessories
-    Inspire visitors to explore new products/areas
-    Provide-increased understanding of visitor needs/preferences

–> Convert visitors into returning customers!

Tobias Berg

The difference between Search and Find

January 23 - 2011 | Tobias Berg

Is “Findability” only a buzzword to describe the same thing as before when talking about search solutions, or does it bring something new to the discussion? I’d like to think the latter and this week I read a blog post describing the difference between search and findability in a very good way. I couldn’t have written it better myself :)

For the lazy one, I’ve picked a quote that is the key element in the post:

Findability: introducing the robot waiter

Imagine you’re in a futuristic restaurant and when the robot waiter approaches, you ask for ‘ham and cheese omelette’. In response he just shrugs his robotic shoulders and says ‘not found – please try again.’ You then have to keep guessing until you find a match for something you’d like to order.

Now imagine a second futuristic restaurant where the robot waiter says ‘Mr Grimes, how lovely to see you, the last time you visited you had A and B and gave them a 5 star rating. People who ordered x, also ordered y and found that the wines a, b and c went really well with it.’At first restaurant the menu was searchable (though regretably the ‘ham and cheese omelette’ query didn’t match anything), at the second restaurant the menu was findable.

To me, this analogy is spot-on. I dare to say that making content searchable is more of a technical issue while reaching great findability requries understanding of the business. Why is that?

Well, making a content repository searchable you “only” need to hook up a connector, index the repository and display a search box to the users. To succeed with this, it doesn’t matter if the content is movie reviews, user manuals, reciepes, a product catalog or whatever. What you need to know is the format of the repository (is it a SQL database, filesystem, ECM, etc.).

But if you want your users to find what they want in your repositories, business knowledge is a requirement. It’s true that you help your users find information by implementing technical stuff likequery completion, facets, did-you-mean, synonym dictionaries, etc. But if they are to be of any help you need to present facets that are useful, populate the synonym dictionary with terms used in your organisation,etc. For example, a good synonym file targeted towards nurses and doctors would be very different compared to one targeted at employees at an insurance company.

Caroline Abrahamsson

Findability blog: Wrapping up 2010

December 23 - 2010 | Caroline Abrahamsson

Christmas is finally here and at Findwise we are taking a few days off to spend time with family and friends.

During 2010 we’ve delivered more than 25 successful projects, arranged breakfast seminars to talk about customer solutions (based on Microsoft, IBM, Autonomy and Open source), meet-ups in a number of cities as well as networking meetings for profound Findability discussions and moving in parties for our new offices.

At our Findability blog we have been discussing technology and vendor solutions (Microsoft and FAST, Autonomy, IBM, Google and open source), reasearchconferences, customized solutions and how to find a balance between technology and people.

Some of our posts have resulted in discussions, both on our own blog and in other forums. Please get involved in some of the previous ongoing discussions on “Solr Processing Pipeline”,  “Search and Business Intelligence” or “If a piece of content is never read, does it exist?”  if you have thoughts to share.

Findability blog is taking a break and we will be back with new posts is January.

If you have some spare time during the vacation some of customers run their own blogs, and good reading tips within Findability are the blogs driven by Kristian Norling (VGR) and Alexandra Larsson (Swedish armed forces).

Merry Christmas and a Happy New Year to you all!

Maria Johansson

Tagging, social networks and interaction

December 20 - 2010 | Maria Johansson

Events the past days has got me thinking about the power of social tagging and its connection to findability. Thoughts that commend me to writing my most personal (and perhaps off topic) post yet on this blog. (All thoughts expressed in this post are my own and do not necessarily reflect the opinions of my employer.)

Rumors about the shut down of Delicious have been circling the web. Even though it is still unconfirmed from Yahoo, my Twitter feed has been filled with comments about how to save your bookmarks, export bookmarks to other services, petitions to Yahoo about saving Delicious or making it open source.

Traditionally when talking about user tagging of content the topic is re-finding things. Users tag information on the web or an intranet in order to be able to find their way back to them. However most of the comments that I’ve seen about Delicious being shut down has nothing to do with this. As I see it, users don’t claim to be missing the bookmarks themselves, but the social network, research, collaboration and search capabilities that came with the bookmarking service. Delicious seems to have emerged from a service that helps you bookmark your things for re-finding them to a service that helps you find new things based on the tagging of others. Tagging, or social bookmarking may very well have started as a way of re-finding your information but has grown into a new way of discovering information, in parallel to search. (Maybe that is an explanation to the tweets wishing for Google to buy delicious from Yahoo?)

So, tagging can not only help you re-find your own stuff but also explore new things and spread information. One good example of this is what is currently going on in the swedish Twitterverse. It all started with one journalist’s discussion with her friends about the disbelief towards the women accusing Julian Assange of sexual assault. It quickly turned into so much more; a profound discussion about the fine lines of sexuality, what is OK, what we want and like and how to say no. Using the hash tag #prataomdet swedish twitter users are writing about and discussing their experiences in an effort to change the cultural climate so that people talk about it, start communicating with each other about sexuality. You can easily follow all the tweets real time and read blog posts on the topic at prataomdet.se. Many of the major news sites have now started reporting on this as well after the massive activity on twitter. (For non-swedish speaking readers an effort has also been made to start discussions in English as well at #talkaboutit on twitter.)

The feed in itself is thought provoking and can easily keep you busy for hours. Besides the content and openness of the discussions I find something else amazing. In a matter of hours this one tag joined together users, many of whom have never interacted with each other before, helping them share and find new information about something that was unspoken of earlier. Combining the power of social networks and tagging made this possible.

I usually write very different sorts of blog posts at this blog. This one time I just wanted to revel over the amazing possibilities for interaction that technology offers us today. Then maybe the next step is to think about how to tap into this power of interaction and how findability within the enterprise can benefit from this as well. In the mean time I recommend reading about What social networks reveal about interaction or how Västra Götalands Region are currently working on incorporating user tagging into their metadata.

Mattias Brunnert

If a piece of content is never read, does it exist?

December 10 - 2010 | Mattias Brunnert

Since ancient times, information technology has developed from carvings in rock and wood to cell phones and Facebook. Still, the basic purpose remains the same; to facilitate communication between people separated by space and time. Therefore one can measure the successfulness of any information tool by two axes: how easy it is to create information and how easy it is to consume it. Being a Findability expert, I spend a large part of my life focusing on the latter. Therefore it troubles me that so many organizations wait so long when they are introducing new content management systems before looking at search. If I had a nickel for every time I heard “we are currently busy with building our new intranet/web page/collaboration tool and will look at search when the project is finished” I would definitely have had a few quarters by now.

I like to say that I am in the information marketing business. What I mean by that is that Findability is all about marketing information so that the consumers, your employees, can find the piece of information they need. And just as an industrialist would not construct a factory before doing a marketing plan, you should not build a new information repository without thinking about how the content created in that repository will reach its target audience. When marketing information, search is one of your most important channels.

While a search solution can definitely smooth out imperfections in information structure and quality using intelligent algorithms, spending a little time thinking about how you can make it easier for a search engine to deliver relevant results presented in a user friendly way can really make it shine. Some questions you can ask yourself are:

  • How can we make tagging so convenient that we have good metadata for presenting and filtering results using facets? Many search solutions have automated tagging functionality to take load off users.
  • How can we use search as an integration platform to pull in content from other sources instead of making costly one-time integrations?
  • How will the new information repository fit into an existing search solution, for example are we changing the metadata model and how should the documents be ranked compared to other sources?
  • Should we migrate content from an old system to the new one or just freeze information creation in the old one and have a search box that let’s the user find information from both?
  • Can we use search to avoid creating duplicate information by encouraging users to make searches before typing new content or even doing implicit searches while the user is typing?

So does a piece of content that no one ever reads exist? Well in terms of bits on a disk in a data center, yes, but in terms of business value definitely no. Designing your information repository for Findability will have great returns in improved efficiency and user satisfaction.

Maria Johansson

Bridging the gap between people and technology

December 6 - 2010 | Maria Johansson

Tony Russell-Rose recently wrote about the changing face of search, a post that summed up the discussion about the future of search that took part at the recent search solutions conference. This is indeed an interesting topic. My colleague Ludvig also touched on this topic in his recent post where he expressed his disappointment in the lack of visionary presentations at this year’s KMWorld conference.

At our last monthly staff meeting we had a visit from Dick Stenmark, associate professor of Informatics at the Department of Applied IT at Gothenburg University. He spoke about his view on the intranets of the future. One of the things he talked about was the big gap in between the user’s vague representation of her information need (e.g. the search query) and the representation of the documents indexed by the intranet search engine. If a user has a hard time defining what it is she is looking for it will of course be very hard for the search engine to interpret the query and deliver relevant results. What is needed, according to Dick Stenmark, is a way to bridge the gap between technology (the search engine) and people (the users of the search engine).

As I see it there are two ways you can bridge this gap:

  1. Help users become better searchers
  2. Customize search solutions to fit the needs of different user groups

Helping users become better searchers

I have mentioned this topic in one of my earlier posts. Users are not good at describing which information they are seeking, so it is important that we make sure the search solutions help them do so. Already existing functionalities, such as query completion and related searches, can help users create and use better queries.

Query completion often includes common search terms, but what if we did combine them with the search terms we would have wanted them to search for? This requires that you learn something about your users and their information needs. If you do take the time to learn about this it is possible to create suggestions that will help the user not only spell correctly, but also to create a more specific query. Some search solutions (such as homedepot.com) also uses a sort of query disambiguation, where the user’s search returns not only results, but a list of matching categories (where the user is asked to choose which category of products her search term belongs). This helps the search engine return not only the correct set of results, but also display the most relevant set of facets for that product category. Likewise, Google displays a list of related searches at the bottom of the search results list.

These are some examples of functionalities that can help users become better searchers. If you want to learn some more have a look at Dan Russells presentation linked from my previous post.

Customize search solutions to fit the needs of different user groups

One of the things Dick Stenmark talked about in his presentation for us at Findwise was how different users’ behavior is when it comes to searching for information. Users both have different information needs and also different ways of searching for information. However, when it comes to designing the experience of finding information most companies still try to achieve a one size fits all solution. A public website can maybe get by supporting 90% of its visitors but an intranet that only supports part of the employees is a failure. Still very few companies work with personalizing the search applications for their different user groups. (Some don’t even seem to care that they have different user groups and therefore treat all their users as one and the same.) The search engine needs to know and care more about its’ users in order to deliver better results and a better search experience as a whole. For search to be really useful personalization in some form is a must, and I think and hope we will see more of this in the future.

Ludvig Johansson

Search is a journey not a destination

December 2 - 2010 | Ludvig Johansson

Two weeks ago me, Ludvig Johansson and Christopher Wallström attended KMWorlds quadruple conference in Washington D.C. The conference consisted of four different conferences; KMWorld, Enterprise Search Summit, Taxonomy Bootcamp and SharePoint Symposium. I focused on Enterprise Search Summit and SharePoint Symposium and Christopher mainly covered Taxonomy Bootcamp as well as the Enterprise Search Summit. (Christopher will soon write a blog post about this as well.)

During the conferences there where some good quality content, however most of it was old news with speakers mainly focusing on outputs of their own products. This was disappointing since I had hoped to see the newest and coolest solutions within my area. Speakers presented systems from their corporations, where the newest and coolest functionality they described was shallow filters on a Google Search Appliance. From my perspective this is not new or cool. I would rather consider this standard functionality in today’s search solutions.

However, some sessions where really good. Daniel W. Rasmus talked about the Evolution of Search in quite a fun and thoughtful way. One thing he wanted to see in the near future was more personalization of search. Search needs to know the user and adapt to him/her and not simply use a standardized algorithm. As Rasmus sad it: “my search engine is not that in to me”. This is, as I would put it, spot on how we see it at Findwise. Today’s customer wants standard search with components that have existed for years now. It’s time for search to take the next step in the evolution and for us to start deliver Findabillity solutions adapted to your needs as an individual. In the line of this, Rasmus ended with another good quote: “Don’t let your search vendors set your exceptions to low”. I think this speaks for it self more or less. If we want contextual search then we should push the vendors out there to start deliver!

Another good session was delivered by Ellen Feaheny on how to utilize both old and new systems smarter. It was from this session the title of this post origins, “It’s a journey not a destination”. I thought this sums up what we feel everyday in our projects. It’s common that customers want to see projects to have a clear start and end. However with search and Findability we see it as a journey. I can even go as far to say it’s a journey without an end. We have customers coming and complaining about their search; saying “It doesn’t work anymore” or “The content is old”, to give two examples. The problem is that search is not a one time problem that you solve and then never have to think about again. If you don’t work with your search solution and treat search as a journey, continually improve relevance, content and invest time in search analytics your solution will soon get dusty and not deliver what your employees or customers wants.

Search is a journey not a destination.

Tobias Berg

LDAP connector for Openpipeline

November 23 - 2010 | Tobias Berg
Finding people within your organisation, also denoted as People Search, is something that is a key ingredient in a findability solutions. People catalogs are often based on an LDAP structure which holds the important information about each employee.

The LDAP connector for Openpipeline is the result of the latest activity at the Findwise development department which makes it easy to make the LDAP structure searchable. As always with a connector, you get direct access to the source which ensures a very efficent indexing and good control over the indexed information.

The LDAP connector has a number of features, some noted below:

  • SSL support – Supports LDAP over SSL
  • Pagination – LDAP entries can be retrieved in batches if the LDAP server supports the PagedResultControl. This increases performance and reduces memory consumption drastically
  • Incremental indexing – If the LDAP server flag each update to an entry with a timestamp, the connector can use this timestamp to only fetch updated entries.
  • Delete entries – LDAP entries that has been removed since the last run will be removed from the index
  • Attribute specification – Specify what attributes that should be returned for each entry. By only retrieving the attributes you need, performance is increased.

Interested of knowing more about the connector, or do you have any experience you like to share when indexing LDAP directories? Please drop a comment!

Anders Rask

Apache Nutch making use of Open Pipeline

November 11 - 2010 | Anders Rask

During the last couple of months I’ve been working on a project for Uppsala University. The project’s goal is to improve the findability on the university web site. The solution that we are working on is based on Apache Nutch 1.1 in conjunction with Apache Solr 1.4. Nutch provides us with a robust web crawler that scales very well and also gives us a page rank for each page that we can use for relevance tuning. Besides the web information crawled by Nutch, the search application will also be used to search people and organizational information that we index from another source. I thought that I would share some details on how we are using Nutch.

We have made two extensions to Nutch, one is a parser plug-in that can run Open Pipeline embedded in it. This was an important extension in order to get better control of the information that we index to Solr and also to be able to reuse our different Open Pipeline components. The main stages of the pipeline are the following:

  1. Extract the encoding of a web page
  2. Extract all links from a web page
  3. Extract all headings (hx) from a web page
  4. Remove all tags that don’t contain complete sentences on a web page
  5. Extract text and metadata from different types of documents with Tika
  6. Do some metadata mapping and cleaning
  7. Populate facets according to metadata and/or URL
  8. Do static URL ranking
  9. Replace certain common titles with the largest heading of the web page

The other extension we made to Nutch is an indexing filter that makes sure all our metadata fields are indexed to Solr.

So far so good. The fetching, parsing and indexing works well now and currently our largest challenge is tuning all the different relevance parameters we have, as well as harmonizing the relevance of web information to that of people and organizational information. I will have to get back to you on how that went!

Maria Johansson

Metadata in focus for our Findability solution

October 28 - 2010 | Maria Johansson

Last week Kristian Norling wrote a blog post about how they work with metadata at Västra Götalands Regionen (VGR). In the beginning of his post he states that metadata is boring, but extremely useful. A teacher in statistics that I had in college used to say that statistics is the most boring thing there is. It’s the things that you can do with statistics that makes it really interesting. So I agree with both of them, the metadata (or statistics) in itself is quite boring, but the things you can do with it is what makes it all worth it. The quality and structure of information must also be in focus when creating Findability solutions that aim to provide easy access to all information inside and outside the firewall.

Findwise is currently working on improving our findability solution which is our intranet. When we investigated our own business and user needs we learned that there is a need for a more flexible way of organizing information so it can be found from different entrypoints as well as in different contexts. Therefore one of the things at the heart of our intranet (except the search functionality off course) is metadata. As a fast growing  (and changing) company we find it hard to create and maintain one single information hierarchy that is intuitive and self-evident to all our employees.  Instead we are working with a taxonomy with a simple set of categories and concepts. All content is tagged with what, where and who.

Who describes which people or groups are allowed to see a document. It can be everyone, a single person or a group of people such as the finance department, or a project team. Since knowledge sharing is very important for our organization most of the information is open for everyone to see and use.

Where describes which sites the content should be visible on. A single document can be visible on several sites. So if contact details for a customer is relevant to show on several projects for that customer the same content can be displayed on all the different project sites, without us having to store duplicate versions of the content.

What describes the concepts the content relates to. These concepts include customers, projects, products & competences, information types as well as categories that are created through the means of user generated tagging. This way one single document does not have to belong to one specific site or folder, but can be displayed in several different and all relevant locations on the intranet. Thanks to this use of metadata it is also possible to use the different categories for search and faceted navigation. For example I can view all design specifications from different customer projects that include the concept faceted navigation, or all information about how to work with search analytics with the search platform Autonomy IDOL. The concepts and the information becomes the focus instead of the location where it is stored.

In the first stage this will be done manually as content is added to the intranet. In the future it would also be of interest for us to utilize the same type of service that we developed for VGR, for our own content. But instead of using controlled vocabularies such as MeSH we use our own taxonomy and the power of search technology to suggest or automatically add appropriate customers, projects and categories for a document. A first step in this will probably be to use entity extraction techniques to identify and automatically tag already existing documents with concepts such as customers and search platforms.

We hope to share our experiences from this project with you in the future. In the mean while I recommend that you read Kristian’s post about how they use different types of keyword metadata at VGR.