« The General Motors of publishing? | Main | What is the difference between RSuite CMS and MarkLogic Server? »

December 29, 2008

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d83453675c69e2010536a2cb5e970c

Listed below are links to weblogs that reference "Taxonomies are dead. Long live metadata!":

Comments

Michael Puscar

In my opinion, this definitely depends upon the context in which the user is browsing/searching content. Personalized metadata tagging is a way of tagging and identifying content that has already been discovered. However, unified controlled vocabularies are necessary for undiscovered content.

Another perspective is that taxonomies and ontologies provide value because of their relationships. Sure, a map of Botswana could be tagged with "Ohio" because it is relevant to a particular user, but that only provides discoverability through searches that use the word "Ohio". This is the difference between metadata keyword tagging and metadata concepts tagging.

Ontologies allow more abstract searches. Take, for example, the case where the map of Botswana is tagged with the keyword "Botswana" or "Africa". Because it is a type "map" in an ontology, search terms such as "Kalahari Desert" and "Limpopo River" should also produce this result. You could also make the argument that search terms which match the Botswana culture should also produce this result.

It is not possible to allow flat metadata word tagging and then to layer in relationships, because the English language is riddled with multi-use words and context is of critical importance.

In my opinion, the multi-tiered relationships that taxonomies and ontologies allow users to leverage in search are difficult to replace. The biggest challenge, and the reason why publishers have sought alternatives, is building reliable and unified taxonomies that have enough topical coverage. There are several ways to adress this, but it is clearly an ongoing challenge.

Michael Josefowicz

Some of the new tech might imply that the reliance on words only as the pointed end of search is being supplemented by searching on sounds or images.

One example is the app for an iPhone that allows "listening" to a song and being taken to a site with ? (the information about that song? the next place that the band is playing? the closest place to buy the record?)

The other example is the app that allows a photo to be taken of a product, then being taken to a website that ? (more details? cross sell? instructions for huse?)

While sounds and images have always been a critical way that humans search in the physical world, it's only now that they can be computerized.

Is there work being done on an ontology of music and art?

annvmichael

Michael P - Not being an expert (some of my colleagues are, but not me) I would say that a mix of both are required - especially for professional content (physicians, lawyers, accountants, etc). It would be neat if individuals could maintain their own "portable" tag set and so if Ohio is meaningful to me (related to a picture of Botswana) I can save that without confusing the larger searching public.

Michael J - I love Shazam (the song recognition software that I have on my iPhone). I will see if anyone knows of anything being done with music and art ontologies. It would be so cool if we could actually store a visual or audio pattern as a tag and recognize it where it occurs.

Andrea Laue

I agree that simple-minded applications of taxonomies are outmoded. In the same breath I'd argue that the increasing sophistication of tagging and text-mining technologies are best applied in parallel with taxonomies. I see the two as being good at different things.

Taxonomies are analytical tools for representing the content and (via additional processing) structure of a set of resources. Social tagging is good at providing cues to the temporal and contextual significance of a given resource. Automatic extraction of terms provides a hybrid of content and contextual information, providing a snapshot of a resource based on a vocabulary current at the time of authorship.

Taxonomies, particularly when applied by a subject matter expert, are excellent educational tools. Parallel implementations of taxonomy, thesauri and NLP tools can teach a vocabulary while also facilitating discovery -- an experience that will improve my current and future search results.

A taxonomy can also provide an invaluable bridge between languages. Although machine translation is improving, a translated taxonomy does a much better job of capturing domain-specific and nuanced terms. (For instance, see the National Agriculutral Library Thesaurus (NALT), a robust and constantly updated -- and backward compatible -- vocabulary in English *and* Spanish.)

And, taxonomies can be an invaluable resource when building browse tools. While building a usable visualization of a large repository remains non-trivial, an applied taxonomy offers tested and tried structures to visualize.

On the topic of taxonomies for art, see Iconclass.

annvmichael

Thanks Andrea!

Sarah Bourne

I agree that we need room for both user-generated supplied tags in conjunction with more formal taxonomies/ontologies. Why? Because they provide different findability affordances.

We In Massachusetts have been working to consolidate the hundreds of standalone websites into a single "portal" - http://www.mass.gov: hundreds of thousands of web pages and download documents, with millions of users trying to find particular information using different approaches. Our primary topic-tree navigation is well used, most likely by people who may not have a precise idea of what they are looking for. Search is heavily used, and I would guess that this is heavily represented by people who are looking for something very specific, based on prior knowledge.

As our collection grows (and our budget doesn't) augmenting search with user-contributed keywords and phrases looks very attractive, as does mapping search terms and results to formal structures to enrich the search experience and perhaps to replace some of our manual navigation-building. The most opportunities are possible by embracing metadata from as many places as you can get it.

annvmichael

Thanks Sarah -

You bring up a great point about scalability in the face of limited budgets.

Even if not faced with limited budgets, it seems an impossible task to internally retain the ability to tag the continuously and rapidly growing amount of information that needs it.

Happy New Year!
Ann

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.

Our Company