Cuisinarts, E-Commerce, and ... Controlled Vocabularies



January 01, 2002
URL:http://www.drdobbs.com/cuisinarts-e-commerce-and-controlled-vo/184412772

WebReview.com: Cuisinarts, E-Commerce, and ... Controlled Vocabularies

My friend related this interesting e-commerce experience to me recently.

He wanted to buy a food processor. Being the sensible guy that he is, he wanted quality but didn't want to pay an arm and a leg for it. A used Cuisinart would fit the bill quite nicely. So my pal fired up his browser and went on his happy way to eBay, where he proceeded to enter a search for exactly what he thought he was looking for:

"Cusinart."

Now, my friend is a pretty bright guy. He has a PhD and a host of publications to his credit. He's been a successful entrepreneur and academic. Nowadays he's helping the federal government get up to speed on e-commerce. So misspelling Cuisinart is obviously independent of intelligence—something that could happen to anyone.

It gets better. He submits his search and, voila, eBay has a "Cusinart" available. He bids successfully, and the misspelled "Cusinart" shows up on his doorstep a few days later.

You think that's the end of story. It's not. My friend eventually realizes his mistake and out of curiosity goes back to eBay. This time he searches for the correctly spelled "Cuisinart" and finds lots of them listed—as well as lots of bids pending. More bidders mean higher prices. The upshot is that if my friend had spelled the word correctly in the first place, he'd probably would have spent roughly triple the amount for a "Cuisinart" rather than his bargain-priced "Cusinart."

So what's my point? Well, it's not so much about how to make the most of your eBay experience (besides, there are already whole books on playing eBay to your advantage). Instead, it's just another illustration of how using controlled vocabularies can be the difference between a productive, efficient site and ... well, giving away your old Cuisinart at a third of its worth.

Controlled vocabularies and the guessing game

Whether you realize it or not, you're already familiar with controlled vocabularies. The Library of Congress subject headings and Yahoo's search criteria are a couple of examples. So, as you've probably guessed by now, controlled vocabularies are predetermined sets of terms that fit together to describe a specific domain such as kitchen appliances, nuclear engineering, or dirt biking.

The terms are standardized because language is ambiguous. People use different terms to say the same thing all the time. Or, worse yet, the same terms can mean different things. Sometimes folks just honestly screw up—like my friend did.

By predetermining the terms that make up a controlled vocabulary, and using those terms to describe your site's content, you can minimize the negative effects that variants, synonyms, and various other annoyances can have on your site and its users.

Here's an example. Let's say you're a webmaster at AT&T. Your site describes a huge host of products; one of them is the One Rate plan. There are many pages in your site that deal with One Rate. The problem is that there isn't a standardized spelling for it. So, some pages are about the "One Rate" plan, others describe it as "1 Rate" and on and on. Here are some of the possible references:

In this case, a patient user might eventually guess the right variant. But, as we all know, patience is a rare commodity on the Web. Without an effective controlled vocabulary strategy, users who enter the wrong term would find nothing. Consequently AT&T would lose a number of new business opportunities. This is where e-commerce degrades into e-guessing.

The guessing gets a lot more difficult with synonyms. Let's say a user visits a financial services site to find information on "income dividends." Would he or she have guessed that much of the content was listed under the following synonymous terms?

It's unrealistic to expect users to even bother trying to make guesses. Which, if you've invested any amount of time or money into building your web site, is really bad news.

Users who are browsing a site benefit from controlled vocabularies because they will find the information they need in one place under one heading. It also makes your life as a webmaster easier since you don't have to make up a new label for each piece of information you want to add to your site—just choose from the controlled vocabulary.

An even better approach (though more work) is to consider expanding your controlled vocabulary into a thesaurus that includes variant, related, broader, and narrower terms, as well as glossary definitions. That way, a user browsing your site could, for example, look for "investment income" and be directed to content indexed under the standard term, "income dividends." Searching can be similarly improved—the user's query, "1Rate," would be enriched to automatically include "One Rate" and the variants that have already been mapped to "One Rate," terms that the user hadn't considered.

Resources

If you are going to explore using a controlled vocabulary, here are a few resources. Visit the American Society of Indexers web site, which includes a listing of web-based resources. You might even consider joining ASI, or hiring its members. You might also read Peter Morville's Web Architect column on developing a thesaurus where he defines "a controlled vocabulary that leverages synonymous, hierarchical, and associative relationships among terms to help users find the information they need."

Tips

Junk in, junk out

Looking around the Web and you'll find a wasteland of chaotic vocabularies. It's almost amusing: with all the hype about e-commerce these days (somehow I remember commercial web sites existing long before I ever heard that term), webmasters are going nuts for taxonomies to describe their products and services. But you don't hear too much about what terms should be used to actually populate those taxonomies.

Same thing goes for the corporate portal and the Yahoo-ized intranet. Vendors of portal software, XML-based approaches, and other products espouse metadata as solutions to the challenges of searching and browsing. Again, their solutions are only halfway there. They provide you with descriptive metadata fields, but you'll still need standardized terms to enter into those fields. Otherwise, it's simply another case of junk in, junk out.

Terms of Service | Privacy Statement | Copyright © 2024 UBM Tech, All rights reserved.