Remember Me
forgot your password?

Web Design Articles - Captcha: Concept & Interpretation

CAPTCHAs are generally one or two words presented as graphics, overlaid with some kind of distortion, and they function as a test that relies on your human ability to recognize them. CAPTCHA stands for "Completely Automated Public Turing test to tell Computers and Humans Apart." The CAPTCHA innovation was pioneered by developers at Carnegie Mellon University. The idea behind it was to develop a means of distinguishing between people and web robots, so that web sites could offer their resources to individual humans without being exploited by robots.

The Need for CAPTCHA

Site owners face a number of unique challenges in protecting their resources from automated harvesting. These include:

* Resources may be expensive to provide, and machines can consume far more data far more quickly than humans. Therefore, services that are machine-accessible may prove prohibitively expensive to maintain.

* Allowing bots to post comments and user-generated content opens a floodgate for spammers, which inevitably results in massive volumes of spam -- often to the point where a service becomes unusable.

* Data may be highly sensitive, such as personal medical or financial information, and needs to be sufficiently protected to prevent against attacks from data-mining robots.

* Interactions with a system may have fundamental implications for society as a whole; consider the issues that would arise in the case of electronic voting.

The Problem with CAPTCHA

CAPTCHA systems create a significant accessibility barrier, since they require the user to be able to see and understand shapes that may be very distorted and difficult to read. A CAPTCHA is therefore difficult or impossible for people who are blind or partially sighted, or have a cognitive disability such as dyslexia, to translate into the plain text box.

And of course there can be no plain-text equivalent for such an image, because that alternative would be readable by machines and therefore undermine the original purpose.

Since users with these disabilities are unable to perform critical tasks, such as creating accounts or making purchases, the CAPTCHA system can clearly be seen to fail this group.

Such a system is also eminently crack able. A CAPTCHA can be understood by suitably sophisticated scanning and character recognition software, such as that employed by postal systems the world over to recognize handwritten zip or postal codes. Or images can be aggregated and fed to a human, who can manually process thousands of such images in a day to create a database of known images -- which can then be easily identified.

Recent high-profile cases of bots cracking the CAPTCHA system on Windows Live Hotmail and Gmail have highlighted the issue, as spammers created thousands of bogus accounts and flooded the systems with junk. Even more recently, security firm Websense Security Labs have reported that the Windows Live CAPTCHA can be cracked in as little as 60 seconds.

One CAPTCHA-cracking project, called PWNtcha ("Pretend We're Not a Turing Computer but a Human Antagonist"), reports success rates between 49% and 100% at cracking some of the most popular systems, including 99% for the system used by LiveJournal, and 88% for that employed by PayPal.

Thus, the growth and proliferation of CAPTCHA systems should be taken less as evidence of their success than as evidence of the human propensity to be comforted by things that provide a false sense of security.

It's ironic that CAPTCHA can be defeated by those who are sufficiently motivated, when they're the very same people the test is designed to protect against. Just like DRM, CAPTCHA systems ultimately fail to protect against the original threat, while simultaneously inconveniencing ordinary users.

Alternatives to CAPTCHA

Non use of CAPTCHA would increase the volume of unwanted traffic, sometimes to an unmanageable extent.

Clearly there's a need for something. So what are the alternatives to CAPTCHA?

Non-linguistic Visual Tests

Tests that use images other than words may be generally easier for users, since all they have to do is comprehend an undistorted picture, rather than decode distorted language.
The system shows you a set of nine images, three of which are kittens. You have to identify the three kittens in order to pass authentication.

Audio Tests

An alternative to a visual CAPTCHA test is an audio test, where a series of words or letters are spoken out loud and offered to users as an audio file; this audio is also overlaid with distortion of some kind, in the same attempt to prevent programmatic decoding.

However, such tests have exactly the same issues as visual CAPTCHAS. They solve the visual issue, sure, but they do so by introducing another, equally problematic barrier. People who are deaf and blind, who work in a noisy environment, lack the necessary hardware for sound output, or are unable to understand the sound due to a cognitive disability, or even a language barrier, are no better supported than with a conventional visual test.

Also, audio tests are as equally vulnerable to being cracked by suitably motivated bot programmers as visual ones.

Logical or Semantic Puzzles

Eric Meyer's Gatekeeper plugin for WordPress works by asking a simple question, framed in such a way as to make it extremely difficult for machines to understand while blatantly obvious to humans. Would you get this one?
Other questions might be "What color is an orange?" or "How many sides does a trangle have?"

The major lacuna of this system is its scope. It has a limited number of questions and answers and is therefore vulnerable to brute-force attack. That problem can be reduced -- but not solved completely -- using flood-control (preventing a single user from making multiple attempts within a certain timeframe) and by ensuring that the selection of questions is large and frequently changed.

But the system is also underpinned by assumptions of knowledge. Ideally, the questions should be so simple that a child could answer them easily -- as is certainly the case in this example. But for every question, we still have to assume that any human can answer it, which may not be true, especially when you factor cognitive disability or language barriers into the equation.

Individual Authentication

For the highest level of security, individual authorization is always required. To log in to online banking, pay a credit-card bill, or vote, the system needs to know not just that you're a human, but that you're a specific human.

This kind of authentication could be harnessed to provide a lower level of certainty in more general applications, as authentication for a system where your specific identify is not required -- only that you're a person.

The simplest approach here is to require users to register before being able to comment, post, or add content to a site. This certainly reduces the amount of casual spam that a system might get, but it does nothing to put off a determined spammer who's prepared to take the time to create an account.

It's not difficult to find large numbers of people prepared to do this kind of work for next to nothing, given the wide range of living costs across the world economy. It would be trivially cheap for a spammer in a rich country to pay people in a poor country to do this kind of work all day.

Centralized Sign-on

A system of centralized sign-on can mitigate the potential for abuse by putting all the impetus on a single system to authenticate users once, and then give them free rein thereafter.

Systems such as Microsoft Passport offer this kind of centralization; however, they also create significant privacy questions, as you have to be prepared to trust your personal data to a single, commercial entity (quite apart from the fact that Passport uses CAPTCHA authentication!).

However, a most promising alternative to this has recently begun to gain traction, in the form of OpenID. The OpenID system avoids privacy issues because it isn't limited to a single authentication provider -- you can pick and choose, and change at any time, who you trust to hold your authentication information. This information in turn is not revealed to the site you're visiting; therefore, it offers a convenient means of centralized authentication without the attendant privacy issues.

The weak point of the system is how you obtain an OpenID in the first place, since some form of authentication is going to be required there. Simply having an OpenID is not enough to prove that you're a legitimate user, so the onus would end up being on individual sites or OpenID providers to police the use of OpenID; for example, by banning OpenIDs that are known to be spammers. This in itself could end up being a minefield for disputes.

OpenID is a good idea, and is bound to catch on, but in itself does not address the issue at hand any better than individual authentication.

Non-interactive Solutions

We've looked at a number of interactive solutions now, and seen how none of them are entirely perfect, either for protection from robot attack, or for reliably identifying humans without introducing accessibility barriers.

Perhaps the solution lies with non-interactive solutions. These analyze data as its being submitted, rather than relying on users to authenticate themselves.

Honey Traps

The idea here is that you include a form field, which is hidden with CSS, and give it a name that encourages spam bots to fill it in, such as "email2." The human user will never fill it in because they don't know it's there, but the bot won't be able to tell the difference. Therefore, if that field contains any value when the form is submitted, the submission is rejected.

The problem is that assistive technologies may not be able to tell the difference either, and so their users may not know not to fill it in. That possibility could be reduced with descriptive text, such as "do not complete this field," but doing that may be very confusing, as well as being recognizable by a bot.

Another variant of this is a simple trap that asks human users to confirm they're not robots. This could take the form of a checkbox, like this one.


In both these examples, however, bots could learn to recognize the trap and thereby circumvent it. It's one of those things that only works as long as not many people are using it -- as soon as it became prevalent, on high-traffic sites like Digg or Facebook, the spammers would simply adapt.

Session Keys

A partial solution for form submission is to generate a session key on the fly when building the original form, and then check that session key when the form is submitted. This will prevent bots that bypass the form and post directly to its target, but it does nothing to stop bots that go through the regular web form.

Limited-use Accounts

One way for a system such as free email to limit abuse by robots is to deliberately throttle new accounts for a period of time; for example, by only allowing ten emails to be sent per day for the first month.

However, this approach may not ultimately help. It may reduce the incidence of abuse on a per-account basis, but it doesn't prevent abuse entirely. There's also nothing to stop a spammer from simply signing up for thousands of accounts and sending ten spam emails from each one. And of course, such a limitation may affect legitimate users as well, but legitimate users aren't going to be inclined to sign up for multiple accounts.

Conclusion

Don't make users take responsibility for our problems.

Bots, and the damage they cause, are not the fault or responsibility of individual users, and it's totally unfair to expect them to take the responsibility. They're not the fault of site owners either, but like it or not they are our responsibility -- it's we who suffer from them, we who benefit from their eradication, and therefore we should shoulder the burden. Moreover, the common theme with all interactive alternatives is that they fail users who have a cognitive disability, or don't understand the same cultural cues as the author, or use assistive technologies. The more stringent the system, the higher the bar is raised and therefore the greater the chance of failing to recognize or admit a real human.

The Future

It's clear that both interactive and non-interactive tests will continue to be used by site owners for the foreseeable future. Developers will try to come up with new and better tests, and spammers will continue to find ways of cracking them; it's very much a vicious circle.

Perhaps, at some point in the future, somebody will come up with a test that is truly reliable and uncrackable -- something that identifies humans in a way that cannot be faked. Maybe biometric data such as fingerprints or retina scans could factor into that somewhere; perhaps we'll have direct neural interfaces that identify the presence of brain activity.

So go gaga about CAPTCHA!

Anantait

Head SEO, Marketing at AIT India

Rate this Article: 0 / 5 stars - 0 vote(s)
Print Email Re-Publish

Add new Comment



Captcha

  • Latest Web Design Articles
  • More from Anantait

Web Development India: Dabble Into Pure Web Development

By: Arun Kumar | 12/11/2009
In today’s internet dominated world, website development has become a much patronized field – credit goes to the never-ceasing progress of technology. Web developed saw its first seeds sown in the development of platforms for virtual communication and knowledge base germinated on websites.

Flash Web Design: The Unanimous Winner

By: Arun Kumar | 12/11/2009
Flash is that sort of a website development platform that comes cheap despite its awe-inspiring results. A product from the world renowned Macromedia Group, a California based graphics, web development and software firm, Flash is now taken over by Macromedia’s own rival Adobe Systems which in fact controls the line of all Macromedia products.

Understanding Many Aspects of Website Development

By: spinxwebdesign | 12/11/2009
A website development process consist many stages and there are many aspects which needs to be taken care from start till the end. These stages behave as milestones to achieve your final goal and satisfy your business needs.

Create a marvelous website with easy website builders

By: Mantoo Joinx | 12/11/2009
With the Easy Website Builders one can have the website built in the best manner. With the Easy Website Builders you can have the website built along with its placement on the reliable servers. You will also be assisted in the website promotion.

Choose the right seo website design company

By: Chris Genge | 12/11/2009
It is absolutely essential for anyone to select an expert seo website design company for your website because you can not handover your source of earning to anyone. It should be taken care of well by only expert hands.

Contribution of Graphic Designers in Designing Process in Kelowna

By: Michael Horny | 12/11/2009
Graphic designers are the people who carry out the graphic arts in a professional manner, who congregate the images, symbols and words to convey the message in a different manner.

Flash Website Designing - Usage in e-Business

By: Amya Adora | 12/11/2009
Flash Website Designing helps the web today, Which is a gamut of websites that are designed in varied creative techniques which make them appear attractive and definitely more interactive. Businesses that wish to acquire a stronger presence in the internet, are readily adopting flash website designing

Quick Reference Links To Fight Iframe Injections

By: Joseph Schembri | 11/11/2009
I have had many requests from people reading my articles on combatting iframe injection to create a quick start guide with the various links one can use to detect and recover from iframe injection attacks.

Developing Internet Business

By: Anantait | 13/11/2008 | Internet Marketing
The Web has brought opportunities that most of us never even imagined. This includes the ability to quickly start a business with a little investment. The Web has created many self-made millionaires. Many of these made their money from the comfort of home.

SEO Secrets Revealed

By: Anantait | 13/11/2008 | SEO
It is a legitimate question, whether you should go for pay-per-click or for SEO for organic search engines. It is no more a point of research that search engines drive a huge traffic to your website. By doing that it will be a reinventing the wheel! As all of you know search engine marketing is necessary to get quality traffic to your website.

How to Get Free Inbound Links to Your Website

By: Anantait | 01/11/2008 | Link Popularity
If you want your website to be popular on the Internet then it simple cannot be isolated. You must find ways to make other websites link back to your website. Lets look at how you can create inbound links to your website. There are many reasons why inbound links matter. First of all, they build your page rank or popularity. Page rank is basically a scale in which your site is ranked which is based on the number of links coming into your website. The more links, the higher the page rank.

Buy Links or Buy Trouble?

By: Anantait | 01/11/2008 | Link Popularity
The big debate raging on the Internet these days focuses on Google‘’s recent campaign to ensure that paid links aren”t part of the determination of popularity of a website. Their take is that if you buy links, you”re trying to skew the results of any search in order to increase your presence in the rankings. But wait, isn”t that what any successful company wants to do? Increase their presence and popularity?

Increase the Strength of Links to Your Site

By: Anantait | 01/11/2008 | Link Popularity
When it comes to promoting your site online it can really be quite a drag. There are many resources out there to help you but this either means spending lots of time building links manually or using spamming software to litter you link all over the net. Personally I tend to delete spam and so does the mighty google. So what do you do spam or spend loads of time looking for link partners, it seems like a lose, lose to me.

Affiliate Marketing: Finding the Golden Needle in the Internet Haystack

By: Anantait | 29/10/2008 | Affiliate Programs
Finding the affiliate marketing program can be a daunting task. There are hundreds if not thousands of products, services and marketing plans to choose from, so how does someone get started finding which is best for them. Choosing a program on profit or percentage may seem like a good thing to consider first but it is not the best place to start. Personality and interests or hobbies, should be the starting point in deciding which program is right for you.

Finding Affiliate Program Providers

By: Anantait | 29/10/2008 | Affiliate Programs
There are at least 10,000 affiliate program providers out there, if you consider each company or individual which markets products through affiliates. They should provide you with tools to sell their products, such as text ads and banners.

Affiliate Marketing: the Importance of the Giveaway Item

By: Anantait | 29/10/2008 | Affiliate Programs
Here, we are going to assess the value of giveaway items in affiliate marketing programs and why you should make them a part of your own marketing plan. You can easily figure out from the first sentence that I do believe that using giveaways is a good idea and I”ll tell you why. Also, I”ll tell you what to look for and what not to do in your own marketing efforts.

Submit Your Articles Free: Signup
Article Categories




Use of this web site constitutes acceptance of the Terms Of Use and Privacy Policy | User published content is licensed under a Creative Commons License.
Copyright © 2005-2008 Free Articles by ArticlesBase.com, All rights reserved. (0.93, 1, w1)