We’re going to learn how how CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) works and how it minimizes automatic sign-up of forms. We will also be creating a simple CAPTCHA script in PHP to illustrate this.
Basically CAPTCHA works in the following manner:
- Create Random Value: Some random string is generated, random values are often hard to guess and predict.
- Generate an Image: Images are used as these are generally a lot harder to read for computers while being nice and readable to humans. This is also the most important step as simple text in images can be read (and CAPTCHA cracked) quite easily. To make it difficult for them, developers employ different techniques so that the text in the image becomes hard to read for computers. Some create zig-zag lines for background while others twist-and-turn individual characters in the image. Possibilities are many and new techniques are being developed all the time as crackers are always into finding ways to break them.
- Store it: The random string generated (which is also in the image) is stored for matching the user input. The easiest way to do so is to use the Session variables.
- Matching: After the above step, the CAPTCHA image is generated and shown on some form which we want to protect from being abused. The users fills in the form along with the CAPTCHA text and submits it. Now we have the following:
- All submitted form data.
- CAPTCHA string (from form), input by user.
- CAPTCHA string (real one, generated by us), from session variable. Session variable is generally used as it can keep stored values across page requests. Here, we needed to preserve stored values from one page (form page) to another (action page-that receives form data).
- If both match, it’s okay otherwise not, in that case we can give the user a message that the CAPTCHA they had entered was wrong and their form could not be submitted. You could also ask them to verify it again.
From the above image it’s quite clear that when someone requests the form page, the CAPTCHA text is generated and sent back to requesting user, but only in the form of an image. If the requester is a human he’d not have much difficulty reading the image and inputting the text when asked but if it’s a bot it might face difficulties guessing whats in the image. In the next step when we match the string generated and the one the user had input, we can restrict automated form submissions.
The following is the code that does this, it’ll just output the CAPTCHA image to the browser when the script is requested:
/********************************************************* File: captcha.php ** Author: Snehal Masne ** Date: 12-Mar-2009 ** Description: This file can be embedded as image ** to show CAPTCHA/ *********************************************************/// The number of characters you// want your CAPTCHA text to havedefine('CAPTCHA_STRENGTH', 5);/***************************** INITIALISE *****************************/// Tell PHP we're going to use// Session varssession_start();// Md5 to generate the random string$random_str = md5(microtime());// Trim required number of characters$captcha_str = substr($random_str, 0, CAPTCHA_STRENGTH);// Allocate new image$width = (CAPTCHA_STRENGTH * 10)+10;$height = 20;$captcha_img =ImageCreate($width, $height);// ALLOCATE COLORS// Background color-black$back_color = ImageColorAllocate($captcha_img, 0, 0, 0);// Text color-white$text_color = ImageColorAllocate($captcha_img, 255, 255, 255);// Line color-red$line_color = ImageColorAllocate($captcha_img, 255, 0, 0);/***************************** DRAW BACKGROUND & ** LINES *****************************/// Fill background colorImageFill($captcha_img, 0, 0, $back_color);// Draw lines accross the x-axisfor($i = 0; $i < $width; $i += 5)ImageLine($captcha_img, $i, 0, $i, 20, $line_color);// Draw lines accross the y-axisfor($i = 0; $i < 20; $i += 5)ImageLine($captcha_img, 0, $i, $width, $i , $line_color);/***************************** DRAW AND OUTPUT ** IMAGE *****************************/// Draw the random stringImageString($captcha_img, 5, 5, 2, $captcha_str, $text_color);// Carry the data (KEY) through session$_SESSION['key'] = $captcha_str;// Send data typeheader("Content-type: image/jpeg");// Output image to browserImageJPEG($captcha_img);// Free-Up resourcesImageDestroy($captcha_img);