Scunthorpe problem
From Wikipedia, the free encyclopedia
The Scunthorpe problem occurs when a spam filter or search engine blocks e-mails or search results because their text contains a string of letters that are shared by an obscene word. While computers can easily identify strings of text within a document, broad blocking rules will result in a false positive, causing innocent phrases to be blocked.
Contents |
[edit] Origin and history
The problem was named after an incident in 1996 in which America Online's dirty-word filter prevented residents from the town of Scunthorpe, North Lincolnshire, England from creating accounts with AOL, because the town's name contains the substring cunt. Years later, Google's filters apparently made the same mistake, preventing residents from searching for local businesses that included Scunthorpe in their names. Residents of Penistone, South Yorkshire, experienced problems because the town's name includes the substring penis, while Lightwater in Surrey suffered similarly because its name contains the substring twat. Gareth Roelofse, the designer of the website RomansInSussex.co.uk, noted: "We found many library Net stations, school networks and Internet cafes block sites with the word 'sex' in" the domain name. This was a challenge for RomansInSussex.co.uk because its target audience is school children." [1]
Problems can also occur with the words socialism or socialist because they contain the substring Cialis, the brand name for an erectile dysfunction medication commonly advertised in spam e-mails. A similar problem can occur with the word specialist, typically blocking mailed résumés and other material including job descriptions.[2]
[edit] Other examples
Mistaken decisions by obscenity filters include:
- In April 1998 Jeff Gold attempted to register the domain name
shitakemushrooms.com
, but was blocked by a InterNIC filter prohibiting the "seven dirty words" which was active between 1996 and the transfer of control to ICANN.[3] - In October 2004, it was reported that the Horniman Museum in London was failing to receive some of its e-mail due to filters mistakenly deciding that its name was a version of the words horny man.[4][5]
- In February 2004, Craig Cockburn of Scotland reported that he was unable to use his surname (pronounced "Coburn") with Hotmail, Yahoo! or his workplace computers. He discovered that his e-mails would be delivered if he spelled his name C0ckburn (with a zero instead of the letter "o").[6]
- Web searches on the Nigerian-born footballer Nwankwo Kanu may be blocked due the substring wank in his name.[citation needed]
- In February 2006 Linda Callahan, a resident of Ashfield, Massachusetts was initially prevented from registering her name with Yahoo! as an e-mail address as it contained the substring Allah. Yahoo! later reversed the ban.[7]
- The filter of the free wireless service of the town of Whakatane in New Zealand blocked searches involving the town's own name, because the phonetic analysis used by the filter deemed the "whak" to sound like fuck.[8]
- In July 2008, Dr. Herman I. Libshitz was initially unable to get the e-mail address he wanted from Verizon because it contained the substring "shit". A spokesperson commented: "As a general rule (since 2005) Verizon doesn't allow questionable language in e-mail addresses, but we can, and do, make exceptions based on reasonable requests. The one from Dr. and Mrs. Libshitz certainly is reasonable and we regret the inconvenience and frustration they've been caused." [9]
Additionally there can be cases of whole word filtering, showing that computer obscenity filters may fail to take into account the context in which a word appears:
- In May 2006 Ray Kennedy from Manchester in the UK found that e-mails that he had written to his local council to complain about a planning application had been blocked as they contained the word erection when referring to a structure.[10]
- In October 2004, e-mails advertising the pantomime Dick Whittington sent by a teacher from Norwich in the UK were blocked by school computers due to the use of the word Dick.[11]
- In February 2003 Members of Parliament at the British House of Commons found that a new spam filter was blocking e-mails containing references to the Sexual Offences Bill then under debate, as well as some messages relating to a discussion about censorship.[12]
- Résumés of magna cum laude graduates have been blocked by spam filters due to inclusion of the word cum.[13]
- In June 2008, a news site run by the American Family Association censored an Associated Press article on sprinter Tyson Gay, replacing instances of "gay" with homosexual, thus rendering his name as "Tyson Homosexual".[14]
[edit] See also
[edit] References
- ^ Google's chastity belt too tight CNET article on the Scunthorpe Problem.
- ^ http://www.pekingduck.org/archives/002371.php Example of a website that blocks the substring Cialis.[dead link]
- ^ Paul Festa (April 27, 1998). "Food domain found "obscene"". News.com. http://www.news.com/2100-1023-210566.html.
- ^ Name Of Museum Is Confused With Porn (from News Shopper)
- ^ Horniman Museum
- ^ Print Article: How Mr C0ckburn fought spam
- ^ Kallahar's Place: Is Yahoo Banning Allah?
- ^ "F-word town censors itself on the internet". http://www.news.com.au/technology/story/0,25642,24112585-5014108,00.html. Retrieved on 2008-08-01.
- ^ "When your name gets turned against you". http://www.philly.com/philly/hp/news_update/26089374.html. Retrieved on 2008-08-03.
- ^ BBC E-mail filter blocks 'erection' 30 May 2006
- ^ Sam Jones Panto email falls foul of filth filter The Guardian 14 October 2004
- ^ BBC E-mail vetting blocks MPs' sex debate 4 February 2003
- ^ Maher, Kris. "Don't Let Spam Filters Snatch Your Resume". Career Journal. http://www.collegejournal.com/jobhunting/resumeadvice/20040426-maher.html. Retrieved on 2008-02-11.
- ^ Frauenfelder, Mark (June 30th, 2008). "Homophobic news site changes athlete Tyson Gay to Tyson Homosexual". BoingBoing. http://boingboing.net/2008/06/30/homophobic-news-site.html. Retrieved on 2008-12-22.
[edit] External links
- Article on autocorrecting from The Guardian
- Comment Headaches: article from pekingduck.org