Go to Making Light's front page.
Forward to next post: Hard-eyed enforcers of empty-minded clichés
Subscribe (via RSS) to this post's comment thread. (What does this mean? Here's a quick introduction.)
Search strings Google doesn’t like.
Well, that list is an education in itself.
I'm reassured that apparently, I HAVE pretty much seen it all.
Some days, I cry for the state of the world.
Today, I seem to be fiddling while Rome burns. What a lovely light.
Is the capitalization significant? Why was "marijuana" listed in all lower-case on the non-blacklist, but as "marijuanA" on the blacklist?
According to the note at the beginning of the list, the capital letter marks where in the process of typing the search term you trigger the reaction.
So you type m-a-r-i-j-u-a-n and have results being built up. Add that last a, and the page goes blank until you hit enter to confirm you really do want results for the Demon Weed.
I sent in to them that "buggerY" cuts off, and I guess they haven't gotten around to adding it. Club Buggery, an Australian comedy show, is not something one can get suggested even though it's got a lot of hits.
abi @ 4: Huh. I actually looked for an explanatory note a few times before posting that question, but never saw one. Now I see that I completely skipped over the line under the bolded title.
I fail at webpage-skimming. Oh well.
Tom Whitmore @ 5
Well, bugger me with a toasting fork.
(hmmm. On further consideration, please don't.)
Funny thing: when you type the final "a" in "marijuana", the suggestions go blank, but if you type a space after that, the suggestions come back.
I haven't checked to see if that's the case with anything of the other "banned" words, but it makes me wonder who the heck Google thinks they're fooling.
My employer's web filters feel this is a no-no site.
Sigh. "...with any of the other banned words." I used to be able to proofread my messages; I don't know what has happened.
I wrote about some of this behaviour when they launched Instant, in particular, that Irina Slutsky disappears:
However, the modern-day Bowdlers at Google don't white you out based on what you type, but on what they predict you're going to type.If I type 'blue-footed' - it predicts I'm typing 'blue-foooted booby' and as 'boobies' is an Official Google Smutty Word, my search goes white (in fact 'blue-foo' is enough).Similarly, typing 'turn again d' implies 'turn again Dick Whittington', and 'dick' is a an Official Google Smutty Word.The same is true for Irina -so shocking is her last name that all you have to type is 'irina sl' and the Google whiteout erases her from results.Weirdly, if you type 'who killed cock' it is completed to 'who killed cock-robin' with a hyphen inserted, which implies someone has edited the auto-complete list manually.
However, the modern-day Bowdlers at Google don't white you out based on what you type, but on what they predict you're going to type.
If I type 'blue-footed' - it predicts I'm typing 'blue-foooted booby' and as 'boobies' is an Official Google Smutty Word, my search goes white (in fact 'blue-foo' is enough).
Similarly, typing 'turn again d' implies 'turn again Dick Whittington', and 'dick' is a an Official Google Smutty Word.
The same is true for Irina -so shocking is her last name that all you have to type is 'irina sl' and the Google whiteout erases her from results.
Weirdly, if you type 'who killed cock' it is completed to 'who killed cock-robin' with a hyphen inserted, which implies someone has edited the auto-complete list manually.
Also the more subtle issues around Google predicting what you are going to write next, as warned of by Frayn and Orwell
Serge @9: I'm not surprised. 2600.com is well known for distributing information of dubious legality (e.g. they're famous for distributing the potentially-DMCA-violating DeCSS in the early days). Probably blocked from an "access to this site opens up potential legal issues" perspective.
From the list: philip kindRed dick
WTF? Just confirmed this, it really does blank out after philip kindr. philip k dick works just fine though. Just kindred dicK also triggers the filter, although not until it's finished. dick by itself doesn't.
Jules #12: yes, that's like my 'blue-foo' example; as soon as the projected phrase includes an Official Google Naughty Word™ then it all goes white.
I guess the obvious conclusion to come to is that whoever set up the search string filter thinks that "philip kindred dick" is filthier than "philip k dick", but isn't more likely that the string "philip kindred dick" just isn't common enough to trigger any quicksearch options?
NelC @14 — I don't know if that's just supposed to be a joke, but my best guess on the topic is that "philip k dick" completes to
philip k dick's valisphilip k dick's best novelphilip k dick's exegesisphilip k dick's the man in the high castlephilip k dick'
I find this list utterly unsurprising.
It's not a list of search terms google will blacklist.
It's a list of search completions that google won't throw in the face of someone who's begun typing them, among other things.
You can still search for this stuff; it just won't autocomplete on you ... or on J. Random Bluenose of Colorado Springs, who will have a hissy fit at the word "Breast" coming up in when Little Jimmy started to type in "Brest-Litovsk" for his history assignment and therefore try to have Google banned from schools.
I'm much more annoyed that Microsoft Word 2008 for the Mac's British English spelling checker dictionary contains American mis-spellings of common British English words (and still doesn't include "cunt", "fuck", or any similar useful punctuation/wildcard particles from routine Scottish conversation).
How strange. Oh well, I know where on the web to find most of the stuff that Google does not want to admit exists.
The list is, however, pretty appalling.
I think Charlie has the right of it, and the sense of, "censoring" is an artifact of seeing things "disappear". It's not that google doesn't want us to see things (I am not sure that "google" qua google cares).
It's that people who don't want to see (or whom some commercially dangerous segment of the population thinks need protecting, i.e. children; though some of it is nanny statism) are being protected from inadvertent offense.
Which is a different philosophical issue altogether.
Like, not cool, man, you know?
URL spam ahoy at #20
...except now that the spam at #20 has been deleted, my spam alert post now bears the number #20 and calls for its own removal.
Time paradox approaching!
blah, blah, blah
Comments containing more than seven URLs will be held for approval. If you want to comment on a thread that's been closed, please post to the most recent "Open Thread" discussion.
You can subscribe (via RSS) to this particular comment thread. (If this option is baffling, here's a quick introduction.)
<strong>Strong</strong> = Strong
<em>Emphasized</em> = Emphasized
<a href="http://www.url.com">Linked text</a> = Linked text
Tolkien. Minuscule. Gandhi. Millennium. Delany. Embarrassment. Publishers Weekly. Occurrence. Asimov. Weird. Connoisseur. Accommodate. Hierarchy. Deity. Etiquette. Pharaoh. Teresa. Its. Macdonald. Nielsen Hayden. It's. Fluorosphere. Barack. More here.
(You must preview before posting.)