Register | Login
Views: 19364387
Main | Memberlist | Active users | ACS | Commons | Calendar | Online users
Ranks | FAQ | Color Chart | Photo album | IRC Chat
11-02-05 12:59 PM
2 users currently in General Chat: Ailure, Dark Vampriel | 1 guest
Acmlm's Board - I2 Archive - General Chat - idea: a search blacklist | |
Add to favorites | "RSS" Feed | Next newer thread | Next older thread
User Post
Mutation

Buzzy Beetle
Level: 30

Posts: 264/389
EXP: 154861
For next: 11008

Since: 09-21-04

Since last post: 22 days
Last activity: 5 min.
Posted on 11-22-04 10:44 AM Link | Quote
You know how some sites do various things (e.g. putting gibberish strings of text within page content) so that they will be more likely to show up in the lists of results you get from search engines and stuff? Well, I think search engines should have some sort of "blacklist" for those sites; that way, they wouldn't ever be included among search results unless you specifically told the search engine to include them.
Discuss.


(edited by Mutation on 11-22-04 01:45 AM)
(edited by Mutation on 11-22-04 01:51 AM)
Karadur

Fire Snake



Level: 48

Posts: 124/1192
EXP: 786444
For next: 37099

Since: 11-02-04
From: Chatham, Ontario, Canada

Since last post: 1 day
Last activity: 15 hours
Posted on 11-22-04 11:02 AM Link | Quote
Have a frustrating experience with a search engine lately, Mutation? It sounds like it from your post I have to agree though. I remember reading something on bash.org, where someone did a search for 'man pages' (apparently it was for Linux stuff), and they got a bunch of prOn results

Even still sometimes today, when I can't find a ROM on the sites I usually look on, I do a google search, and half of the time there's a completely unrelated result. The ones I dislike the most are when you can tell just by looking at the url that's displayed on google that it's not what you're looking for Something like looking for a ROM, and a url that's displayed looks like 'http://whatever-you-want-here.com/theromyouwant.php'. Obviously that's not a URL that'd be returned in a search, but it gets the point across, I think.

Having a blacklist for those sites is a fine idea, but what would happen when a site got put on that list, and someone came along and needed information from it? Unless you think there should be different blacklists for all sorts of categories, and sites would be put into those respectively. That's a good idea as well, but I can't imagine all the different categories that'd need to be made.

I don't know if you know this yet or not, but you can make the search engine look for exactly the text you typed in by putting quote marks around it. That works quite well when you know exactly what you're looking for, but can't find anything with the 'normal' search.

I think a better idea would be to make the search engines just overlook long strings of random text in a page's content. Of course, I don't run a search engine, or know anyone who does
Xkeeper
The required libraries have not been defined.
Level: NAN

Posts: -3059/-863
EXP: NAN
For next: 0

Since: 03-15-04

Since last post: 2 hours
Last activity: -753366 sec.
Posted on 11-22-04 11:27 AM Link | Quote
It's called "keyword stuffing", and search engines tend to catch on quickly...

Though something bigger like "Google bombing" is getting worse (try a google serach for both of these)... it's why searches like "Miserable fKitten Yiffer" give GWB's homepage.
Kitten Yiffer

Purple wand
Furry moderator
Vivent l'exp����¯�¿�½������©rience de signalisation d'amusement, ou bien !
Level: 135

Posts: 6231/11162
EXP: 28824106
For next: 510899

Since: 03-15-04
From: Sweden

Since last post: 3 hours
Last activity: 4 min.
Posted on 11-22-04 11:50 AM Link | Quote
Half of the time I don't mind Google bombing, as long it's purpose isn't advertising.

And well, there is something I call crap porn sites which like to come up at ROM searches. I wish google could check the words the site use and if there is lack of verbs and punctation... then that site should get low rating.
Karadur

Fire Snake



Level: 48

Posts: 125/1192
EXP: 786444
For next: 37099

Since: 11-02-04
From: Chatham, Ontario, Canada

Since last post: 1 day
Last activity: 15 hours
Posted on 11-22-04 11:51 AM Link | Quote
The results you get from searching for 'Miserable fKitten Yiffer' have to be the funniest thing I've seen all day (or by now, morning )

I read the article over at this site, and I have to say, that's quite informative, but if I read the word 'blog' once more, I'm going to go crazy. I never thought that just by writing in a blog that I might be contributing to something like this Hah. They even have an All Your Base thing down at the bottom

The thing is, with the blogrolling thing they talk about on that page, short of ignoring all blog sites' front pages, how are they going to stop it?

Fortunately, I've not been 'victim' to very many google bombs yet. I ran across a couple back in high school, I remember, but I did'nt know what they were actually called. All I knew was that it was quite strange to see a result that had nothing to do with my search in the first page
Gavin

Fuzzy
Rhinoceruses don't play games. They fucking charge your ass.
Level: 43

Posts: 315/799
EXP: 551711
For next: 13335

Since: 03-15-04
From: IL, USA

Since last post: 13 hours
Last activity: 13 hours
Posted on 11-22-04 12:52 PM Link | Quote
most sophisticated search engines are able to check the Keyword Density and prevent pages that oversaturate from being returned high on the list. From what i recall, i think somewhere between 3-7% is a nice range for search engines to pick up on.
Ran-chan

Moldorm
eek, when are they going to stop growing...
Level: 143

Posts: 5330/12781
EXP: 35293588
For next: 538220

Since: 03-15-04
From: Nerima District, Tokyo - Japan

Since last post: 12 hours
Last activity: 12 hours
Posted on 11-22-04 03:22 PM Link | Quote
You can read about google bombing on this site: http://www.wordspy.com/words/Googlebombing.asp

I don
HyperLamer
<||bass> and this was the soloution i thought of that was guarinteed to piss off the greatest amount of people

Sesshomaru
Tamaranian

Level: 118

Posts: 2128/8210
EXP: 18171887
For next: 211027

Since: 03-15-04
From: Canada, w00t!
LOL FAD

Since last post: 2 hours
Last activity: 2 hours
Posted on 11-22-04 10:46 PM Link | Quote
Ugh. This is Google's #2 problem. I've seen quite a few that aren't just lists of words, but actually lists of links all leading to the same page but with different URLs. They're getting smarter, too... Some are actually making lists that look like real sentences. Instead of something like "n64 gamecube ps2 xbox rom gba roms" they say things like "the n64 roms for ps2 are the gba xbox"... things that still make no sense, but look like they do at first glance. The links are also interesting and they make me wonder if it's just a few people doing it... They come up with stuff like "www.download-free-games.com/discount-viagra.htm", and other links that would look legitimate if not that the page's name has NOTHING to do with the site's. (And on EVERY one of them, each word is separated by a dash like that. )

I think if nothing else, Google needs to have a system where you sign up or something and have a list of pages to filter from search results, with easy 'block' links next to each results. (Experts-exchange.com needs to die. )

But, still - the number one problem with Google is that its image search needs to check if the image actually [insert swear word here] exists! I am SO sick of finding just the right picture only to click it and get a 404 page!
Ran-chan

Moldorm
eek, when are they going to stop growing...
Level: 143

Posts: 5340/12781
EXP: 35293588
For next: 538220

Since: 03-15-04
From: Nerima District, Tokyo - Japan

Since last post: 12 hours
Last activity: 12 hours
Posted on 11-22-04 10:59 PM Link | Quote
I
Karadur

Fire Snake



Level: 48

Posts: 130/1192
EXP: 786444
For next: 37099

Since: 11-02-04
From: Chatham, Ontario, Canada

Since last post: 1 day
Last activity: 15 hours
Posted on 11-22-04 11:10 PM Link | Quote
I'll be the third person to say I've noticed that problem then. Like Trapster, I might've needed a picture for a school project. Went to do a google image search, and I found the *perfect* one. Click the picture to try to get the full version, and get a 404 Very annoying.

HyperHacker; I know what you mean when you say experts-exchange needs to die In fact, just today I was looking for a free alternative to ISOBuster, because I've got a CD I burned a while ago that I can no longer read anymore Wouldn't you know it, one of the results turned up was this, a page at experts-exchange Funny thing is, look down on that page underneath the 'sign up to see this solution link, and see the 3 ads. The bottom one offers a link to buy scratched CDs on eBay I managed to get the stuff off the CD I wanted though, so that's good there

Originally posted by HyperHacker
And on EVERY one of them, each word is separated by a dash like that.



At least it makes it easier to know which ones to ignore when you're skimming through the results
paragon
Newcomer
Level: 4

Posts: 2/5
EXP: 272
For next: 7

Since: 03-17-04

Since last post: 107 days
Last activity: 2 days
Posted on 11-23-04 01:38 AM Link | Quote
They really just need to have some form of a rating system on top of the pagerank. This takes a fair bit more work to implement but would result in higher quality of results. Just throw a little "rate" link or something beside each search result. If it's a good resource for what you searched for, then rate it up, if it was crap or completely unrelated then rate it down. From that and the search terms you used, certain results would be highlighted as either exceptional or unexceptional. The latter could be punished by having their pagerank lowered and the former have their pagerank increased.

Naturally this would require honesty from users and would no doubtedly be abused so it's not too practical, but would be a nice solution to the problem.

Unfortunately you'll never do away with pages like that really. People will always find a way around today's search engine techniques and flood it with their own advertisments, etc.
Add to favorites | "RSS" Feed | Next newer thread | Next older thread
Acmlm's Board - I2 Archive - General Chat - idea: a search blacklist | |


ABII


AcmlmBoard vl.ol (11-01-05)
© 2000-2005 Acmlm, Emuz, et al



Page rendered in 0.018 seconds.