Google.com

KISS_Racing · November 16, 2006

Has anyone else noticed that Google.com is logged in as a User?

Rosie · November 16, 2006

Strange...

**NickHolt** · November 17, 2006

Go through the member list. Lot's of strange names there.

Nick

abrungot · November 17, 2006

Google pings many sites to index the content.. to update their search engines.

direct-flo · November 17, 2006

a bot

tps48 · November 17, 2006

Maybe not a bot. I saw on user list Google.com contacting member via e-mail.

KISS_Racing · November 18, 2006

here is what i found interesting and why i asked. the Google.com is NOT underlined. it is almost like it's part of the New forum software.

oh well.....

on edit: the mystery continues. i looked as Nick suggested, Google.com isn't a member!

**NickHolt** · November 18, 2006

Kiss,

When you use a search engine, such as Google.Com, how do you suppose the information that you see presented to you all nice and neat got there? Google.com visits your website too - along with every other website it can locate.

The difference is that now you can see all the bots as well as all the other "guests" that visit TXSZ, whereas with the old software the bots were not showing up on the list of people online.

Nick

KISS_Racing · November 19, 2006

nick,

thanks for educating me on what goes on behind the scenes and what we aren't used to seeing. i can now add this to my resume and next time someone asks, i can act like i know what i'm talking about...

folks mystery solved thanks to nick...

oh nick, i always thought that a website owner/company had to pay somone to do submissions for them to search engines like google, lycos, dog pile and etc.... see i did learn something! thanks again.....

Crazyhorse · November 19, 2006

i'm sorry , i'm coming into the conversation late (and i'm illterate).............what the hell is a bot and should i be concerned?

kn1ghtblade · November 19, 2006

Just a little tid bit of info, I just did a test, and now when searching on google for texas auto racing this board is the very first link like it shows, and when just searching for Texas Racing its the first one on the 3rd page mainly due to all the horse and dog racing sites.... I know before it was alot further back in the search engine...

**Budman** · February 23, 2007

Well whoever he is, he don't never post nothin' on here. I'm pretty sure I caught a glimpse of him tryin' to sneak a peek over my firewall the other day. I wouldn't trust him if I were you! :lol: :lol:

CEL73 · February 23, 2007

I hate it when people try to peak over my firewall!!! <_<

KISS_Racing · February 23, 2007

this is one of the only forums I don't get port scans from.

**NickHolt** · February 23, 2007

this is one of the only forums I don't get port scans from.

That's because I hand screen every registration. I get between 10 and 25 registrations a day that I deny. Takes about 5 minutes to screen the IP address, look up the host, search various PHP sites for a match to the registration and run the screen name and email addy through the various search engines.

I also require email responses from most new registrations.

You will also notice that (knock on wood) that we have no problem with spammers for the same reason.

Nick

Rookie49 · February 24, 2007

Good Job Nick. Now I know why this site is so clean. Now if you can just get a spell checker for some of us.

jakdad · December 2, 2007

I didn't know that either. Thanks Again Nick!!

**NickHolt** · December 2, 2007

I didn't know that either. Thanks Again Nick!!

Unfortunately, the number of spammers trying to register at TXSZ has increased dramatically over the past couple of months. Apparently they're not happy with just messing up your email. They're moving on to where they can get multiple reads instead of just one at a time.

Here's one of the tools I use to check IP addresses.

http://www.fspamlist.com/files/export.txt

Nick

jakdad · December 4, 2007

Man, that list is endless! Being a computer illiterate, I truly appreciate anyone that can copy & paste.

:rolleyes:

WIKKED-RACING · May 14, 2008

To who has no computer skills at how google gets it's information its whats called a spyder. Excuse the coding at the bottom .. You wont understand it if you dont know how to code.

Spiders and robots are programs that browse the web automatically, usually for gathering and indexing links or other information.

XML and its grandparent SGML are attempts to instill meaningful order into information. With them, single documents become leaves of databases. A collection of pages can be displayed as HTML easily through conversion or used for indexed searching or even generating entirely new documents.

The Internet has always been full of data, just never with any real meta-organization. You can think of the Internet itself as the single most important database in existence, but without it all being in a formatted language like XML or some other rigid scheme, it’s not a valuable database. Information without order, indices and strong categorization, reduces quickly to noise.

The real value of the Internet is found in its surfeit of plain text, no offense to the porn industry. The one arena where no one debates the supremacy of Perl is text parsing and manipulating. Therefore, it’s no real stretch to set some Perl loose on the Internet, with the right instructions, and find the value in that great unkeyed DB.

So let’s do something really valuable with the WWW! Let’s find a celebrity’s birthday. We’ll pick Jimmy Page to dull the irony somewhat. We are using simple regexes to check for birthdays. Much better ones could be crafted for serious applications.

Code

#!/usr/bin/perl

use strict;

use warnings;

#---------------------------------------------------------------------

use WWW::Spyder; # our crawler

use URI::Escape; # to properly escape our query for the search engine

#---------------------------------------------------------------------

@ARGV == 2 or usage();

my $spyder = WWW::Spyder->new(sleep_base => 20,

exit_on => { pages => 30,

time => '1min'});

my $name = join(' ',@ARGV);

$spyder->terms($name, qr/birthdays?/i);

$spyder->seed( 'http://www.google.com/search?q=' .

uri_escape(qq{"$name"}) );

my $bday;

while ( my $page = $spyder->crawl ) {

print "Check-->> ", $page->url, "\n";

# try to extract the birthday here

( $bday ) = $page->text =~

m,$name\s+was born on ([^.]+\d\d+),sio;

last if $bday;

( $bday ) = $page->text =~

m,$name\'s\s+birthday is ([^.]+\d\d+),sio;

last if $bday;

}

if ( $bday ) {

print "\n ${name}'s birthday seems to be: $bday\n\n";

} else {

print "\n Sorry, couldn't find ${name}'s birthday quickly.\n\n";

}

exit 0;

#=====================================================================

sub usage {

my ( $tool ) = $0 =~ m,([^\/]+)$,;

die <<KettleChips;

----------------------------------------------------------------------

USAGE:

$tool [Proper Name]

I will try to find the birthday of someone famous if you will please

give me his/her name. I can only do two word names right now.

----------------------------------------------------------------------

KettleChips

}

#=====================================================================

Usage

jinx[96]>spyder-birthday Jimmy Page

Output

Check-->> http://www.google.com/search?q=%22Jimmy%20Page%22

Check-->> http://www.led-zeppelin.com/

Check-->>

http://directory.google.com/Top/Arts/Music/Bands_and...

Check-->> http://home.earthlink.net/~juliannwh/

Jimmy Page's birthday seems to be: January 9, 1944

Google.com

Recommended Posts

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Archived