Find us elsewhere
Join Now Member Login

Is there any FREE guide or lesson how to write a robot?

 
New Topic
Post Reply
Follow Topic
« Prev Page of 2
  • Author
  • Message
 
sddreamweavers

posts: 260

Oct 25, 2007 6:38 PM ET    Quote  Report Abuse
Points: 0   Vote

Irwan,

Please check out http://www.robotstxt.org/wc/norobots.html. This is the standard  for writing and consuming robots.txt. This page also provides several good examples.

The robot.txt file is basically an instruction set that informs web-crawlers what they should / shouldn`t crawl in your site.

There are some sites that provide free, online wizards for this tasks. This is a good one: http://www.mcanerin.com/EN/search-engine/robots-txt.asp

Enjoy,

David



Second that.  If you want to look at the robots.txt file of a heavily used site check out Wikipedia`s robot.txt file.

As for the Google/Yahoo Sitemap Generator, try GSiteCrawler this sucker will crawl a given website and create sitemaps for Google and Yahoo.




-------------------------

Aaron Wood
CEO
San Diego Dream Weavers
http://www.sddreamweavers.com
awood@sddreamweavers.com

New and improved! Now with blogging goodness!
http://www.sddreamweavers.com/san-diego-seo-marketing-blog/
abdelrahman80

posts: 8

Dec 06, 2007 7:57 AM ET    Quote  Report Abuse
Points: 0   Vote
ROBOTS is used to block search engines from indexing pages. But many web authors use it to tell search engines to index a page. Here is an example:
<META NAME="robots" CONTENT="ALL">

This tag is a waste of time. If a search engine finds your page and wants to index it, and hasn`t been blocked from doing so, it will. And if it doesn`t want to index a page, it won`t. Telling the search engine to do so doesn`t make a difference.

Here is a special Google meta tag that you can use a couple of ways. Here`s one example:
<META NAME="googlebot" CONTENT="nosnippet">

This meta tag tells Google not to use the description snippet, the piece of information it grabs from within a Web page to use as the description; instead it will use the DESCRIPTION meta tag. Here is another example
<META NAME="googlebot" CONTENT ="noarchive">

Using the  ROBOTS meta tag or the robots.txt file, you can tell the search engines to stay away . The meta tag looks like this:

<META NAME="robots" CONTENT="noindex, nofollow">



-------------------------

AbdelRahman
http://trackthatad.com/?i=137822/f-pif4pabdou-startupnation
« Prev Page of 2
Post Reply
 
.
Advertisement

Keep the Community Clean!

  • StartupNation forums should be used as a platform to learn, educate others, share stories, tips & tricks and to provide constructive feedback.
  • Please do not use the Forums for advertising & blatant self-promotion.
  • Please be respectful to other members and refrain from personal attacks and vulgar language.
  • StartupNation reserves the right to delete any message, reply, and/or member who violates our terms of use.
Read full terms of use
Advertisement
Advertisement
Advertisement
Advertisement