f
31 Step Success Building Starting with a PENNY
...

 

DAY 131

Update: Monday, 22-Aug-2005 11:52 AM

I am sure that by now from the lack of postings that you can see I have hit a "rough patch" in my work to keep doubling the pennies.

Actually, the period after the failure of my project maxsimo.com following several programming, management and financial errors just ended up draining me personally.

On top of that I kept adding work onto what I am doing here and in the end all of it 'exploded' on me and the whole project and MY LIFE entered a very sour two months.

But what to do? Just doing nothing will make it even worse, so what I elected to do and am doing now is to cut costs and re-tool all of my work.

Today I just wanted to get started and get my momentum moving again. That is all I wanted to get done today.

I got my work going by following the advice of the article below and now I will expand that to a niche site that I created to expand on the idea of business success --which I aimed to do by building a site that sold a product or service. When I first started on the idea, I decided to setup a part of p31.com and sell a membership to produce the revenue.

After some research I found out that I didn't have the traffic necessary to generate such revenue from this site. I quickly created http://www.businesscreditbuilding.com from all the research that I did while building my own business credit.

Frankly, I have been lost at best and I kept making pages and pages of plans but all of that kept changing and I have had a 'devil of a time' getting the plan going forth straight.

Let me give an example: I know that running a good size mailing list with ad placement can generate a good revenue. However, then I run into the problem of a mailing list for which site? A mailing list with what content? I already have the software to run the mailing list and I have put several boxes up for subscribers. However, I have not moved beyond that.

There are some decisions that I obviously must make:
1. I have to go through the entire site and make sure 100% of the links work. I have linked to several outside articles and now the articles have been moved and the links don't work. That is a poor reflection of the site and my work.

2. I successfully moved to this new page format, however, I still have some clean up to do on the pages. While making sure that links work, I need to make sure that the pages are clean and the content is easy to read and make sense.

3. Lastly, I need to make sure that I execute the plan that I come up with business success.

Over the past two months, I have been a mess and the planning and execution took a back seat to the sucess I was building.

NOW I need to get back on that path and make sure that I make it.

Today I got going again. I didn't know where to get re-started. So, I just grabbed the article below and I just started working.

I just wanted to do something to get going again...

Here is the article. I will now concentrate on the plan and the sites that I am working on. And build from there. In the two months since I "got lost" and had to re-tool there have been some positive developments. That too I will discuss in the coming days.

Search Engine Spiders Lost
Without Guidance - Post This Sign!

Robots.txt Signpost Warns Trespassers From Private Property, By Mike Banks Valentine (c) 2005

The robots.txt file is an exclusion standard required by all web crawlers/robots to tell them what files and directories that you want them to stay OUT of on your site. Not all crawlers/bots follow the exclusion standard and will continue crawling your site anyway. I like to call them "Bad Bots" or trespassers. We block them by IP exclusion which is another story entirely.
This is a very simple overview of robots.txt basics for webmasters. For a complete and thorough lesson, visit Robotstxt.org.

To see the proper format for a somewhat standard robots.txt file look directly below. That file should be at the root of the domain because that is where the crawlers expect it to be, not in some secondary directory.

Below is the proper format for a robots.txt file ----->

User-agent: *
Disallow: /cgi-bin/
Disallow: /images/
Disallow: /group/

User-agent: msnbot
Crawl-delay: 10

User-agent: Teoma
Crawl-delay: 10

User-agent: Slurp
Crawl-delay: 10

User-agent: aipbot
Disallow: /

User-agent: BecomeBot
Disallow: /

User-agent: psbot
Disallow: /

--------> End of robots.txt file

This tiny text file is saved as a plain text document and ALWAYS with the name "robots.txt" in the root of your domain.


A quick review of the listed information from the robots.txt file above follows. The "User Agent: MSNbot" is from MSN, Slurp is from Yahoo and Teoma is from AskJeeves. The others listed are "Bad" bots that crawl very fast and to nobody's benefit but their own, so we ask them to stay out entirely. The * asterisk is a wild card that means "All" crawlers/spiders/bots should stay out of that group of files or directories listed.

The bots given the instruction "Disallow: /" means they should stay out entirely and those with "Crawl-delay: 10" are those that crawled our site too quickly and caused it to bog down and overuse the server resources. Google crawls more slowly than the others and doesn't require that instruction, so is not specifically listed in the above robots.txt file. Crawl-delay instruction is only needed on very large sites with hundreds or thousands of pages. The wildcard asterisk * applies to all crawlers, bots and spiders, including Googlebot.

Those we provided that "Crawl-delay: 10" instruction to were requesting as many as 7 pages every second and so we asked them to slow down. The number you see is seconds and you can change it to suit your server capacity, based on their crawling rate. Ten seconds between page requests is far more leisurely and stops them from asking for more pages than your server can dish up.

(You can discover how fast robots and spiders are crawling by looking at your raw server logs - which show pages requested by precise times to within a hundredth of a second - available from your web host or ask your web or IT person. Your server logs can be found in the root directory if you have server access, you can usually download compressed server log files by calendar day right off your server. You'll need a utility that can expand compressed files to open and read those plain text raw server log files.)


To see the contents of any robots.txt file just type robots.txt after any domain name. If they have that file up, you will see it displayed as a text file in your web browser. Click on the link below to see that file for Amazon.com

http://www.Amazon.com/robots.txt

You can see the contents of any website robots.txt file that way.

The robots.txt shown above is what we currently use at Publish101 Web Content Distributor, just launched in May of 2005. We did an extensive case study and published a series of articles on crawler behavior and indexing delays known as the Google Sandbox. That Google Sandbox Case Study is highly instructive on many levels for webmasters everywhere about the importance of this often ignored little text file.

One thing we didn't expect to glean from the research involved in indexing delays (known as the Google Sandbox) was the importance of robots.txt files to quick and efficient crawling by the spiders from the major search engines and the number of heavy crawls from bots that will do no earthly good to the site owner, yet crawl most sites extensively and heavily, straining servers to the breaking point with requests for pages coming as fast as 7 pages per second.

We discovered in our launch of the new site that Google and Yahoo will crawl the site whether or not you use a robots.txt file, but MSN seems to REQUIRE it before they will begin crawling at all. All of the search engine robots seem to request the file on a regular basis to verify that it hasn't changed.

Then when you DO change it, they will stop crawling for brief periods and repeatedly ask for that robots.txt file during that time without crawling any additional pages. (Perhaps they had a list of pages to visit that included the directory or files you have instructed them to stay out of and must now adjust their crawling schedule to eliminate those files from their list.)

Most webmasters instruct the bots to stay out of "image" directories and the "cgi-bin" directory as well as any directories containing private or proprietary files intended only for users of an intranet or password protected sections of your site. Clearly, you should direct the bots to stay out of any private areas that you don't want indexed by the search engines.

The importance of robots.txt is rarely discussed by average webmasters and I've even had some of my client business' webmasters ask me what it is and how to implement it when I tell them how important it is to both site security and efficient crawling by the search engines. This should be standard knowledge by webmasters at substantial companies, but this illustrates how little attention is paid to use of robots.txt.

The search engine spiders really do want your guidance and this tiny text file is the best way to provide crawlers and bots a clear signpost to warn off trespassers and protect private property - and to warmly welcome invited guests, such as the big three search engines while asking them nicely to stay out of private areas.

About The Author
Google Sandbox Case Study. Mike Banks Valentine operates Publish101.com Frëe Web Content Distribution for Article Marketers and Provides content aggregation, press release optimization and custom web content for Search Engine Positioning. http://www.seoptimism.com/SEO_Contact.htm

LAST DAY: 130 | TODAY 131 | NEXT is 132

 


Learn about the hottest advertising, marketing and business success building techniques that I explore, test and explain to help your build your own success!

JOIN MY MAILING LIST:
31-STEP Success Plan™

for 4 to 5 e-mails PER MONTH with new & updated information on how to build PERSONAL, BUSINESS & FINANCIAL Success, ONE STEP at a TIME.

Please insert your e-mail address below:

subscribe
unsubscribe


Become an Affiliate
>> CLICK HERE <<

[ P31.COM: Part of the
CO-OP Maxsimo.Net Sites
helping IR Success! All referrals from P31 are CO-OP referrals ]


Get listed in the BiiG Affiliate Managed Directory


Another dream...

Another dream


Legal Note: I am not a Financial Advisor, Lawyer or Accountant. If you are going to do this on your own seek professional help when you need it. Through this site, my BLOG, I am simply sharing my thoughts, goals, and showing what I am doing. Collectively "INFORMATION" that you can read to follow along. Otherwise, go here...

-- My Broadcast Blog --

 

Text for the day will go here -- An update will come soon!

BACK to DAY | TODAY is DAY | NEXT is DAY

UPDATE:
------------Business Success Wisdom for the Day:



--------------

BACK to DAY | TODAY is DAY | NEXT is DAY

GREAT LINKS


Google
Web P31.COM

Below are links to my thoughts, comments, actions, results and a BLOG as I take ONE PENNY (USD$0.01) and double it thirty one times utilizing a 31-STEP Success Plan™ in an effort to build the PENNY to USD $10,737,418.24:

STEP 01: $0.01
STEP 02: $0.02
STEP 03: $0.04
STEP 04: $0.08
STEP 05: $0.16
STEP 06: $0.32
STEP 07: $0.64
STEP 08: $1.28
STEP 09: $2.56
STEP 10: $5.12
STEP 11: $10.24
STEP 12: $20.48
STEP 13: $40.96
STEP 14: $81.92
STEP 15: $163.84
STEP 16: $327.68
STEP 17: $655.36
STEP 18: $1,310.72
STEP 19: $2,621.44
STEP 20: $5,242.88
STEP 21: $10,485.76
STEP 22: $20,971.52
STEP 23: $41,943.04
STEP 24: $83,886.08
STEP 25: $167,772.16
STEP 26: $335,544.32
STEP 27: $671,088.64
STEP 28: $1,342,177.28
STEP 29: $2,684,354.56
STEP 30: $5,368,709.12
STEP 31: $10,737,418.24

 

31-STEP Success Plan™ © 2004, 2005, 2006 & 2007 P31.COM