[Pro] 404 Crawl Errors?

I just looked at the Google report for one of my websites and it identified several 404 errors. The errors look like they were old pages I used during the development process, which have since been deleted, but there must still be some link to them somewhere in the website.

What’s the easiest way to track down these type of errors in a website?


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

but there must still be some link to them somewhere in the website.

At the top of the FW ‘page’ there are headers Master, Page, Preview and Link Map

While on a page clicking the Link Map header will display all links in and out of that page.

If an outward link has a node on the end then clicking on that will show further links.

David


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Thanks David. The only problem is the link tab doesn’t show the page’s suffix. Upon closer review, it looks like the errors are from Perch enabled pages that used to be labeled with .html and now have .php.

I need to track down any links to the old .html pages.


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Edit>URLs will also show links in your pages.

You can go through them all and change any you find that are incorrect without having to do it on each individual page.

D


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Yeah, I’m not seeing anything wrong. Weird.


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

I figured out what the errors were. They were links from the old website. Now that I’m using a CMS, pages like client.html are now client.php. Is there anyway I can redirect the old client.html pages to the new .php pages? How can I do that?

Thanks!


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Use a 301 redirect

Apache .htaccess Singe Page Redirect

In order to use this method you will need to create a file named .htaccess or add the following to it if it exists already (not supported by Windows-based hosting) and place it on the root directory of your website, then add the code below to the file. (Remove the angle brackets from the URL if looking in the web view)

Redirect 301 /old/oldpage.html http://www.mysite.com/newpage.php

Rinse and repeat for each

David


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Visit this page to see one in action http://www.deltadesign.co/test.html - note the URLs


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Thanks David. I created the .htaccess file and saved it to my desktop and now I can’t find it. I’m assuming it’s invisible or hidden. How do I view invisible files?


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Oops! Never mind. Found a great little app that does that:


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Transmit shows invisible files too. :slight_smile:


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

I just got off the phone with GoDaddy. Apparently, their Apache servers don’t support the Redirect 301 command. They advised me to use this code instead, but I’m not sure how to set it up.

RewriteEngine on

rewritecond %{http_host} ^coolexample.com [nc]

rewriterule ^(.*)$ http://www.coolexample.com/$1 [r=301,nc]


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

I just got off the phone with GoDaddy. Apparently, their Apache servers don’t support the Redirect 301 command.

Another reason not to host with GD - non standard implementations!

As far as I understand it the code that they have given you only ensures that a request for yoursite.com goes to www.yoursite.com and is not for page specific redirects.

Try this instead in your htaccess file

RewriteEngine  on
RewriteBase    /~quux/
RewriteRule    ^foo\.html$  bar.html  [R]

Where foo is the old page name and bar is the new one.

D


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Thanks David. That worked perfectly, but ~quux needs to be the domain name.

Where can I find more information on the Rewrite command and its available attributes? Also, how do I get the green code box above when inserting code into threads?


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Which is written like this:

RewriteBase //


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Oops, my bad. The code should only have one forward slash, like this:

RewriteBase /


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Also discovered that to comment an .htaccess file you use the # symbol instead of // :slight_smile:


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Here’s an article I wrote back in March last year on the subject which should cover most of the basics of redirection;

Regards,
Tim.

On 30 Sep 2012, at 11:12, RavenManiac wrote:

Where can I find more information on the Rewrite command and its available attributes?


Experienced Freeway designer for hire - http://www.freewayactions.com


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Cool. Thanks Tim.


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options

Even though I’ve done a Rewrite to redirect old pages I’m still getting Google 404 Crawl Errors. When I click on the supposedly erroneous page the redirect is working perfectly.

Why am I still getting 404 errors even though the error has been corrected?


freewaytalk mailing list
email@hidden
Update your subscriptions at:
http://freewaytalk.net/person/options