| ||||||||||
![]() |
Shopping cart software Solutions for online shops and malls | |||||||||
![]() |
![]() |
|
X-Cart Home | ![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
gsite crawler taking forever!!! | |||
![]() |
|
|
Thread Tools | Search this Thread |
#1
|
|||||||
|
|||||||
![]() I started gsite crawler this morning around 9:30am I imported my robots.txt to make sure it wasn't going to crawl my entire site. I ended up with around 6400 files waiting and I'm now down to 4500 but it is 4:30pm!!!! Is it supposed to take that long???? Anybody know what settings I should have adjusted or any other tips?
thanks
__________________
www.uscandleco.com - X-Cart Version 4.7.11 Gold Plus php7.3 mods: reCaptcha running on UNIX www.keystonecandle.com X-Cart Gold Plus - Version 4.7.11 php7.2 mods: reCaptcha cdseo pro running on UNIX |
|||||||
#2
|
|||||||
|
|||||||
![]() We have something like
Quote:
![]() It takes about less than 5 minutes (2-3 minutes) for it to go through and process (generated 300ish links). Though you may want to limit it on a shared hosting since it can be pretty intense on your website. Also when your host locks your site for CPU overuse, GSiteCrawler doesn't realize that and continues to crawl.
__________________
4.1.8 |
|||||||
#3
|
|||||||||
|
|||||||||
![]() Some crawlers don't respect the base href tag so having CDSEO installed can cause problems if they don't.
If you find that the urls they are coming up with are "looping", it means that the base href tag is not being followed. You can solve this problem with the following code in .htaccess: Code:
|
|||||||||
#4
|
|||||||
|
|||||||
![]() take a look at http://www.xml-sitemaps.com/standalone-google-sitemap-generator.html . it can be automated on your server. works fine with CDSEO
__________________
----------------------- x-cart v4.7.6 [LIVE] x-cart v4.0.18 [retired 2004-2016] x-cart v3.5.13 [retired] x-cart v3.4.14 [retired] |
|||||||
#5
|
|||||||
|
|||||||
![]() Is this code in my .htaccess file the same as what you suggested? If so I already have it there.
<IfModule mod_rewrite.c> RewriteEngine on RewriteBase / RewriteCond %{HTTP_HOST} !^www\.uscandleco\.com [NC] RewriteRule ^(.*)$ http://www.uscandleco.com/$1 [R=301,L] RewriteCond %{SERVER_PORT} 80 #RewriteRule ^admin/ https://%{HTTP_HOST}%{REQUEST_URI} [R=301,L] </IfModule> I ran gsitecrawler again and it ran overnight and when I checked it there were150000 files waiting and it still was going. I shut it down. I'm assuming it was just looping again. Any help would be great.
__________________
www.uscandleco.com - X-Cart Version 4.7.11 Gold Plus php7.3 mods: reCaptcha running on UNIX www.keystonecandle.com X-Cart Gold Plus - Version 4.7.11 php7.2 mods: reCaptcha cdseo pro running on UNIX |
|||||||
#6
|
|||||||||
|
|||||||||
![]() The code isn't the same, please try the code I provided.
|
|||||||||
|
|||
X-Cart forums © 2001-2020
|