View Full Version : Sitemap, data directory and Google images indexing questions
December 24th, 2013, 01:00 PM
Hi, looking for ideas, input, suggestions, etc. My site has over 3000 images and is several years old. It looks like most, if not all pages are indexed, although I have poor search results via Google (another story). My main Google concern right now is that although most of my pages are indexed, it seems that 0 actual images are indexed. I have reviewed several other posts that talk about this, talk about sitemaps, etc. Yet, I cannot definitively determine why google is might not index any of my images. All images have appropriate file names describing the image.
Based on what I read, I have used the “create sitemap” command under admin tools. It created the sitemap just fine. I submitted this to Google using webmaster tools and it gives me over 3000 warnings. Basically one warning for each image, as the site map is referencing the image file under the data directory, which is blocked by the robots file.
So, is there any reason that I should or should not change the robot file to NOT disallow the “data” directory?
Or, are there any other suggestions that might help figure out why Google does not index my actual images.
December 24th, 2013, 02:50 PM
The sitemap is a relatively new thing and yes it requires an actual image link which is how images get into google there are two separate things here. google page indexes and google image index.
if you want your images in google images then yes remove the data directory disallow in the robots text file.
December 24th, 2013, 04:09 PM
Thanks Chuck. When I was looking back through some of the other threads where people had image indexing issues, I thought I saw somewhere where you had indicated unblocking /data could cause issues and that search engines did not need access to that directory as they were just images with no html, etc. thus the reason for the question. With that info, I thought I would also post my problem on one of the google forums. Here is a link to that with some responses with the main/best response from a google peson bolded below. Hope this helps others as well.
"well had to remove the link as I do not yet have 15 posts", but here is part of the response
this appears to be a combination of two things, and one of them is fixed already :-).
1) Until recently you had blocked crawling of /data in your robots.txt file. This is the directory where your thumbnail images appear to be. Since it's no longer blocked now, we'll probably pick up on crawling those images soon.
At any rate, since the thumbnail URLs on their own should start getting picked up soon, maybe that's already enough for you & you don't actually need to make any changes with the full images -- that's completely up to you.
Hope it helps!
December 24th, 2013, 07:11 PM
john brings up another point. If you use the on the fly watermarking and other image protection aspects of the program which are there to protect others from leaching your content and are options you set then in effect you are blocking google from accessing your images sure. ;)
December 30th, 2013, 05:57 PM
Although painfully slow, Google now does seem to be picking up some of my images. 16 out of 3,129 so far. Mostly thumbnail images with a couple of medium size. So, bottom line, making progress. Thanks
December 30th, 2013, 07:16 PM
Yeah I have no control over google or host fast things happen there. ;)
If you want google images to pick up your images you need to turn off on the fly watermarking and serving up the images with watermark.php and run the sitemap script and submit it to google. Google will tell you how many it indexed.
vBulletin® v3.8.1, Copyright ©2000-2014, Jelsoft Enterprises Ltd.