Webmasters, Hacking Ball Z - http://www.hackingballz.com
Espía los ingresos de motores de búsqueda a tu sitio
http://www.hackingballz.com/articulos/40/1/Espia-los-ingresos-de-motores-de-busqueda-a-tu-sitio/Page1.html
Por Larry Hans Arroyo Vargas
Published on 28.07.08
 
En esta oportunidad desarrollaremos un código interesante, que nos permita reportar de forma diaria, el ingreso de los motores de búsqueda a nuestro sitio web, con un amplio detalle de horas y direcciones IP.

Reseña sobre la idea original de LeeMiBlog y Código Geek

El otro día haciendo mi rutinaria y matutina revisión de blogs, pase por uno que en general me gusta mucho, se llama “Código Geek”… Ese día en especifico me tope con una entrada de un código que notificaba por email una y otra vez, cada vez que un ingreso de Google ocurría; y pese a que ya había visto un código similar hace años, me pareció que sería interesante comentarlo a futuro en esta categoría de PHP.

Pasado el tiempo, la semana pasada volví a revisar mis marcadores y me encontré con la famosa entrada, titulada “Recibe un email cuando GoogleBot visite tu blog”, y ya un poco más avanzados los comentarios, los usuarios se habían percatado, que el código es realmente de uso muy artesanal, pues saturaría tu correo enviándote notas sin variaciones, cada vez que Google decida ingresar.

Teniendo claro el panorama anterior, me decidí a crear el siguiente código, que en resumidas cuentas desarrolla las siguientes funciones:

  1. Ofrece una alternativa, en versión demo, para registrar una entrada en el log, y de esta forma asegurarnos que todo quedo bien instalado.
  2. Utiliza un directorio de trabajo, basado en archivos de texto, por lo que no necesita ninguna base de datos.
  3. No solo permitir incluir a GoogleBot, sino a cualquier cantidad más de spiders, entre los cuales incluimos a Microsoft y a Yahoo, y otros tantos más.
  4. Busca coincidencias en las “user-agents”, para determinar si el robot es quien queremos, sin limitarnos a una coincidencia exacta.
  5. En cada visita de algún robot, genera una entrada en el log de texto, para posteriormente tramitar el envió del historial, el día siguiente.
  6. Cuando existe un reporte previo, realiza el envió correspondiente, y elimina el log del día anterior.

Con las características anteriores, creo que veremos un script mucho más interesante, que con el que partimos, y así podremos decidir si utilizar la versión sencilla que compartían los colegas de Codigo Geek, o bien la versión con “extras” que hemos hecho con todo cariño.

Versión original:

<?php
if ( strpos( $_SERVER['HTTP_USER_AGENT'], 'Googlebot' ) !== false )
{
// Tu direccecion de correo
$email_address = 'tu@tudominio.com ';

// Enviate el email
mail($email_address,'Alerta de Googlebot', 'El Googlebot ha visitado tu pagina: '.$_SERVER['REQUEST_URI']);
}
?>

NOTA 1: No hemos modificado de ninguna manera el código original.
NOTA 2: Leyendo la entrada de Código Geek, vemos que ellos obtuvieron la idea, del blog de “LeeMiBlog”, por lo que de igual manera extendemos los créditos del caso, para el código original.

Ahora, en la siguiente página veremos el código que nos ha traído hasta aquí.


Nuestro código "mejorado"

Ok, llegados a la página dos de esta entrega, comenzamos a fuego con el material nuevo.

El código que hemos implementado lo hemos titulado botspy.php.

botspy.php:

<?php

/*
*
* Recibe un email al día, informándote cuando un motor de búsqueda visito tu sitio web.
*
* Idea original: http://www.codigogeek.com/2008/06/22/recibe-un-email-cuando-googlebot-visite-tu-blog/
* http://www.leemiblog.com/Articulos/Programacion/Recibe-un-email-cada-vez-que-Google-visita-tu-pgina.html
*
* Hacking Ball Z
* http://www.hackingballz.com/articulos/40/1/Espia-los-ingresos-de-motores-de-busqueda-a-tu-sitio/Page1.html
*/

//Opción demo
if($_GET['demo'] == 1){

$_SERVER['HTTP_USER_AGENT'] = 'demo-googlebot';

}

//Correo electrónico
$abdy = 'tu@correo.com';

//Directorio de trabajo
$botspy = '/path/completo/demo_robots_spider/botspy'; # CHMOD 777 a todos los archivos.

// Bots "reconociodos"
$bots = array('googlebot','msnbot','yahoo','teoma','gigabot','robozilla','nutch','ia_archiver','baiduspider');

/*
Google googlebot
MSN Search msnbot
Yahoo yahoo
Ask/Teoma teoma
GigaBlast gigabot
DMOZ Checker robozilla
Nutch nutch
Alexa/Wayback ia_archiver
Baidu baiduspider
*/

// Definimos HOY
$hoy = date('d-m-Y');

// Procesamos el USER_AGENT para verificar si es un bot reconocido.
while (list($key, $val) = each($bots)) {

if ( stripos( $_SERVER['HTTP_USER_AGENT'], $val ) !== false ){

$found_bot = true;
break;

}

}

// Si es un Bot reconocido, procesiguimos.

if($found_bot){

//Escribimos en el log los detalles
$cadena = $_SERVER['HTTP_USER_AGENT']. ' || '. $_SERVER['REMOTE_ADDR'] .' || '. date('h:i:s A') . "\r\n";
file_put_contents($botspy.'/'.$hoy.'.log',$cadena,FILE_APPEND);

}

// Enviamos un mensaje al día con el Blog del día anterior

if(!file_exists($botspy.'/'.$hoy.'.mail.log')){

if ($gestor = opendir($botspy)) {
while (false !== ($archivo = readdir($gestor))) {
if ($archivo != "." && $archivo != ".." && $archivo != $hoy.'.log') {

$log = fopen ($botspy.'/'.$archivo, "r");
while (!feof ($log)) {
//si extraigo una línea del archivo y no es false
if ($contenido .= fgets($log)){
//acumulo una en la variable número de líneas
$num_lineas++;

}
}
fclose($log);


unlink($botspy.'/'.$archivo);
}
}
closedir($gestor);
}


$mensaje = ("

FECHA ACTUAL: $hoy

REPORTE DEL DÍA DE AYER
===============================

Total de visitas de bots reconocidos: $num_lineas

LOG
===============================
$contenido

Un hacking saludo...

HACKING BALL Z
http://www.hackingballz.com
");

if(trim($num_lineas)!=''){

mail($abdy,'Informe de visitas de los robots ayer.',$mensaje);

}

file_put_contents($botspy.'/'.$hoy.'.mail.log','');

}

?>

A lo largo del código añadimos comentarios, en las partes más determinantes del proceso, el cual en general no incluye funciones demasiado complicadas, que pueden ser fácilmente detalladas, visualizando el manual oficial de cada una de ellas.

Como es normal, al ser este un script que trabaja por días, realizamos las pruebas correspondientes en un sitio web amigo de Hacking Ball Z, y en la página siguiente, mostramos a manera de bitácora, los resultados de 3 días de utilización, y los datos de instalación del script.



Muestras de las bitácoras
FECHA ACTUAL: 25-07-2008

Total de visitas de bots reconocidos: 48

LOG
===============================
msnbot/1.1 (+http://search.msn.com/msnbot.htm) || 65.55.104.156 || 01:53:25 PM
googlebot || 200.122.133.106 || 01:54:11 PM
googlebot || 66.249.70.232 || 01:54:16 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 01:54:37 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 01:56:25 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 02:03:51 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 02:07:47 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 02:40:22 PM
ia_archiver || 209.234.171.33 || 02:46:30 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 02:56:28 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 03:08:18 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 03:08:27 PM
Mozilla/5.0 (compatible; Ask Jeeves/Teoma; +http://about.ask.com/en/docs/about/webmasters.shtml) || 65.214.45.121 || 03:27:32 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 03:45:58 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 04:06:27 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 04:08:09 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 04:08:27 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 05:00:41 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 05:03:24 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 05:07:12 PM
Gigabot/3.0 (http://www.gigablast.com/spider.html) || 66.231.188.113 || 05:11:35 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 06:03:41 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 06:05:38 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 06:37:03 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 06:53:36 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 07:04:36 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 07:08:13 PM
Mozilla/5.0 (compatible; Ask Jeeves/Teoma; +http://about.ask.com/en/docs/about/webmasters.shtml) || 65.214.45.121 || 07:09:29 PM
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.0.1) Gecko/2008070208 Googlebot 2.1 || 201.195.130.198 || 07:36:00 PM
Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.0.1) Gecko/2008070208 Googlebot 2.1 || 201.195.130.198 || 07:36:02 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 08:00:38 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 08:03:23 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 08:04:10 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 09:03:42 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 09:05:06 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 09:26:47 PM
msnbot-media/1.1 (+http://search.msn.com/msnbot.htm) || 65.55.230.229 || 09:34:35 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 09:35:19 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 09:56:48 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 10:03:23 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 10:03:59 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 11:05:32 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 11:07:22 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 11:19:27 PM
msnbot/1.1 (+http://search.msn.com/msnbot.htm) || 65.55.25.136 || 11:31:38 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 11:55:37 PM


FECHA ACTUAL: 26-07-2008

Total de visitas de bots reconocidos: 99

LOG
===============================
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 12:04:07 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 12:07:32 AM
Mozilla/5.0 (compatible; Ask Jeeves/Teoma; +http://about.ask.com/en/docs/about/webmasters.shtml) || 65.214.45.121 || 12:19:42 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 12:40:44 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 01:03:20 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 01:08:25 AM
msnbot-media/1.0 (+http://search.msn.com/msnbot.htm) || 65.55.212.214 || 01:13:52 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 01:33:24 AM
Mozilla/5.0 (compatible; Ask Jeeves/Teoma; +http://about.ask.com/en/docs/about/webmasters.shtml) || 65.214.45.121 || 01:45:32 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 02:02:57 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 02:05:08 AM
msnbot-media/1.0 (+http://search.msn.com/msnbot.htm) || 65.55.212.214 || 02:05:13 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 02:57:04 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 03:04:11 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 03:04:36 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 03:15:19 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.31 || 03:19:16 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 03:27:07 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 04:07:06 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 04:08:51 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 04:22:49 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.31 || 04:50:56 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 04:57:12 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 05:04:09 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 05:05:08 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.31 || 05:54:45 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 06:00:44 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 06:03:19 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 06:06:18 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 06:29:01 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.31 || 06:29:28 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 07:04:10 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 07:08:40 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 07:24:45 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 07:57:23 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 08:03:37 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 08:08:25 AM
msnbot-media/1.0 (+http://search.msn.com/msnbot.htm) || 65.55.212.214 || 08:31:36 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.31 || 08:51:59 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 08:52:30 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 08:57:27 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 09:01:51 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 09:03:13 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 09:03:25 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 10:04:47 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 10:05:13 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 10:30:22 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 10:52:57 AM
Gigabot/3.0 (http://www.gigablast.com/spider.html) || 66.231.188.113 || 10:57:10 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 11:06:22 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 11:06:26 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 11:30:09 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.31 || 11:44:03 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 12:01:27 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 12:06:27 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 12:07:36 PM
Baiduspider+(+http://www.baidu.com/search/spider_jp.html) || 119.63.193.55 || 12:09:38 PM
msnbot/1.1 (+http://search.msn.com/msnbot.htm) || 65.55.232.19 || 12:17:28 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 01:04:25 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 01:06:12 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 01:17:22 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 02:07:32 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 02:08:14 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 02:23:10 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 02:27:43 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 02:52:52 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 03:02:56 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 03:05:25 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 04:07:25 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 04:10:12 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 04:32:07 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 04:51:08 PM
msnbot/1.1 (+http://search.msn.com/msnbot.htm) || 65.55.209.240 || 04:56:20 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 05:03:06 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 05:04:01 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 05:51:42 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 06:03:12 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 06:05:04 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 06:57:56 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 07:06:32 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 07:07:34 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 07:30:07 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 07:57:59 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 08:07:30 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 08:08:34 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 08:42:38 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 08:51:47 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 09:03:12 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 09:08:33 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 09:58:05 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 10:05:03 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 10:06:19 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 10:24:23 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 10:34:24 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 10:58:36 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 11:02:26 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 11:06:54 PM
msnbot/1.1 (+http://search.msn.com/msnbot.htm) || 65.55.104.12 || 11:36:21 PM


FECHA ACTUAL: 28-07-2008

Total de visitas de bots reconocidos: 91

LOG
===============================
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 12:07:09 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 12:07:24 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 12:31:37 AM
msnbot/1.1 (+http://search.msn.com/msnbot.htm) || 65.55.25.136 || 12:32:15 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 12:59:43 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 01:03:48 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 01:04:04 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 01:41:55 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 02:04:24 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 02:07:43 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 03:01:28 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 03:07:18 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 03:08:06 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 03:15:15 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 03:59:52 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 04:04:25 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 04:08:11 AM
msnbot-media/1.0 (+http://search.msn.com/msnbot.htm) || 65.55.212.164 || 04:54:53 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 05:01:46 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 05:02:00 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 05:03:36 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 05:30:00 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 05:44:10 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 06:02:33 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 06:07:45 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 06:28:39 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 06:30:07 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 07:03:43 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 07:07:17 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 07:58:15 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 08:04:40 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 08:05:51 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 08:28:43 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 08:30:16 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 09:03:47 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 09:05:35 AM
msnbot/1.1 (+http://search.msn.com/msnbot.htm) || 65.55.211.139 || 09:21:26 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 09:30:19 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 09:34:04 AM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 10:00:22 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 10:06:52 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 10:07:37 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 11:02:38 AM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 11:03:59 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 11:17:01 AM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 12:03:41 PM
Baiduspider+(+http://www.baidu.com/search/spider_jp.html) || 119.63.193.55 || 12:05:49 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 12:07:25 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 12:08:15 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 12:52:31 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 01:04:49 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 01:04:58 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 01:15:04 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 02:05:06 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 02:07:15 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 02:08:41 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 02:30:29 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 03:05:41 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 03:05:51 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 03:43:03 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 04:05:23 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 04:07:59 PM
msnbot-media/1.0 (+http://search.msn.com/msnbot.htm) || 65.55.212.65 || 04:16:38 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 04:29:27 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 05:00:36 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 05:06:41 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 05:06:57 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 05:18:11 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 06:05:11 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 06:06:47 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 06:30:41 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 06:59:17 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 07:06:20 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 07:07:53 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 07:29:22 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 07:30:45 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 07:59:02 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 08:05:01 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 08:05:45 PM
Baiduspider+(+http://help.baidu.jp/system/05.html) || 119.63.194.85 || 08:30:51 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 09:02:42 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 09:02:55 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 09:32:06 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 10:07:43 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 10:07:48 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 10:26:08 PM
RAYSPIDER/Nutch-0.9 || 199.46.198.232 || 10:34:12 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 61.135.168.39 || 11:02:26 PM
Baiduspider+(+http://www.baidu.com/search/spider.htm) || 220.181.32.22 || 11:08:00 PM
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) || 66.249.70.232 || 11:25:23 PM


Con la información anterior, podemos confirmar que el script funciona.

Lo podemos incluir en nuestro sitio web utilizando la siguiente forma:
<?
include("path/demo_robotsspider/botspy.php");
?>
Y debemos además recordar, que el tanto el directorio contenedor, como el archivo PHP, como el directorio de trabajo, deben tener asignado CHMOD 777.

Cualquier duda, por favor utilice el sistema de comentarios.

Un hacking saludo…