问个服务器日志的正则怎么写

程序员文章站 2022-05-02 21:55:30

...

2013-06-23 04:33:51 W3SVC1539885 198.56.185.162 GET /robots.txt - 80 - 66.249.75.65 Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html) 404 0 2

我想分别匹配日期2013-06-23/时间04:33:51/服务器ip198.56.185.162/文件地址robots.txt/蜘蛛ip66.249.75.65/蜘蛛信息Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html)/状态码404 0 2/，这个如何精确匹配?

回复讨论(解决方案)

最好举一个特定的例子，然后给出你期望的结果，你的问题表示看不懂

最好举一个特定的例子，然后给出你期望的结果，你的问题表示看不懂
代码就是特定的例子，我想取的值标注在下面，就是想写一句话正则匹配，用pregmatch这种生成一个数组然后我再干点其他的事。

我想这个日期的格式应该是固定的，你可以按空格将它们分割，例如

$log = '2013-06-23 04:33:51 W3SVC1539885 198.56.185.162 GET /robots.txt - 80 - 66.249.75.65 Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html) 404 0 2';var_dump( explode(' ', $log) ); /**array(14) {  [0]=>  string(10) "2013-06-23"  [1]=>  string(8) "04:33:51"  [2]=>  string(12) "W3SVC1539885"  [3]=>  string(14) "198.56.185.162"  [4]=>  string(3) "GET"  [5]=>  string(11) "/robots.txt"  [6]=>  string(1) "-"  [7]=>  string(2) "80"  [8]=>  string(1) "-"  [9]=>  string(12) "66.249.75.65"  [10]=>  string(72) "Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html)"  [11]=>  string(3) "404"  [12]=>  string(1) "0"  [13]=>  string(1) "2"}*/

我想这个日期的格式应该是固定的，你可以按空格将它们分割，例如

$log = '2013-06-23 04:33:51 W3SVC1539885 198.56.185.162 GET /robots.txt - 80 - 66.249.75.65 Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html) 404 0 2';var_dump( explode(' ', $log) ); /**array(14) {  [0]=>  string(10) "2013-06-23"  [1]=>  string(8) "04:33:51"  [2]=>  string(12) "W3SVC1539885"  [3]=>  string(14) "198.56.185.162"  [4]=>  string(3) "GET"  [5]=>  string(11) "/robots.txt"  [6]=>  string(1) "-"  [7]=>  string(2) "80"  [8]=>  string(1) "-"  [9]=>  string(12) "66.249.75.65"  [10]=>  string(72) "Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html)"  [11]=>  string(3) "404"  [12]=>  string(1) "0"  [13]=>  string(1) "2"}*/

我想这个日期的格式应该是固定的，你可以按空格将它们分割，例如

$log = '2013-06-23 04:33:51 W3SVC1539885 198.56.185.162 GET /robots.txt - 80 - 66.249.75.65 Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html) 404 0 2';var_dump( explode(' ', $log) ); /**array(14) {  [0]=>  string(10) "2013-06-23"  [1]=>  string(8) "04:33:51"  [2]=>  string(12) "W3SVC1539885"  [3]=>  string(14) "198.56.185.162"  [4]=>  string(3) "GET"  [5]=>  string(11) "/robots.txt"  [6]=>  string(1) "-"  [7]=>  string(2) "80"  [8]=>  string(1) "-"  [9]=>  string(12) "66.249.75.65"  [10]=>  string(72) "Mozilla/5.0+(compatible;+Googlebot/2.1;++http://www.google.com/bot.html)"  [11]=>  string(3) "404"  [12]=>  string(1) "0"  [13]=>  string(1) "2"}*/

但是服务器日志不是每行都是这样的，有很多#开头的，所以才想做个正则过滤掉其他格式的。

这个分不能浪费了

日志文件一般都很大
你需要在循环中逐行读取，拆分成数组

相关标签：问个服务器日志的正则怎么写

上一篇： Python bsddb模块操作Berkeley DB数据库介绍

下一篇：序列化过的数据,经过post方式传值,数据消失

问个服务器日志的正则怎么写

回复讨论(解决方案)

钉钉怎么写工作日志? 钉钉提交工作日报的教程

获取https:到.jpg或者.gif的正则怎么写？即获取网络图片

区域内匹配的正则表达式应该怎么写？

：这个a标签的正则怎么写？

正则表达式 - 求助怎么写php的正则匹配

用正则表达式判断是不是指定的url,怎么写?

linux - 自己写的项目，如果服务器配置url重写了，那原来已经写好链接怎么办？还是 .php 啊，要进入代码一起改成重写后格式吗？

静态文件服务器A和web应用服务器B分开，怎么样在B服务器上传的图片，上传到静态文件服务器应用是PHP写的？

问个服务器日志的正则如何写

匹配页面上某个class的正则怎么写