PHP批量删除、清除UTF-8文件BOM头的代码实例
程序员文章站
2023-11-27 11:31:28
记得运行代码前先把文件备份一下哦,避免出现失败问题。
代码一:
function checkbom ($filename) {
globa...
记得运行代码前先把文件备份一下哦,避免出现失败问题。
代码一:
function checkbom ($filename) { global $auto; $contents = file_get_contents($filename); $charset[1] = substr($contents, 0, 1); $charset[2] = substr($contents, 1, 1); $charset[3] = substr($contents, 2, 1); if (ord($charset[1]) == 239 && ord($charset[2]) == 187 && ord($charset[3]) == 191) { if ($auto == 1) { $rest = substr($contents, 3); rewrite ($filename, $rest); return ("<font color=red>bom found, automatically removed.</font>"); } else { return ("<font color=red>bom found.</font>"); } } else return ("bom not found."); }
代码二:
<?php header('content-type: text/html; charset=utf-8'); if(isset($_get['dir'])){ //设置文件目录,如果没有设置,则自动设置为当前文件所在目录 $basedir=$_get['dir']; }else{ $basedir='.'; } $auto=1;/*设置为1标示检测bom并去除,设置为0标示只进行bom检测,不去除*/ echo '当前查找的目录为:'.$basedir.'当前的设置是:'; echo $auto?'检测文件bom同时去除检测到bom文件的bom<br />':'只检测文件bom不执行去除bom操作<br />'; checkdir($basedir); function checkdir($basedir){ if($dh=opendir($basedir)){ while (($file=readdir($dh)) !== false){ if($file != '.' && $file != '..'){ if(!is_dir($basedir.'/'.$file)){ echo '文件: '.$basedir.'/'.$file .checkbom($basedir.'/'.$file).' <br>'; }else{ $dirname=$basedir.'/'.$file; checkdir($dirname); } } } closedir($dh); } } function checkbom($filename){ global $auto; $contents=file_get_contents($filename); $charset[1]=substr($contents,0,1); $charset[2]=substr($contents,1,1); $charset[3]=substr($contents,2,1); if(ord($charset[1])==239 && ord($charset[2])==187 && ord($charset[3])==191){ if($auto==1){ $rest=substr($contents,3); rewrite($filename,$rest); return (' <font color=red>找到bom并已自动去除</font>'); }else{ return (' <font color=red>找到bom</font>'); } }else{ return (' 没有找到bom'); } } function rewrite($filename,$data){ $filenum=fopen($filename,'w'); flock($filenum,lock_ex); fwrite($filenum,$data); fclose($filenum); } ?>
代码三:
##把该文件放在需求去除bom头的目录下跑一下却可。 <?php if (isset ( $_get ['dir'] )) { // config the basedir $basedir = $_get ['dir']; } else { $basedir = '.'; } $auto = 1; checkdir ( $basedir ); function checkdir($basedir) { if ($dh = opendir ( $basedir )) { while ( ($file = readdir ( $dh )) !== false ) { if ($file != '.' && $file != '..') { if (! is_dir ( $basedir . "/" . $file )) { // 如果是文件 echo "filename: $basedir/$file " . checkbom ( "$basedir/$file" ) . " <br>"; } else { $dirname = $basedir . "/" . $file; // 如果是目录 checkdir ( $dirname ); // 递归 } } } closedir ( $dh ); } } function checkbom($filename) { global $auto; $contents = file_get_contents ( $filename ); $charset [1] = substr ( $contents, 0, 1 ); $charset [2] = substr ( $contents, 1, 1 ); $charset [3] = substr ( $contents, 2, 1 ); if (ord ( $charset [1] ) == 239 && ord ( $charset [2] ) == 187 && ord ( $charset [3] ) == 191) { // bom // 的前三个字符的ascii // 码分别为 // 239 // 187 // 191 if ($auto == 1) { $rest = substr ( $contents, 3 ); rewrite ( $filename, $rest ); return ("<font color=red>bom found, automatically removed.</font>"); } else { return ("<font color=red>bom found.</font>"); } } else return ("bom not found."); } function rewrite($filename, $data) { $filenum = fopen ( $filename, "w" ); flock ( $filenum, lock_ex ); fwrite ( $filenum, $data ); fclose ( $filenum ); } ?>
二、python
#!/usr/bin/env python #-*- coding: utf-8 -*- import os def delbom(): file_count = 0 bom_files = [] for dirpath, dirnames, filenames in os.walk('.'): if(len(filenames)): for filename in filenames: file_count += 1 file = open(dirpath + "/" + filename, 'r+') file_contents = file.read() if(len(file_contents) > 3): if(ord(file_contents[0]) == 239 and ord(file_contents[1]) == 187 and ord(file_contents[2]) == 191): bom_files.append(dirpath + "/" + filename) file.seek(0) file.write(file_contents[3:]) print bom_files[-1], "bom found. deleted." file.close() print file_count, "file(s) found.", len(bom_files), "file(s) have a bom. deleted." if __name__ == "__main__": delbom()
为了方便大家使用,这里小编分享一个bom工具方便大家检测。
下载地址:
上一篇: 用PHP和Shell写Hadoop的MapReduce程序
下一篇: java实现字符串排列组合问题