欢迎您访问程序员文章站本站旨在为大家提供分享程序员计算机编程知识!
您现在的位置是: 首页  >  数据库

【Oracle】undo损坏,无备份非常规恢复

程序员文章站 2022-06-17 15:16:01
...

客户的一个测试环境,主机异常断电,启动后发现undo文件损坏,无法启动,在open阶段报错如下: Errors in file /u01/app/oracle/diag/rdbms/cdrdb/CDRDB/trace/CDRDB_ora_4109.trc: ORA-01122: database file 3 failed verification check ORA-01110: data f

客户的一个测试环境,主机异常断电,启动后发现undo文件损坏,无法启动,在open阶段报错如下:

Errors in file /u01/app/oracle/diag/rdbms/cdrdb/CDRDB/trace/CDRDB_ora_4109.trc:

ORA-01122: database file 3 failed verification check

ORA-01110: data file 3: '/u01/app/oracle/oradata/CDRDB/undotbs01.dbf'

ORA-01210: data file header is media corrupt

ORA-1122 signalled during: ALTER DATABASE OPEN...

由于是测试环境,没有备份,但是又需要里边的一些数据,所以我尝试使用非常规恢复方法进行了尝试。

先冷备份现有环境!!!!!!!!!!!!!!!!

创建pfile文件:

create pfile from spfile;

在pfile中修改这两个参数

#*.undo_tablespace='UNDOTBS1'

*.undo_management= MANUAL

之后用这个pfile启动:

SYS@CDRDB>startup force pfile='/u01/app/oracle/product/11.2.0/dbhome_1/dbs/initCDRDB.ora';

ORACLE instance started.

Total System Global Area 523108352 bytes

Fixed Size 1337632 bytes

Variable Size 364906208 bytes

Database Buffers 150994944 bytes

Redo Buffers 5869568 bytes

Database mounted.

ORA-01122: database file 3 failed verification check

ORA-01110: data file 3: '/u01/app/oracle/oradata/CDRDB/undotbs01.dbf'

ORA-01210: data file header is media corrupt

仍然报错

之后尝试drop掉这个undo

SYS@CDRDB>alter database datafile 3 offline drop;

Database altered.

之后重新开库

SYS@CDRDB>alter database open;

Database altered.

之后创建新的undo表空间undotbs2

SYS@CDRDB>create undo tablespace undotbs2 datafile '/u01/app/oracle/oradata/CDRDB/undotbs02_01.dbf' size 100M;

create undo tablespace undotbs2 datafile '/u01/app/oracle/oradata/CDRDB/undotbs02_01.dbf' size 100M

*

ERROR at line 1:

ORA-00604: error occurred at recursive SQL level 1

ORA-01552: cannot use system rollback segment for non-system tablespace

'DATA_OL'

ORA-06512: at line 999

ORA-01552: cannot use system rollback segment for non-system tablespace

'DATA_OL'

产生报错如上

比较疑惑创建undo为什么会影响到DATA_OL表空间,所以做了个10046

SYS@CDRDB>oradebug event 10046 trace name context off

Statement processed.

SYS@CDRDB>oradebug tracefile_name

/u01/app/oracle/diag/rdbms/cdrdb/CDRDB/trace/CDRDB_ora_4279.trc

[oracle@centos-1 ~]$ tkprof /u01/app/oracle/diag/rdbms/cdrdb/CDRDB/trace/CDRDB_ora_4279.trc

output = 1.trm

查看1.trm,发现了原因:

"OGG".DDLReplication.dbQueried IS NULL THEN

SELECT database_role,

open_mode

INTO dbRole, dbOpenMode

FROM v$database;

"OGG".DDLReplication.dbQueried := TRUE;

END IF;

IF NOT (

^@ (dbRole = 'PRIMARY' OR dbRole = 'LOGICAL STANDBY')

AND dbOpenMode =

'READ WRITE'

)

THEN

-- do not write any trace even though it

should work as this is standby

"OGG"

.DDLReplication.setCtxInfo(-1,-1,-1,-1,-1);

RETURN; -- do not use

trigger if not read/write and primary/logical_standby

END IF;

EXCEPTION

......略

原来是因为这个库配置过OGG的DDL同步,有DDL产生时会产生insert操作,使用DATA_OL表空间。

原因找到~ 跑脚本关闭该库OGG的DDL配置即可

SYS@CDRDB>@ddl_disable.sql

Trigger altered.

之后再重新创建undotbs2表空间

SYS@CDRDB>create undo tablespace undotbs2 datafile '/u01/app/oracle/oradata/CDRDB/undotbs02_01.dbf' size 1000M;

Tablespace created.

这次很顺利的添加成功。

undotbs2添加完毕,接下来关闭数据库,修改pfile参数

*.undo_tablespace='UNDOTBS2'

*.undo_management=AUTO

重启数据库用pfile启动

SYS@CDRDB>startup pfile='/u01/app/oracle/product/11.2.0/dbhome_1/dbs/initCDRDB.ora';

ORACLE instance started.

Total System Global Area 523108352 bytes

Fixed Size 1337632 bytes

Variable Size 364906208 bytes

Database Buffers 150994944 bytes

Redo Buffers 5869568 bytes

Database mounted.

Database opened.

数据库成功启动,然而在很多业务表查询时依然会报错:

ERROR at line 1:

ORA-00376: file 3 cannot be read at this time

ORA-01110: data file 3: '/u01/app/oracle/oradata/CDRDB/undotbs01.dbf'

依然需要原来的undotbs01.dbf回滚数据库崩溃时未提交的事务

之后我先尝试了设置event 10513,来屏蔽smon的回滚

SYS@CDRDB>alter system set events '10513 trace name context forever, level 2';

System altered.

发现还是会报之前的错误,看来需要自己手工去屏蔽回滚段了。

SYS@CDRDB>SELECT SEGMENT_NAME,STATUS FROM DBA_ROLLBACK_SEGS;

SEGMENT_NAME STATUS

------------------------------ ----------------

SYSTEM ONLINE

_SYSSMU10_4131489474$ NEEDS RECOVERY

_SYSSMU9_1735643689$ NEEDS RECOVERY

_SYSSMU8_3901294357$ NEEDS RECOVERY

_SYSSMU7_3517345427$ NEEDS RECOVERY

_SYSSMU6_2897970769$ NEEDS RECOVERY

_SYSSMU5_538557934$ NEEDS RECOVERY

_SYSSMU4_1003442803$ NEEDS RECOVERY

_SYSSMU3_1204390606$ NEEDS RECOVERY

_SYSSMU2_967517682$ NEEDS RECOVERY

_SYSSMU1_592353410$ NEEDS RECOVERY

SEGMENT_NAME STATUS

------------------------------ ----------------

_SYSSMU30_244658789$ ONLINE

_SYSSMU29_1020880693$ ONLINE

_SYSSMU28_2912622077$ ONLINE

_SYSSMU27_747253598$ ONLINE

_SYSSMU26_560868814$ ONLINE

_SYSSMU25_1357066082$ ONLINE

_SYSSMU24_103440716$ ONLINE

_SYSSMU23_1006903361$ ONLINE

_SYSSMU22_2808190508$ ONLINE

_SYSSMU21_39626587$ ONLINE

21 rows selected.

之后把查询到的NEEDS RECOVERY的表加到以下两个参数中,屏蔽这些回滚段

_offline_rollback_segments/_corrupted_rollback_segments参数

*._offline_rollback_segments=(_SYSSMU10_4131489474$,_SYSSMU9_1735643689$,_SYSSMU8_3901294357$,_SYSSMU7_3517345427$,_SYSSMU6_2897970769$,_SYSSMU5_538557934$,_SYSSMU4_1003442803$,_SYSSMU3_1204390606$,_SYSSMU2_967517682$,_SYSSMU1_592353410$)

*._corrupted_rollback_segments=(_SYSSMU10_4131489474$,_SYSSMU9_1735643689$,_SYSSMU8_3901294357$,_SYSSMU7_3517345427$,_SYSSMU6_2897970769$,_SYSSMU5_538557934$,_SYSSMU4_1003442803$,_SYSSMU3_1204390606$,_SYSSMU2_967517682$,_SYSSMU1_592353410$)

使用这个pfile启动数据库

startup force pfile='/u01/app/oracle/product/11.2.0/dbhome_1/dbs/initCDRDB.ora';

之后对那些测试表都查询了一下,确认可以查询,之后导出帮他们导出数据,一切OK~