I have a Dataguard 11g using physical standby. After a long active time and some end of year batch jobs the size of UNDO tablespace so large. The result from checking show that the usage space is very little. So I decide resize the datafile of UNDO tablespace. The commands are success. And the file has a smaller size.
But on the Standby Database, the archivelogs are still recieved from Primary Database and cannot be applied. The alert log show the error:
Recovered data files to a consistent state at change 1443961551
Errors in file /oracle/app/diag/rdbms/proddr/PRODDR/trace/PRODDR_pr00_2930.trc:
ORA-00448: normal completion of background process
Errors in file /oracle/app/diag/rdbms/proddr/PRODDR/trace/PRODDR_mrp0_2918.trc:
ORA-00600: internal error code, arguments: [3020], [6], [11627], [25177451], [], [], [], [], [], [], [], []
ORA-10567: Redo is inconsistent with data block (file# 6, block# 11627, file offset is 95248384 bytes)
ORA-10564: tablespace UNDOTBS1
ORA-01110: data file 6: '/oradata/PRODDR/undotbs02.dbf'
ORA-10560: block type 'KTU UNDO BLOCK'
MRP0: Background Media Recovery process shutdown (PRODDR)
Errors in file /oracle/app/diag/rdbms/proddr/PRODDR/trace/PRODDR_pr00_2930.trc:
ORA-00448: normal completion of background process
Errors in file /oracle/app/diag/rdbms/proddr/PRODDR/trace/PRODDR_mrp0_2918.trc:
ORA-00600: internal error code, arguments: [3020], [6], [11627], [25177451], [], [], [], [], [], [], [], []
ORA-10567: Redo is inconsistent with data block (file# 6, block# 11627, file offset is 95248384 bytes)
ORA-10564: tablespace UNDOTBS1
ORA-01110: data file 6: '/oradata/PRODDR/undotbs02.dbf'
ORA-10560: block type 'KTU UNDO BLOCK'
MRP0: Background Media Recovery process shutdown (PRODDR)
I have faced the lost of sync on Standby Database before. I have resolved by guide on http://arup.blogspot.com/2009/12/resolving-gaps-in-data-guard-apply.html. But yesterday's error is different. The following are steps that I performed to fix the error.
Check the SCN on Primary:
SQL> select current_scn, scn_to_timestamp(current_scn) from v$database;
CURRENT_SCN SCN_TO_TIMESTAMP(CURRENT_SCN)
----------- ---------------------------------------------------------------------------
1447116920 02-FEB-16 10.43.58.000000000 AM
CURRENT_SCN SCN_TO_TIMESTAMP(CURRENT_SCN)
----------- ---------------------------------------------------------------------------
1447116920 02-FEB-16 10.43.58.000000000 AM
Check the SCN on Standby:
SQL> select current_scn from v$database;
CURRENT_SCN
-----------
1446590956
CURRENT_SCN
-----------
1446590956
SQL> select scn_to_timestamp(1446590956) from dual;
SCN_TO_TIMESTAMP(1446590956)
---------------------------------------------------------------------------
02-FEB-16 09.26.47.000000000 AM
Check archivelog on Standby: logs are transfered but cannot be applied
SQL> select dest_id, sequence#, applied, deleted, to_char(first_time,'dd/mm/yyyy hh24:mi:ss') first_time
2 from (
3 select dest_id, first_time, sequence#, applied, deleted from v$archived_log order by first_time desc, dest_id
4 )
5 where rownum<=5
6 ;
DEST_ID SEQUENCE# S APPLIED DEL FIRST_TIME
---------- ---------- - --------- --- -------------------
1 19220 A NO NO 02/02/2016 10:30:18
1 19219 A NO NO 02/02/2016 10:13:53
1 19218 A NO NO 02/02/2016 09:56:23
1 19217 A NO NO 02/02/2016 09:39:35
1 19216 A NO NO 02/02/2016 09:30:16
2 from (
3 select dest_id, first_time, sequence#, applied, deleted from v$archived_log order by first_time desc, dest_id
4 )
5 where rownum<=5
6 ;
DEST_ID SEQUENCE# S APPLIED DEL FIRST_TIME
---------- ---------- - --------- --- -------------------
1 19220 A NO NO 02/02/2016 10:30:18
1 19219 A NO NO 02/02/2016 10:13:53
1 19218 A NO NO 02/02/2016 09:56:23
1 19217 A NO NO 02/02/2016 09:39:35
1 19216 A NO NO 02/02/2016 09:30:16
Check process on Standby: MRP-apply process not running
SQL> select process, status from v$managed_standby;
PROCESS STATUS
--------- ------------
ARCH CLOSING
ARCH CLOSING
ARCH CONNECTED
ARCH CLOSING
RFS IDLE
RFS IDLE
PROCESS STATUS
--------- ------------
ARCH CLOSING
ARCH CLOSING
ARCH CONNECTED
ARCH CLOSING
RFS IDLE
RFS IDLE
On Primary: backup datafile 6 as shown in the alertlog & copy to Standby host:
rman> backup format '/BACKUP/undofile2%U' datafile 6
# scp /BACKUP/undofile2d8qsrpml_1_1 oracle@standby-host:/BACKUP/
# scp /BACKUP/undofile2d8qsrpml_1_1 oracle@standby-host:/BACKUP/
On Standby: catalog the new backuppiece; shutdown, mount DB and restore datafile 6:
rman> catalog start with '/BACKUP/undofile2d8qsrpml_1_1'
alter database recover managed standby database cancel
shutdown immediate;
startup mount;
RMAN> restore datafile 6; Starting restore at 01-FEB-16
using target database control file instead of recovery catalog
allocated channel: ORA_DISK_1
channel ORA_DISK_1: SID=577 device type=DISK
allocated channel: ORA_DISK_2
channel ORA_DISK_2: SID=768 device type=DISK
allocated channel: ORA_DISK_3
channel ORA_DISK_3: SID=959 device type=DISK
channel ORA_DISK_1: starting datafile backup set restore
channel ORA_DISK_1: specifying datafile(s) to restore from backup set
channel ORA_DISK_1: restoring datafile 00006 to /oradata/PRODDR/undotbs02.dbf
channel ORA_DISK_1: reading from backup piece /BACKUP/undofile2d8qsrpml_1_1
channel ORA_DISK_1: piece handle=/BACKUP/undofile2d8qsrpml_1_1 tag=TAG20160201T181341
channel ORA_DISK_1: restored backup piece 1
channel ORA_DISK_1: restore complete, elapsed time: 00:00:36
Finished restore at 01-FEB-16
alter database recover managed standby database cancel
shutdown immediate;
startup mount;
RMAN> restore datafile 6; Starting restore at 01-FEB-16
using target database control file instead of recovery catalog
allocated channel: ORA_DISK_1
channel ORA_DISK_1: SID=577 device type=DISK
allocated channel: ORA_DISK_2
channel ORA_DISK_2: SID=768 device type=DISK
allocated channel: ORA_DISK_3
channel ORA_DISK_3: SID=959 device type=DISK
channel ORA_DISK_1: starting datafile backup set restore
channel ORA_DISK_1: specifying datafile(s) to restore from backup set
channel ORA_DISK_1: restoring datafile 00006 to /oradata/PRODDR/undotbs02.dbf
channel ORA_DISK_1: reading from backup piece /BACKUP/undofile2d8qsrpml_1_1
channel ORA_DISK_1: piece handle=/BACKUP/undofile2d8qsrpml_1_1 tag=TAG20160201T181341
channel ORA_DISK_1: restored backup piece 1
channel ORA_DISK_1: restore complete, elapsed time: 00:00:36
Finished restore at 01-FEB-16
Starting apply process:
SQL> alter database recover managed standby database using current logfile disconnect from session
Now you can re-check the archivelog & MRP process on Standby as mentioned above.
Then taking a cup of coffe as the reward after a work hard.
Untimately, I want to say thank you to Bhavani and Arup Nanda. Yours' post help me to solve problems.
================================================================================
ReplyDeleteopmn id=hb-oraapp.hoabinh.com:6200
2 of 3 processes started.
ias-instance id=infra.hb-oraapp.hoabinh.com
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
ias-component/process-type/process-set:
OC4J/OC4J_SECURITY/default_island
Error
--> Process (pid=20457)
time out while waiting for a managed process to start
Log:
/u01/app/oracle/infra/opmn/logs/OC4J~OC4J_SECURITY~default_island~1
18/04/01 01:13:57 Initilize NLS utility...
ReplyDelete18/04/01 01:13:57 Reading SSO server policy....
18/04/01 01:13:57 SSO: Entered FilePolicyManager constructor ...
18/04/01 01:13:57 SSO: FilePolicyManager: done loading the /u01/app/oracle/infra/sso/conf/policy.properties file
18/04/01 01:13:57 SSO: Leaving FilePolicyManager constructor ...
18/04/01 01:13:57 SSODebug: Done loading the debug file, /u01/app/oracle/infra/sso/log/ssoServer.log
18/04/01 01:13:59 SSO: Initializing the encryption objects failed
18/04/01 01:13:59 oracle.security.sso.server.conf.ConfigurationException: Missing SSO server configuration
18/04/01 01:13:59 at oracle.security.sso.server.conf.DatabaseConfigReader.init(DatabaseConfigReader.java:286)
18/04/01 01:13:59 at oracle.security.sso.server.auth.AuthUtil.init(AuthUtil.java:172)
18/04/01 01:13:59 at oracle.security.sso.server.ui.SSOLoginServlet.init(SSOLoginServlet.java:237)
18/04/01 01:13:59 at javax.servlet.GenericServlet.init(GenericServlet.java:258)
18/04/01 01:13:59 at com.evermind.server.http.HttpApplication.loadServlet(HttpApplication.java:2354)
18/04/01 01:13:59 at com.evermind.server.http.HttpApplication.findServlet(HttpApplication.java:4795)
18/04/01 01:13:59 at com.evermind.server.http.HttpApplication.initPreloadServlets(HttpApplication.java:4889)
18/04/01 01:13:59 at com.evermind.server.http.HttpApplication.initDynamic(HttpApplication.java:1015)
18/04/01 01:13:59 at com.evermind.server.http.HttpApplication.(HttpApplication.java:549)
18/04/01 01:13:59 at com.evermind.server.Application.getHttpApplication(Application.java:890)
18/04/01 01:13:59 at com.evermind.server.http.HttpServer.getHttpApplication(HttpServer.java:707)
18/04/01 01:13:59 at com.evermind.server.http.HttpSite.initApplications(HttpSite.java:625)
18/04/01 01:13:59 at com.evermind.server.http.HttpSite.setConfig(HttpSite.java:278)
18/04/01 01:13:59 at com.evermind.server.http.HttpServer.setSites(HttpServer.java:278)
18/04/01 01:13:59 at com.evermind.server.http.HttpServer.setConfig(HttpServer.java:179)
18/04/01 01:13:59 at com.evermind.server.ApplicationServer.initializeHttp(ApplicationServer.java:2394)
18/04/01 01:13:59 at com.evermind.server.ApplicationServer.setConfig(ApplicationServer.java:1551)
18/04/01 01:13:59 at com.evermind.server.ApplicationServerLauncher.run(ApplicationServerLauncher.java:92)
18/04/01 01:13:59 at java.lang.Thread.run(Thread.java:534)
18/04/01 01:13:59 SSO: FAILED TO START SSO SERVER!
Many thanks
ReplyDeleteHi-I am trying to understand how the standby catches up with the primary after having restored the datafile in question. I mean what mechanism does it use to sync up after the datafile restore to get in sync with the Primary. Appreciate your feedback!
ReplyDelete