To track the data changes in Oracle database, I’m planning to use Timestamp based approach. During implementation Rajesh has taken my attention to an interesting case,
Where I have 2 Nodes in Oracle RAC, these 2 node can return different timestamp even if you fire the concurrent requests. This difference can be up to 2mins as Timestamp is dependent on machine clock cycle. Oracle RAC keeps timestamp in Sync with the help of NTP (Network Time Protocol), which triggers after every 15mins (http://www.oracledatabase12g.com/wp-content/uploads/html/RAC-Frequently%20Asked%20Questions.htm#A10074)
Solution to this problem could be :
1. Use of System Change Number (SCN) : A sequence number allocated by oracle to keep track of the changes. Oracle keep all the nodes and data changes in sync by allocating unique SCN number and uses this for backup and restore purpose. SCN gets allocated at Block level on Commit operation by default. To change this behaviour and to keep track of row change will have to enable the ROWDEPENDENCIES at table level. This was right and most reliable approach in terms of keeping track of all concurrent transactions. However, I couldn’t use this as SCN number changes if DBA dose Export/Import for backup and restore or Issues command RESTLOG. So I thought of finding some new approach to this,
2. Approach that I took,
- I have been saving the last successfully processed timestamp in a table say Timestamp1.
- Now when the process starts next time, find
o DiffInMinutes = Minutes (Current system Timestamp – TimeStamp1 ) – 5 (Or any number > 2 which will be sufficient to avoid the time drift issue).
o TimeStamp2 = TimeStamp1 + DiffInMinutes.
o Finally Select changed data from tables between TimeStamp1 and TimeStamp2.
Now, here
- I’m not restricting my upper boundary limit with currant systimestamp (which is Node specific), but just adding the reference value to Last processed time stamp saved in table which is common to all the node. So I don’t have to worry about which node is updating data and which node reading it …..
- Selection of the data older than 5 min’s (Or any number > 2) should avoid the interference of other online transactions which are uncommitted. ( Expecting all DML transaction should finish in 5 min’s) Calculative risk as if DML transaction runs for more than 5mins, users will throw away the system J
Comments
Post a Comment