Solved: Order of pages in a DUMP DATABASE

Former Member · ‎09-23-2013

Hi.

I know if I perform a DUMP DATABASE in an actively modified database, the pages might be written out in an essentially random order, because ASE will use an algorithm that gives priority to a page that a user is wanting to modify just then. Or something.

But what if the database is idle throughout the whole DUMP operation? Will ASE tend to write the pages out in a fixed order, like in sequential order by page number?

I ask because I'm wondering if it's technically feasible to reduce the file transfer size for a full dump, through use of a diff-type algorithm such as rsync. It seems like if the content of one full dump is wildly different from the content of the next one, due to ASE shuffling the order of the pages each time, then such a diff algorithm could not possibly work. The entire dump file would have to be transferred off each time. But what if one dump file was only slightly different from the last one? Would rsync be able to take advantage of that? And I realize that this could possibly make sense only with UNcompressed dumps.

Thanks.

- John.

Former Member · ‎09-25-2013

John

I know if I perform a DUMP DATABASE in an actively modified database, the pages might be written out in an essentially random order, because ASE will use an algorithm that gives priority to a page that a user is wanting to modify just then. Or something.

Yes. And Backup Server is much more intelligent than that.

Think about the dump file written in, say, eight stripes.

But what if the database is idle throughout the whole DUMP operation? Will ASE tend to write the pages out in a fixed order, like in sequential order by page number?

AFAIK no order can be determined. BackupServer maps onto the Shared Memory Segment. It can read the caches directly. So if any order can be identified (noting that it would not be a programmable or predictable one), it would be a list of caches, and then physical order (not MRU-LRU, which is not a physical ordering). Plus all the pages that are not in memory, directly from the disk. There would be order in those. All spliced with its internal read/write extents algorithm. Plus another level of splicing if stripes are used.

I ask because I'm wondering if it's technically feasible to reduce the file transfer size for a full dump, through use of a diff-type algorithm such as rsync. It seems like if the content of one full dump is wildly different from the content of the next one, due to ASE shuffling the order of the pages each time, then such a diff algorithm could not possibly work. The entire dump file would have to be transferred off each time. But what if one dump file was only slightly different from the last one? Would rsync be able to take advantage of that?

Now we get to the intent of the question, rather than the question itself. So what you want is really an efficient way of not storing pages that do not change, in the context of the large volume of full db dump files that are stored, archived, etc, is that correct ? And you are researching methods outside ASE that will reduce that volume.

Bit of background.

I had the same need, pressing from a couple of large customers, from about ten years ago. AFAIC, there was no need to backup pages that did not change. Think about text and images that never change, and large stable tables that are added to, but never modified.
So I devised an Incremental Backup method. I will spare you the details, but it was based on sysgams and checksums per AllocUnit/Extent/Page, maintained in both the db and the dump file.
In the good old days, ending Apr 2009 to be precise, when I enjoyed a warm relationship with Sybase Engineering, I discussed it with them. They asked me to submit it with a formal ER, which I did.
Engineering looked at it, and a few months later, they informed me that they had come up with an even better method. No complaint from me.
They promised that it will be implemented in ASE 16.
The good old days are long gone. So are the days of TechWaves and product roadmaps. Everything "went South". Now with SAP running the show, things appear to be "going North".
No one will discuss ASE 16. No one can, because there is no roadmap or list of intended features.

Next thing I knew was, ASE 15.7 SP100 was delivered, a few months ago. I always read the New Features Guide for every release, and this one is 154 pages. No reference to the series of previous minor releases, so this is not a minor release. It is packed with a number of large new features. One of them is the promised Incremental Backup, with a full set of features around it. They have called it Cumulative Backup. I have not tested that release yet, but the doco is complete (still not synchronised, but let's not beat a dead horse).

The release is not a minor or minor, minor release. AFAIC the release should have been named 15.8, if not ASE 16. Since there is no list of what ASE 16 is, we can't state that it is, or is not ASE 16. And they keep changing the internal and external release names. If I were to go by promises made five years ago, and the few posts about the subject in-between, which is all that I have, this is ASE 16. (Note that I am not a Sybase employee, I do not speak for Sybase, or wish to appear to do so. I am speaking for myself, as an old, battle-scarred Sybase hand.)

Please look into ASE 15.7 SP100, the New Features Guide. What you are seeking, at least on the face of it and unconfirmed, has been delivered, inside ASE. No need for anything outside ASE, or for post-dump processing.

Cheers

Derek

Former Member · ‎09-25-2013

I'm using rsync with rsyncable gzip on 8 stripes. It works well for me on ASE 15.0.3.

rsync send around 200M of a 2G rsyncable gzip file.

Order of pages in a DUMP DATABASE

Accepted Solutions (1)

Accepted Solutions (1)

Answers (1)

Answers (1)

Re: Importing View/Tables from Datasphere to SAC A...

migrate all the Crystal Reports into WEBI reports ...

Re: SELECT DISTINCT SQL for Projects

Re: Destination Service failed after deployment - ...

Re: GUI oddities with new 2022 ABAP Trial