update Index Stats using Sampling

Former Member · ‎12-03-2015

Hi Team Good afternoon We currently have a JOB that runs UIS using sampling = 30 and is failing on a table that has 293412179 rows and one CI Error - the tempdb is full for the DB maintenance a.c which is mapped to the DBA tempdb Can't allocate space for object 'temp worktable' in database 'DBA_tempdb' because 'system' segment is full/has no free extents. If you ran out of space in syslogs, dump the transaction log. Otherwise, use ALTER DATABASE to increase the size of the segment. When we run the same job using sampling = 10 it works fine. Question What are the risk associated with changing the sampling ? Will that impact the query plans for the app using that table ? -Sid

former_member182259 · ‎12-04-2015

If you are on 15.7sp130+, use update statistics with hashing instead of update statistics with sampling. My opinion is that it is more accurate than sampling and definitely would eliminate the issue with tempdb as it uses next to nothing for tempdb space - result is it often is much faster was well due to not having to wait for IOs.

former_member89972 · ‎12-04-2015

Siddharth

May be you can take a different approach

Some of the places I worked, I have seen tempdb configured with log and data mixed just like master database. So you have full tempdb available for growing data and log. This is done because unlike normal user database we do not need to worry about tempdb recovery.

Risk of runaway process exists in all set-ups.

This architecture has a danger of runaway transaction filling up whole tempdb.

But it also has the benefit of space for the DBA maintenance work where whole tempdb is available for you to use for worktables and/or transaction logs as needed.

Think about it, test it and if found suitable implement it

HTH

Avinash

simon_ogden · ‎12-03-2015

For me I don't think it will make a huge amount of difference to plan choice unless you don't have in place non-sampled densities.

When you update statistics with sampling the pages and values it reads it considers as the 'full' table from the density point of view. If you have 100 unique values and you do sampling of 10% it'll calculate that you have 10 unique values and give you a total density of 0.1 instead of 0.01. This can make a big difference to costings in some areas (unknowns/subqueries). You might also end up with slightly less granular histograms.

For this reason the any densities that have been calculated without sampling will not be replaced with sampled densities by default. If you've never gathered a non-sampled density then you may end up reducing your density further ( I wouldn't know without checking whether aa density calculated with 30% I'd replaced when sampled with 10%)

If you don't have a non-sampled density and sont have the resources to generate them you should consider adjusting the density by a factor equal to 1/sampling percent - see sp_modifystats.

Run Command	CPU Time (ms)	Elapse Time (ms)
Update statistics xxxx	1691914	3996041
Update index statistics xxxx	2086711	4433156
Update index statistics xxx with hashing	2018926	3872431
Update statistics xxx with hashing	1461211	3278796

update Index Stats using Sampling

Accepted Solutions (0)

Answers (3)

Answers (3)

Re: ABAP2XLSX delete row

Re: How to configure SAP system in Eclipse ?

Re: Sac Dimension Comment

Re: Embed mode not working for optimized story

Fiori画面の仕様 1809と2023で出力幅が異なる