Hello all,
Now if I understand clustered index correctly, they will store the index in the same order as the data there for speeding up grouping and aggregation.
Though one issue that I have seen is during data loads. (In an EDW environment)
I have several data load being posted via bulk insert into various tables and have.
My testing has shown a 50 % decrease in load time with table containing a clustered index verses those that do not contain one.
Now I understand that the clustered index has to re-org it's self for every record inserted. This included updates and deletes.
Given example:
Say we have a file source that has 5 million record in it and it need to be loaded into an exiting table with 20 million records.
TABLE-A has a clustered index on the primary key comprised of 4 columns.
TABLE-A record count 20 million
Would it be more efficient to drop the clustered index prior to performing the load or load with the index in place?
My tests showed neither one were done with in a acceptable time (hence the reason the cluster index has been removed)
Unfortunately here in lies the issue, I want to reap the rewards of a clustered index but don’t want to deal with the time it takes to work around them.
Does anyone have any experience with this type of issue, and type of solution did you find work for you.
Thanks
Talenx
Now if I understand clustered index correctly, they will store the index in the same order as the data there for speeding up grouping and aggregation.
Though one issue that I have seen is during data loads. (In an EDW environment)
I have several data load being posted via bulk insert into various tables and have.
My testing has shown a 50 % decrease in load time with table containing a clustered index verses those that do not contain one.
Now I understand that the clustered index has to re-org it's self for every record inserted. This included updates and deletes.
Given example:
Say we have a file source that has 5 million record in it and it need to be loaded into an exiting table with 20 million records.
TABLE-A has a clustered index on the primary key comprised of 4 columns.
TABLE-A record count 20 million
Would it be more efficient to drop the clustered index prior to performing the load or load with the index in place?
My tests showed neither one were done with in a acceptable time (hence the reason the cluster index has been removed)
Unfortunately here in lies the issue, I want to reap the rewards of a clustered index but don’t want to deal with the time it takes to work around them.
Does anyone have any experience with this type of issue, and type of solution did you find work for you.
Thanks
Talenx