First, let me say, I am not a DBA, we don't have a DBA, and we are learning this as we go, out of necessity.
We have 5 (identical in structure) sql databases that store large amounts of data. We are experiencing massive performance issues lately and I was hoping someone could help. These are for cell site statistics. Each cell site has data for itself, cell sites it talks to (neighbors), and for up to each of its 20 carriers (Frequencies). So, we have a heirarchal nightmare.
Database structure:
Table 1 - Date, time, siteID, approx. 30 columns of numerical data.
Indexed on date, time, siteid. There are no unique identifiers. Approximately 4000 individual siteIDs, 48 halfhours per day.
Table 2 - Date, time, siteID, carrier, approx. 10 columns of numericla data
Indexed on date, time, siteID, carrier
Table 3 - Date, time, siteID, neighborID, approx. 20 columns of numerical data
Indexed on date, time, siteID, neighborID
Table 4 - SiteID, SiteName, SiteNumber, other identifying information
Indexed on SiteID, SiteName, SiteNumber
Table 5 - 24 hour rollups of table 1, grouped by date and siteID
Table 6 - 24 hour rollups of table 2, grouped by date, siteID, and carrierID
Table 7 - 24 hour rollups of table 3, grouped by date, siteID, neighborID
What needs to be done regularly:
-Insert text files for halfhourly updates to Tables 1, 2, 3
-Daily removal of halfhour data from Tables 1, 2, 3 for data older than 10 days (approx 200,000 rows on table 1, 3.8 million rows on tables 2 and 3)
-24 hour rollups of tables 1, 2, 3 to create tables 5, 6, 7
In the meantime, users are *trying* to access the data via ColdFusion queries by accessing web pages. Some of these pages query multiple siteIDs, some query single siteIDs. Most require joins of tables 1, 2, 3 and 4, or some combination of them.
All of these things need to happen on each of the 5 databases, which are all stored on the same server because it's all we have.
The problem -
Queries are stalling, causing coldfusion to time out the pages (Eventually). Scheduled jobs are stalling and queuing up, resulting in missing halfhour data. Scheduled jobs for deleting the old data (run in batches of 500-1000) are failing or taking several hours to execute, then causing the halfhour updates to wait and miss data.
I have a lot of angry people asking me daily when the pages will be back and working properly again, and I don't know what to tell them anymore.
Can anyone help?
We have 5 (identical in structure) sql databases that store large amounts of data. We are experiencing massive performance issues lately and I was hoping someone could help. These are for cell site statistics. Each cell site has data for itself, cell sites it talks to (neighbors), and for up to each of its 20 carriers (Frequencies). So, we have a heirarchal nightmare.
Database structure:
Table 1 - Date, time, siteID, approx. 30 columns of numerical data.
Indexed on date, time, siteid. There are no unique identifiers. Approximately 4000 individual siteIDs, 48 halfhours per day.
Table 2 - Date, time, siteID, carrier, approx. 10 columns of numericla data
Indexed on date, time, siteID, carrier
Table 3 - Date, time, siteID, neighborID, approx. 20 columns of numerical data
Indexed on date, time, siteID, neighborID
Table 4 - SiteID, SiteName, SiteNumber, other identifying information
Indexed on SiteID, SiteName, SiteNumber
Table 5 - 24 hour rollups of table 1, grouped by date and siteID
Table 6 - 24 hour rollups of table 2, grouped by date, siteID, and carrierID
Table 7 - 24 hour rollups of table 3, grouped by date, siteID, neighborID
What needs to be done regularly:
-Insert text files for halfhourly updates to Tables 1, 2, 3
-Daily removal of halfhour data from Tables 1, 2, 3 for data older than 10 days (approx 200,000 rows on table 1, 3.8 million rows on tables 2 and 3)
-24 hour rollups of tables 1, 2, 3 to create tables 5, 6, 7
In the meantime, users are *trying* to access the data via ColdFusion queries by accessing web pages. Some of these pages query multiple siteIDs, some query single siteIDs. Most require joins of tables 1, 2, 3 and 4, or some combination of them.
All of these things need to happen on each of the 5 databases, which are all stored on the same server because it's all we have.
The problem -
Queries are stalling, causing coldfusion to time out the pages (Eventually). Scheduled jobs are stalling and queuing up, resulting in missing halfhour data. Scheduled jobs for deleting the old data (run in batches of 500-1000) are failing or taking several hours to execute, then causing the halfhour updates to wait and miss data.
I have a lot of angry people asking me daily when the pages will be back and working properly again, and I don't know what to tell them anymore.
Can anyone help?