NervousRex
Programmer
Which design would I be better to go with?
Table A
ID ID2 Col1 ... Col50
Col1 thru Col50, not every column will have data, and there will be at least 20k rows to start, growing by about 40 a day. The most used query would return 1 row which would be found by ID = ID
or
Table B
ID ID2 ID3 Col1
In this idea, a row only exists if Col1 has data, but now the table size would be estimated at least 10x larger (200k). Estimating that half of Col1 thru Col50 in the above (Table A) have data. And would grow at a rate of 400 a day based on the estimate. And now each query would return multiple rows 1 to 50.
To me Table A sounds much better, but I want to be sure to use the right one. Basically 1 - 50 are questions on a survey, they are not required to answer and are open text, so we actually need to see what they say, not just a count of who choose choice A.
Table A
ID ID2 Col1 ... Col50
Col1 thru Col50, not every column will have data, and there will be at least 20k rows to start, growing by about 40 a day. The most used query would return 1 row which would be found by ID = ID
or
Table B
ID ID2 ID3 Col1
In this idea, a row only exists if Col1 has data, but now the table size would be estimated at least 10x larger (200k). Estimating that half of Col1 thru Col50 in the above (Table A) have data. And would grow at a rate of 400 a day based on the estimate. And now each query would return multiple rows 1 to 50.
To me Table A sounds much better, but I want to be sure to use the right one. Basically 1 - 50 are questions on a survey, they are not required to answer and are open text, so we actually need to see what they say, not just a count of who choose choice A.