Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations Chriss Miller on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Data Mining Tek-Tips 2

Status
Not open for further replies.

jnicho02

Programmer
Jul 20, 1999
397
GB
Just a general question.

Would it be possible to data mine the back catalogue of Tek-Tips threads?...what would be the problems involved?....and how would you go about it?

The idea would be to produce a Tek-Tips Expert application. My home ----> visit me for Java and Data Warehousing resources
 
Yes, this falls under the heading of "text mining". At the least, one could run all of the past messages through a system like WizDoc (from WizSoft) or dtSearch (from the company of the same name). This would allow one to search very quickly and easily, even using conceptual searches (as opposed to just keyword AND/OR searching). There are other, fancier text mining systems but I don't know that they'd provide much more utility in this context than the above.
 
Tek-Tips have already indexed the forums, but I'm thinking about how they make a sort of 'ask Jeeves'.

Keyword searches tend to be a bit hit or miss, but then again it is quite complicated to parse sentences and extract sense from them.

Are 'text mining' systems essentially fancy indexers? My home ----> visit me for Java and Data Warehousing resources
 
Ask Jeeves is a conceptual search (which is available in both of the tools I mentioned): the system makes some attempt to interpret the user's query, instead of searching for keywords.
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top