Allan, J., Carbonell, J., Doddington, G., Yamron, J., & Yang, T. (1998). Topic detection and tracking pilot study: Final report. In: Proceedings of the DARPA Broadcast News Transcription an Understanding Workshop.
Allan, J., Papka, R., & Lavrenko, V., (1998). On-line new event detection and tracking. In: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, pp. 37-45.
Aurora, P. P., Rafael, B. L., & Jose, R. S. (2007). Topic discovery based on text mining techniques. Information Processing & Management, 43, pp. 742-768.
Berry, M.W. (2004) Survey of text mining-clustering, classification, and retrieval. Springer, pp. 185-224.
Bolelli, L., Ertekin, S., Zhou, D., & Giles, C. L. (2009). Finding topic trends in digital libraries, In: Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, pp. 69-72.
Chen, K.Y., Luesukprasert, L., & Chou, S. C. (2007). Hot topic extraction based on timeline analysis and multidimensional sentence modeling. IEEE Transactions on Knowlede and Data Enginerting, 19(8), pp. 1016-1025.
Chou, T. C., & Chen, M. C. (2008). Using incremental plsi for threshold-resilient online event analysis. IEEE Transactions on Knowlede and Data Enginerting, 20(3), pp. 289-299.
Clifton,
C., Cooley, R., & Rennie, J. (2004). Topcat: data mining for topic indentification in a text corpus. IEEE Transactions on Knowlede and Data Enginerting, 16(8), pp. 949-964.
Cui, C., & Kitagawa, H. (2005). Topic activation analysis for document streams based on document arrival rate and relevance. In: Proceedings of the 2005 ACM symposium on applied computing, pp. 1089-1095.
Felix, M. A., Benjamin, V. Q., Zaida, C. R., Elena, C. A., Victor, H. S., Francisco J. M. F. (2005). Domain analysis and information retrieval through the construction of heliocentric maps based on ISI-JCR category cocitation. Information Processing & Management, 41(6), pp. 1521-1533.
Franz, M., & McCarley, J. C. (2001). Unsupervised and supervised clustering for topic tracking. In: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 310-317.
Hatzivassiloglou, V., Gravano, L., & Maganti, A. (2000). An investigation of linguistic features and clustering algorithms. In: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pp. 224-231.
Jin, Y., Myaeng, S. H., & Jung, Y. (2007). Use of place information for improved event tracking. Information Processing & Management, 43, pp. 365-378.
Jo, Y., Lagoze, C., & Giles, C. L. (2007). Detecting research topics via the correlation between graphs and texts. In: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, pp.370-379.
Joachims, T. (1998). Text categorization with Support Vector Machines: learning with many relevant features. In: Proceedings of the EMNLP Conference.
Kollios, G., Gunopulos, D., Koudas, N., & Berchtold, S. (2003). Efficient biased sampling for approximate clustering and outlier detection in large data sets. IEEE Transactionson Knowlede and Data Enginerting, 15(5), pp. 1170-1187.
Kleinberg, J. (2002). Bursty and hierarchical structure in streams. In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 91-101.
Kuramochi, M., & Karypis, G. (2004). An efficient algorithm for discovering frequent subgraphs. IEEE Transactionson on Knowlede and Data Enginerting, 16(9), pp. 1038-1051.
Lee, C., Lee, G. G., & J, M. (2007). Dependency structure language model for topic detection and tracking. Information Processing & Management, 43, pp. 1249-1259.
Lee, Z., Gosain, S., & Im, I. (1997). Topics of interest in IS: evolution of themes and differences between research and practice. Information & Management, 36, pp. 233-246.
Liu, Y., Niculescu-Mizil, A., & Gryc, W. (2009). Topic-link LDA: joint models of topic and author community, In :Proceedings of the 26th Annual International Conference on Machine Learning, pp. 665-672.
Malone, J., McGarry, K., & Bowerman, C. (2006). Automated trend analysis of proteomics data using an intelligent data mining architecture, Expert Systems with Applications, 30, pp. 24-33.
Manmatha, R., Feng, A., & Allan, J. (2002). A critical examination of TDT’s cost function. In: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 403-404.
Markkonen, J., Ahonen-Myka, H., & Salmenkivi, M. (2004). Simple semantics in topic detection and tracking. Information Retrieval, 7, pp. 347-368.
Morinaga, S., & Yamanishi, K. (2004). Tracking dynamics of topic trends using a finite mixture model. In: Proceedings of the 10th ACM SIGKDD international
conference on Knowledge discovery and data mining, pp.811-816.
Moulinier, I., Raskinis, G., & Ganascia, J. (1996). Text categorization: A symbolic approach. In: Annual Symposium on Document Analysis and information retrieval (SDAIR).
Nallapati, R., Ahmed, A., Xing, E. P., & Cohen, W. W. (2008). Joint latent topic models for text and citations. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 542-550.
Ontrup, J., Ritter, H., Scholz, S. W., & Wagner R. (2008). Detecting, assessing and monitoring relevant topics in virtual information environments. IEEE Transactionson Knowlede and Data Enginerting, 20(7).
Ozmutlu, H. C., & Cavdur, F. (2005). Application of automatic topic identification on excited web search engine data logs. Information Processing & Management, 41, pp. 1243-1262.
Ozmutlu, S. (2006). Automatic new topic identification using multiple linear regression. Information Processing & Management, 42, pp. 934-950.
Porter, M. (1980). An algorithm for suffix stripping. Program (Automated Library and Information Systems), 14(3), pp. 130-137.
Rosen-Zvi, M., Chemudugunta, C., Griffiths, T., Smyth, P., & Steyvers, M. (2010). Learning author-topic models from text corpora, Transactions on Information Systems, 28 (1).![new window](/gs32/images/newin.png)
Salton, G. (1989). Automatic text processing: The transformation, analysis and retrieval of information by computer, Addison-Wesley, Reading, MA.
Salton, G., Wong, A., & Yang, C. S. (1975). A vector space model for automatic indexing. Communications of the ACM, 18(11), pp. 613-620.
Salton, G., & Buckley, C. (1988). Term weighting approaches in automatic text retrieval. Information Processing and Management, 24(5), pp. 513-523.
Salton, G., & McGill, M. J. (1983). Introduction to modern information retrieval. McGraw Hill Publishing Company.
Schultz, J. M., & Liberman, M. (1999). Topic detection and tracking using idf-weighted cosine coefficient. In: Proceedings of the DARPA Broadcast News Transcription an Understanding Workshop.
Schutze, H., Hull, D., & Pedersen, J. (1995). A comparison of classifiers and document representations for the routing problem. In: Proceedings of the 18st annual international ACM SIGIR conference on Research and development in information retrieval, pp.229-237.
Steyvers, M., Smyth, P., & Griffiths, T. (2004). Probabilistic author topic models for information discovery. In: Proceedings of the 10th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 306-315.
Stokes, N., & Carthy, J. (2001). Combining semantic and syntactic document classifiers to improve first story detection. In: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 424-425.
Swan, R., & Allan, J. (2000). Automatic generation of overview timelines. In: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pp. 49-56.
Tu, Y. N., & Seng, J. L. (2009). Research Intelligence Involving Information Retrieval – An example of Conferences and Journals, Expert Systems with Applications, 47(6).
Tu, Y. N., & Seng, J. L. (2010). Indices of Novelty for Emerging Topic Detection. (working paper).
Tan, P. N., Steinbach, M. & Kumar, V. (2006). Introduction to data mining. Addison-Wesley, pp. 69-84.
Thelwall, M. (2005). Scientific web intelligence: Finding relationships in university webs, Communications of the ACM, 48(7), pp. 93-96.
Thelwall, M., & Harries, G. (2004). Do better scholars’ Web publications have significantly higher online impact? Journal of the American Society for Information Science and Technology, 55(2), pp. 149-159.
Thelwall, M., Vaughan, L., Cothey, V., Li, X., & Smith, A. (2003). Which academic subjects have most online impact? A pilot study and a new classification process, Online Information Review, 27(5), pp. 333-343.
Tho, Q. T., Hui, S. C., & Fong, A. C. M. (2007). A citation-based document retrieval system for finding research expertise, Information Processing and Management, 43(1), pp. 248-264.![new window](/gs32/images/newin.png)
Walls, F., Jin, H., Sista, S., & Schwartz, R. (1999). Topic detection in broadcast news, In: Proceedings of the DARPA Broadcast News Transcription an Understanding Workshop.
Wang, X., Zhai, C., Hu, X., & Sproat, R. (2007). Mining correlated bursty topic patterns from coordinated text streams, In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 784-793.
Wu, K., Chen, M., & Sun, Y. (2004). Automatic topics discovery from hyperlinked documents, Information Processing & Management, 40, pp. 239-255.
Yang, H. C., & Lee, C. H. (2004). A text mining approach on automatic generation of web directories and hierarchies, Expert Systems with Applications, 27, pp. 645-663.
Yang, H. C., & Lee, C. H. (2005). A text mining approach for automatic construction of hypertexts, Expert Systems with Applications, 29, pp. 723-734.
Yang, Y., Ault, T., Pierce T., & Lattimer, C. W. (2000). Improving text categorization methods for event tracking, In: Proceedings of the 23th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 65-72.
Yang, Y. & Pedersen, J. (1997). A comparative study on feature selection in text categorization, In: International Conference on Machine Learning.
Yang, Y. & Wilbur, J. (1996). Using corpus statistics to remove redundant words in text categorization, Journal of the American Society for Information Science, 47(5), pp. 357-369.
Yang, Y., Zhang, J., Carbonell, J., & Jin, Chun. (2002). Topic-conditioned novelty detection, In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pp.688-693.
Yang, Y., Yoo, S., Zhang, J., & Kisiel, B. (2005). Robustness of adaptive filtering methods in a cross-benchmark evaluation, In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 98-105.
Zhang, Y., Callan, J., & Minka, T. (2002). Novelty and redundancy detection in adaptive filtering, In: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 81-88.
Zhang, Y., Surendran, A. C., Platt, J. C., & Narasimhan, M. (2008). Learning from multi-topic web documents for contextual advertisement, In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp.1051-1059.