The effectiveness of all well known search engines crucially depends on the quality of the underlying term weighting mechanism. In this talk, first, I will briefly talk about the grand hypotheses which build the foundation for effective term weighting, followed by the limitations of the state of the art methods. I will then describe the development of a novel TF-IDF term weighting scheme. Finally, I will show the experimental resuls and compare them with the state of the art term weghting schemes. The talk will conclude with some potential future directions.
Jiaul Paik is a new CLIP postdoc. He earned his PhD in Computer Science from the Indian Statistical Institute, Kolkata, India. He has published a number of papers in ACM TOIS, ACM TALIP and ACM SIGIR. His research mainly focuses on challenges in information retrieval.