Web Search Sciences is responsible for constantly improving the search experience of Yahoo customers. Our scientists combine a diverse set of scientific disciplines, from information retrieval and machine learning to text- and data-mining, to create new algorithms and data models, for crawling, indexing, query and content understanding, ranking, and presenting results. Our scientists work with the engineering and product groups, and deliver innovation into Yahoo search impacting millions of users across the world, thousands of times every second.
Disciplines & Areas of Expertise
Scientific Disciplines include Information Retrival, Machine Learning, Data & Text Mining and Natural Language Processing. Learn More
Areas of Expertise include Ranking, Classification, Information Extraction and Summarization. Learn moreTeam
- Aitao Chen
- Alex Cozzi
- A. Rochette
- A. Popescul
- Alpa Jain
- A. Jagdish Agarwal
- Amit Madan
- Ana-Maria Popescu
- Anlei Dong
- Anne Zhang
- A. Raghuveer
- Ashwin Tengli
- Belle Tseng
- Ben Shahshahani
- Benoit Dumoulin
- Bill Pentney
- Bo Long
- Byron Dom
- Changsung Kang
- Charu Tiwari
- Chi-Hoon Lee
- Christina Yip
- Ciya Liao
- Daniel Boies
- Daniel Dechelotte
- Danny Levinson
- D. Farrar
- Deepa Paranjpe
- Dun Liu
- Emre Velipasaoglu
- Eric Crestan
- Fan Li
- Fernando Diaz
- Flavian Vasile
- Fuchun Peng
- George Mills
- Georges Dupret
- Gilad Mishne
- Gilbert Leung
- Hang Cui
- Harsha Khattri
- Hugues Bouchard
- Huihsin Tseng
- James Shaw
- Jan Pfeifer
- J. Crespo
- J. Paiement
- J. Langlois
- Jerry Ye
- Jiang Chen
- Jianzhang He
- Jing Bai
- John Blackmer
- Jon Degenhardt
- Jyh-Herng Chow
- Karolina Buchner
- Kimberly Farrell
- K. Tsioutsiouliklis
- Krishnan S. Kumar
- Lan Nie
- Larry Lai
- Lei Duan
- Longbin Chen
- M. Pennacchiotti
- M. Kshirsagar
- Mike Ching
- Mingrui Wu
- Nadia Ghamrawi
- N. Sadagopan
- Nicolas Stroppa
- Nicolas Torzec
- Pankaj Gulhane
- Patrick Pantel
- Pavel Dmitriev
- Pranam Kolari
- Priyanka Garg
- Rao Shen
- Remi Kwan
- Richard Kasperski
- Romain Vinot
- Rosie Jones
- Ruiqiang Zhang
- Rupesh Mehta
- S.R. Jeyashankher
- S. Bhuramal Satpal
- Scott Gaffney
- Seokkyung Chung
- Shailesh Kumar
- Shihao Ji
- Siva Gurumurthy
- Soo-Min Pantel
- Srihari Reddy
- Srinivas Vadrevu
- S. H. Sengamedu
- Su Chan
- Su-Lin Wu
- Subhajit Sanyal
- S. Lamkhede
- Suju Rajan
- S. Sellamanickam
- Taesup Moon
- Tanuja Bompada
- Tina Liu
- U. Kamlakar Sawant
- Umut Ozertem
- Xavier Dupre
- Xiangyu Jin
- Xiaofeng He
- Xin Li
- Xing Wei
- Yi Chang
- Yiping Zhou
- Yoshiyuki Inagaki
- Youssef Billawala
- Yumao Lu
- Zhaohui Zheng
- Zhenzhen Kou
- Zhigang Hua
- Ziming Zhuang
Publications
Joint categorization of queries and clips for web-based video search,
, Multimedia Information Retrieval 2006, (2006)
Internet-scale collection of human-reviewed data,
, WWW 2007, (2007)
Incorporating query difference for learning retrieval functions in world wide web search,
, CIKM 2006 , (2006)
A Regression Framework for Learning ranking functions using relative relevance judgments,
, SIGIR 2007, (2007)
A General Boosting Method and its Application to Learning Ranking Functions for Web Search,
, NIPS 2008, (2008)
Query-Level Learning to Rank Using Isotonic Regression,
, Proceedings of the 46th Annual Allerton Conference on Communication, Control and Computing 2008, (2008)
Enhancing Topical Ranking with Preferences from Click-Through Data,
, SIGIR 2009 poster , (2009)
Search Engine Adaptation by Feedback Control Adjustment for Time-sensitive Query,
, NAACL-HLT 2009, (2009)
Web Search Engine Metrics: Direct Metrics to Measure User Satisfaction,
, WWW (2009), Madrid, Spain, (2009)
Search result reranking by feedback control adjustment for time-sensitive query,
, HLT-NAACL 2009, (2009)
Document Preprocessing For Naive Bayes Classification and Clustering with Mixture of Multinomials,
, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining (KDD-2004), (2004)
A structure-sensitive framework for text categorization,
, Proceedings of the 14th International Conference on Information and Knowledge Management (CIKM-2005), (2005)
Linear prediction models with graph regularization for web-page categorization,
, The Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2006), (2006)
Yahoo! Answers: Applications of Machine Learning in Social Search,
, Proceedings of the ACM SIGIR 2007 Industry Event, (2007)
A Bayesian Technique for Estimating the Credibility of Question Answerers,
, Proceedings of the 2008 SIAM Conference on Data Mining (SDM08), (2008)
Global ranking by exploiting user clicks,
, In Proceedings of The 32nd Annual ACM SIGIR Conference, 07/2009, Boston, MA, (2009)
Threshold Selection for Web-Page Classification with Highly Skewed Class Distribution,
, Proceedings of the 18th International World Wide Web Conference (WWW 2009), 04/2009, Madrid, Spain, (2009)
Context sensitive stemming for web search,
, SIGIR, (2007)
Comparing Both Relevance and Robustness in Selection of Web Ranking Functions,
, SIGIR Poster, (2009)
Web search engine evaluation using clickthrough data and a user model,
, In WWW2007 workshop Query Log Analysis: Social and Technological Challenges, 2007, (2007)
A study of mobile search queries in japan,
, In WWW2007 workshop Query Log Analysis: Social and Technological Challenges, 2007, (2007)
Enhancing educational-material retrieval using authored lesson metadata,
, SPIRE 2007, (2007)
A user browsing model to predict search engine click data from past observations,
, In ACM Press, editor, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, (2008)
Threshold Selection for Web Page Classification with Highly Skewed Class Distribution,
, WWW, (2009)
An extension of precision-recall with user modelling (PRUM): Application to XML retrieval,
, Transactions on Information Systems (TOIS), (2007)
